Your browser doesn't support javascript.
loading
A feature selection method based on graph theory for cancer classification.
Zhou, Kai; Yin, Zhixiang; Gu, Jiaying; Zeng, Zhiliang.
Afiliação
  • Zhou K; School of Mathematics Physics and Statistics, Shanghai University of Engineering Science, Shanghai 201620, OR China.
  • Yin Z; School of Mathematics Physics and Statistics, Shanghai University of Engineering Science, Shanghai 201620, OR China.
  • Gu J; School of Mathematics Physics and Statistics, Shanghai University of Engineering Science, Shanghai 201620, OR China.
  • Zeng Z; School of Mathematics Physics and Statistics, Shanghai University of Engineering Science, Shanghai 201620, OR China.
Article em En | MEDLINE | ID: mdl-37056061
ABSTRACT

OBJECTIVE:

Gene expression profile data is a good data source for people to study tumors, but gene expression data has the characteristics of high dimension and redundancy. Therefore, gene selection is a very important step in microarray data classification.

METHOD:

In this paper, a feature selection method based on the maximum mutual information coefficient and graph theory is proposed. Each feature of gene expression data is treated as a vertex of the graph, and the maximum mutual information coefficient between genes is used to measure the relationship between the vertices to construct an undirected graph, and then the core and coritivity theory is used to determine the feature subset of gene data.

RESULTS:

In this work, we used three different classification models and three different evaluation metrics such as accuracy, F1-Score, and AUC to evaluate the classification performance to avoid reliance on any one classifier or evaluation metric. The experimental results on six different types of genetic data show that our proposed algorithm has high accuracy and robustness compared to other advanced feature selection methods.

CONCLUSION:

In this method, the importance and correlation of features are considered at the same time, and the problem of gene selection in microarray data classification is solved.
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2023 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2023 Tipo de documento: Article