Your browser doesn't support javascript.
loading
FS-GBDT: identification multicancer-risk module via a feature selection algorithm by integrating Fisher score and GBDT.
Zhang, Jialin; Xu, Da; Hao, Kaijing; Zhang, Yusen; Chen, Wei; Liu, Jiaguo; Gao, Rui; Wu, Chuanyan; De Marinis, Yang.
Afiliação
  • Zhang J; School of Mathematics and Statistics at Shandong University, China.
  • Xu D; School of Mathematics and Statistics at Shandong University, China.
  • Hao K; School of Mathematics and Statistics at Shandong University, China.
  • Zhang Y; academic leader of Computer Engineering in Shandong University, China.
  • Chen W; School of Mathematics and Statistics at Shandong University, China.
  • Liu J; School of Mathematics and Statistics at Shandong University, China.
  • Gao R; School of Control Science and Engineering, Shandong University.
  • Wu C; School of Intelligent Engineering in Shandong Management University.
  • De Marinis Y; Diabetes Centre at Lund University, Sweden.
Brief Bioinform ; 22(3)2021 05 20.
Article em En | MEDLINE | ID: mdl-34020547
ABSTRACT
Cancer is a highly heterogeneous disease caused by dysregulation in different cell types and tissues. However, different cancers may share common mechanisms. It is critical to identify decisive genes involved in the development and progression of cancer, and joint analysis of multiple cancers may help to discover overlapping mechanisms among different cancers. In this study, we proposed a fusion feature selection framework attributed to ensemble method named Fisher score and Gradient Boosting Decision Tree (FS-GBDT) to select robust and decisive feature genes in high-dimensional gene expression datasets. Joint analysis of 11 human cancers types was conducted to explore the key feature genes subset of cancer. To verify the efficacy of FS-GBDT, we compared it with four other common feature selection algorithms by Support Vector Machine (SVM) classifier. The algorithm achieved highest indicators, outperforms other four methods. In addition, we performed gene ontology analysis and literature validation of the key gene subset, and this subset were classified into several functional modules. Functional modules can be used as markers of disease to replace single gene which is difficult to be found repeatedly in applications of gene chip, and to study the core mechanisms of cancer.
Assuntos
Palavras-chave

Texto completo: 1 Bases de dados: MEDLINE Assunto principal: Algoritmos / Regulação Neoplásica da Expressão Gênica / Biologia Computacional / Perfilação da Expressão Gênica / Máquina de Vetores de Suporte / Neoplasias Tipo de estudo: Diagnostic_studies / Etiology_studies / Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Revista: Brief Bioinform Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2021 Tipo de documento: Article País de afiliação: China

Texto completo: 1 Bases de dados: MEDLINE Assunto principal: Algoritmos / Regulação Neoplásica da Expressão Gênica / Biologia Computacional / Perfilação da Expressão Gênica / Máquina de Vetores de Suporte / Neoplasias Tipo de estudo: Diagnostic_studies / Etiology_studies / Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Revista: Brief Bioinform Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2021 Tipo de documento: Article País de afiliação: China