Your browser doesn't support javascript.
loading
An Efficient Mixed-Model for Screening Differentially Expressed Genes of Breast Cancer Based on LR-RF.
Article em En | MEDLINE | ID: mdl-29993693
ABSTRACT
To screen differentially expressed genes quickly and efficiently in breast cancer, two gene microarray datasets of breast cancer, GSE15852 and GSE45255, were downloaded from GEO. By combining the Logistic Regression and Random Forest algorithm, this paper proposed a novel method named LR-RF to select differentially expressed genes of breast cancer on microarray data by the Bonferroni test of FWER error measure. Comparing with Logistic Regression and Random Forest, our study shows that LR-FR has a great facility in selecting differentially expressed genes. The average prediction accuracy of the proposed LR-RF from replicating random test 10 times surprisingly reaches 93.11 percent with variance as low as 0.00045. The prediction accuracy rate reaches a maximum 95.57 percent when threshold value α = 0.2 in the random forest algorithm process of ranking genes' importance score, and the differentially expressed genes are relatively few in number. In addition, through analyzing the gene interaction networks, most of the top 20 genes we selected were found to involve in the development of breast cancer. All of these results demonstrate the reliability and efficiency of LR-RF. It is anticipated that LR-RF would provide new knowledge and method for biologists, medical scientists, and cognitive computing researchers to identify disease-related genes of breast cancer.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Neoplasias da Mama / Modelos Logísticos / Perfilação da Expressão Gênica / Aprendizado de Máquina Tipo de estudo: Diagnostic_studies / Prognostic_studies / Screening_studies Limite: Female / Humans Idioma: En Revista: ACM Trans Comput Biol Bioinform Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2019 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Neoplasias da Mama / Modelos Logísticos / Perfilação da Expressão Gênica / Aprendizado de Máquina Tipo de estudo: Diagnostic_studies / Prognostic_studies / Screening_studies Limite: Female / Humans Idioma: En Revista: ACM Trans Comput Biol Bioinform Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2019 Tipo de documento: Article