Your browser doesn't support javascript.
loading
Identification of lung cancer gene markers through kernel maximum mean discrepancy and information entropy.
Zhao, Zhixun; Peng, Hui; Zhang, Xiaocai; Zheng, Yi; Chen, Fang; Fang, Liang; Li, Jinyan.
Afiliação
  • Zhao Z; Advanced Analytics Institute, Faculty of Engineering and Information Technology, University of Technology Sydney, PO Box 123, Broadway, Sydney, 2007, NSW, Australia.
  • Peng H; Advanced Analytics Institute, Faculty of Engineering and Information Technology, University of Technology Sydney, PO Box 123, Broadway, Sydney, 2007, NSW, Australia.
  • Zhang X; Advanced Analytics Institute, Faculty of Engineering and Information Technology, University of Technology Sydney, PO Box 123, Broadway, Sydney, 2007, NSW, Australia.
  • Zheng Y; Advanced Analytics Institute, Faculty of Engineering and Information Technology, University of Technology Sydney, PO Box 123, Broadway, Sydney, 2007, NSW, Australia.
  • Chen F; Faculty of Engineering and Information Technology, University of Technology Sydney, PO Box 123, Broadway, Sydney, 2007, NSW, Australia.
  • Fang L; School of Computer, National University of Defense Technology, Changsha, 410073, China.
  • Li J; Advanced Analytics Institute, Faculty of Engineering and Information Technology, University of Technology Sydney, PO Box 123, Broadway, Sydney, 2007, NSW, Australia. Jinyan.Li@uts.edu.au.
BMC Med Genomics ; 12(Suppl 8): 183, 2019 12 20.
Article em En | MEDLINE | ID: mdl-31856830
BACKGROUND: The early diagnosis of lung cancer has been a critical problem in clinical practice for a long time and identifying differentially expressed gene as disease marker is a promising solution. However, the most existing gene differential expression analysis (DEA) methods have two main drawbacks: First, these methods are based on fixed statistical hypotheses and not always effective; Second, these methods can not identify a certain expression level boundary when there is no obvious expression level gap between control and experiment groups. METHODS: This paper proposed a novel approach to identify marker genes and gene expression level boundary for lung cancer. By calculating a kernel maximum mean discrepancy, our method can evaluate the expression differences between normal, normal adjacent to tumor (NAT) and tumor samples. For the potential marker genes, the expression level boundaries among different groups are defined with the information entropy method. RESULTS: Compared with two conventional methods t-test and fold change, the top average ranked genes selected by our method can achieve better performance under all metrics in the 10-fold cross-validation. Then GO and KEGG enrichment analysis are conducted to explore the biological function of the top 100 ranked genes. At last, we choose the top 10 average ranked genes as lung cancer markers and their expression boundaries are calculated and reported. CONCLUSION: The proposed approach is effective to identify gene markers for lung cancer diagnosis. It is not only more accurate than conventional DEA methods but also provides a reliable method to identify the gene expression level boundaries.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Marcadores Genéticos / Biologia Computacional / Entropia / Neoplasias Pulmonares Idioma: En Ano de publicação: 2019 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Marcadores Genéticos / Biologia Computacional / Entropia / Neoplasias Pulmonares Idioma: En Ano de publicação: 2019 Tipo de documento: Article