Your browser doesn't support javascript.
loading
A Computational Framework Based on Ensemble Deep Neural Networks for Essential Genes Identification.
Le, Nguyen Quoc Khanh; Do, Duyen Thi; Hung, Truong Nguyen Khanh; Lam, Luu Ho Thanh; Huynh, Tuan-Tu; Nguyen, Ngan Thi Kim.
Affiliation
  • Le NQK; Professional Master Program in Artificial Intelligence in Medicine, College of Medicine, Taipei Medical University, Taipei 106, Taiwan.
  • Do DT; Research Center for Artificial Intelligence in Medicine, Taipei Medical University, Taipei 106, Taiwan.
  • Hung TNK; Translational Imaging Research Center, Taipei Medical University Hospital, Taipei 110, Taiwan.
  • Lam LHT; Graduate Institute of Biomedical Informatics, Taipei Medical University, Taipei 106, Taiwan.
  • Huynh TT; International Master/Ph.D. Program in Medicine, College of Medicine, Taipei Medical University, Taipei 110, Taiwan.
  • Nguyen NTK; Department of Orthopedic and Trauma, Cho Ray Hospital, Ho Chi Minh 70000, Vietnam.
Int J Mol Sci ; 21(23)2020 Nov 28.
Article in En | MEDLINE | ID: mdl-33260643
ABSTRACT
Essential genes contain key information of genomes that could be the key to a comprehensive understanding of life and evolution. Because of their importance, studies of essential genes have been considered a crucial problem in computational biology. Computational methods for identifying essential genes have become increasingly popular to reduce the cost and time-consumption of traditional experiments. A few models have addressed this problem, but performance is still not satisfactory because of high dimensional features and the use of traditional machine learning algorithms. Thus, there is a need to create a novel model to improve the predictive performance of this problem from DNA sequence features. This study took advantage of a natural language processing (NLP) model in learning biological sequences by treating them as natural language words. To learn the NLP features, a supervised learning model was consequentially employed by an ensemble deep neural network. Our proposed method could identify essential genes with sensitivity, specificity, accuracy, Matthews correlation coefficient (MCC), and area under the receiver operating characteristic curve (AUC) values of 60.2%, 84.6%, 76.3%, 0.449, and 0.814, respectively. The overall performance outperformed the single models without ensemble, as well as the state-of-the-art predictors on the same benchmark dataset. This indicated the effectiveness of the proposed method in determining essential genes, in particular, and other sequencing problems, in general.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Algorithms / Neural Networks, Computer / Genes, Essential / Deep Learning Type of study: Diagnostic_studies / Prognostic_studies Language: En Journal: Int J Mol Sci Year: 2020 Document type: Article Affiliation country:

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Algorithms / Neural Networks, Computer / Genes, Essential / Deep Learning Type of study: Diagnostic_studies / Prognostic_studies Language: En Journal: Int J Mol Sci Year: 2020 Document type: Article Affiliation country: