Your browser doesn't support javascript.
loading
BiLSTM-5mC: A Bidirectional Long Short-Term Memory-Based Approach for Predicting 5-Methylcytosine Sites in Genome-Wide DNA Promoters.
Cheng, Xin; Wang, Jun; Li, Qianyue; Liu, Taigang.
  • Cheng X; College of Information Technology, Shanghai Ocean University, Shanghai 201306, China.
  • Wang J; School of Software Technology, Zhejiang University, Ningbo 315048, China.
  • Li Q; College of Information Technology, Shanghai Ocean University, Shanghai 201306, China.
  • Liu T; College of Information Technology, Shanghai Ocean University, Shanghai 201306, China.
Molecules ; 26(24)2021 Dec 07.
Article en En | MEDLINE | ID: mdl-34946497
An important reason of cancer proliferation is the change in DNA methylation patterns, characterized by the localized hypermethylation of the promoters of tumor-suppressor genes together with an overall decrease in the level of 5-methylcytosine (5mC). Therefore, identifying the 5mC sites in the promoters is a critical step towards further understanding the diverse functions of DNA methylation in genetic diseases such as cancers and aging. However, most wet-lab experimental techniques are often time consuming and laborious for detecting 5mC sites. In this study, we proposed a deep learning-based approach, called BiLSTM-5mC, for accurately identifying 5mC sites in genome-wide DNA promoters. First, we randomly divided the negative samples into 11 subsets of equal size, one of which can form the balance subset by combining with the positive samples in the same amount. Then, two types of feature vectors encoded by the one-hot method, and the nucleotide property and frequency (NPF) methods were fed into a bidirectional long short-term memory (BiLSTM) network and a full connection layer to train the 22 submodels. Finally, the outputs of these models were integrated to predict 5mC sites by using the majority vote strategy. Our experimental results demonstrated that BiLSTM-5mC outperformed existing methods based on the same independent dataset.
Asunto(s)
Palabras clave

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Envejecimiento / ADN / 5-Metilcitosina / Aprendizaje Profundo / Neoplasias Tipo de estudio: Prognostic_studies / Risk_factors_studies Límite: Humans Idioma: En Año: 2021 Tipo del documento: Article

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Envejecimiento / ADN / 5-Metilcitosina / Aprendizaje Profundo / Neoplasias Tipo de estudio: Prognostic_studies / Risk_factors_studies Límite: Humans Idioma: En Año: 2021 Tipo del documento: Article