Your browser doesn't support javascript.
loading
iDNA-ABT: advanced deep learning model for detecting DNA methylation with adaptive features and transductive information maximization.
Yu, Yingying; He, Wenjia; Jin, Junru; Xiao, Guobao; Cui, Lizhen; Zeng, Rao; Wei, Leyi.
Afiliação
  • Yu Y; School of Software, Shandong University, Jinan, China.
  • He W; Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University, Jinan, China.
  • Jin J; Fujian Provincial Key Laboratory of Information Processing and Intelligent Control (Minjiang University), Fuzhou, China.
  • Xiao G; School of Software, Shandong University, Jinan, China.
  • Cui L; Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University, Jinan, China.
  • Zeng R; School of Software, Shandong University, Jinan, China.
  • Wei L; Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University, Jinan, China.
Bioinformatics ; 37(24): 4603-4610, 2021 12 11.
Article em En | MEDLINE | ID: mdl-34601568
ABSTRACT
MOTIVATION DNA methylation plays an important role in epigenetic modification, the occurrence, and the development of diseases. Therefore, identification of DNA methylation sites is critical for better understanding and revealing their functional mechanisms. To date, several machine learning and deep learning methods have been developed for the prediction of different DNA methylation types. However, they still highly rely on manual features, which can largely limit the high-latent information extraction. Moreover, most of them are designed for one specific DNA methylation type, and therefore cannot predict multiple methylation sites in multiple species simultaneously. In this study, we propose iDNA-ABT, an advanced deep learning model that utilizes adaptive embedding based on Bidirectional Encoder Representations from Transformers (BERT) together with transductive information maximization (TIM).

RESULTS:

Benchmark results show that our proposed iDNA-ABT can automatically and adaptively learn the distinguishing features of biological sequences from multiple species, and thus perform significantly better than the state-of-the-art methods in predicting three different DNA methylation types. In addition, TIM loss is proven to be effective in dichotomous tasks via the comparison experiment. Furthermore, we verify that our features have strong adaptability and robustness to different species through comparison of adaptive embedding and six handcrafted feature encodings. Importantly, our model shows great generalization ability in different species, demonstrating that our model can adaptively capture the cross-species differences and improve the predictive performance. For the convenient use of our method, we further established an online webserver as the implementation of the proposed iDNA-ABT. AVAILABILITY AND IMPLEMENTATION Our proposed iDNA-ABT and data are freely accessible via http//server.wei-group.net/iDNA_ABT and our source codes are available for downloading in the GitHub repository (https//github.com/YUYING07/iDNA_ABT). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Metilação de DNA / Aprendizado Profundo Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2021 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Metilação de DNA / Aprendizado Profundo Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2021 Tipo de documento: Article