Your browser doesn't support javascript.
loading
Modeling multi-species RNA modification through multi-task curriculum learning.
Xiong, Yuanpeng; He, Xuan; Zhao, Dan; Tian, Tingzhong; Hong, Lixiang; Jiang, Tao; Zeng, Jianyang.
Afiliação
  • Xiong Y; Bioinformatics Division, BNRIST/Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China.
  • He X; Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing 100084, China.
  • Zhao D; Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing 100084, China.
  • Tian T; Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing 100084, China.
  • Hong L; Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing 100084, China.
  • Jiang T; Department of Computer Science and Engineering, University of California, Riverside, CA 92521, USA.
  • Zeng J; Bioinformatics Division, BNRIST/Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China.
Nucleic Acids Res ; 49(7): 3719-3734, 2021 04 19.
Article em En | MEDLINE | ID: mdl-33744973
ABSTRACT
N6-methyladenosine (m6A) is the most pervasive modification in eukaryotic mRNAs. Numerous biological processes are regulated by this critical post-transcriptional mark, such as gene expression, RNA stability, RNA structure and translation. Recently, various experimental techniques and computational methods have been developed to characterize the transcriptome-wide landscapes of m6A modification for understanding its underlying mechanisms and functions in mRNA regulation. However, the experimental techniques are generally costly and time-consuming, while the existing computational models are usually designed only for m6A site prediction in a single-species and have significant limitations in accuracy, interpretability and generalizability. Here, we propose a highly interpretable computational framework, called MASS, based on a multi-task curriculum learning strategy to capture m6A features across multiple species simultaneously. Extensive computational experiments demonstrate the superior performances of MASS when compared to the state-of-the-art prediction methods. Furthermore, the contextual sequence features of m6A captured by MASS can be explained by the known critical binding motifs of the related RNA-binding proteins, which also help elucidate the similarity and difference among m6A features across species. In addition, based on the predicted m6A profiles, we further delineate the relationships between m6A and various properties of gene regulation, including gene expression, RNA stability, translation, RNA structure and histone modification. In summary, MASS may serve as a useful tool for characterizing m6A modification and studying its regulatory code. The source code of MASS can be downloaded from https//github.com/mlcb-thu/MASS.
Assuntos

Texto completo: 1 Bases de dados: MEDLINE Assunto principal: RNA / Adenosina / Aprendizado de Máquina Tipo de estudo: Prognostic_studies Limite: Animals / Humans Idioma: En Revista: Nucleic Acids Res Ano de publicação: 2021 Tipo de documento: Article País de afiliação: China

Texto completo: 1 Bases de dados: MEDLINE Assunto principal: RNA / Adenosina / Aprendizado de Máquina Tipo de estudo: Prognostic_studies Limite: Animals / Humans Idioma: En Revista: Nucleic Acids Res Ano de publicação: 2021 Tipo de documento: Article País de afiliação: China