Your browser doesn't support javascript.
loading
m5UMCB: Prediction of RNA 5-methyluridine sites using multi-scale convolutional neural network with BiLSTM.
Ji, Yingshan; Sun, Jianqiang; Xie, Jingxuan; Wu, Wei; Shuai, Stella C; Zhao, Qi; Chen, Wei.
Affiliation
  • Ji Y; School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan, 114051, China.
  • Sun J; School of Information Science and Engineering, Linyi University, Linyi, 276000, China.
  • Xie J; School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan, 114051, China.
  • Wu W; School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan, 114051, China.
  • Shuai SC; Biological Science, Northwestern University, Evanston, IL, 60208, USA.
  • Zhao Q; School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan, 114051, China. Electronic address: zhaoqi@lnu.edu.cn.
  • Chen W; Innovative Institute of Chinese Medicine and Pharmacy, Chengdu University of Traditional Chinese Medicine, Chengdu, 611137, China. Electronic address: greatchen@ncst.edu.cn.
Comput Biol Med ; 168: 107793, 2024 01.
Article in En | MEDLINE | ID: mdl-38048661
ABSTRACT
As a prevalent RNA modification, 5-methyluridine (m5U) plays a critical role in diverse biological processes and disease pathogenesis. High-throughput identification of m5U typically relies on labor-intensive biochemical experiments using various sequencing-based techniques, which are not only time-consuming but also expensive. Consequently, there is a pressing need for more efficient and cost-effective computational methods to complement these high-throughput techniques. In this study, we present m5UMCB, a novel approach that harnesses a multi-scale convolutional neural network (CNN) in tandem with bidirectional long short-term memory (BiLSTM) to recognize m5U sites. Our method involves segmenting RNA sequences into smaller fragments based on a 3-mer length and subsequently mapping each fragment to a lower-dimensional vector representation using the global vectors for word representation (GloVe) technique. Through a series of multi-scale convolution and pooling operations, local features are extracted from RNA sequences and transformed into abstract, high-level features. The feature matrix is then inputted into a BiLSTM network, enabling the capture of contextual information and long-term dependencies within the sequence. Ultimately, a fully connected layer is employed to classify m5U sites. The validation results from 5-fold cross-validation (5-fold CV) test indicate that m5UMCB outperforms existing state-of-the-art predictive methods, demonstrating a 1.98% increase in the area under ROC curve (AUC) and significant improvements in relevant evaluation metrics. We are confident that m5UMCB will serve as a valuable tool for m5U prediction.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: RNA / Neural Networks, Computer Language: En Journal: Comput Biol Med Year: 2024 Type: Article Affiliation country: China

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: RNA / Neural Networks, Computer Language: En Journal: Comput Biol Med Year: 2024 Type: Article Affiliation country: China