Your browser doesn't support javascript.
loading
Improved prediction of DNA and RNA binding proteins with deep learning models.
Wu, Siwen; Guo, Jun-Tao.
Affiliation
  • Wu S; Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC 28223, United States.
  • Guo JT; Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC 28223, United States.
Brief Bioinform ; 25(4)2024 May 23.
Article in En | MEDLINE | ID: mdl-38856168
ABSTRACT
Nucleic acid-binding proteins (NABPs), including DNA-binding proteins (DBPs) and RNA-binding proteins (RBPs), play important roles in essential biological processes. To facilitate functional annotation and accurate prediction of different types of NABPs, many machine learning-based computational approaches have been developed. However, the datasets used for training and testing as well as the prediction scopes in these studies have limited their applications. In this paper, we developed new strategies to overcome these limitations by generating more accurate and robust datasets and developing deep learning-based methods including both hierarchical and multi-class approaches to predict the types of NABPs for any given protein. The deep learning models employ two layers of convolutional neural network and one layer of long short-term memory. Our approaches outperform existing DBP and RBP predictors with a balanced prediction between DBPs and RBPs, and are more practically useful in identifying novel NABPs. The multi-class approach greatly improves the prediction accuracy of DBPs and RBPs, especially for the DBPs with ~12% improvement. Moreover, we explored the prediction accuracy of single-stranded DNA binding proteins and their effect on the overall prediction accuracy of NABP predictions.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: RNA-Binding Proteins / Computational Biology / DNA-Binding Proteins / Deep Learning Limits: Humans Language: En Journal: Brief Bioinform / Brief. bioinform / Briefings in bioinformatics Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2024 Type: Article Affiliation country: United States

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: RNA-Binding Proteins / Computational Biology / DNA-Binding Proteins / Deep Learning Limits: Humans Language: En Journal: Brief Bioinform / Brief. bioinform / Briefings in bioinformatics Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2024 Type: Article Affiliation country: United States