Your browser doesn't support javascript.
loading
cnnAlpha: Protein disordered regions prediction by reduced amino acid alphabets and convolutional neural networks.
Oberti, Mauricio; Vaisman, Iosif I.
Affiliation
  • Oberti M; School of Systems Biology, George Mason University, Manassas, Virginia, USA.
  • Vaisman II; Novartis Institutes for BioMedical Research, Cambridge, Massachussets, USA.
Proteins ; 88(11): 1472-1481, 2020 11.
Article in En | MEDLINE | ID: mdl-32535960
ABSTRACT
Intrinsically disordered regions (IDR) play an important role in key biological processes and are closely related to human diseases. IDRs have great potential to serve as targets for drug discovery, most notably in disordered binding regions. Accurate prediction of IDRs is challenging because their genome wide occurrence and a low ratio of disordered residues make them difficult targets for traditional classification techniques. Existing computational methods mostly rely on sequence profiles to improve accuracy which is time consuming and computationally expensive. This article describes an ab initio sequence-only prediction method-which tries to overcome the challenge of accurate prediction posed by IDRs-based on reduced amino acid alphabets and convolutional neural networks (CNNs). We experiment with six different 3-letter reduced alphabets. We argue that the dimensional reduction in the input alphabet facilitates the detection of complex patterns within the sequence by the convolutional step. Experimental results show that our proposed IDR predictor performs at the same level or outperforms other state-of-the-art methods in the same class, achieving accuracy levels of 0.76 and AUC of 0.85 on the publicly available Critical Assessment of protein Structure Prediction dataset (CASP10). Therefore, our method is suitable for proteome-wide disorder prediction yielding similar or better accuracy than existing approaches at a faster speed.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Neural Networks, Computer / Computational Biology / Data Mining / Intrinsically Disordered Proteins / Machine Learning Type of study: Prognostic_studies / Risk_factors_studies Limits: Humans Language: En Journal: Proteins Journal subject: BIOQUIMICA Year: 2020 Document type: Article Affiliation country:

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Neural Networks, Computer / Computational Biology / Data Mining / Intrinsically Disordered Proteins / Machine Learning Type of study: Prognostic_studies / Risk_factors_studies Limits: Humans Language: En Journal: Proteins Journal subject: BIOQUIMICA Year: 2020 Document type: Article Affiliation country: