Your browser doesn't support javascript.
loading
econvRBP: Improved ensemble convolutional neural networks for RNA binding protein prediction directly from sequence.
Zhao, Yuze; Du, Xiuquan.
Afiliação
  • Zhao Y; School of Computer Science and Technology, Anhui University, Hefei, Anhui, China.
  • Du X; School of Computer Science and Technology, Anhui University, Hefei, Anhui, China. Electronic address: dxqllp@163.com.
Methods ; 181-182: 15-23, 2020 10 01.
Article em En | MEDLINE | ID: mdl-31513916
ABSTRACT
RNA binding proteins (RBPs) determine RNA process from synthesis to decay, which play a key role in RNA transport, translation and degradation. Therefore, exploring RBPs' function from the amino acid sequence using computational methods has become one of the momentous topics in genome annotation. However, there still have some challenges (1) shallow feature Although the sequence determines structure is self-evident, it is difficult to analyze the essential features from simple sequence. (2) Poorly understand feature-based prediction methods mainly emphasize feature extraction, while in-depth understanding of protein mysteries limits the application of feature engineering. (3) Feature fusion multi-feature fusion is often used, but the features are not well integrated. In view of these challenges, we propose a novel ensemble convolutional neural network (econvRBP) to predict RBPs. In order to capture the local and global features of RNA binding proteins simultaneously, first of all, One Hot and Conjoint Triad encoding methods are used to transform amino acid sequence into local and global features, respectively. After that the local and global features are combined for further high-level feature extraction using convolutional neural networks. Some experiments are constructed to evaluate our method with 10-fold cross validation and the results show that it has achieved the best performance among all the predictors so far. We correctly predicted 99% of 2875 RBPs and 99% of 6782 non-RBPs with accuracy of 0.99. In addition, the datasets provided by RBPPred are also used to validate our models with an accuracy of 0.87. These results indicate that the econvRBP is the most excellent method at present, and will provide reliable guidance for the detection of RBPs. econvRBP is available at http//47.100.203.2183389/home.html/.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Proteínas de Ligação a RNA / Biologia Computacional / Análise de Sequência de Proteína / Aprendizado Profundo Tipo de estudo: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Methods Assunto da revista: BIOQUIMICA Ano de publicação: 2020 Tipo de documento: Article País de afiliação: China

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Proteínas de Ligação a RNA / Biologia Computacional / Análise de Sequência de Proteína / Aprendizado Profundo Tipo de estudo: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Methods Assunto da revista: BIOQUIMICA Ano de publicação: 2020 Tipo de documento: Article País de afiliação: China