Your browser doesn't support javascript.
loading
QUATgo: Protein quaternary structural attributes predicted by two-stage machine learning approaches with heterogeneous feature encoding.
Tung, Chi-Hua; Chien, Ching-Hsuan; Chen, Chi-Wei; Huang, Lan-Ying; Liu, Yu-Nan; Chu, Yen-Wei.
Afiliação
  • Tung CH; Department of Bioinformatics, Chung-Hua University, Hsinchu, Taiwan (R.O.C.).
  • Chien CH; Ph.D. Program in Medical Biotechnology, National Chung Hsing University, Taichung City, Taiwan (R.O.C.).
  • Chen CW; Institute of Genomics and Bioinformatics, National Chung Hsing University, Taichung City, Taiwan (R.O.C.).
  • Huang LY; Institute of Genomics and Bioinformatics, National Chung Hsing University, Taichung City, Taiwan (R.O.C.).
  • Liu YN; Department of Computer Science and Engineering, National Chung Hsing University, Taichung City, Taiwan (R.O.C.).
  • Chu YW; Institute of Genomics and Bioinformatics, National Chung Hsing University, Taichung City, Taiwan (R.O.C.).
PLoS One ; 15(4): e0232087, 2020.
Article em En | MEDLINE | ID: mdl-32348325
ABSTRACT
Many proteins exist in natures as oligomers with various quaternary structural attributes rather than as single chains. Predicting these attributes is an essential task in computational biology for the advancement of proteomics. However, the existing methods do not consider the integration of heterogeneous coding and the accuracy of subunit categories with limited data. To this end, we proposed a tool that can predict more than 12 subunit protein oligomers, QUATgo. Meanwhile, three kinds of sequence coding were used, including dipeptide composition, which was used for the first time to predict protein quaternary structural attributes, and protein half-life characteristics, and we modified the coding method of the functional domain composition proposed by predecessors to solve the problem of large feature vectors. QUATgo solves the problem of insufficient data for a single subunit using a two-stage architecture and uses 10-fold cross-validation to test the predictive accuracy of the classifier. QUATgo has 49.0% cross-validation accuracy and 31.1% independent test accuracy. In the case study, the accuracy of QUATgo can reach 61.5% for predicting the quaternary structure of influenza virus hemagglutinin proteins. Finally, QUATgo is freely accessible to the public as a web server via the site http//predictor.nchu.edu.tw/QUATgo.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Proteínas Virais / Software / Proteínas / Biologia Computacional / Análise de Sequência de Proteína / Estrutura Quaternária de Proteína / Aprendizado de Máquina Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Animals / Humans Idioma: En Revista: PLoS One Ano de publicação: 2020 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Proteínas Virais / Software / Proteínas / Biologia Computacional / Análise de Sequência de Proteína / Estrutura Quaternária de Proteína / Aprendizado de Máquina Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Animals / Humans Idioma: En Revista: PLoS One Ano de publicação: 2020 Tipo de documento: Article