Your browser doesn't support javascript.
loading
Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction.
de Oliveira, Saulo H P; Law, Eleanor C; Shi, Jiye; Deane, Charlotte M.
Afiliação
  • de Oliveira SHP; Department of Statistics, University of Oxford, Oxford OX1 3LB, UK.
  • Law EC; Department of Statistics, University of Oxford, Oxford OX1 3LB, UK.
  • Shi J; Department of Informatics, UCB Pharma, Slough SL1 3WE, UK.
  • Deane CM; Division of Physical Biology, Shanghai Institute of Applied Physics, Chinese Academy of Sciences, Shanghai 201800, China.
Bioinformatics ; 34(7): 1132-1140, 2018 04 01.
Article em En | MEDLINE | ID: mdl-29136098
ABSTRACT
Motivation Most current de novo structure prediction methods randomly sample protein conformations and thus require large amounts of computational resource. Here, we consider a sequential sampling strategy, building on ideas from recent experimental work which shows that many proteins fold cotranslationally.

Results:

We have investigated whether a pseudo-greedy search approach, which begins sequentially from one of the termini, can improve the performance and accuracy of de novo protein structure prediction. We observed that our sequential approach converges when fewer than 20 000 decoys have been produced, fewer than commonly expected. Using our software, SAINT2, we also compared the run time and quality of models produced in a sequential fashion against a standard, non-sequential approach. Sequential prediction produces an individual decoy 1.5-2.5 times faster than non-sequential prediction. When considering the quality of the best model, sequential prediction led to a better model being produced for 31 out of 41 soluble protein validation cases and for 18 out of 24 transmembrane protein cases. Correct models (TM-Score > 0.5) were produced for 29 of these cases by the sequential mode and for only 22 by the non-sequential mode. Our comparison reveals that a sequential search strategy can be used to drastically reduce computational time of de novo protein structure prediction and improve accuracy. Availability and implementation Data are available for download from http//opig.stats.ox.ac.uk/resources. SAINT2 is available for download from https//github.com/sauloho/SAINT2. Contact saulo.deoliveira@dtc.ox.ac.uk. Supplementary information Supplementary data are available at Bioinformatics online.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Conformação Proteica / Software / Biologia Computacional / Análise de Sequência de Proteína Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Animals / Humans Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2018 Tipo de documento: Article País de afiliação: Reino Unido

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Conformação Proteica / Software / Biologia Computacional / Análise de Sequência de Proteína Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Animals / Humans Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2018 Tipo de documento: Article País de afiliação: Reino Unido