Your browser doesn't support javascript.
loading
Structural discrimination analysis for constraint selection in protein modeling.
Bottino, Guilherme F; Ferrari, Allan J R; Gozzo, Fabio C; Martínez, Leandro.
Afiliação
  • Bottino GF; Institute of Chemistry, University of Campinas, Campinas, SP, Brazil.
  • Ferrari AJR; Center for Computational Engineering & Science, University of Campinas, Campinas, SP, Brazil.
  • Gozzo FC; Institute of Chemistry, University of Campinas, Campinas, SP, Brazil.
  • Martínez L; Center for Computational Engineering & Science, University of Campinas, Campinas, SP, Brazil.
Bioinformatics ; 37(21): 3766-3773, 2021 11 05.
Article em En | MEDLINE | ID: mdl-34086840
ABSTRACT
MOTIVATION Protein structure modeling can be improved by the use of distance constraints between amino acid residues, provided such data reflects-at least partially-the native tertiary structure of the target system. In fact, only a small subset of the native contact map is necessary to successfully drive the model conformational search, so one important goal is to obtain the set of constraints with the highest true-positive rate, lowest redundancy and greatest amount of information. In this work, we introduce a constraint evaluation and selection method based on the point-biserial correlation coefficient, which utilizes structural information from an ensemble of models to indirectly measure the power of each constraint in biasing the conformational search toward consensus structures.

RESULTS:

Residue contact maps obtained by direct coupling analysis are systematically improved by means of discriminant analysis, reaching in some cases accuracies often seen only in modern deep-learning-based approaches. When combined with an iterative modeling workflow, the proposed constraint classification optimizes the selection of the constraint set and maximizes the probability of obtaining successful models. The use of discriminant analysis for the valorization of the information of constraint datasets is a general concept with possible applications to other constraint types and modeling problems. AVAILABILITY AND IMPLEMENTATION MSA for the targets in this work is available on https//github.com/m3g/2021_Bottino_Biserial. Modeling data supporting the findings of this study was generated at the Center for Computing in Engineering and Sciences, and is available from the corresponding author LM on request. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Proteínas / Aminoácidos Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2021 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Proteínas / Aminoácidos Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2021 Tipo de documento: Article