Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 46
Filtrar
1.
Nature ; 561(7724): 485-491, 2018 09.
Artículo en Inglés | MEDLINE | ID: mdl-30209393

RESUMEN

The regular arrangements of ß-strands around a central axis in ß-barrels and of α-helices in coiled coils contrast with the irregular tertiary structures of most globular proteins, and have fascinated structural biologists since they were first discovered. Simple parametric models have been used to design a wide range of α-helical coiled-coil structures, but to date there has been no success with ß-barrels. Here we show that accurate de novo design of ß-barrels requires considerable symmetry-breaking to achieve continuous hydrogen-bond connectivity and eliminate backbone strain. We then build ensembles of ß-barrel backbone models with cavity shapes that match the fluorogenic compound DFHBI, and use a hierarchical grid-based search method to simultaneously optimize the rigid-body placement of DFHBI in these cavities and the identities of the surrounding amino acids to achieve high shape and chemical complementarity. The designs have high structural accuracy and bind and fluorescently activate DFHBI in vitro and in Escherichia coli, yeast and mammalian cells. This de novo design of small-molecule binding activity, using backbones custom-built to bind the ligand, should enable the design of increasingly sophisticated ligand-binding proteins, sensors and catalysts that are not limited by the backbone geometries available in known protein structures.


Asunto(s)
Compuestos de Bencilo/química , Fluorescencia , Imidazolinas/química , Proteínas/química , Animales , Compuestos de Bencilo/análisis , Células COS , Chlorocebus aethiops , Escherichia coli , Proteínas Fluorescentes Verdes/genética , Proteínas Fluorescentes Verdes/metabolismo , Enlace de Hidrógeno , Imidazolinas/análisis , Ligandos , Unión Proteica , Dominios Proteicos , Pliegue de Proteína , Estabilidad Proteica , Estructura Secundaria de Proteína , Reproducibilidad de los Resultados , Levaduras
2.
J Comput Chem ; 44(14): 1360-1368, 2023 05 30.
Artículo en Inglés | MEDLINE | ID: mdl-36847771

RESUMEN

Cryo-electron microscopy (cryo-EM) is gaining large attention for high-resolution protein structure determination in solutions. However, a very high percentage of cryo-EM structures correspond to resolutions of 3-5 Å, making the structures difficult to be used in in silico drug design. In this study, we analyze how useful cryo-EM protein structures are for in silico drug design by evaluating ligand docking accuracy. From realistic cross-docking scenarios using medium resolution (3-5 Å) cryo-EM structures and a popular docking tool Autodock-Vina, only 20% of docking succeeded, when the success rate doubles in the same kind of cross-docking but using high-resolution (<2 Å) crystal structures instead. We decipher the reason for failures by decomposing the contribution from resolution-dependent and independent factors. The heterogeneity in the protein side-chain and backbone conformations is identified as the major resolution-dependent factor causing docking difficulty from our analysis, while intrinsic receptor flexibility mainly comprises the resolution-independent factor. We demonstrate the flexibility implementation in current ligand docking tools is able to rescue only a portion of failures (10%), and the limited performance was majorly due to potential structural errors than conformational changes. Our work suggests the strong necessity of more robust method developments on ligand docking and EM modeling techniques in order to fully utilize cryo-EM structures for in silico drug design.


Asunto(s)
Benchmarking , Proteínas , Microscopía por Crioelectrón/métodos , Ligandos , Proteínas/química , Diseño de Fármacos , Conformación Proteica
3.
Proc Natl Acad Sci U S A ; 117(3): 1496-1503, 2020 01 21.
Artículo en Inglés | MEDLINE | ID: mdl-31896580

RESUMEN

The prediction of interresidue contacts and distances from coevolutionary data using deep learning has considerably advanced protein structure prediction. Here, we build on these advances by developing a deep residual network for predicting interresidue orientations, in addition to distances, and a Rosetta-constrained energy-minimization protocol for rapidly and accurately generating structure models guided by these restraints. In benchmark tests on 13th Community-Wide Experiment on the Critical Assessment of Techniques for Protein Structure Prediction (CASP13)- and Continuous Automated Model Evaluation (CAMEO)-derived sets, the method outperforms all previously described structure-prediction methods. Although trained entirely on native proteins, the network consistently assigns higher probability to de novo-designed proteins, identifying the key fold-determining residues and providing an independent quantitative measure of the "ideality" of a protein structure. The method promises to be useful for a broad range of protein structure prediction and design problems.


Asunto(s)
Conformación Proteica , Análisis de Secuencia de Proteína/métodos , Programas Informáticos , Animales , Aprendizaje Profundo , Humanos
4.
J Am Chem Soc ; 144(34): 15519-15528, 2022 08 31.
Artículo en Inglés | MEDLINE | ID: mdl-35972994

RESUMEN

Although interest in stabilized α-helical peptides as next-generation therapeutics for modulating biomolecular interfaces is increasing, peptides have limited functionality and stability due to their small size. In comparison, α-helical ligands based on proteins can make steric clash with targets due to their large size. Here, we report the design of a monomeric pseudo-isolated α-helix (mPIH) system in which proteins behave as if they are peptides. The designed proteins contain α-helix ligands that do not require any covalent chemical modification, do not have frayed ends, and importantly can make sterically favorable interactions similar to isolated peptides. An optimal mPIH showed a more than 100-fold increase in target selectivity, which might be related to the advantages in conformational selection due to the absence of frayed ends. The α-helical ligand in the mPIH displayed high thermal stability well above human body temperature and showed reversible and rapid folding/unfolding transitions. Thus, mPIH can become a promising protein-based platform for developing stabilized α-helix pharmaceuticals.


Asunto(s)
Péptidos , Proteínas , Secuencia de Aminoácidos , Dicroismo Circular , Humanos , Péptidos/química , Conformación Proteica en Hélice alfa , Pliegue de Proteína , Estructura Secundaria de Proteína
5.
Proteins ; 89(12): 1824-1833, 2021 12.
Artículo en Inglés | MEDLINE | ID: mdl-34324224

RESUMEN

For CASP14, we developed deep learning-based methods for predicting homo-oligomeric and hetero-oligomeric contacts and used them for oligomer modeling. To build structure models, we developed an oligomer structure generation method that utilizes predicted interchain contacts to guide iterative restrained minimization from random backbone structures. We supplemented this gradient-based fold-and-dock method with template-based and ab initio docking approaches using deep learning-based subunit predictions on 29 assembly targets. These methods produced oligomer models with summed Z-scores 5.5 units higher than the next best group, with the fold-and-dock method having the best relative performance. Over the eight targets for which this method was used, the best of the five submitted models had average oligomer TM-score of 0.71 (average oligomer TM-score of the next best group: 0.64), and explicit modeling of inter-subunit interactions improved modeling of six out of 40 individual domains (ΔGDT-TS > 2.0).


Asunto(s)
Modelos Moleculares , Conformación Proteica , Proteínas , Programas Informáticos , Biología Computacional , Bases de Datos de Proteínas , Aprendizaje Profundo , Unión Proteica , Subunidades de Proteína/química , Subunidades de Proteína/metabolismo , Proteínas/química , Proteínas/metabolismo , Análisis de Secuencia de Proteína
6.
Proteins ; 89(12): 1722-1733, 2021 12.
Artículo en Inglés | MEDLINE | ID: mdl-34331359

RESUMEN

The trRosetta structure prediction method employs deep learning to generate predicted residue-residue distance and orientation distributions from which 3D models are built. We sought to improve the method by incorporating as inputs (in addition to sequence information) both language model embeddings and template information weighted by sequence similarity to the target. We also developed a refinement pipeline that recombines models generated by template-free and template utilizing versions of trRosetta guided by the DeepAccNet accuracy predictor. Both benchmark tests and CASP results show that the new pipeline is a considerable improvement over the original trRosetta, and it is faster and requires less computing resources, completing the entire modeling process in a median < 3 h in CASP14. Our human group improved results with this pipeline primarily by identifying additional homologous sequences for input into the network. We also used the DeepAccNet accuracy predictor to guide Rosetta high-resolution refinement for submissions in the regular and refinement categories; although performance was quite good on a CASP relative scale, the overall improvements were rather modest in part due to missing inter-domain or inter-chain contacts.


Asunto(s)
Biología Computacional/métodos , Aprendizaje Profundo , Estructura Terciaria de Proteína , Proteínas , Programas Informáticos , Humanos , Metagenoma/genética , Proteínas/química , Proteínas/genética , Proteínas/metabolismo , Análisis de Secuencia de Proteína
7.
PLoS Comput Biol ; 16(9): e1008103, 2020 09.
Artículo en Inglés | MEDLINE | ID: mdl-32956350

RESUMEN

Highly coordinated water molecules are frequently an integral part of protein-protein and protein-ligand interfaces. We introduce an updated energy model that efficiently captures the energetic effects of these ordered water molecules on the surfaces of proteins. A two-stage method is developed in which polar groups arranged in geometries suitable for water placement are first identified, then a modified Monte Carlo simulation allows highly coordinated waters to be placed on the surface of a protein while simultaneously sampling amino acid side chain orientations. This "semi-explicit" water model is implemented in Rosetta and is suitable for both structure prediction and protein design. We show that our new approach and energy model yield significant improvements in native structure recovery of protein-protein and protein-ligand docking discrimination tests.


Asunto(s)
Sitios de Unión/fisiología , Simulación del Acoplamiento Molecular , Unión Proteica/fisiología , Proteínas , Agua , Algoritmos , Aminoácidos/química , Aminoácidos/metabolismo , Enlace de Hidrógeno , Ligandos , Método de Montecarlo , Proteínas/química , Proteínas/metabolismo , Agua/química , Agua/metabolismo
8.
Proc Natl Acad Sci U S A ; 115(12): 3054-3059, 2018 03 20.
Artículo en Inglés | MEDLINE | ID: mdl-29507254

RESUMEN

Proteins fold to their lowest free-energy structures, and hence the most straightforward way to increase the accuracy of a partially incorrect protein structure model is to search for the lowest-energy nearby structure. This direct approach has met with little success for two reasons: first, energy function inaccuracies can lead to false energy minima, resulting in model degradation rather than improvement; and second, even with an accurate energy function, the search problem is formidable because the energy only drops considerably in the immediate vicinity of the global minimum, and there are a very large number of degrees of freedom. Here we describe a large-scale energy optimization-based refinement method that incorporates advances in both search and energy function accuracy that can substantially improve the accuracy of low-resolution homology models. The method refined low-resolution homology models into correct folds for 50 of 84 diverse protein families and generated improved models in recent blind structure prediction experiments. Analyses of the basis for these improvements reveal contributions from both the improvements in conformational sampling techniques and the energy function.


Asunto(s)
Simulación por Computador , Modelos Químicos , Biología Computacional/métodos , Modelos Moleculares , Simulación de Dinámica Molecular , Conformación Proteica , Pliegue de Proteína , Termodinámica
9.
Proteins ; 87(12): 1276-1282, 2019 12.
Artículo en Inglés | MEDLINE | ID: mdl-31325340

RESUMEN

Because proteins generally fold to their lowest free energy states, energy-guided refinement in principle should be able to systematically improve the quality of protein structure models generated using homologous structure or co-evolution derived information. However, because of the high dimensionality of the search space, there are far more ways to degrade the quality of a near native model than to improve it, and hence, refinement methods are very sensitive to energy function errors. In the 13th Critial Assessment of techniques for protein Structure Prediction (CASP13), we sought to carry out a thorough search for low energy states in the neighborhood of a starting model using restraints to avoid straying too far. The approach was reasonably successful in improving both regions largely incorrect in the starting models as well as core regions that started out closer to the correct structure. Models with GDT-HA over 70 were obtained for five targets and for one of those, an accuracy of 0.5 å backbone root-mean-square deviation (RMSD) was achieved. An important current challenge is to improve performance in refining oligomers and larger proteins, for which the search problem remains extremely difficult.


Asunto(s)
Biología Computacional/métodos , Conformación Proteica , Pliegue de Proteína , Proteínas/química , Algoritmos , Modelos Moleculares , Reproducibilidad de los Resultados , Termodinámica
10.
Proteins ; 86 Suppl 1: 113-121, 2018 03.
Artículo en Inglés | MEDLINE | ID: mdl-28940798

RESUMEN

We describe several notable aspects of our structure predictions using Rosetta in CASP12 in the free modeling (FM) and refinement (TR) categories. First, we had previously generated (and published) models for most large protein families lacking experimentally determined structures using Rosetta guided by co-evolution based contact predictions, and for several targets these models proved better starting points for comparative modeling than any known crystal structure-our model database thus starts to fulfill one of the goals of the original protein structure initiative. Second, while our "human" group simply submitted ROBETTA models for most targets, for six targets expert intervention improved predictions considerably; the largest improvement was for T0886 where we correctly parsed two discontinuous domains guided by predicted contact maps to accurately identify a structural homolog of the same fold. Third, Rosetta all atom refinement followed by MD simulations led to consistent but small improvements when starting models were close to the native structure, and larger but less consistent improvements when starting models were further away.


Asunto(s)
Biología Computacional/métodos , Modelos Moleculares , Conformación Proteica , Pliegue de Proteína , Proteínas/química , Algoritmos , Cristalografía por Rayos X , Humanos , Análisis de Secuencia de Proteína
11.
Proteins ; 86 Suppl 1: 283-291, 2018 03.
Artículo en Inglés | MEDLINE | ID: mdl-28913931

RESUMEN

Many naturally occurring protein systems function primarily as symmetric assemblies. Prediction of the quaternary structure of these assemblies is an important biological problem. This article describes automated tools we have developed for predicting the structures of symmetric protein assemblies in the Robetta structure prediction server. We assess the performance of this pipeline on a set of targets from the recent CASP12/CAPRI blind quaternary structure prediction experiment. Our approach successfully predicted 5 of 7 symmetric assemblies in this challenge, and was assessed as the best participating server group, and 1 of only 2 groups (human or server) with 2 predictions judged as high quality by the assessors. We also assess the method on a broader set of 22 natively symmetric CASP12 targets, where we show that oligomeric modeling can improve the accuracy of monomeric structure determination, particularly in highly intertwined oligomers.


Asunto(s)
Biología Computacional/métodos , Bases de Datos de Proteínas , Modelos Moleculares , Conformación Proteica , Multimerización de Proteína , Proteínas/química , Programas Informáticos , Humanos , Análisis de Secuencia de Proteína
12.
J Chem Inf Model ; 58(6): 1234-1243, 2018 06 25.
Artículo en Inglés | MEDLINE | ID: mdl-29786430

RESUMEN

The second extracellular loops (ECL2s) of G-protein-coupled receptors (GPCRs) are often involved in GPCR functions, and their structures have important implications in drug discovery. However, structure prediction of ECL2 is difficult because of its long length and the structural diversity among different GPCRs. In this study, a new ECL2 conformational sampling method involving both template-based and ab initio sampling was developed. Inspired by the observation of similar ECL2 structures of closely related GPCRs, a template-based sampling method employing loop structure templates selected from the structure database was developed. A new metric for evaluating similarity of the target loop to templates was introduced for template selection. An ab initio loop sampling method was also developed to treat cases without highly similar templates. The ab initio method is based on the previously developed fragment assembly and loop closure method. A new sampling component that takes advantage of secondary structure prediction was added. In addition, a conserved disulfide bridge restraining ECL2 conformation was predicted and analytically incorporated into sampling, reducing the effective dimension of the conformational search space. The sampling method was combined with an existing energy function for comparison with previously reported loop structure prediction methods, and the benchmark test demonstrated outstanding performance.


Asunto(s)
Receptores Acoplados a Proteínas G/química , Animales , Bases de Datos de Proteínas , Disulfuros/química , Humanos , Modelos Moleculares , Conformación Proteica , Estructura Secundaria de Proteína
13.
Proteins ; 84 Suppl 1: 314-22, 2016 09.
Artículo en Inglés | MEDLINE | ID: mdl-26205421

RESUMEN

We report new Rosetta-based approaches to tackling the major issues that confound protein structure refinement, and the testing of these approaches in the CASP11 experiment. Automated refinement protocols were developed that integrate a range of sampling methods using parallel computation and multiobjective optimization. In CASP11, we used a more aggressive large-scale structure rebuilding approach for poor starting models, and a less aggressive local rebuilding plus core refinement approach for starting models likely to be closer to the native structure. The more incorrectly modeled a structure was predicted to be, the more it was allowed to vary during refinement. The CASP11 experiment revealed strengths and weaknesses of the approaches: the high-resolution strategy incorporating local rebuilding with core refinement consistently improved starting structures, while the low-resolution strategy incorporating the reconstruction of large parts of the structures improved starting models in some cases but often considerably worsened them, largely because of model selection issues. Overall, the results suggest the high-resolution refinement protocol is a promising method orthogonal to other approaches, while the low-resolution refinement method clearly requires further development. Proteins 2016; 84(Suppl 1):314-322. © 2015 Wiley Periodicals, Inc.


Asunto(s)
Biología Computacional/estadística & datos numéricos , Modelos Estadísticos , Simulación de Dinámica Molecular , Proteínas/química , Programas Informáticos , Algoritmos , Secuencias de Aminoácidos , Benchmarking , Biología Computacional/métodos , Humanos , Internet , Conformación Proteica en Hélice alfa , Conformación Proteica en Lámina beta , Pliegue de Proteína , Dominios y Motivos de Interacción de Proteínas , Estructura Terciaria de Proteína , Homología de Secuencia de Aminoácido , Termodinámica
14.
Proteins ; 84 Suppl 1: 181-8, 2016 09.
Artículo en Inglés | MEDLINE | ID: mdl-26857542

RESUMEN

In CASP11 we generated protein structure models using simulated ambiguous and unambiguous nuclear Overhauser effect (NOE) restraints with a two stage protocol. Low resolution models were generated guided by the unambiguous restraints using continuous chain folding for alpha and alpha-beta proteins, and iterative annealing for all beta proteins to take advantage of the strand pairing information implicit in the restraints. The Rosetta fragment/model hybridization protocol was then used to recombine and regularize these models, and refine them in the Rosetta full atom energy function guided by both the unambiguous and the ambiguous restraints. Fifteen out of 19 targets were modeled with GDT-TS quality scores greater than 60 for Model 1, significantly improving upon the non-assisted predictions. Our results suggest that atomic level accuracy is achievable using sparse NOE data when there is at least one correctly assigned NOE for every residue. Proteins 2016; 84(Suppl 1):181-188. © 2016 Wiley Periodicals, Inc.


Asunto(s)
Biología Computacional/estadística & datos numéricos , Modelos Moleculares , Modelos Estadísticos , Proteínas/química , Programas Informáticos , Algoritmos , Secuencias de Aminoácidos , Biología Computacional/métodos , Simulación por Computador , Bases de Datos de Proteínas , Cooperación Internacional , Internet , Resonancia Magnética Nuclear Biomolecular , Conformación Proteica en Hélice alfa , Conformación Proteica en Lámina beta , Pliegue de Proteína , Dominios y Motivos de Interacción de Proteínas
15.
Nucleic Acids Res ; 41(Web Server issue): W384-8, 2013 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-23737448

RESUMEN

The quality of model structures generated by contemporary protein structure prediction methods strongly depends on the degree of similarity between the target and available template structures. Therefore, the importance of improving template-based model structures beyond the accuracy available from template information has been emphasized in the structure prediction community. The GalaxyRefine web server, freely available at http://galaxy.seoklab.org/refine, is based on a refinement method that has been successfully tested in CASP10. The method first rebuilds side chains and performs side-chain repacking and subsequent overall structure relaxation by molecular dynamics simulation. According to the CASP10 assessment, this method showed the best performance in improving the local structure quality. The method can improve both global and local structure quality on average, when used for refining the models generated by state-of-the-art protein structure prediction servers.


Asunto(s)
Conformación Proteica , Programas Informáticos , Internet , Simulación de Dinámica Molecular
16.
Proteins ; 82(4): 620-32, 2014 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-24155158

RESUMEN

We report the first assessment of blind predictions of water positions at protein-protein interfaces, performed as part of the critical assessment of predicted interactions (CAPRI) community-wide experiment. Groups submitting docking predictions for the complex of the DNase domain of colicin E2 and Im2 immunity protein (CAPRI Target 47), were invited to predict the positions of interfacial water molecules using the method of their choice. The predictions-20 groups submitted a total of 195 models-were assessed by measuring the recall fraction of water-mediated protein contacts. Of the 176 high- or medium-quality docking models-a very good docking performance per se-only 44% had a recall fraction above 0.3, and a mere 6% above 0.5. The actual water positions were in general predicted to an accuracy level no better than 1.5 Å, and even in good models about half of the contacts represented false positives. This notwithstanding, three hotspot interface water positions were quite well predicted, and so was one of the water positions that is believed to stabilize the loop that confers specificity in these complexes. Overall the best interface water predictions was achieved by groups that also produced high-quality docking models, indicating that accurate modelling of the protein portion is a determinant factor. The use of established molecular mechanics force fields, coupled to sampling and optimization procedures also seemed to confer an advantage. Insights gained from this analysis should help improve the prediction of protein-water interactions and their role in stabilizing protein complexes.


Asunto(s)
Colicinas/química , Mapeo de Interacción de Proteínas , Agua/química , Algoritmos , Biología Computacional , Modelos Moleculares , Simulación del Acoplamiento Molecular , Conformación Proteica
17.
Bioinformatics ; 29(8): 1078-80, 2013 Apr 15.
Artículo en Inglés | MEDLINE | ID: mdl-23413437

RESUMEN

SUMMARY: A large number of proteins function as homo-oligomers; therefore, predicting homo-oligomeric structure of proteins is of primary importance for understanding protein function at the molecular level. Here, we introduce a web server for prediction of protein homo-oligomer structure. The server takes a protein monomer structure as input and predicts its homo-oligomer structure from oligomer templates selected based on sequence and tertiary/quaternary structure similarity. Using protein model structures as input, the server shows clear improvement over the best methods of CASP9 in predicting oligomeric structures from amino acid sequences. AVAILABILITY: http://galaxy.seoklab.org/gemini. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Multimerización de Proteína , Programas Informáticos , Internet , Proteínas/química , Análisis de Secuencia de Proteína , Homología de Secuencia de Aminoácido , Homología Estructural de Proteína
18.
Nucleic Acids Res ; 40(Web Server issue): W294-7, 2012 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-22649060

RESUMEN

Three-dimensional protein structures provide invaluable information for understanding and regulating biological functions of proteins. The GalaxyWEB server predicts protein structure from sequence by template-based modeling and refines loop or terminus regions by ab initio modeling. This web server is based on the method tested in CASP9 (9th Critical Assessment of techniques for protein Structure Prediction) as 'Seok-server', which was assessed to be among top performing template-based modeling servers. The method generates reliable core structures from multiple templates and re-builds unreliable loops or termini by using an optimization-based refinement method. In addition to structure prediction, a user can also submit a refinement only job by providing a starting model structure and locations of loops or termini to refine. The web server can be freely accessed at http://galaxy.seoklab.org/.


Asunto(s)
Conformación Proteica , Programas Informáticos , Internet , Análisis de Secuencia de Proteína , Interfaz Usuario-Computador
19.
J Chem Theory Comput ; 20(7): 2689-2695, 2024 Apr 09.
Artículo en Inglés | MEDLINE | ID: mdl-38547871

RESUMEN

Mapping the ensemble of protein conformations that contribute to function and can be targeted by small molecule drugs remains an outstanding challenge. Here, we explore the use of variational autoencoders for reducing the challenge of dimensionality in the protein structure ensemble generation problem. We convert high-dimensional protein structural data into a continuous, low-dimensional representation, carry out a search in this space guided by a structure quality metric, and then use RoseTTAFold guided by the sampled structural information to generate 3D structures. We use this approach to generate ensembles for the cancer relevant protein K-Ras, train the VAE on a subset of the available K-Ras crystal structures and MD simulation snapshots, and assess the extent of sampling close to crystal structures withheld from training. We find that our latent space sampling procedure rapidly generates ensembles with high structural quality and is able to sample within 1 Å of held-out crystal structures, with a consistency higher than that of MD simulation or AlphaFold2 prediction. The sampled structures sufficiently recapitulate the cryptic pockets in the held-out K-Ras structures to allow for small molecule docking.


Asunto(s)
Proteínas , Proteínas/química , Conformación Proteica , Simulación por Computador
20.
Proteins ; 81(11): 1980-7, 2013 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-23843247

RESUMEN

Community-wide blind prediction experiments such as CAPRI and CASP provide an objective measure of the current state of predictive methodology. Here we describe a community-wide assessment of methods to predict the effects of mutations on protein-protein interactions. Twenty-two groups predicted the effects of comprehensive saturation mutagenesis for two designed influenza hemagglutinin binders and the results were compared with experimental yeast display enrichment data obtained using deep sequencing. The most successful methods explicitly considered the effects of mutation on monomer stability in addition to binding affinity, carried out explicit side-chain sampling and backbone relaxation, evaluated packing, electrostatic, and solvation effects, and correctly identified around a third of the beneficial mutations. Much room for improvement remains for even the best techniques, and large-scale fitness landscapes should continue to provide an excellent test bed for continued evaluation of both existing and new prediction methodologies.


Asunto(s)
Bases de Datos de Proteínas , Mapeo de Interacción de Proteínas , Algoritmos , Mutación , Unión Proteica
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA