Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 36
Filtrar
1.
Sci Rep ; 13(1): 11612, 2023 07 18.
Artículo en Inglés | MEDLINE | ID: mdl-37463925

RESUMEN

Antibodies with similar amino acid sequences, especially across their complementarity-determining regions, often share properties. Finding that an antibody of interest has a similar sequence to naturally expressed antibodies in healthy or diseased repertoires is a powerful approach for the prediction of antibody properties, such as immunogenicity or antigen specificity. However, as the number of available antibody sequences is now in the billions and continuing to grow, repertoire mining for similar sequences has become increasingly computationally expensive. Existing approaches are limited by either being low-throughput, non-exhaustive, not antibody specific, or only searching against entire chain sequences. Therefore, there is a need for a specialized tool, optimized for a rapid and exhaustive search of any antibody region against all known antibodies, to better utilize the full breadth of available repertoire sequences. We introduce Known Antibody Search (KA-Search), a tool that allows for the rapid search of billions of antibody variable domains by amino acid sequence identity across either the variable domain, the complementarity-determining regions, or a user defined antibody region. We show KA-Search in operation on the [Formula: see text]2.4 billion antibody sequences available in the OAS database. KA-Search can be used to find the most similar sequences from OAS within 30 minutes and a representative subset of 10 million sequences in less than 9 seconds. We give examples of how KA-Search can be used to obtain new insights about an antibody of interest. KA-Search is freely available at https://github.com/oxpig/kasearch .


Asunto(s)
Anticuerpos , Regiones Determinantes de Complementariedad , Regiones Determinantes de Complementariedad/química , Secuencia de Aminoácidos
2.
Bioinformatics ; 39(1)2023 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-36370083

RESUMEN

SUMMARY: The development of new vaccines and antibody therapeutics typically takes several years and requires over $1bn in investment. Accurate knowledge of the paratope (antibody binding site) can speed up and reduce the cost of this process by improving our understanding of antibody-antigen binding. We present Paragraph, a structure-based paratope prediction tool that outperforms current state-of-the-art tools using simpler feature vectors and no antigen information. AVAILABILITY AND IMPLEMENTATION: Source code is freely available at www.github.com/oxpig/Paragraph. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Anticuerpos , Redes Neurales de la Computación , Sitios de Unión de Anticuerpos , Programas Informáticos , Antígenos
3.
Bioinform Adv ; 2(1): vbac046, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-36699403

RESUMEN

Motivation: General protein language models have been shown to summarize the semantics of protein sequences into representations that are useful for state-of-the-art predictive methods. However, for antibody specific problems, such as restoring residues lost due to sequencing errors, a model trained solely on antibodies may be more powerful. Antibodies are one of the few protein types where the volume of sequence data needed for such language models is available, e.g. in the Observed Antibody Space (OAS) database. Results: Here, we introduce AbLang, a language model trained on the antibody sequences in the OAS database. We demonstrate the power of AbLang by using it to restore missing residues in antibody sequence data, a key issue with B-cell receptor repertoire sequencing, e.g. over 40% of OAS sequences are missing the first 15 amino acids. AbLang restores the missing residues of antibody sequences better than using IMGT germlines or the general protein language model ESM-1b. Further, AbLang does not require knowledge of the germline of the antibody and is seven times faster than ESM-1b. Availability and implementation: AbLang is a python package available at https://github.com/oxpig/AbLang. Supplementary information: Supplementary data are available at Bioinformatics Advances online.

4.
Structure ; 29(6): 606-621.e5, 2021 06 03.
Artículo en Inglés | MEDLINE | ID: mdl-33539768

RESUMEN

Accurate predictive modeling of antibody-antigen complex structures and structure-based antibody design remain major challenges in computational biology, with implications for biotherapeutics, immunity, and vaccines. Through a systematic search for high-resolution structures of antibody-antigen complexes and unbound antibody and antigen structures, in conjunction with identification of experimentally determined binding affinities, we have assembled a non-redundant set of test cases for antibody-antigen docking and affinity prediction. This benchmark more than doubles the number of antibody-antigen complexes and corresponding affinities available in our previous benchmarks, providing an unprecedented view of the determinants of antibody recognition and insights into molecular flexibility. Initial assessments of docking and affinity prediction tools highlight the challenges posed by this diverse set of cases, which includes camelid nanobodies, therapeutic monoclonal antibodies, and broadly neutralizing antibodies targeting viral glycoproteins. This dataset will enable development of advanced predictive modeling and design methods for this therapeutically relevant class of protein-protein interactions.


Asunto(s)
Anticuerpos/química , Anticuerpos/metabolismo , Antígenos/química , Antígenos/metabolismo , Algoritmos , Anticuerpos Monoclonales/química , Anticuerpos Monoclonales/metabolismo , Anticuerpos Antivirales/química , Anticuerpos Antivirales/metabolismo , Complejo Antígeno-Anticuerpo/química , Benchmarking , Anticuerpos ampliamente neutralizantes/química , Anticuerpos ampliamente neutralizantes/metabolismo , Biología Computacional/métodos , Simulación del Acoplamiento Molecular , Unión Proteica , Conformación Proteica , Anticuerpos de Dominio Único/química , Anticuerpos de Dominio Único/metabolismo , Programas Informáticos , Relación Estructura-Actividad
5.
Methods Mol Biol ; 2165: 199-216, 2020.
Artículo en Inglés | MEDLINE | ID: mdl-32621226

RESUMEN

Many of the biological functions of the cell are driven by protein-protein interactions. However, determining which proteins interact and exactly how they do so to enable their functions, remain major research questions. Functional interactions are dependent on a number of complicated factors; therefore, modeling the three-dimensional structure of protein-protein complexes is still considered a complex endeavor. Nevertheless, the rewards for modeling protein interactions to atomic level detail are substantial, and there are numerous examples of how models can provide useful information for drug design, protein engineering, systems biology, and understanding of the immune system. Here, we provide practical guidelines for docking proteins using the web-server, SwarmDock, a flexible protein-protein docking method. Moreover, we provide an overview of the factors that need to be considered when deciding whether docking is likely to be successful.


Asunto(s)
Simulación del Acoplamiento Molecular/métodos , Conformación Proteica , Programas Informáticos , Sitios de Unión , Unión Proteica
6.
Mol Biol Evol ; 36(9): 2086-2103, 2019 09 01.
Artículo en Inglés | MEDLINE | ID: mdl-31114882

RESUMEN

Few models of sequence evolution incorporate parameters describing protein structure, despite its high conservation, essential functional role and increasing availability. We present a structurally aware empirical substitution model for amino acid sequence evolution in which proteins are expressed using an expanded alphabet that relays both amino acid identity and structural information. Each character specifies an amino acid as well as information about the rotamer configuration of its side-chain: the discrete geometric pattern of permitted side-chain atomic positions, as defined by the dihedral angles between covalently linked atoms. By assigning rotamer states in 251,194 protein structures and identifying 4,508,390 substitutions between closely related sequences, we generate a 55-state "Dayhoff-like" model that shows that the evolutionary properties of amino acids depend strongly upon side-chain geometry. The model performs as well as or better than traditional 20-state models for divergence time estimation, tree inference, and ancestral state reconstruction. We conclude that not only is rotamer configuration a valuable source of information for phylogenetic studies, but that modeling the concomitant evolution of sequence and structure may have important implications for understanding protein folding and function.


Asunto(s)
Evolución Molecular , Modelos Biológicos , Conformación Proteica , Sustitución de Aminoácidos , Cadenas de Markov
7.
Bioinformatics ; 35(3): 462-469, 2019 02 01.
Artículo en Inglés | MEDLINE | ID: mdl-30020414

RESUMEN

Motivation: Understanding the relationship between the sequence, structure, binding energy, binding kinetics and binding thermodynamics of protein-protein interactions is crucial to understanding cellular signaling, the assembly and regulation of molecular complexes, the mechanisms through which mutations lead to disease, and protein engineering. Results: We present SKEMPI 2.0, a major update to our database of binding free energy changes upon mutation for structurally resolved protein-protein interactions. This version now contains manually curated binding data for 7085 mutations, an increase of 133%, including changes in kinetics for 1844 mutations, enthalpy and entropy changes for 443 mutations, and 440 mutations, which abolish detectable binding. Availability and implementation: The database is available as supplementary data and at https://life.bsc.es/pid/skempi2/. Supplementary information: Supplementary data are available at Bioinformatics online.


Asunto(s)
Bases de Datos de Proteínas , Mutación , Unión Proteica , Cinética , Termodinámica
8.
Methods Mol Biol ; 1764: 413-428, 2018.
Artículo en Inglés | MEDLINE | ID: mdl-29605931

RESUMEN

The atomic structures of protein complexes can provide useful information for drug design, protein engineering, systems biology, and understanding pathology. Obtaining this information experimentally can be challenging. However, if the structures of the subunits are known, then it is often possible to model the complex computationally. This chapter provide practical guidelines for docking proteins using the SwarmDock flexible protein-protein docking method, providing an overview of the factors that need to be considered when deciding whether docking is likely to be successful, the preparation of structural input, generation of docked poses, analysis and ranking of docked poses, and the validation of models using external data.


Asunto(s)
Proteínas Adaptadoras Transductoras de Señales/metabolismo , Filaminas/metabolismo , Simulación del Acoplamiento Molecular , Fosfoproteínas/metabolismo , Dominios y Motivos de Interacción de Proteínas , Programas Informáticos , Proteínas Adaptadoras Transductoras de Señales/química , Algoritmos , Filaminas/química , Humanos , Modelos Moleculares , Fosfoproteínas/química , Unión Proteica , Conformación Proteica
9.
Proteins ; 85(7): 1287-1297, 2017 07.
Artículo en Inglés | MEDLINE | ID: mdl-28342242

RESUMEN

Protein-protein interactions play fundamental roles in biological processes including signaling, metabolism, and trafficking. While the structure of a protein complex reveals crucial details about the interaction, it is often difficult to acquire this information experimentally. As the number of interactions discovered increases faster than they can be characterized, protein-protein docking calculations may be able to reduce this disparity by providing models of the interacting proteins. Rigid-body docking is a widely used docking approach, and is often capable of generating a pool of models within which a near-native structure can be found. These models need to be scored in order to select the acceptable ones from the set of poses. Recently, more than 100 scoring functions from the CCharPPI server were evaluated for this task using decoy structures generated with SwarmDock. Here, we extend this analysis to identify the predictive success rates of the scoring functions on decoys from three rigid-body docking programs, ZDOCK, FTDock, and SDOCK, allowing us to assess the transferability of the functions. We also apply set-theoretic measure to test whether the scoring functions are capable of identifying near-native poses within different subsets of the benchmark. This information can provide guides for the use of the most efficient scoring function for each docking method, as well as instruct future scoring functions development efforts. Proteins 2017; 85:1287-1297. © 2017 Wiley Periodicals, Inc.


Asunto(s)
Modelos Estadísticos , Simulación del Acoplamiento Molecular/estadística & datos numéricos , Proteínas/química , Proyectos de Investigación , Benchmarking , Internet , Mapeo de Interacción de Proteínas , Programas Informáticos
10.
J Chem Theory Comput ; 13(3): 1401-1410, 2017 Mar 14.
Artículo en Inglés | MEDLINE | ID: mdl-28230364

RESUMEN

Many proteins can adopt multiple distinct conformational states which often play different functional roles. Previous studies have shown that the underlying global dynamics through which these states are accessed are, at least in part, encoded by the protein's topology. In this work we present a method for generating transition pathways between states by perturbing the protein toward a target conformational state along thermally accessible collective motions calculated from the starting conformation. Specifically, the least absolute shrinkage and selection operator (LASSO) is used to identify the most parsimonious route along soft modes calculated using the anisotropic network model. In a survey of 436 conformational changes following protein-protein interaction, we show that such a path exists for most cases and that selected paths are low in energy. We discuss the implications for the atomic modeling of protein recognition and provide soft energy and parameter bounds which can be employed to efficiently constrain the sampling of such pathways.

11.
Bioinformatics ; 33(12): 1806-1813, 2017 Jun 15.
Artículo en Inglés | MEDLINE | ID: mdl-28200016

RESUMEN

MOTIVATION: In order to function, proteins frequently bind to one another and form 3D assemblies. Knowledge of the atomic details of these structures helps our understanding of how proteins work together, how mutations can lead to disease, and facilitates the designing of drugs which prevent or mimic the interaction. RESULTS: Atomic modeling of protein-protein interactions requires the selection of near-native structures from a set of docked poses based on their calculable properties. By considering this as an information retrieval problem, we have adapted methods developed for Internet search ranking and electoral voting into IRaPPA, a pipeline integrating biophysical properties. The approach enhances the identification of near-native structures when applied to four docking methods, resulting in a near-native appearing in the top 10 solutions for up to 50% of complexes benchmarked, and up to 70% in the top 100. AVAILABILITY AND IMPLEMENTATION: IRaPPA has been implemented in the SwarmDock server ( http://bmm.crick.ac.uk/∼SwarmDock/ ), pyDock server ( http://life.bsc.es/pid/pydockrescoring/ ) and ZDOCK server ( http://zdock.umassmed.edu/ ), with code available on request. CONTACT: moal@ebi.ac.uk. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Almacenamiento y Recuperación de la Información/métodos , Simulación del Acoplamiento Molecular , Conformación Proteica , Mapeo de Interacción de Proteínas/métodos , Programas Informáticos , Internet
12.
Proteins ; 85(3): 528-543, 2017 03.
Artículo en Inglés | MEDLINE | ID: mdl-27935158

RESUMEN

Reliable identification of near-native poses of docked protein-protein complexes is still an unsolved problem. The intrinsic heterogeneity of protein-protein interactions is challenging for traditional biophysical or knowledge based potentials and the identification of many false positive binding sites is not unusual. Often, ranking protocols are based on initial clustering of docked poses followed by the application of an energy function to rank each cluster according to its lowest energy member. Here, we present an approach of cluster ranking based not only on one molecular descriptor (e.g., an energy function) but also employing a large number of descriptors that are integrated in a machine learning model, whereby, an extremely randomized tree classifier based on 109 molecular descriptors is trained. The protocol is based on first locally enriching clusters with additional poses, the clusters are then characterized using features describing the distribution of molecular descriptors within the cluster, which are combined into a pairwise cluster comparison model to discriminate near-native from incorrect clusters. The results show that our approach is able to identify clusters containing near-native protein-protein complexes. In addition, we present an analysis of the descriptors with respect to their power to discriminate near native from incorrect clusters and how data transformations and recursive feature elimination can improve the ranking performance. Proteins 2017; 85:528-543. © 2016 Wiley Periodicals, Inc.


Asunto(s)
Biología Computacional/métodos , Aprendizaje Automático , Simulación del Acoplamiento Molecular/métodos , Proteínas/química , Programas Informáticos , Benchmarking , Sitios de Unión , Análisis por Conglomerados , Unión Proteica , Conformación Proteica , Mapeo de Interacción de Proteínas , Proyectos de Investigación , Homología Estructural de Proteína , Termodinámica
13.
Proteins ; 85(3): 487-496, 2017 03.
Artículo en Inglés | MEDLINE | ID: mdl-27701776

RESUMEN

The sixth CAPRI edition included new modeling challenges, such as the prediction of protein-peptide complexes, and the modeling of homo-oligomers and domain-domain interactions as part of the first joint CASP-CAPRI experiment. Other non-standard targets included the prediction of interfacial water positions and the modeling of the interactions between proteins and nucleic acids. We have participated in all proposed targets of this CAPRI edition both as predictors and as scorers, with new protocols to efficiently use our docking and scoring scheme pyDock in a large variety of scenarios. In addition, we have participated for the first time in the servers section, with our recently developed webserver, pyDockWeb. Excluding the CASP-CAPRI cases, we submitted acceptable models (or better) for 7 out of the 18 evaluated targets as predictors, 4 out of the 11 targets as scorers, and 6 out of the 18 targets as servers. The overall success rates were below those in past CAPRI editions. This shows the challenging nature of this last edition, with many difficult targets for which no participant submitted a single acceptable model. Interestingly, we submitted acceptable models for 83% of the evaluated protein-peptide targets. As for the 25 cases of the CASP-CAPRI experiment, in which we used a larger variety of modeling techniques (template-based, symmetry restraints, literature information, etc.), we submitted acceptable models for 56% of the targets. In summary, this CAPRI edition showed that pyDock scheme can be efficiently adapted to the increasing variety of problems that the protein interactions field is currently facing. Proteins 2017; 85:487-496. © 2016 Wiley Periodicals, Inc.


Asunto(s)
Algoritmos , Biología Computacional/métodos , Simulación del Acoplamiento Molecular/métodos , Péptidos/química , Proteínas/química , Programas Informáticos , Secuencia de Aminoácidos , Benchmarking , Sitios de Unión , Cristalografía por Rayos X , Unión Proteica , Conformación Proteica , Mapeo de Interacción de Proteínas , Multimerización de Proteína , Proyectos de Investigación , Homología Estructural de Proteína , Termodinámica , Agua/química
14.
J Mol Biol ; 427(19): 3031-41, 2015 Sep 25.
Artículo en Inglés | MEDLINE | ID: mdl-26231283

RESUMEN

We present an updated and integrated version of our widely used protein-protein docking and binding affinity benchmarks. The benchmarks consist of non-redundant, high-quality structures of protein-protein complexes along with the unbound structures of their components. Fifty-five new complexes were added to the docking benchmark, 35 of which have experimentally measured binding affinities. These updated docking and affinity benchmarks now contain 230 and 179 entries, respectively. In particular, the number of antibody-antigen complexes has increased significantly, by 67% and 74% in the docking and affinity benchmarks, respectively. We tested previously developed docking and affinity prediction algorithms on the new cases. Considering only the top 10 docking predictions per benchmark case, a prediction accuracy of 38% is achieved on all 55 cases and up to 50% for the 32 rigid-body cases only. Predicted affinity scores are found to correlate with experimental binding energies up to r=0.52 overall and r=0.72 for the rigid complexes.


Asunto(s)
Simulación del Acoplamiento Molecular , Mapeo de Interacción de Proteínas/métodos , Proteínas/metabolismo , Algoritmos , Animales , Humanos , Polinucleotido Adenililtransferasa/química , Polinucleotido Adenililtransferasa/metabolismo , Unión Proteica , Conformación Proteica , Proteínas/química , Programas Informáticos , Termodinámica , Virus Vaccinia/química , Virus Vaccinia/metabolismo , Proteínas Virales/química , Proteínas Virales/metabolismo
15.
Proteins ; 83(4): 640-50, 2015 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-25586563

RESUMEN

Mutations at protein-protein recognition sites alter binding strength by altering the chemical nature of the interacting surfaces. We present a simple surface energy model, parameterized with empirical ΔΔG values, yielding mean energies of -48 cal mol(-1) Å(-2) for interactions between hydrophobic surfaces, -51 to -80 cal mol(-1) Å(-2) for surfaces of complementary charge, and 66-83 cal mol(-1) Å(-2) for electrostatically repelling surfaces, relative to the aqueous phase. This places the mean energy of hydrophobic surface burial at -24 cal mol(-1) Å(-2) . Despite neglecting configurational entropy and intramolecular changes, the model correlates with empirical binding free energies of a functionally diverse set of rigid-body interactions (r = 0.66). When used to rerank docking poses, it can place near-native solutions in the top 10 for 37% of the complexes evaluated, and 82% in the top 100. The method shows that hydrophobic burial is the driving force for protein association, accounting for 50-95% of the cohesive energy. The model is available open-source from http://life.bsc.es/pid/web/surface_energy/ and via the CCharpPPI web server http://life.bsc.es/pid/ccharppi/.


Asunto(s)
Mutación/fisiología , Unión Proteica , Proteínas/química , Proteínas/metabolismo , Interacciones Hidrofóbicas e Hidrofílicas , Simulación del Acoplamiento Molecular , Electricidad Estática , Termodinámica
16.
Bioinformatics ; 31(1): 123-5, 2015 Jan 01.
Artículo en Inglés | MEDLINE | ID: mdl-25183488

RESUMEN

SUMMARY: The atomic structures of protein-protein interactions are central to understanding their role in biological systems, and a wide variety of biophysical functions and potentials have been developed for their characterization and the construction of predictive models. These tools are scattered across a multitude of stand-alone programs, and are often available only as model parameters requiring reimplementation. This acts as a significant barrier to their widespread adoption. CCharPPI integrates many of these tools into a single web server. It calculates up to 108 parameters, including models of electrostatics, desolvation and hydrogen bonding, as well as interface packing and complementarity scores, empirical potentials at various resolutions, docking potentials and composite scoring functions. AVAILABILITY AND IMPLEMENTATION: The server does not require registration by the user and is freely available for non-commercial academic use at http://life.bsc.es/pid/ccharppi.


Asunto(s)
Internet , Simulación del Acoplamiento Molecular/métodos , Complejos Multiproteicos/química , Mapeo de Interacción de Proteínas , Programas Informáticos , Humanos , Enlace de Hidrógeno , Electricidad Estática
17.
Mol Microbiol ; 95(1): 17-30, 2015 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-25354037

RESUMEN

σ(54)-dependent transcription controls a wide range of stress-related genes in bacteria and is tightly regulated. In contrast to σ(70), the σ(54)-RNA polymerase holoenzyme forms a stable closed complex at the promoter site that rarely isomerises into transcriptionally competent open complexes. The conversion into open complexes requires the ATPase activity of activator proteins that bind remotely upstream of the transcriptional start site. These activators belong to the large AAA protein family and the majority of them consist of an N-terminal regulatory domain, a central AAA domain and a C-terminal DNA binding domain. Here we use a functional variant of the NorR activator, a dedicated NO sensor, to provide the first structural and functional characterisation of a full length AAA activator in complex with its enhancer DNA. Our data suggest an inter-dependent and synergistic relationship of all three functional domains and provide an explanation for the dependence of NorR on enhancer DNA. Our results show that NorR readily assembles into higher order oligomers upon enhancer binding, independent of activating signals. Upon inducing signals, the N-terminal regulatory domain relocates to the periphery of the AAA ring. Together our data provide an assembly and activation mechanism for NorR.


Asunto(s)
Bacterias/metabolismo , ARN Polimerasa Sigma 54/genética , Transactivadores/química , Transactivadores/genética , Bacterias/genética , Proteínas Bacterianas/química , Proteínas Bacterianas/genética , Proteínas Bacterianas/metabolismo , ADN Bacteriano/metabolismo , Modelos Moleculares , Simulación del Acoplamiento Molecular , Óxido Nítrico/metabolismo , ARN Polimerasa Sigma 54/metabolismo , Secuencias Reguladoras de Ácidos Nucleicos , Transactivadores/metabolismo
19.
Ticks Tick Borne Dis ; 5(6): 947-50, 2014 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-25108785

RESUMEN

In the next generation sequencing era we are encountering hundreds of thousands of sequences from specific organisms. Such massive data must be accurately classified both functionally and structurally. Determining appropriate sequences with a specific function from next generation sequencing, however, is a daunting experimental task. A recent salivary gland transcriptome from the hard tick Ixodes ricinus, a European disease vector, has been made publicly available. Among the protein families sequenced by the salivary gland transcriptome of I. ricinus, the Kunitz-domain is one of the highly represented protein families. Thus far, recent tick transciptomes solely classify (computationally) Kunitz sequences as putative serine protease inhibitors. We present here a novel method using a machine-learning algorithm to "fish" for candidate ion-channel effectors and loss of serine protease inhibitor function within the Kunitz-domain protein family of the I. ricinus salivary gland transcriptome. The models, data and scripts used in this work are available online from http://life.bsc.es/pid/web/imoal/kunitz-classification.html.


Asunto(s)
Canales Iónicos/genética , Ixodes/genética , Transcriptoma , Algoritmos , Secuencia de Aminoácidos , Animales , Proteínas de Artrópodos/genética , Análisis por Conglomerados , Secuenciación de Nucleótidos de Alto Rendimiento , Inhibidores de Proteasas , Dominios Proteicos , Proteínas y Péptidos Salivales/genética , Análisis de Secuencia de ADN
20.
Proteins ; 82(4): 620-32, 2014 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-24155158

RESUMEN

We report the first assessment of blind predictions of water positions at protein-protein interfaces, performed as part of the critical assessment of predicted interactions (CAPRI) community-wide experiment. Groups submitting docking predictions for the complex of the DNase domain of colicin E2 and Im2 immunity protein (CAPRI Target 47), were invited to predict the positions of interfacial water molecules using the method of their choice. The predictions-20 groups submitted a total of 195 models-were assessed by measuring the recall fraction of water-mediated protein contacts. Of the 176 high- or medium-quality docking models-a very good docking performance per se-only 44% had a recall fraction above 0.3, and a mere 6% above 0.5. The actual water positions were in general predicted to an accuracy level no better than 1.5 Å, and even in good models about half of the contacts represented false positives. This notwithstanding, three hotspot interface water positions were quite well predicted, and so was one of the water positions that is believed to stabilize the loop that confers specificity in these complexes. Overall the best interface water predictions was achieved by groups that also produced high-quality docking models, indicating that accurate modelling of the protein portion is a determinant factor. The use of established molecular mechanics force fields, coupled to sampling and optimization procedures also seemed to confer an advantage. Insights gained from this analysis should help improve the prediction of protein-water interactions and their role in stabilizing protein complexes.


Asunto(s)
Colicinas/química , Mapeo de Interacción de Proteínas , Agua/química , Algoritmos , Biología Computacional , Modelos Moleculares , Simulación del Acoplamiento Molecular , Conformación Proteica
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA