RESUMO
The IntFOLD server based at the University of Reading has been a leading method over the past decade in providing free access to accurate prediction of protein structures and functions. In a post-AlphaFold2 world, accurate models of tertiary structures are widely available for even more protein targets, so there has been a refocus in the prediction community towards the accurate modelling of protein-ligand interactions as well as modelling quaternary structure assemblies. In this paper, we describe the latest improvements to IntFOLD, which maintains its competitive structure prediction performance by including the latest deep learning methods while also integrating accurate model quality estimates and 3D models of protein-ligand interactions. Furthermore, we also introduce our two new server methods: MultiFOLD for accurately modelling both tertiary and quaternary structures, with performance which has been independently verified to outperform the standard AlphaFold2 methods, and ModFOLDdock, which provides world-leading quality estimates for quaternary structure models. The IntFOLD7, MultiFOLD and ModFOLDdock servers are available at: https://www.reading.ac.uk/bioinf/.
Assuntos
Proteínas , Software , Ligantes , Estrutura Terciária de Proteína , Modelos Moleculares , Proteínas/química , Conformação ProteicaRESUMO
In CASP15, there was a greater emphasis on multimeric modeling than in previous experiments, with assembly structures nearly doubling in number (41 up from 22) since the previous round. CASP15 also included a new estimation of model accuracy (EMA) category in recognition of the importance of objective quality assessment (QA) for quaternary structure models. ModFOLDdock is a multimeric model QA server developed by the McGuffin group at the University of Reading, which brings together a range of single-model, clustering, and deep learning methods to form a consensus of approaches. For CASP15, three variants of ModFOLDdock were developed to optimize for the different facets of the quality estimation problem. The standard ModFOLDdock variant produced predicted scores optimized for positive linear correlations with the observed scores. The ModFOLDdockR variant produced predicted scores optimized for ranking, that is, the top-ranked models have the highest accuracy. In addition, the ModFOLDdockS variant used a quasi-single model approach to score each model on an individual basis. The scores from all three variants achieved strongly positive Pearson correlation coefficients with the CASP observed scores (oligo-lDDT) in excess of 0.70, which were maintained across both homomeric and heteromeric model populations. In addition, at least one of the ModFOLDdock variants was consistently ranked in the top two methods across all three EMA categories. Specifically, for overall global fold prediction accuracy, ModFOLDdock placed second and ModFOLDdockR placed third; for overall interface quality prediction accuracy, ModFOLDdockR, ModFOLDdock, and ModFOLDdockS were placed above all other predictor methods, and ModFOLDdockR and ModFOLDdockS were placed second and third respectively for individual residue confidence scores. The ModFOLDdock server is available at: https://www.reading.ac.uk/bioinf/ModFOLDdock/. ModFOLDdock is also available as part of the MultiFOLD docker package: https://hub.docker.com/r/mcguffin/multifold.
Assuntos
Proteínas , Software , Conformação Proteica , Proteínas/genética , Proteínas/química , Modelos Moleculares , Biologia ComputacionalRESUMO
Connexins oligomerise to form hexameric hemichannels in the plasma membrane that can further dock together on adjacent cells to form gap junctions and facilitate intercellular trafficking of molecules. In this study, we report the expression and function of an orphan connexin, connexin-62 (Cx62), in human and mouse (Cx57, mouse homolog) platelets. A novel mimetic peptide (62Gap27) was developed to target the second extracellular loop of Cx62, and 3-dimensional structural models predicted its interference with gap junction and hemichannel function. The ability of 62Gap27 to regulate both gap junction and hemichannel-mediated intercellular communication was observed using fluorescence recovery after photobleaching analysis and flow cytometry. Cx62 inhibition by 62Gap27 suppressed a range of agonist-stimulated platelet functions and impaired thrombosis and hemostasis. This was associated with elevated protein kinase A-dependent signaling in a cyclic adenosine monophosphate-independent manner and was not observed in Cx57-deficient mouse platelets (in which the selectivity of 62Gap27 for this connexin was also confirmed). Notably, Cx62 hemichannels were observed to function independently of Cx37 and Cx40 hemichannels. Together, our data reveal a fundamental role for a hitherto uncharacterized connexin in regulating the function of circulating cells.
Assuntos
Plaquetas/metabolismo , Conexinas/fisiologia , Animais , Comunicação Celular/fisiologia , Linhagem Celular , Conexinas/sangue , Conexinas/química , Conexinas/deficiência , Conexinas/genética , Proteínas Quinases Dependentes de AMP Cíclico/metabolismo , Junções Comunicantes/fisiologia , Hemostasia/fisiologia , Humanos , Integrinas/sangue , Megacariócitos/metabolismo , Camundongos , Camundongos Endogâmicos C57BL , Camundongos Transgênicos , Modelos Moleculares , Simulação de Acoplamento Molecular , Fragmentos de Peptídeos/síntese química , Fragmentos de Peptídeos/farmacologia , Adesividade Plaquetária , Agregação Plaquetária , Conformação Proteica , Multimerização Proteica , Relação Estrutura-Atividade , Trombose/sangueRESUMO
ReFOLD3 is unique in its application of gradual restraints, calculated from local model quality estimates and contact predictions, which are used to guide the refinement of theoretical 3D protein models towards the native structures. ReFOLD3 achieves improved performance by using an iterative refinement protocol to fix incorrect residue contacts and local errors, including unusual bonds and angles, which are identified in the submitted models by our leading ModFOLD8 model quality assessment method. Following refinement, the likely resulting improvements to the submitted models are recognized by ModFOLD8, which produces both global and local quality estimates. During the CASP14 prediction season (May-Aug 2020), we used the ReFOLD3 protocol to refine hundreds of 3D models, for both the refinement and the main tertiary structure prediction categories. Our group improved the global and local quality scores for numerous starting models in the refinement category, where we ranked in the top 10 according to the official assessment. The ReFOLD3 protocol was also used for the refinement of the SARS-CoV-2 targets as a part of the CASP Commons COVID-19 initiative, and we provided a significant number of the top 10 models. The ReFOLD3 web server is freely available at https://www.reading.ac.uk/bioinf/ReFOLD/.
Assuntos
Computadores , Internet , Modelos Moleculares , Conformação Proteica , Dobramento de Proteína , Proteínas/química , Software , Reprodutibilidade dos Testes , SARS-CoV-2/química , Proteínas Virais/químicaRESUMO
Methods for estimating the quality of 3D models of proteins are vital tools for driving the acceptance and utility of predicted tertiary structures by the wider bioscience community. Here we describe the significant major updates to ModFOLD, which has maintained its position as a leading server for the prediction of global and local quality of 3D protein models, over the past decade (>20 000 unique external users). ModFOLD8 is the latest version of the server, which combines the strengths of multiple pure-single and quasi-single model methods. Improvements have been made to the web server interface and there has been successive increases in prediction accuracy, which were achieved through integration of newly developed scoring methods and advanced deep learning-based residue contact predictions. Each version of the ModFOLD server has been independently blind tested in the biennial CASP experiments, as well as being continuously evaluated via the CAMEO project. In CASP13 and CASP14, the ModFOLD7 and ModFOLD8 variants ranked among the top 10 quality estimation methods according to almost every official analysis. Prior to CASP14, ModFOLD8 was also applied for the evaluation of SARS-CoV-2 protein models as part of CASP Commons 2020 initiative. The ModFOLD8 server is freely available at: https://www.reading.ac.uk/bioinf/ModFOLD/.
Assuntos
Computadores , Modelos Moleculares , Redes Neurais de Computação , Conformação Proteica , Dobramento de Proteína , Proteínas/química , Software , Reprodutibilidade dos Testes , Projetos de Pesquisa , SARS-CoV-2/química , Proteínas Virais/químicaRESUMO
The IntFOLD server provides a unified resource for the automated prediction of: protein tertiary structures with built-in estimates of model accuracy (EMA), protein structural domain boundaries, natively unstructured or disordered regions in proteins, and protein-ligand interactions. The component methods have been independently evaluated via the successive blind CASP experiments and the continual CAMEO benchmarking project. The IntFOLD server has established its ranking as one of the best performing publicly available servers, based on independent official evaluation metrics. Here, we describe significant updates to the server back end, where we have focused on performance improvements in tertiary structure predictions, in terms of global 3D model quality and accuracy self-estimates (ASE), which we achieve using our newly improved ModFOLD7_rank algorithm. We also report on various upgrades to the front end including: a streamlined submission process, enhanced visualization of models, new confidence scores for ranking, and links for accessing all annotated model data. Furthermore, we now include an option for users to submit selected models for further refinement via convenient push buttons. The IntFOLD server is freely available at: http://www.reading.ac.uk/bioinf/IntFOLD/.
Assuntos
Algoritmos , Proteínas/química , Software , Sequência de Aminoácidos , Animais , Benchmarking , Sítios de Ligação , Ontologia Genética , Humanos , Internet , Ligantes , Modelos Moleculares , Anotação de Sequência Molecular , Ligação Proteica , Conformação Proteica em alfa-Hélice , Conformação Proteica em Folha beta , Dobramento de Proteína , Domínios e Motivos de Interação entre Proteínas , Estrutura Terciária de Proteína , Proteínas/metabolismo , Alinhamento de Sequência , Homologia de Sequência de AminoácidosRESUMO
The refinement of predicted 3D protein models is crucial in bringing them closer towards experimental accuracy for further computational studies. Refinement approaches can be divided into two main stages: The sampling and scoring stages. Sampling strategies, such as the popular Molecular Dynamics (MD)-based protocols, aim to generate improved 3D models. However, generating 3D models that are closer to the native structure than the initial model remains challenging, as structural deviations from the native basin can be encountered due to force-field inaccuracies. Therefore, different restraint strategies have been applied in order to avoid deviations away from the native structure. For example, the accurate prediction of local errors and/or contacts in the initial models can be used to guide restraints. MD-based protocols, using physics-based force fields and smart restraints, have made significant progress towards a more consistent refinement of 3D models. The scoring stage, including energy functions and Model Quality Assessment Programs (MQAPs) are also used to discriminate near-native conformations from non-native conformations. Nevertheless, there are often very small differences among generated 3D models in refinement pipelines, which makes model discrimination and selection problematic. For this reason, the identification of the most native-like conformations remains a major challenge.
Assuntos
Biologia Computacional/métodos , Modelos Moleculares , Proteínas/químicaRESUMO
The refinement of predicted 3D models aims to bring them closer to the native structure by fixing errors including unusual bonds and torsion angles and irregular hydrogen bonding patterns. Refinement approaches based on molecular dynamics (MD) simulations using different types of restraints have performed well since CASP10. ReFOLD, developed by the McGuffin group, was one of the many MD-based refinement approaches, which were tested in CASP 12. When the performance of the ReFOLD method in CASP12 was evaluated, it was observed that ReFOLD suffered from the absence of a reliable guidance mechanism to reach consistent improvement for the quality of predicted 3D models, particularly in the case of template-based modelling (TBM) targets. Therefore, here we propose to utilize the local quality assessment score produced by ModFOLD6 to guide the MD-based refinement approach to further increase the accuracy of the predicted 3D models. The relative performance of the new local quality assessment guided MD-based refinement protocol and the original MD-based protocol ReFOLD are compared utilizing many different official scoring methods. By using the per-residue accuracy (or local quality) score to guide the refinement process, we are able to prevent the refined models from undesired structural deviations, thereby leading to more consistent improvements. This chapter will include a detailed analysis of the performance of the local quality assessment guided MD-based protocol versus that deployed in the original ReFOLD method.
Assuntos
Biologia Computacional , Simulação de Dinâmica Molecular , Conformação Proteica , Biologia Computacional/métodos , Proteínas/química , Ligação de HidrogênioRESUMO
Motivation: The accuracy gap between predicted and experimental structures has been significantly reduced following the development of AlphaFold2 (AF2). However, for many targets, AF2 models still have room for improvement. In previous CASP experiments, highly computationally intensive MD simulation-based methods have been widely used to improve the accuracy of single 3D models. Here, our ReFOLD pipeline was adapted to refine AF2 predictions while maintaining high model accuracy at a modest computational cost. Furthermore, the AF2 recycling process was utilized to improve 3D models by using them as custom template inputs for tertiary and quaternary structure predictions. Results: According to the Molprobity score, 94% of the generated 3D models by ReFOLD were improved. AF2 recycling showed an improvement rate of 87.5% (using MSAs) and 81.25% (using single sequences) for monomeric AF2 models and 100% (MSA) and 97.8% (single sequence) for monomeric non-AF2 models, as measured by the average change in lDDT. By the same measure, the recycling of multimeric models showed an improvement rate of as much as 80% for AF2-Multimer (AF2M) models and 94% for non-AF2M models. Availability and implementation: Refinement using AlphaFold2-Multimer recycling is available as part of the MultiFOLD docker package (https://hub.docker.com/r/mcguffin/multifold). The ReFOLD server is available at https://www.reading.ac.uk/bioinf/ReFOLD/ and the modified scripts can be downloaded from https://www.reading.ac.uk/bioinf/downloads/. Supplementary information: Supplementary data are available at Bioinformatics Advances online.