RESUMEN
As of now, more than 60 years have passed since the first determination of protein structures through crystallography, and a significant portion of protein structures can be predicted by computers. This is due to the groundbreaking enhancement in protein structure prediction achieved through neural network training utilizing extensive sequence and structure data. However, substantial challenges persist in structure prediction due to limited data availability, with antibody structure prediction standing as one such challenge. In this paper, we propose a novel neural network architecture that effectively enables structure prediction by reflecting the inherent combinatorial nature involved in protein structure formation. The core idea of this neural network architecture is not solely to track and generate a single structure but rather to form a community of multiple structures and pursue accurate structure prediction by exchanging information among community members. Applying this concept to antibody CDR H3 loop structure prediction resulted in improved structure sampling. Such an approach could be applied in the structural and functional studies of proteins, particularly in exploring various physiological processes mediated by loops. Moreover, it holds potential in addressing various other types of combinatorial structure prediction and design problems.
Asunto(s)
Biología Computacional , Aprendizaje Profundo , Modelos Moleculares , Conformación Proteica , Biología Computacional/métodos , Regiones Determinantes de Complementariedad/química , Redes Neurales de la Computación , Anticuerpos/química , Bases de Datos de Proteínas , Humanos , AlgoritmosRESUMEN
While deep learning (DL) has brought a revolution in the protein structure prediction field, still an important question remains how the revolution can be transferred to advances in structure-based drug discovery. Because the lessons from the recent GPCR dock challenge were inconclusive primarily due to the size of the dataset, in this work we further elaborated on 70 diverse GPCR complexes bound to either small molecules or peptides to investigate the best-practice modeling and docking strategies for GPCR drug discovery. From our quantitative analysis, it is shown that substantial improvements in docking and virtual screening have been possible by the advance in DL-based protein structure predictions with respect to the expected results from the combination of best pre-DL tools. The success rate of docking on DL-based model structures approaches that of cross-docking on experimental structures, showing over 30% improvement from the best pre-DL protocols. This amount of performance could be achieved only when two modeling points were considered properly: 1) correct functional-state modeling of receptors and 2) receptor-flexible docking. Best-practice modeling strategies and the model confidence estimation metric suggested in this work may serve as a guideline for future computer-aided GPCR drug discovery scenarios.
RESUMEN
Somatostatin is a peptide hormone that regulates endocrine systems by binding to G-protein-coupled somatostatin receptors. Somatostatin receptor 2 (SSTR2) is a human somatostatin receptor and is highly implicated in hormone disorders, cancers, and neurological diseases. Here, we report the high-resolution cryo-EM structure of full-length human SSTR2 bound to the agonist somatostatin (SST-14) in complex with inhibitory G (Gi) proteins. Our structural and mutagenesis analyses show that seven transmembrane helices form a deep pocket for ligand binding and that SSTR2 recognizes the highly conserved Trp-Lys motif of SST-14 at the bottom of the pocket. Furthermore, our sequence analysis combined with AlphaFold modeled structures of other SSTR isoforms provide a structural basis for the mechanism by which SSTR family proteins specifically interact with their cognate ligands. This work provides the first glimpse into the molecular recognition mechanism of somatostatin receptors and a crucial resource to develop therapeutics targeting somatostatin receptors.
Asunto(s)
Receptores de Somatostatina , Somatostatina , Microscopía por Crioelectrón , Humanos , Ligandos , Receptores de Somatostatina/agonistas , Receptores de Somatostatina/metabolismo , Somatostatina/metabolismoRESUMEN
Neuropeptide Y (NPY) is highly abundant in the brain and involved in various physiological processes related to food intake and anxiety, as well as human diseases such as obesity and cancer. However, the molecular details of the interactions between NPY and its receptors are poorly understood. Here, we report a cryo-electron microscopy structure of the NPY-bound neuropeptide Y1 receptor (Y1R) in complex with Gi1 protein. The NPY C-terminal segment forming the extended conformation binds deep into the Y1R transmembrane core, where the amidated C-terminal residue Y36 of NPY is located at the base of the ligand-binding pocket. Furthermore, the helical region and two N-terminal residues of NPY interact with Y1R extracellular loops, contributing to the high affinity of NPY for Y1R. The structural analysis of NPY-bound Y1R and mutagenesis studies provide molecular insights into the activation mechanism of Y1R upon NPY binding.
Asunto(s)
Neuropéptido Y/metabolismo , Receptores de Neuropéptido Y/metabolismo , Animales , Encéfalo/metabolismo , Línea Celular , Microscopía por Crioelectrón , Activación Enzimática/fisiología , Humanos , Neuropéptido Y/genética , Unión Proteica/fisiología , Conformación Proteica , Receptores de Neuropéptido Y/genética , Células Sf9 , Transducción de SeñalRESUMEN
The spread of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) presents a public health crisis, and the vaccines that can induce highly potent neutralizing antibodies are essential for ending the pandemic. The spike (S) protein on the viral envelope mediates human angiotensin-converting enzyme 2 binding and thus is the target of a variety of neutralizing antibodies. In this work, we built various S trimer-antibody complex structures on the basis of the fully glycosylated S protein models described in our previous work and performed all-atom molecular dynamics simulations to gain insight into the structural dynamics and interactions between S protein and antibodies. Investigation of the residues critical for S-antibody binding allows us to predict the potential influence of mutations in SARS-CoV-2 variants. Comparison of the glycan conformations between S-only and S-antibody systems reveals the roles of glycans in S-antibody binding. In addition, we explored the antibody binding modes and the influences of antibody on the motion of S protein receptor binding domains. Overall, our analyses provide a better understanding of S-antibody interactions, and the simulation-based S-antibody interaction maps could be used to predict the influences of S mutation on S-antibody interactions, which will be useful for the development of vaccine and antibody-based therapy.
Asunto(s)
Anticuerpos Neutralizantes/química , Glicoproteína de la Espiga del Coronavirus/química , Anticuerpos Neutralizantes/inmunología , Reacciones Antígeno-Anticuerpo , COVID-19 , Simulación por Computador , Glicosilación , Humanos , Simulación de Dinámica Molecular , Estructura Molecular , Mutación , Polisacáridos/química , Unión Proteica , Glicoproteína de la Espiga del Coronavirus/genética , Glicoproteína de la Espiga del Coronavirus/inmunologíaRESUMEN
Proteins perform their functions by interacting with other biomolecules. For these interactions, proteins often form homo- or hetero-oligomers as well. Thus, oligomer protein structures provide important clues regarding the biological roles of proteins. To this end, computational prediction of oligomer structures may be a useful tool in the absence of experimentally resolved structures. Here, we describe our server and human-expert methods used to predict oligomer structures in the CASP14 experiment. Examples are provided for cases in which manual domain-splitting led to improved oligomeric domain structures by ab initio docking, automated oligomer structure refinement led to improved subunit orientation and terminal structure, and manual oligomer modeling utilizing literature information generated a reasonable oligomer model. We also discussed the results of post-prediction docking calculations with AlphaFold2 monomers as input in comparison to our blind prediction results. Overall, ab initio docking of AlphaFold2 models did not lead to better oligomer structure prediction, which may be attributed to the interfacial structural difference between the AlphaFold2 monomer structures and the crystal oligomer structures. This result poses a next-stage challenge in oligomer structure prediction after the success of AlphaFold2. For successful protein assembly structure prediction, a different approach that exploits further evolutionary information on the interface and/or flexible docking taking the interfacial conformational flexibilities of subunit structures into account is needed.
Asunto(s)
Modelos Moleculares , Conformación Proteica , Subunidades de Proteína , Programas Informáticos , Biología Computacional , Simulación del Acoplamiento Molecular , Pliegue de Proteína , Subunidades de Proteína/química , Subunidades de Proteína/metabolismo , Proteínas/química , Proteínas/metabolismo , Análisis de Secuencia de ProteínaRESUMEN
The spread of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) presents a public health crisis, and the vaccines that can induce highly potent neutralizing antibodies are essential for ending the pandemic. The spike (S) protein on the viral envelope mediates human angiotensin-converting enzyme 2 (ACE2) binding and thus is the target of a variety of neutralizing antibodies. In this work, we built various S trimer-antibody complex structures on the basis of the fully glycosylated S protein models described in our previous work, and performed all-atom molecular dynamics simulations to get insight into the structural dynamics and interactions between S protein and antibodies. Investigation of the residues critical for S-antibody binding allows us to predict the potential influence of mutations in SARS-CoV-2 variants. Comparison of the glycan conformations between S-only and S-antibody systems reveals the roles of glycans in S-antibody binding. In addition, we explored the antibody binding modes, and the influences of antibody on the motion of S protein receptor binding domains. Overall, our analyses provide a better understanding of S-antibody interactions, and the simulation-based S-antibody interaction maps could be used to predict the influences of S mutation on S-antibody interactions, which will be useful for the development of vaccine and antibody-based therapy.
RESUMEN
The spike (S) protein of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) mediates host cell entry by binding to angiotensin-converting enzyme 2 (ACE2) and is considered the major target for drug and vaccine development. We previously built fully glycosylated full-length SARS-CoV-2 S protein models in a viral membrane including both open and closed conformations of the receptor-binding domain (RBD) and different templates for the stalk region. In this work, multiple µs-long all-atom molecular dynamics simulations were performed to provide deeper insights into the structure and dynamics of S protein and glycan functions. Our simulations reveal that the highly flexible stalk is composed of two independent joints and most probable S protein orientations are competent for ACE2 binding. We identify multiple glycans stabilizing the open and/or closed states of the RBD and demonstrate that the exposure of antibody epitopes can be captured by detailed antibody-glycan clash analysis instead of commonly used accessible surface area analysis that tends to overestimate the impact of glycan shielding and neglect possible detailed interactions between glycan and antibodies. Overall, our observations offer structural and dynamic insights into the SARS-CoV-2 S protein and potentialize for guiding the design of effective antiviral therapeutics.
Asunto(s)
Enzima Convertidora de Angiotensina 2/metabolismo , COVID-19/metabolismo , SARS-CoV-2/metabolismo , Glicoproteína de la Espiga del Coronavirus/metabolismo , Anticuerpos/metabolismo , Glicosilación , Humanos , Simulación de Dinámica Molecular , Unión Proteica , Conformación Proteica , Multimerización de Proteína , SARS-CoV-2/química , Glicoproteína de la Espiga del Coronavirus/químicaRESUMEN
This technical study describes all-atom modeling and simulation of a fully glycosylated full-length SARS-CoV-2 spike (S) protein in a viral membrane. First, starting from PDB: 6VSB and 6VXX, full-length S protein structures were modeled using template-based modeling, de-novo protein structure prediction, and loop modeling techniques in GALAXY modeling suite. Then, using the recently determined most occupied glycoforms, 22 N-glycans and 1 O-glycan of each monomer were modeled using Glycan Reader & Modeler in CHARMM-GUI. These fully glycosylated full-length S protein model structures were assessed and further refined against the low-resolution data in their respective experimental maps using ISOLDE. We then used CHARMM-GUI Membrane Builder to place the S proteins in a viral membrane and performed all-atom molecular dynamics simulations. All structures are available in CHARMM-GUI COVID-19 Archive (http://www.charmm-gui.org/docs/archive/covid19) so that researchers can use these models to carry out innovative and novel modeling and simulation research for the prevention and treatment of COVID-19.
Asunto(s)
Glicoproteína de la Espiga del Coronavirus/química , Betacoronavirus/química , Betacoronavirus/genética , Betacoronavirus/metabolismo , Cristalografía por Rayos X , Glicosilación , Humanos , Modelos Moleculares , Simulación de Dinámica Molecular , Polisacáridos/química , Estructura Secundaria de Proteína , SARS-CoV-2RESUMEN
This technical study describes all-atom modeling and simulation of a fully-glycosylated full-length SARS-CoV-2 spike (S) protein in a viral membrane. First, starting from PDB:6VSB and 6VXX, full-length S protein structures were modeled using template-based modeling, de-novo protein structure prediction, and loop modeling techniques in GALAXY modeling suite. Then, using the recently-determined most occupied glycoforms, 22 N-glycans and 1 O-glycan of each monomer were modeled using Glycan Reader & Modeler in CHARMM-GUI. These fully-glycosylated full-length S protein model structures were assessed and further refined against the low-resolution data in their respective experimental maps using ISOLDE. We then used CHARMM-GUI Membrane Builder to place the S proteins in a viral membrane and performed all-atom molecular dynamics simulations. All structures are available in CHARMM-GUI COVID-19 Archive (http://www.charmm-gui.org/docs/archive/covid19), so researchers can use these models to carry out innovative and novel modeling and simulation research for the prevention and treatment of COVID-19.
RESUMEN
We participated in CARPI rounds 38-45 both as a server predictor and a human predictor. These CAPRI rounds provided excellent opportunities for testing prediction methods for three classes of protein interactions, that is, protein-protein, protein-peptide, and protein-oligosaccharide interactions. Both template-based methods (GalaxyTBM for monomer protein, GalaxyHomomer for homo-oligomer protein, GalaxyPepDock for protein-peptide complex) and ab initio docking methods (GalaxyTongDock and GalaxyPPDock for protein oligomer, GalaxyPepDock-ab-initio for protein-peptide complex, GalaxyDock2 and Galaxy7TM for protein-oligosaccharide complex) have been tested. Template-based methods depend heavily on the availability of proper templates and template-target similarity, and template-target difference is responsible for inaccuracy of template-based models. Inaccurate template-based models could be improved by our structure refinement and loop modeling methods based on physics-based energy optimization (GalaxyRefineComplex and GalaxyLoop) for several CAPRI targets. Current ab initio docking methods require accurate protein structures as input. Small conformational changes from input structure could be accounted for by our docking methods, producing one of the best models for several CAPRI targets. However, predicting large conformational changes involving protein backbone is still challenging, and full exploration of physics-based methods for such problems is still to come.
Asunto(s)
Simulación del Acoplamiento Molecular , Oligosacáridos/química , Péptidos/química , Proteínas/química , Programas Informáticos , Secuencia de Aminoácidos , Sitios de Unión , Humanos , Ligandos , Oligosacáridos/metabolismo , Péptidos/metabolismo , Unión Proteica , Conformación Proteica en Hélice alfa , Conformación Proteica en Lámina beta , Dominios y Motivos de Interacción de Proteínas , Mapeo de Interacción de Proteínas , Multimerización de Proteína , Proteínas/metabolismo , Proyectos de Investigación , Homología Estructural de Proteína , TermodinámicaRESUMEN
Many proteins need to form oligomers to be functional, so oligomer structures provide important clues to biological roles of proteins. Prediction of oligomer structures therefore can be a useful tool in the absence of experimentally resolved structures. In this article, we describe the server and human methods that we used to predict oligomer structures in the CASP13 experiment. Performances of the methods on the 42 CASP13 oligomer targets consisting of 30 homo-oligomers and 12 hetero-oligomers are discussed. Our server method, Seok-assembly, generated models with interface contact similarity measure greater than 0.2 as model 1 for 11 homo-oligomer targets when proper templates existed in the database. Model refinement methods such as loop modeling and molecular dynamics (MD)-based overall refinement failed to improve model qualities when target proteins have domains not covered by templates or when chains have very small interfaces. In human predictions, additional experimental data such as low-resolution electron microscopy (EM) map were utilized. EM data could assist oligomer structure prediction by providing a global shape of the complex structure.