Búsqueda | Biblioteca Virtual en Salud Odontología. Uruguay

1.

De novo design of high-affinity binders of bioactive helical peptides.

Vázquez Torres, Susana; Leung, Philip J Y; Venkatesh, Preetham; Lutz, Isaac D; Hink, Fabian; Huynh, Huu-Hien; Becker, Jessica; Yeh, Andy Hsien-Wei; Juergens, David; Bennett, Nathaniel R; Hoofnagle, Andrew N; Huang, Eric; MacCoss, Michael J; Expòsit, Marc; Lee, Gyu Rie; Bera, Asim K; Kang, Alex; De La Cruz, Joshmyn; Levine, Paul M; Li, Xinting; Lamb, Mila; Gerben, Stacey R; Murray, Analisa; Heine, Piper; Korkmaz, Elif Nihal; Nivala, Jeff; Stewart, Lance; Watson, Joseph L; Rogers, Joseph M; Baker, David.

Nature ; 626(7998): 435-442, 2024 Feb.

Artículo en Inglés | MEDLINE | ID: mdl-38109936

RESUMEN

Many peptide hormones form an α-helix on binding their receptors1-4, and sensitive methods for their detection could contribute to better clinical management of disease5. De novo protein design can now generate binders with high affinity and specificity to structured proteins6,7. However, the design of interactions between proteins and short peptides with helical propensity is an unmet challenge. Here we describe parametric generation and deep learning-based methods for designing proteins to address this challenge. We show that by extending RFdiffusion8 to enable binder design to flexible targets, and to refining input structure models by successive noising and denoising (partial diffusion), picomolar-affinity binders can be generated to helical peptide targets by either refining designs generated with other methods, or completely de novo starting from random noise distributions without any subsequent experimental optimization. The RFdiffusion designs enable the enrichment and subsequent detection of parathyroid hormone and glucagon by mass spectrometry, and the construction of bioluminescence-based protein biosensors. The ability to design binders to conformationally variable targets, and to optimize by partial diffusion both natural and designed proteins, should be broadly useful.

Asunto(s)

Diseño Asistido por Computadora , Aprendizaje Profundo , Péptidos , Proteínas , Técnicas Biosensibles , Difusión , Glucagón/química , Glucagón/metabolismo , Mediciones Luminiscentes , Espectrometría de Masas , Hormona Paratiroidea/química , Hormona Paratiroidea/metabolismo , Péptidos/química , Péptidos/metabolismo , Estructura Secundaria de Proteína , Proteínas/química , Proteínas/metabolismo , Especificidad por Sustrato , Modelos Moleculares

2.

De novo design of luciferases using deep learning.

Yeh, Andy Hsien-Wei; Norn, Christoffer; Kipnis, Yakov; Tischer, Doug; Pellock, Samuel J; Evans, Declan; Ma, Pengchen; Lee, Gyu Rie; Zhang, Jason Z; Anishchenko, Ivan; Coventry, Brian; Cao, Longxing; Dauparas, Justas; Halabiya, Samer; DeWitt, Michelle; Carter, Lauren; Houk, K N; Baker, David.

Nature ; 614(7949): 774-780, 2023 02.

Artículo en Inglés | MEDLINE | ID: mdl-36813896

RESUMEN

De novo enzyme design has sought to introduce active sites and substrate-binding pockets that are predicted to catalyse a reaction of interest into geometrically compatible native scaffolds1,2, but has been limited by a lack of suitable protein structures and the complexity of native protein sequence-structure relationships. Here we describe a deep-learning-based 'family-wide hallucination' approach that generates large numbers of idealized protein structures containing diverse pocket shapes and designed sequences that encode them. We use these scaffolds to design artificial luciferases that selectively catalyse the oxidative chemiluminescence of the synthetic luciferin substrates diphenylterazine3 and 2-deoxycoelenterazine. The designed active sites position an arginine guanidinium group adjacent to an anion that develops during the reaction in a binding pocket with high shape complementarity. For both luciferin substrates, we obtain designed luciferases with high selectivity; the most active of these is a small (13.9 kDa) and thermostable (with a melting temperature higher than 95 °C) enzyme that has a catalytic efficiency on diphenylterazine (kcat/Km = 106 M-1 s-1) comparable to that of native luciferases, but a much higher substrate specificity. The creation of highly active and specific biocatalysts from scratch with broad applications in biomedicine is a key milestone for computational enzyme design, and our approach should enable generation of a wide range of luciferases and other enzymes.

Asunto(s)

Aprendizaje Profundo , Luciferasas , Biocatálisis , Dominio Catalítico , Estabilidad de Enzimas , Calor , Luciferasas/química , Luciferasas/metabolismo , Luciferinas/metabolismo , Luminiscencia , Oxidación-Reducción , Especificidad por Sustrato

3.

GalaxyRefine2: simultaneous refinement of inaccurate local regions and overall protein structure.

Lee, Gyu Rie; Won, Jonghun; Heo, Lim; Seok, Chaok.

Nucleic Acids Res ; 47(W1): W451-W455, 2019 07 02.

Artículo en Inglés | MEDLINE | ID: mdl-31001635

RESUMEN

The 3D structure of a protein can be predicted from its amino acid sequence with high accuracy for a large fraction of cases because of the availability of large quantities of experimental data and the advance of computational algorithms. Recently, deep learning methods exploiting the coevolution information obtained by comparing related protein sequences have been successfully used to generate highly accurate model structures even in the absence of template structure information. However, structures predicted based on either template structures or related sequences require further improvement in regions for which information is missing. Refining a predicted protein structure with insufficient information on certain regions is critical because these regions may be connected to functional specificity that is not conserved among related proteins. The GalaxyRefine2 web server, freely available via http://galaxy.seoklab.org/refine2, is an upgraded version of the GalaxyRefine protein structure refinement server and reflects recent developments successfully tested through CASP blind prediction experiments. This method adopts an iterative optimization approach involving various structure move sets to refine both local and global structures. The estimation of local error and hybridization of available homolog structures are also employed for effective conformation search.

Asunto(s)

Conformación Proteica , Programas Informáticos , Modelos Moleculares , Análisis de Secuencia de Proteína

4.

Biophysical and functional characterization of Norrin signaling through Frizzled4.

Bang, Injin; Kim, Hee Ryung; Beaven, Andrew H; Kim, Jinuk; Ko, Seung-Bum; Lee, Gyu Rie; Kan, Wei; Lee, Hasup; Im, Wonpil; Seok, Chaok; Chung, Ka Young; Choi, Hee-Jung.

Proc Natl Acad Sci U S A ; 115(35): 8787-8792, 2018 08 28.

Artículo en Inglés | MEDLINE | ID: mdl-30104375

RESUMEN

Wnt signaling is initiated by Wnt ligand binding to the extracellular ligand binding domain, called the cysteine-rich domain (CRD), of a Frizzled (Fzd) receptor. Norrin, an atypical Fzd ligand, specifically interacts with Fzd4 to activate ß-catenin-dependent canonical Wnt signaling. Much of the molecular basis that confers Norrin selectivity in binding to Fzd4 was revealed through the structural study of the Fzd4CRD-Norrin complex. However, how the ligand interaction, seemingly localized at the CRD, is transmitted across full-length Fzd4 to the cytoplasm remains largely unknown. Here, we show that a flexible linker domain, which connects the CRD to the transmembrane domain, plays an important role in Norrin signaling. The linker domain directly contributes to the high-affinity interaction between Fzd4 and Norrin as shown by â¼10-fold higher binding affinity of Fzd4CRD to Norrin in the presence of the linker. Swapping the Fzd4 linker with the Fzd5 linker resulted in the loss of Norrin signaling, suggesting the importance of the linker in ligand-specific cellular response. In addition, structural dynamics of Fzd4 associated with Norrin binding investigated by hydrogen/deuterium exchange MS revealed Norrin-induced conformational changes on the linker domain and the intracellular loop 3 (ICL3) region of Fzd4. Cell-based functional assays showed that linker deletion, L430A and L433A mutations at ICL3, and C-terminal tail truncation displayed reduced ß-catenin-dependent signaling activity, indicating the functional significance of these sites. Together, our results provide functional and biochemical dissection of Fzd4 in Norrin signaling.

Asunto(s)

Proteínas del Ojo/química , Receptores Frizzled/química , Proteínas del Tejido Nervioso/química , Vía de Señalización Wnt , Animales , Proteínas del Ojo/metabolismo , Receptores Frizzled/metabolismo , Ratones , Proteínas del Tejido Nervioso/metabolismo , Unión Proteica , Dominios Proteicos , Estructura Cuaternaria de Proteína , Estructura Secundaria de Proteína , Relación Estructura-Actividad

5.

High-accuracy refinement using Rosetta in CASP13.

Park, Hahnbeom; Lee, Gyu Rie; Kim, David E; Anishchenko, Ivan; Cong, Qian; Baker, David.

Proteins ; 87(12): 1276-1282, 2019 12.

Artículo en Inglés | MEDLINE | ID: mdl-31325340

RESUMEN

Because proteins generally fold to their lowest free energy states, energy-guided refinement in principle should be able to systematically improve the quality of protein structure models generated using homologous structure or co-evolution derived information. However, because of the high dimensionality of the search space, there are far more ways to degrade the quality of a near native model than to improve it, and hence, refinement methods are very sensitive to energy function errors. In the 13th Critial Assessment of techniques for protein Structure Prediction (CASP13), we sought to carry out a thorough search for low energy states in the neighborhood of a starting model using restraints to avoid straying too far. The approach was reasonably successful in improving both regions largely incorrect in the starting models as well as core regions that started out closer to the correct structure. Models with GDT-HA over 70 were obtained for five targets and for one of those, an accuracy of 0.5 å backbone root-mean-square deviation (RMSD) was achieved. An important current challenge is to improve performance in refining oligomers and larger proteins, for which the search problem remains extremely difficult.

Asunto(s)

Biología Computacional/métodos , Conformación Proteica , Pliegue de Proteína , Proteínas/química , Algoritmos , Modelos Moleculares , Reproducibilidad de los Resultados , Termodinámica

6.

Cell-cell adhesion in metazoans relies on evolutionarily conserved features of the α-catenin·ß-catenin-binding interface.

Shao, Xiangqiang; Kang, Hyunook; Loveless, Timothy; Lee, Gyu Rie; Seok, Chaok; Weis, William I; Choi, Hee-Jung; Hardin, Jeff.

J Biol Chem ; 292(40): 16477-16490, 2017 10 06.

Artículo en Inglés | MEDLINE | ID: mdl-28842483

RESUMEN

Stable tissue integrity during embryonic development relies on the function of the cadherin·catenin complex (CCC). The Caenorhabditis elegans CCC is a useful paradigm for analyzing in vivo requirements for specific interactions among the core components of the CCC, and it provides a unique opportunity to examine evolutionarily conserved mechanisms that govern the interaction between α- and ß-catenin. HMP-1, unlike its mammalian homolog α-catenin, is constitutively monomeric, and its binding affinity for HMP-2/ß-catenin is higher than that of α-catenin for ß-catenin. A crystal structure shows that the HMP-1·HMP-2 complex forms a five-helical bundle structure distinct from the structure of the mammalian α-catenin·ß-catenin complex. Deletion analysis based on the crystal structure shows that the first helix of HMP-1 is necessary for binding HMP-2 avidly in vitro and for efficient recruitment of HMP-1 to adherens junctions in embryos. HMP-2 Ser-47 and Tyr-69 flank its binding interface with HMP-1, and we show that phosphomimetic mutations at these two sites decrease binding affinity of HMP-1 to HMP-2 by 40-100-fold in vitro. In vivo experiments using HMP-2 S47E and Y69E mutants showed that they are unable to rescue hmp-2(zu364) mutants, suggesting that phosphorylation of HMP-2 on Ser-47 and Tyr-69 could be important for regulating CCC formation in C. elegans Our data provide novel insights into how cadherin-dependent cell-cell adhesion is modulated in metazoans by conserved elements as well as features unique to specific organisms.

Asunto(s)

Proteínas de Caenorhabditis elegans/metabolismo , Caenorhabditis elegans/embriología , Comunicación Celular/fisiología , Proteínas del Citoesqueleto/metabolismo , Complejos Multiproteicos/metabolismo , alfa Catenina/metabolismo , Sustitución de Aminoácidos , Animales , Caenorhabditis elegans/química , Caenorhabditis elegans/genética , Proteínas de Caenorhabditis elegans/química , Proteínas de Caenorhabditis elegans/genética , Adhesión Celular/fisiología , Cristalografía por Rayos X , Proteínas del Citoesqueleto/química , Proteínas del Citoesqueleto/genética , Complejos Multiproteicos/química , Complejos Multiproteicos/genética , Mutación Missense , Estructura Cuaternaria de Proteína , alfa Catenina/química , alfa Catenina/genética

7.

Simultaneous refinement of inaccurate local regions and overall structure in the CASP12 protein model refinement experiment.

Lee, Gyu Rie; Heo, Lim; Seok, Chaok.

Proteins ; 86 Suppl 1: 168-176, 2018 03.

Artículo en Inglés | MEDLINE | ID: mdl-29044810

RESUMEN

Advances in protein model refinement techniques are required as diverse sources of protein structure information are available from low-resolution experiments or informatics-based computations such as cryo-EM, NMR, homology models, or predicted residue contacts. Given semi-reliable or incomplete structural information, structure quality of a protein model has to be improved by ab initio methods such as energy-based simulation. In this study, we describe a new automatic refinement server method designed to improve locally inaccurate regions and overall structure simultaneously. Locally inaccurate regions may occur in protein structures due to non-convergent or missing information in template structures used in homology modeling or due to intrinsic structural flexibilities not resolved by experimental techniques. However, such variable or dynamic regions often play important functional roles by participating in interactions with other biomolecules or in transitions between different functional states. The new refinement method introduced here utilizes diverse types of geometric operators which drive both local and global changes, and the effect of structure changes and relaxations are accumulated. This resulted in consistent refinement of both local and global structural features. Performance of this method in CASP12 is discussed.

Asunto(s)

Biología Computacional/métodos , Aprendizaje Automático , Modelos Moleculares , Conformación Proteica , Dominios y Motivos de Interacción de Proteínas , Proteínas/química , Algoritmos , Cristalografía por Rayos X , Humanos , Simulación de Dinámica Molecular , Análisis de Secuencia de Proteína

8.

GalaxyGPCRloop: Template-Based and Ab Initio Structure Sampling of the Extracellular Loops of G-Protein-Coupled Receptors.

Won, Jonghun; Lee, Gyu Rie; Park, Hahnbeom; Seok, Chaok.

J Chem Inf Model ; 58(6): 1234-1243, 2018 06 25.

Artículo en Inglés | MEDLINE | ID: mdl-29786430

RESUMEN

The second extracellular loops (ECL2s) of G-protein-coupled receptors (GPCRs) are often involved in GPCR functions, and their structures have important implications in drug discovery. However, structure prediction of ECL2 is difficult because of its long length and the structural diversity among different GPCRs. In this study, a new ECL2 conformational sampling method involving both template-based and ab initio sampling was developed. Inspired by the observation of similar ECL2 structures of closely related GPCRs, a template-based sampling method employing loop structure templates selected from the structure database was developed. A new metric for evaluating similarity of the target loop to templates was introduced for template selection. An ab initio loop sampling method was also developed to treat cases without highly similar templates. The ab initio method is based on the previously developed fragment assembly and loop closure method. A new sampling component that takes advantage of secondary structure prediction was added. In addition, a conserved disulfide bridge restraining ECL2 conformation was predicted and analytically incorporated into sampling, reducing the effective dimension of the conformational search space. The sampling method was combined with an existing energy function for comparison with previously reported loop structure prediction methods, and the benchmark test demonstrated outstanding performance.

Asunto(s)

Receptores Acoplados a Proteínas G/química , Animales , Bases de Datos de Proteínas , Disulfuros/química , Humanos , Modelos Moleculares , Conformación Proteica , Estructura Secundaria de Proteína

9.

Galaxy7TM: flexible GPCR-ligand docking by structure refinement.

Lee, Gyu Rie; Seok, Chaok.

Nucleic Acids Res ; 44(W1): W502-6, 2016 07 08.

Artículo en Inglés | MEDLINE | ID: mdl-27131365

RESUMEN

G-protein-coupled receptors (GPCRs) play important physiological roles related to signal transduction and form a major group of drug targets. Prediction of GPCR-ligand complex structures has therefore important implications to drug discovery. With previously available servers, it was only possible to first predict GPCR structures by homology modeling and then perform ligand docking on the model structures. However, model structures generated without explicit consideration of specific ligands of interest can be inaccurate because GPCR structures can be affected by ligand binding. The Galaxy7TM server, freely accessible at http://galaxy.seoklab.org/7TM, improves an input GPCR structure by simultaneous ligand docking and flexible structure refinement using GALAXY methods. The server shows better performance in both ligand docking and GPCR structure refinement than commonly used programs AutoDock Vina and Rosetta MPrelax, respectively.

Asunto(s)

Internet , Simulación del Acoplamiento Molecular , Receptores Acoplados a Proteínas G/química , Receptores Acoplados a Proteínas G/metabolismo , Programas Informáticos , Azepinas/química , Azepinas/metabolismo , Humanos , Ligandos , Receptores de Orexina/química , Receptores de Orexina/metabolismo , Triazoles/química , Triazoles/metabolismo

10.

Benchmarking predictions of allostery in liver pyruvate kinase in CAGI4.

Xu, Qifang; Tang, Qingling; Katsonis, Panagiotis; Lichtarge, Olivier; Jones, David; Bovo, Samuele; Babbi, Giulia; Martelli, Pier L; Casadio, Rita; Lee, Gyu Rie; Seok, Chaok; Fenton, Aron W; Dunbrack, Roland L.

Hum Mutat ; 38(9): 1123-1131, 2017 09.

Artículo en Inglés | MEDLINE | ID: mdl-28370845

RESUMEN

The Critical Assessment of Genome Interpretation (CAGI) is a global community experiment to objectively assess computational methods for predicting phenotypic impacts of genomic variation. One of the 2015-2016 competitions focused on predicting the influence of mutations on the allosteric regulation of human liver pyruvate kinase. More than 30 different researchers accessed the challenge data. However, only four groups accepted the challenge. Features used for predictions ranged from evolutionary constraints, mutant site locations relative to active and effector binding sites, and computational docking outputs. Despite the range of expertise and strategies used by predictors, the best predictions were marginally greater than random for modified allostery resulting from mutations. In contrast, several groups successfully predicted which mutations severely reduced enzymatic activity. Nonetheless, poor predictions of allostery stands in stark contrast to the impression left by more than 700 PubMed entries identified using the identifiers "computational + allosteric." This contrast highlights a specialized need for new computational tools and utilization of benchmarks that focus on allosteric regulation.

Asunto(s)

Benchmarking/métodos , Piruvato Quinasa/química , Piruvato Quinasa/genética , Regulación Alostérica , Sitio Alostérico , Biología Computacional/métodos , Bases de Datos Genéticas , Fructosadifosfatos/metabolismo , Humanos , Modelos Moleculares , Mutación , Piruvato Quinasa/metabolismo

11.

Template-based modeling and ab initio refinement of protein oligomer structures using GALAXY in CAPRI round 30.

Lee, Hasup; Baek, Minkyung; Lee, Gyu Rie; Park, Sangwoo; Seok, Chaok.

Proteins ; 85(3): 399-407, 2017 03.

Artículo en Inglés | MEDLINE | ID: mdl-27770545

RESUMEN

Many proteins function as homo- or hetero-oligomers; therefore, attempts to understand and regulate protein functions require knowledge of protein oligomer structures. The number of available experimental protein structures is increasing, and oligomer structures can be predicted using the experimental structures of related proteins as templates. However, template-based models may have errors due to sequence differences between the target and template proteins, which can lead to functional differences. Such structural differences may be predicted by loop modeling of local regions or refinement of the overall structure. In CAPRI (Critical Assessment of PRotein Interactions) round 30, we used recently developed features of the GALAXY protein modeling package, including template-based structure prediction, loop modeling, model refinement, and protein-protein docking to predict protein complex structures from amino acid sequences. Out of the 25 CAPRI targets, medium and acceptable quality models were obtained for 14 and 1 target(s), respectively, for which proper oligomer or monomer templates could be detected. Symmetric interface loop modeling on oligomer model structures successfully improved model quality, while loop modeling on monomer model structures failed. Overall refinement of the predicted oligomer structures consistently improved the model quality, in particular in interface contacts. Proteins 2017; 85:399-407. © 2016 Wiley Periodicals, Inc.

Asunto(s)

Algoritmos , Biología Computacional/métodos , Simulación del Acoplamiento Molecular/métodos , Proteínas/química , Secuencia de Aminoácidos , Benchmarking , Sitios de Unión , Unión Proteica , Conformación Proteica , Multimerización de Proteína , Proyectos de Investigación , Programas Informáticos , Homología Estructural de Proteína

12.

Effective protein model structure refinement by loop modeling and overall relaxation.

Lee, Gyu Rie; Heo, Lim; Seok, Chaok.

Proteins ; 84 Suppl 1: 293-301, 2016 09.

Artículo en Inglés | MEDLINE | ID: mdl-26172288

RESUMEN

Protein structures predicted by state-of-the-art template-based methods may still have errors when the template proteins are not similar enough to the target protein. Overall target structure may deviate from the template structures owing to differences in sequences. Structural information for some local regions such as loops may not be available when there are sequence insertions or deletions. Those structural aspects that originate from deviations from templates can be dealt with by ab initio structure refinement methods to further improve model accuracy. In the CASP11 refinement experiment, we tested three different refinement methods that utilize overall structure relaxation, loop modeling, and quality assessment of multiple initial structures. From this experiment, we conclude that the overall relaxation method can consistently improve model quality. Loop modeling is the most useful when the initial model structure is high quality, with GDT-HA >60. The method that used multiple initial structures further refined the already refined models; the minor improvements with this method raise the issue of problem with the current energy function. Future research directions are also discussed. Proteins 2016; 84(Suppl 1):293-301. © 2015 Wiley Periodicals, Inc.

Asunto(s)

Biología Computacional/estadística & datos numéricos , Modelos Moleculares , Modelos Estadísticos , Proteínas/química , Programas Informáticos , Algoritmos , Secuencias de Aminoácidos , Biología Computacional/métodos , Simulación por Computador , Humanos , Internet , Conformación Proteica en Hélice alfa , Conformación Proteica en Lámina beta , Pliegue de Proteína , Dominios y Motivos de Interacción de Proteínas , Estructura Terciaria de Proteína , Homología de Secuencia de Aminoácido , Termodinámica

13.

Evaluation of GalaxyDock Based on the Community Structure-Activity Resource 2013 and 2014 Benchmark Studies.

Shin, Woong-Hee; Lee, Gyu Rie; Seok, Chaok.

J Chem Inf Model ; 56(6): 988-95, 2016 06 27.

Artículo en Inglés | MEDLINE | ID: mdl-26583962

RESUMEN

We analyze the results of the GalaxyDock protein-ligand docking program in the two recent experiments of Community Structure-Activity Resource (CSAR), CSAR 2013 and 2014. GalaxyDock performs global optimization of a modified AutoDock3 energy function by employing the conformational space annealing method. The energy function of GalaxyDock is quite sensitive to atomic clashes. Such energy functions can be effective for sampling physically correct conformations but may not be effective for scoring when conformations are not fully optimized. In phase 1 of CSAR 2013, we successfully selected all four true binders of digoxigenin along with three false positives. However, the energy values were rather high due to insufficient optimization of the conformations docked to homology models. A posteriori relaxation of the model complex structures by GalaxyRefine improved the docking energy values and differentiated the true binders from the false positives better. In the scoring test of CSAR 2013 phase 2, we selected the best poses for each of the two targets. The results of CSAR 2013 phase 3 suggested that an improved method for generating initial conformations for GalaxyDock is necessary for targets involving bulky ligands. Finally, combining existing binding information with GalaxyDock energy-based optimization may be needed for more accurate binding affinity prediction.

Asunto(s)

Simulación del Acoplamiento Molecular , Benchmarking , Ligandos , Conformación Proteica , Proteínas/química , Proteínas/metabolismo , Relación Estructura-Actividad

14.

Protein Ensemble Generation Through Variational Autoencoder Latent Space Sampling.

Mansoor, Sanaa; Baek, Minkyung; Park, Hahnbeom; Lee, Gyu Rie; Baker, David.

J Chem Theory Comput ; 20(7): 2689-2695, 2024 Apr 09.

Artículo en Inglés | MEDLINE | ID: mdl-38547871

RESUMEN

Mapping the ensemble of protein conformations that contribute to function and can be targeted by small molecule drugs remains an outstanding challenge. Here, we explore the use of variational autoencoders for reducing the challenge of dimensionality in the protein structure ensemble generation problem. We convert high-dimensional protein structural data into a continuous, low-dimensional representation, carry out a search in this space guided by a structure quality metric, and then use RoseTTAFold guided by the sampled structural information to generate 3D structures. We use this approach to generate ensembles for the cancer relevant protein K-Ras, train the VAE on a subset of the available K-Ras crystal structures and MD simulation snapshots, and assess the extent of sampling close to crystal structures withheld from training. We find that our latent space sampling procedure rapidly generates ensembles with high structural quality and is able to sample within 1 Å of held-out crystal structures, with a consistency higher than that of MD simulation or AlphaFold2 prediction. The sampled structures sufficiently recapitulate the cryptic pockets in the held-out K-Ras structures to allow for small molecule docking.

Asunto(s)

Proteínas , Proteínas/química , Conformación Proteica , Simulación por Computador

15.

Modeling protein-small molecule conformational ensembles with ChemNet.

Anishchenko, Ivan; Kipnis, Yakov; Kalvet, Indrek; Zhou, Guangfeng; Krishna, Rohith; Pellock, Samuel J; Lauko, Anna; Lee, Gyu Rie; An, Linna; Dauparas, Justas; DiMaio, Frank; Baker, David.

bioRxiv ; 2024 Sep 25.

Artículo en Inglés | MEDLINE | ID: mdl-39386615

RESUMEN

Modeling the conformational heterogeneity of protein-small molecule systems is an outstanding challenge. We reasoned that while residue level descriptions of biomolecules are efficient for de novo structure prediction, for probing heterogeneity of interactions with small molecules in the folded state an entirely atomic level description could have advantages in speed and generality. We developed a graph neural network called ChemNet trained to recapitulate correct atomic positions from partially corrupted input structures from the Cambridge Structural Database and the Protein Data Bank; the nodes of the graph are the atoms in the system. ChemNet accurately generates structures of diverse organic small molecules given knowledge of their atom composition and bonding, and given a description of the larger protein context, and builds up structures of small molecules and protein side chains for protein-small molecule docking. Because ChemNet is rapid and stochastic, ensembles of predictions can be readily generated to map conformational heterogeneity. In enzyme design efforts described here and elsewhere, we find that using ChemNet to assess the accuracy and pre-organization of the designed active sites results in higher success rates and higher activities; we obtain a preorganized retroaldolase with a k cat / K M of 11000 M -1 min - 1 , considerably higher than any pre-deep learning design for this reaction. We anticipate that ChemNet will be widely useful for rapidly generating conformational ensembles of small molecule and small molecule-protein systems, and for designing higher activity preorganized enzymes.

16.

Generalized biomolecular modeling and design with RoseTTAFold All-Atom.

Krishna, Rohith; Wang, Jue; Ahern, Woody; Sturmfels, Pascal; Venkatesh, Preetham; Kalvet, Indrek; Lee, Gyu Rie; Morey-Burrows, Felix S; Anishchenko, Ivan; Humphreys, Ian R; McHugh, Ryan; Vafeados, Dionne; Li, Xinting; Sutherland, George A; Hitchcock, Andrew; Hunter, C Neil; Kang, Alex; Brackenbrough, Evans; Bera, Asim K; Baek, Minkyung; DiMaio, Frank; Baker, David.

Science ; 384(6693): eadl2528, 2024 Apr 19.

Artículo en Inglés | MEDLINE | ID: mdl-38452047

RESUMEN

Deep-learning methods have revolutionized protein structure prediction and design but are presently limited to protein-only systems. We describe RoseTTAFold All-Atom (RFAA), which combines a residue-based representation of amino acids and DNA bases with an atomic representation of all other groups to model assemblies that contain proteins, nucleic acids, small molecules, metals, and covalent modifications, given their sequences and chemical structures. By fine-tuning on denoising tasks, we developed RFdiffusion All-Atom (RFdiffusionAA), which builds protein structures around small molecules. Starting from random distributions of amino acid residues surrounding target small molecules, we designed and experimentally validated, through crystallography and binding measurements, proteins that bind the cardiac disease therapeutic digoxigenin, the enzymatic cofactor heme, and the light-harvesting molecule bilin.

Asunto(s)

Aprendizaje Profundo , Ingeniería de Proteínas , Proteínas , Aminoácidos/química , Cristalografía , ADN/química , Modelos Moleculares , Proteínas/química , Ingeniería de Proteínas/métodos

17.

Binding and sensing diverse small molecules using shape-complementary pseudocycles.

An, Linna; Said, Meerit; Tran, Long; Majumder, Sagardip; Goreshnik, Inna; Lee, Gyu Rie; Juergens, David; Dauparas, Justas; Anishchenko, Ivan; Coventry, Brian; Bera, Asim K; Kang, Alex; Levine, Paul M; Alvarez, Valentina; Pillai, Arvind; Norn, Christoffer; Feldman, David; Zorine, Dmitri; Hicks, Derrick R; Li, Xinting; Sanchez, Mariana Garcia; Vafeados, Dionne K; Salveson, Patrick J; Vorobieva, Anastassia A; Baker, David.

Science ; 385(6706): 276-282, 2024 Jul 19.

Artículo en Inglés | MEDLINE | ID: mdl-39024436

RESUMEN

We describe an approach for designing high-affinity small molecule-binding proteins poised for downstream sensing. We use deep learning-generated pseudocycles with repeating structural units surrounding central binding pockets with widely varying shapes that depend on the geometry and number of the repeat units. We dock small molecules of interest into the most shape complementary of these pseudocycles, design the interaction surfaces for high binding affinity, and experimentally screen to identify designs with the highest affinity. We obtain binders to four diverse molecules, including the polar and flexible methotrexate and thyroxine. Taking advantage of the modular repeat structure and central binding pockets, we construct chemically induced dimerization systems and low-noise nanopore sensors by splitting designs into domains that reassemble upon ligand addition.

Asunto(s)

Aprendizaje Profundo , Unión Proteica , Proteínas , Bibliotecas de Moléculas Pequeñas , Sitios de Unión , Ligandos , Metotrexato/química , Simulación del Acoplamiento Molecular , Nanoporos , Multimerización de Proteína , Proteínas/química , Bibliotecas de Moléculas Pequeñas/química , Tiroxina/química

18.

Evaluating GPCR modeling and docking strategies in the era of deep learning-based protein structure prediction.

Lee, Sumin; Kim, Seeun; Lee, Gyu Rie; Kwon, Sohee; Woo, Hyeonuk; Seok, Chaok; Park, Hahnbeom.

Comput Struct Biotechnol J ; 21: 158-167, 2023.

Artículo en Inglés | MEDLINE | ID: mdl-36544468

RESUMEN

While deep learning (DL) has brought a revolution in the protein structure prediction field, still an important question remains how the revolution can be transferred to advances in structure-based drug discovery. Because the lessons from the recent GPCR dock challenge were inconclusive primarily due to the size of the dataset, in this work we further elaborated on 70 diverse GPCR complexes bound to either small molecules or peptides to investigate the best-practice modeling and docking strategies for GPCR drug discovery. From our quantitative analysis, it is shown that substantial improvements in docking and virtual screening have been possible by the advance in DL-based protein structure predictions with respect to the expected results from the combination of best pre-DL tools. The success rate of docking on DL-based model structures approaches that of cross-docking on experimental structures, showing over 30% improvement from the best pre-DL protocols. This amount of performance could be achieved only when two modeling points were considered properly: 1) correct functional-state modeling of receptors and 2) receptor-flexible docking. Best-practice modeling strategies and the model confidence estimation metric suggested in this work may serve as a guideline for future computer-aided GPCR drug discovery scenarios.

19.

Computational design of sequence-specific DNA-binding proteins.

Glasscock, Cameron J; Pecoraro, Robert; McHugh, Ryan; Doyle, Lindsey A; Chen, Wei; Boivin, Olivier; Lonnquist, Beau; Na, Emily; Politanska, Yuliya; Haddox, Hugh K; Cox, David; Norn, Christoffer; Coventry, Brian; Goreshnik, Inna; Vafeados, Dionne; Lee, Gyu Rie; Gordan, Raluca; Stoddard, Barry L; DiMaio, Frank; Baker, David.

bioRxiv ; 2023 Sep 21.

Artículo en Inglés | MEDLINE | ID: mdl-37790440

RESUMEN

Sequence-specific DNA-binding proteins (DBPs) play critical roles in biology and biotechnology, and there has been considerable interest in the engineering of DBPs with new or altered specificities for genome editing and other applications. While there has been some success in reprogramming naturally occurring DBPs using selection methods, the computational design of new DBPs that recognize arbitrary target sites remains an outstanding challenge. We describe a computational method for the design of small DBPs that recognize specific target sequences through interactions with bases in the major groove, and employ this method in conjunction with experimental screening to generate binders for 5 distinct DNA targets. These binders exhibit specificity closely matching the computational models for the target DNA sequences at as many as 6 base positions and affinities as low as 30-100 nM. The crystal structure of a designed DBP-target site complex is in close agreement with the design model, highlighting the accuracy of the design method. The designed DBPs function in both Escherichia coli and mammalian cells to repress and activate transcription of neighboring genes. Our method is a substantial step towards a general route to small and hence readily deliverable sequence-specific DBPs for gene regulation and editing.

20.

Small-molecule binding and sensing with a designed protein family.

Lee, Gyu Rie; Pellock, Samuel J; Norn, Christoffer; Tischer, Doug; Dauparas, Justas; Anischenko, Ivan; Mercer, Jaron A M; Kang, Alex; Bera, Asim; Nguyen, Hannah; Goreshnik, Inna; Vafeados, Dionne; Roullier, Nicole; Han, Hannah L; Coventry, Brian; Haddox, Hugh K; Liu, David R; Yeh, Andy Hsien-Wei; Baker, David.

bioRxiv ; 2023 Nov 02.

Artículo en Inglés | MEDLINE | ID: mdl-37961294

RESUMEN

Despite transformative advances in protein design with deep learning, the design of small-molecule-binding proteins and sensors for arbitrary ligands remains a grand challenge. Here we combine deep learning and physics-based methods to generate a family of proteins with diverse and designable pocket geometries, which we employ to computationally design binders for six chemically and structurally distinct small-molecule targets. Biophysical characterization of the designed binders revealed nanomolar to low micromolar binding affinities and atomic-level design accuracy. The bound ligands are exposed at one edge of the binding pocket, enabling the de novo design of chemically induced dimerization (CID) systems; we take advantage of this to create a biosensor with nanomolar sensitivity for cortisol. Our approach provides a general method to design proteins that bind and sense small molecules for a wide range of analytical, environmental, and biomedical applications.

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

RESUMEN

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA