RESUMEN
Activating precursor B cell receptors of HIV-1 broadly neutralizing antibodies requires specifically designed immunogens. Here, we compared the abilities of three such germline-targeting immunogens against the VRC01-class receptors to activate the targeted B cells in transgenic mice expressing the germline VH of the VRC01 antibody but diverse mouse light chains. Immunogen-specific VRC01-like B cells were isolated at different time points after immunization, their VH and VL genes were sequenced, and the corresponding antibodies characterized. VRC01 B cell sub-populations with distinct cross-reactivity properties were activated by each immunogen, and these differences correlated with distinct biophysical and biochemical features of the germline-targeting immunogens. Our study indicates that the design of effective immunogens to activate B cell receptors leading to protective HIV-1 antibodies will require a better understanding of how the biophysical properties of the epitope and its surrounding surface on the germline-targeting immunogen influence its interaction with the available receptor variants in vivo.
Asunto(s)
Anticuerpos Monoclonales/inmunología , Antígenos/inmunología , Linfocitos B/inmunología , Anticuerpos ampliamente neutralizantes/inmunología , Epítopos de Linfocito B/inmunología , Anticuerpos Anti-VIH/inmunología , VIH-1/inmunología , Receptores de Antígenos de Linfocitos B/inmunología , Secuencia de Aminoácidos , Animales , Anticuerpos Neutralizantes/inmunología , Línea Celular , Femenino , Células Germinativas/inmunología , Células HEK293 , Infecciones por VIH/inmunología , Humanos , Masculino , Ratones TransgénicosRESUMEN
Proteins mediate their functions through chemical interactions; modeling these interactions, which are typically through sidechains, is an important need in protein design. However, constructing an all-atom generative model requires an appropriate scheme for managing the jointly continuous and discrete nature of proteins encoded in the structure and sequence. We describe an all-atom diffusion model of protein structure, Protpardelle, which represents all sidechain states at once as a "superposition" state; superpositions defining a protein are collapsed into individual residue types and conformations during sample generation. When combined with sequence design methods, our model is able to codesign all-atom protein structure and sequence. Generated proteins are of good quality under the typical quality, diversity, and novelty metrics, and sidechains reproduce the chemical features and behavior of natural proteins. Finally, we explore the potential of our model to conduct all-atom protein design and scaffold functional motifs in a backbone- and rotamer-free way.
Asunto(s)
Modelos Moleculares , Conformación Proteica , Proteínas , Proteínas/química , Secuencia de AminoácidosRESUMEN
Transmembrane channels and pores have key roles in fundamental biological processes1 and in biotechnological applications such as DNA nanopore sequencing2-4, resulting in considerable interest in the design of pore-containing proteins. Synthetic amphiphilic peptides have been found to form ion channels5,6, and there have been recent advances in de novo membrane protein design7,8 and in redesigning naturally occurring channel-containing proteins9,10. However, the de novo design of stable, well-defined transmembrane protein pores that are capable of conducting ions selectively or are large enough to enable the passage of small-molecule fluorophores remains an outstanding challenge11,12. Here we report the computational design of protein pores formed by two concentric rings of α-helices that are stable and monodisperse in both their water-soluble and their transmembrane forms. Crystal structures of the water-soluble forms of a 12-helical pore and a 16-helical pore closely match the computational design models. Patch-clamp electrophysiology experiments show that, when expressed in insect cells, the transmembrane form of the 12-helix pore enables the passage of ions across the membrane with high selectivity for potassium over sodium; ion passage is blocked by specific chemical modification at the pore entrance. When incorporated into liposomes using in vitro protein synthesis, the transmembrane form of the 16-helix pore-but not the 12-helix pore-enables the passage of biotinylated Alexa Fluor 488. A cryo-electron microscopy structure of the 16-helix transmembrane pore closely matches the design model. The ability to produce structurally and functionally well-defined transmembrane pores opens the door to the creation of designer channels and pores for a wide variety of applications.
Asunto(s)
Simulación por Computador , Genes Sintéticos/genética , Canales Iónicos/química , Canales Iónicos/genética , Modelos Moleculares , Biología Sintética , Línea Celular , Microscopía por Crioelectrón , Cristalografía por Rayos X , Conductividad Eléctrica , Escherichia coli/genética , Escherichia coli/metabolismo , Hidrazinas , Canales Iónicos/metabolismo , Transporte Iónico , Liposomas/metabolismo , Técnicas de Placa-Clamp , Porinas/química , Porinas/genética , Porinas/metabolismo , Ingeniería de Proteínas , Estructura Secundaria de Proteína , Solubilidad , Agua/químicaRESUMEN
Nanobodies bind a target antigen with a kinetic profile similar to a conventional antibody, but exist as a single heavy chain domain that can be readily multimerized to engage antigen via multiple interactions. Presently, most nanobodies are produced by immunizing camelids; however, platforms for animal-free production are growing in popularity. Here, we describe the development of a fully synthetic nanobody library based on an engineered human VH3-23 variable gene and a multispecific antibody-like format designed for biparatopic target engagement. To validate our library, we selected nanobodies against the SARS-CoV-2 receptor-binding domain and employed an on-yeast epitope binning strategy to rapidly map the specificities of the selected nanobodies. We then generated antibody-like molecules by replacing the VH and VL domains of a conventional antibody with two different nanobodies, designed as a molecular clamp to engage the receptor-binding domain biparatopically. The resulting bispecific tetra-nanobody immunoglobulins neutralized diverse SARS-CoV-2 variants with potencies similar to antibodies isolated from convalescent donors. Subsequent biochemical analyses confirmed the accuracy of the on-yeast epitope binning and structures of both individual nanobodies, and a tetra-nanobody immunoglobulin revealed that the intended mode of interaction had been achieved. This overall workflow is applicable to nearly any protein target and provides a blueprint for a modular workflow for the development of multispecific molecules.
Asunto(s)
COVID-19 , Anticuerpos de Dominio Único , Humanos , Anticuerpos de Dominio Único/química , Saccharomyces cerevisiae/metabolismo , SARS-CoV-2 , Anticuerpos , EpítoposRESUMEN
Affinity maturation of proteinprotein interactions is an important approach in the development of therapeutic proteins such as cytokines. Typical experimental strategies involve targeting the cytokine-receptor interface with combinatorial libraries and then selecting for higher-affinity variants. Mutations to the binding scaffold are usually not considered main drivers for improved affinity. Here we demonstrate that computational design can provide affinity-enhanced variants of interleukin-2 (IL-2) "out of the box" without any requirement for interface engineering. Using a strategy of global IL-2 structural stabilization targeting metastable regions of the three-dimensional structure, rather than the receptor binding interfaces, we computationally designed thermostable IL-2 variants with up to 40-fold higher affinity for IL-2Rß without any library-based optimization. These IL-2 analogs exhibited CD25-independent activities on T and natural killer (NK) cells both in vitro and in vivo, mimicking the properties of the IL-2 superkine "super-2" that was engineered through yeast surface display [A. M. Levin et al., Nature, 484, 529533 (2012)]. Structure-guided stabilization of cytokines is a powerful approach to affinity maturation with applications to many cytokine and proteinprotein interactions.
Asunto(s)
Interleucina-2 , Proteínas , Biología Computacional/métodos , Interleucina-2/genética , Ingeniería de Proteínas/métodos , Proteínas/metabolismo , Saccharomyces cerevisiae/metabolismoRESUMEN
The regular arrangements of ß-strands around a central axis in ß-barrels and of α-helices in coiled coils contrast with the irregular tertiary structures of most globular proteins, and have fascinated structural biologists since they were first discovered. Simple parametric models have been used to design a wide range of α-helical coiled-coil structures, but to date there has been no success with ß-barrels. Here we show that accurate de novo design of ß-barrels requires considerable symmetry-breaking to achieve continuous hydrogen-bond connectivity and eliminate backbone strain. We then build ensembles of ß-barrel backbone models with cavity shapes that match the fluorogenic compound DFHBI, and use a hierarchical grid-based search method to simultaneously optimize the rigid-body placement of DFHBI in these cavities and the identities of the surrounding amino acids to achieve high shape and chemical complementarity. The designs have high structural accuracy and bind and fluorescently activate DFHBI in vitro and in Escherichia coli, yeast and mammalian cells. This de novo design of small-molecule binding activity, using backbones custom-built to bind the ligand, should enable the design of increasingly sophisticated ligand-binding proteins, sensors and catalysts that are not limited by the backbone geometries available in known protein structures.
Asunto(s)
Compuestos de Bencilo/química , Fluorescencia , Imidazolinas/química , Proteínas/química , Animales , Compuestos de Bencilo/análisis , Células COS , Chlorocebus aethiops , Escherichia coli , Proteínas Fluorescentes Verdes/genética , Proteínas Fluorescentes Verdes/metabolismo , Enlace de Hidrógeno , Imidazolinas/análisis , Ligandos , Unión Proteica , Dominios Proteicos , Pliegue de Proteína , Estabilidad Proteica , Estructura Secundaria de Proteína , Reproducibilidad de los Resultados , LevadurasRESUMEN
The Rosetta software for macromolecular modeling, docking and design is extensively used in laboratories worldwide. During two decades of development by a community of laboratories at more than 60 institutions, Rosetta has been continuously refactored and extended. Its advantages are its performance and interoperability between broad modeling capabilities. Here we review tools developed in the last 5 years, including over 80 methods. We discuss improvements to the score function, user interfaces and usability. Rosetta is available at http://www.rosettacommons.org.
Asunto(s)
Sustancias Macromoleculares/química , Modelos Moleculares , Proteínas/química , Programas Informáticos , Simulación del Acoplamiento Molecular , Peptidomiméticos/química , Conformación ProteicaRESUMEN
Precision tools for spatiotemporal control of cytoskeletal motor function are needed to dissect fundamental biological processes ranging from intracellular transport to cell migration and division. Direct optical control of motor speed and direction is one promising approach, but it remains a challenge to engineer controllable motors with desirable properties such as the speed and processivity required for transport applications in living cells. Here, we develop engineered myosin motors that combine large optical modulation depths with high velocities, and create processive myosin motors with optically controllable directionality. We characterize the performance of the motors using in vitro motility assays, single-molecule tracking and live-cell imaging. Bidirectional processive motors move efficiently toward the tips of cellular protrusions in the presence of blue light, and can transport molecular cargo in cells. Robust gearshifting myosins will further enable programmable transport in contexts ranging from in vitro active matter reconstitutions to microfabricated systems that harness molecular propulsion.
Asunto(s)
Actinina/química , Células Epiteliales/metabolismo , Miosinas/química , Neuronas/metabolismo , Ingeniería de Proteínas/métodos , Espectrina/química , Actinina/genética , Actinina/metabolismo , Animales , Avena , Línea Celular , Chara , Pollos , Clonación Molecular , Dictyostelium , Células Epiteliales/citología , Células Epiteliales/efectos de la radiación , Escherichia coli/genética , Escherichia coli/metabolismo , Expresión Génica , Vectores Genéticos/química , Vectores Genéticos/metabolismo , Hipocampo/citología , Hipocampo/metabolismo , Humanos , Luz , Modelos Moleculares , Movimiento (Física) , Miosinas/genética , Miosinas/metabolismo , Neuronas/citología , Neuronas/efectos de la radiación , Óptica y Fotónica/métodos , Cultivo Primario de Células , Ratas , Proteínas Recombinantes de Fusión/química , Proteínas Recombinantes de Fusión/genética , Proteínas Recombinantes de Fusión/metabolismo , Espectrina/genética , Espectrina/metabolismo , NicotianaRESUMEN
While deep learning models have seen increasing applications in protein science, few have been implemented for protein backbone generation-an important task in structure-based problems such as active site and interface design. We present a new approach to building class-specific backbones, using a variational auto-encoder to directly generate the 3D coordinates of immunoglobulins. Our model is torsion- and distance-aware, learns a high-resolution embedding of the dataset, and generates novel, high-quality structures compatible with existing design tools. We show that the Ig-VAE can be used with Rosetta to create a computational model of a SARS-CoV2-RBD binder via latent space sampling. We further demonstrate that the model's generative prior is a powerful tool for guiding computational protein design, motivating a new paradigm under which backbone design is solved as constrained optimization problem in the latent space of a generative model.
Asunto(s)
COVID-19 , ARN Viral , Humanos , Inmunoglobulinas , Proteínas/química , SARS-CoV-2RESUMEN
RNA hydrolysis presents problems in manufacturing, long-term storage, world-wide delivery and in vivo stability of messenger RNA (mRNA)-based vaccines and therapeutics. A largely unexplored strategy to reduce mRNA hydrolysis is to redesign RNAs to form double-stranded regions, which are protected from in-line cleavage and enzymatic degradation, while coding for the same proteins. The amount of stabilization that this strategy can deliver and the most effective algorithmic approach to achieve stabilization remain poorly understood. Here, we present simple calculations for estimating RNA stability against hydrolysis, and a model that links the average unpaired probability of an mRNA, or AUP, to its overall hydrolysis rate. To characterize the stabilization achievable through structure design, we compare AUP optimization by conventional mRNA design methods to results from more computationally sophisticated algorithms and crowdsourcing through the OpenVaccine challenge on the Eterna platform. We find that rational design on Eterna and the more sophisticated algorithms lead to constructs with low AUP, which we term 'superfolder' mRNAs. These designs exhibit a wide diversity of sequence and structure features that may be desirable for translation, biophysical size, and immunogenicity. Furthermore, their folding is robust to temperature, computer modeling method, choice of flanking untranslated regions, and changes in target protein sequence, as illustrated by rapid redesign of superfolder mRNAs for B.1.351, P.1 and B.1.1.7 variants of the prefusion-stabilized SARS-CoV-2 spike protein. Increases in in vitro mRNA half-life by at least two-fold appear immediately achievable.
Asunto(s)
Algoritmos , ARN Bicatenario/química , ARN Mensajero/química , ARN Viral/química , SARS-CoV-2/genética , Glicoproteína de la Espiga del Coronavirus/genética , Emparejamiento Base , Secuencia de Bases , COVID-19/prevención & control , Humanos , Hidrólisis , Estabilidad del ARN , ARN Bicatenario/genética , ARN Bicatenario/inmunología , ARN Mensajero/genética , ARN Mensajero/inmunología , ARN Viral/genética , ARN Viral/inmunología , SARS-CoV-2/inmunología , Glicoproteína de la Espiga del Coronavirus/inmunología , TermodinámicaRESUMEN
De novo protein design has succeeded in generating a large variety of globular proteins, but the construction of protein scaffolds with cavities that could accommodate large signaling molecules, cofactors, and substrates remains an outstanding challenge. The long, often flexible loops that form such cavities in many natural proteins are difficult to precisely program and thus challenging for computational protein design. Here we describe an alternative approach to this problem. We fused two stable proteins with C2 symmetry-a de novo designed dimeric ferredoxin fold and a de novo designed TIM barrel-such that their symmetry axes are aligned to create scaffolds with large cavities that can serve as binding pockets or enzymatic reaction chambers. The crystal structures of two such designs confirm the presence of a 420 cubic Ångström chamber defined by the top of the designed TIM barrel and the bottom of the ferredoxin dimer. We functionalized the scaffold by installing a metal-binding site consisting of four glutamate residues close to the symmetry axis. The protein binds lanthanide ions with very high affinity as demonstrated by tryptophan-enhanced terbium luminescence. This approach can be extended to other metals and cofactors, making this scaffold a modular platform for the design of binding proteins and biocatalysts.
Asunto(s)
Elementos de la Serie de los Lantanoides/química , Elementos de la Serie de los Lantanoides/metabolismo , Metaloproteínas/química , Metaloproteínas/metabolismo , Ingeniería de Proteínas , Sitios de Unión , Modelos Moleculares , Conformación Molecular , Unión Proteica , Dominios y Motivos de Interacción de Proteínas , Relación Estructura-ActividadRESUMEN
Interleukin-6 (IL-6) family cytokines signal through multimeric receptor complexes, providing unique opportunities to create novel ligand-based therapeutics. The cardiotrophin-like cytokine factor 1 (CLCF1) ligand has been shown to play a role in cancer, osteoporosis, and atherosclerosis. Once bound to ciliary neurotrophic factor receptor (CNTFR), CLCF1 mediates interactions to coreceptors glycoprotein 130 (gp130) and leukemia inhibitory factor receptor (LIFR). By increasing CNTFR-mediated binding to these coreceptors we generated a receptor superagonist which surpassed the potency of natural CNTFR ligands in neuronal signaling. Through additional mutations, we generated a receptor antagonist with increased binding to CNTFR but lack of binding to the coreceptors that inhibited tumor progression in murine xenograft models of nonsmall cell lung cancer. These studies further validate the CLCF1-CNTFR signaling axis as a therapeutic target and highlight an approach of engineering cytokine activity through a small number of mutations.
Asunto(s)
Subunidad alfa del Receptor del Factor Neurotrófico Ciliar/agonistas , Citocinas/metabolismo , Ingeniería de Proteínas/métodos , Animales , Sitios de Unión , Línea Celular Tumoral , Células Cultivadas , Subunidad alfa del Receptor del Factor Neurotrófico Ciliar/antagonistas & inhibidores , Subunidad alfa del Receptor del Factor Neurotrófico Ciliar/metabolismo , Receptor gp130 de Citocinas/metabolismo , Citocinas/química , Citocinas/genética , Humanos , Subunidad alfa del Receptor del Factor Inhibidor de Leucemia/metabolismo , Ligandos , Neuronas/metabolismo , Unión Proteica , Ratas , Transducción de SeñalRESUMEN
The plasticity of naturally occurring protein structures, which can change shape considerably in response to changes in environmental conditions, is critical to biological function. While computational methods have been used for de novo design of proteins that fold to a single state with a deep free-energy minimum [P.-S. Huang, S. E. Boyken, D. Baker, Nature 537, 320-327 (2016)], and to reengineer natural proteins to alter their dynamics [J. A. Davey, A. M. Damry, N. K. Goto, R. A. Chica, Nat. Chem. Biol. 13, 1280-1285 (2017)] or fold [P. A. Alexander, Y. He, Y. Chen, J. Orban, P. N. Bryan, Proc. Natl. Acad. Sci. U.S.A. 106, 21149-21154 (2009)], the de novo design of closely related sequences which adopt well-defined but structurally divergent structures remains an outstanding challenge. We designed closely related sequences (over 94% identity) that can adopt two very different homotrimeric helical bundle conformations-one short (â¼66 Å height) and the other long (â¼100 Å height)-reminiscent of the conformational transition of viral fusion proteins. Crystallographic and NMR spectroscopic characterization shows that both the short- and long-state sequences fold as designed. We sought to design bistable sequences for which both states are accessible, and obtained a single designed protein sequence that populates either the short state or the long state depending on the measurement conditions. The design of sequences which are poised to adopt two very different conformations sets the stage for creating large-scale conformational switches between structurally divergent forms.
Asunto(s)
Biología Computacional/métodos , Proteínas/química , Secuencia de Aminoácidos/genética , Aminoácidos/química , Conformación Molecular , Conformación Proteica , Ingeniería de Proteínas/métodos , Pliegue de ProteínaRESUMEN
There are 20(200) possible amino-acid sequences for a 200-residue protein, of which the natural evolutionary process has sampled only an infinitesimal subset. De novo protein design explores the full sequence space, guided by the physical principles that underlie protein folding. Computational methodology has advanced to the point that a wide range of structures can be designed from scratch with atomic-level accuracy. Almost all protein engineering so far has involved the modification of naturally occurring proteins; it should now be possible to design new functional proteins from the ground up to tackle current challenges in biomedicine and nanotechnology.
Asunto(s)
Diseño de Fármacos , Proteínas/química , Proteínas/metabolismo , Biofisica , Biología Computacional , Nanoestructuras/química , Pliegue de Proteína , Estabilidad ProteicaRESUMEN
The dodecahedron [corrected] is the largest of the Platonic solids, and icosahedral protein structures are widely used in biological systems for packaging and transport. There has been considerable interest in repurposing such structures for applications ranging from targeted delivery to multivalent immunogen presentation. The ability to design proteins that self-assemble into precisely specified, highly ordered icosahedral structures would open the door to a new generation of protein containers with properties custom-tailored to specific applications. Here we describe the computational design of a 25-nanometre icosahedral nanocage that self-assembles from trimeric protein building blocks. The designed protein was produced in Escherichia coli, and found by electron microscopy to assemble into a homogenous population of icosahedral particles nearly identical to the design model. The particles are stable in 6.7 molar guanidine hydrochloride at up to 80 degrees Celsius, and undergo extremely abrupt, but reversible, disassembly between 2 molar and 2.25 molar guanidinium thiocyanate. The dodecahedron [corrected] is robust to genetic fusions: one or two copies of green fluorescent protein (GFP) can be fused to each of the 60 subunits to create highly fluorescent 'standard candles' for use in light microscopy, and a designed protein pentamer can be placed in the centre of each of the 20 pentameric faces to modulate the size of the entrance/exit channels of the cage. Such robust and customizable nanocages should have considerable utility in targeted drug delivery, vaccine design and synthetic biology.
Asunto(s)
Diseño de Fármacos , Multimerización de Proteína , Subunidades de Proteína/química , Simulación por Computador , Microscopía por Crioelectrón , Escherichia coli/metabolismo , Proteínas Fluorescentes Verdes/química , Proteínas Fluorescentes Verdes/genética , Modelos Moleculares , Nanoestructuras/química , Nanoestructuras/ultraestructura , Estabilidad Proteica/efectos de los fármacos , Subunidades de Proteína/genética , Proteínas Recombinantes de Fusión/química , Proteínas Recombinantes de Fusión/genéticaRESUMEN
Naturally occurring, pharmacologically active peptides constrained with covalent crosslinks generally have shapes that have evolved to fit precisely into binding pockets on their targets. Such peptides can have excellent pharmaceutical properties, combining the stability and tissue penetration of small-molecule drugs with the specificity of much larger protein therapeutics. The ability to design constrained peptides with precisely specified tertiary structures would enable the design of shape-complementary inhibitors of arbitrary targets. Here we describe the development of computational methods for accurate de novo design of conformationally restricted peptides, and the use of these methods to design 18-47 residue, disulfide-crosslinked peptides, a subset of which are heterochiral and/or N-C backbone-cyclized. Both genetically encodable and non-canonical peptides are exceptionally stable to thermal and chemical denaturation, and 12 experimentally determined X-ray and NMR structures are nearly identical to the computational design models. The computational design methods and stable scaffolds presented here provide the basis for development of a new generation of peptide-based drugs.
Asunto(s)
Diseño Asistido por Computadora , Diseño de Fármacos , Péptidos/química , Péptidos/síntesis química , Estabilidad Proteica , Secuencias de Aminoácidos , Cristalografía por Rayos X , Ciclización , Disulfuros/química , Calor , Modelos Moleculares , Resonancia Magnética Nuclear Biomolecular , Péptidos/genética , Péptidos Cíclicos/química , Péptidos Cíclicos/genética , Desnaturalización Proteica , Estructura Secundaria de Proteína , Estructura Terciaria de Proteína , EstereoisomerismoRESUMEN
MOTIVATION: Recent advances in computational methods have facilitated large-scale sampling of protein structures, leading to breakthroughs in protein structural prediction and enabling de novo protein design. Establishing methods to identify candidate structures that can lead to native folds or designable structures remains a challenge, since few existing metrics capture high-level structural features such as architectures, folds and conformity to conserved structural motifs. Convolutional Neural Networks (CNNs) have been successfully used in semantic segmentation-a subfield of image classification in which a class label is predicted for every pixel. Here, we apply semantic segmentation to protein structures as a novel strategy for fold identification and structure quality assessment. RESULTS: We train a CNN that assigns each residue in a multi-domain protein to one of 38 architecture classes designated by the CATH database. Our model achieves a high per-residue accuracy of 90.8% on the test set (95.0% average per-class accuracy; 87.8% average per-structure accuracy). We demonstrate that individual class probabilities can be used as a metric that indicates the degree to which a randomly generated structure assumes a specific fold, as well as a metric that highlights non-conformative regions of a protein belonging to a known class. These capabilities yield a powerful tool for guiding structural sampling for both structural prediction and design. AVAILABILITY AND IMPLEMENTATION: The trained classifier network, parser network, and entropy calculation scripts are available for download at https://git.io/fp6bd, with detailed usage instructions provided at the download page. A step-by-step tutorial for setup is provided at https://goo.gl/e8GB2S. All Rosetta commands, RosettaRemodel blueprints, and predictions for all datasets used in the study are available in the Supplementary Information. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)
Redes Neurales de la Computación , Semántica , Bases de Datos Factuales , Proteínas , Programas InformáticosRESUMEN
A central question in protein evolution is the extent to which naturally occurring proteins sample the space of folded structures accessible to the polypeptide chain. Repeat proteins composed of multiple tandem copies of a modular structure unit are widespread in nature and have critical roles in molecular recognition, signalling, and other essential biological processes. Naturally occurring repeat proteins have been re-engineered for molecular recognition and modular scaffolding applications. Here we use computational protein design to investigate the space of folded structures that can be generated by tandem repeating a simple helix-loop-helix-loop structural motif. Eighty-three designs with sequences unrelated to known repeat proteins were experimentally characterized. Of these, 53 are monomeric and stable at 95 °C, and 43 have solution X-ray scattering spectra consistent with the design models. Crystal structures of 15 designs spanning a broad range of curvatures are in close agreement with the design models with root mean square deviations ranging from 0.7 to 2.5 Å. Our results show that existing repeat proteins occupy only a small fraction of the possible repeat protein sequence and structure space and that it is possible to design novel repeat proteins with precisely specified geometries, opening up a wide array of new possibilities for biomolecular engineering.
Asunto(s)
Secuencias de Aminoácidos , Bioingeniería , Simulación por Computador , Conformación Proteica , Proteínas/química , Secuencia de Aminoácidos , Cristalografía por Rayos X , Modelos Moleculares , Pliegue de Proteína , Estabilidad Proteica , TemperaturaRESUMEN
Synthetic protein switches controlled with user-defined inputs are powerful tools for studying and controlling dynamic cellular processes. To date, these approaches have relied primarily on intermolecular regulation. Here we report a computationally guided framework for engineering intramolecular regulation of protein function. We utilize this framework to develop chemically inducible activator of RAS (CIAR), a single-component RAS rheostat that directly activates endogenous RAS in response to a small molecule. Using CIAR, we show that direct RAS activation elicits markedly different RAS-ERK signaling dynamics from growth factor stimulation, and that these dynamics differ among cell types. We also found that the clinically approved RAF inhibitor vemurafenib potently primes cells to respond to direct wild-type RAS activation. These results demonstrate the utility of CIAR for quantitatively interrogating RAS signaling. Finally, we demonstrate the general utility of our approach in design of intramolecularly regulated protein tools by applying it to the Rho family of guanine nucleotide exchange factors.
Asunto(s)
Quinasas MAP Reguladas por Señal Extracelular/metabolismo , Sistema de Señalización de MAP Quinasas , Ingeniería de Proteínas , Proteínas ras/química , Proteínas ras/metabolismo , Línea Celular , Humanos , Modelos MolecularesRESUMEN
Despite efforts for over 25 years, de novo protein design has not succeeded in achieving the TIM-barrel fold. Here we describe the computational design of four-fold symmetrical (ß/α)8 barrels guided by geometrical and chemical principles. Experimental characterization of 33 designs revealed the importance of side chain-backbone hydrogen bonds for defining the strand register between repeat units. The X-ray crystal structure of a designed thermostable 184-residue protein is nearly identical to that of the designed TIM-barrel model. PSI-BLAST searches do not identify sequence similarities to known TIM-barrel proteins, and sensitive profile-profile searches indicate that the design sequence is distant from other naturally occurring TIM-barrel superfamilies, suggesting that Nature has sampled only a subset of the sequence space available to the TIM-barrel fold. The ability to design TIM barrels de novo opens new possibilities for custom-made enzymes.