RESUMEN
Many growth factors and cytokines signal by binding to the extracellular domains of their receptors and driving association and transphosphorylation of the receptor intracellular tyrosine kinase domains, initiating downstream signaling cascades. To enable systematic exploration of how receptor valency and geometry affect signaling outcomes, we designed cyclic homo-oligomers with up to 8 subunits using repeat protein building blocks that can be modularly extended. By incorporating a de novo-designed fibroblast growth factor receptor (FGFR)-binding module into these scaffolds, we generated a series of synthetic signaling ligands that exhibit potent valency- and geometry-dependent Ca2+ release and mitogen-activated protein kinase (MAPK) pathway activation. The high specificity of the designed agonists reveals distinct roles for two FGFR splice variants in driving arterial endothelium and perivascular cell fates during early vascular development. Our designed modular assemblies should be broadly useful for unraveling the complexities of signaling in key developmental transitions and for developing future therapeutic applications.
Asunto(s)
Diferenciación Celular , Factores de Crecimiento de Fibroblastos , Receptores de Factores de Crecimiento de Fibroblastos , Transducción de Señal , Animales , Humanos , Receptores de Factores de Crecimiento de Fibroblastos/metabolismo , Factores de Crecimiento de Fibroblastos/metabolismo , Ratones , Ligandos , Calcio/metabolismo , Sistema de Señalización de MAP QuinasasRESUMEN
Many peptide hormones form an α-helix on binding their receptors1-4, and sensitive methods for their detection could contribute to better clinical management of disease5. De novo protein design can now generate binders with high affinity and specificity to structured proteins6,7. However, the design of interactions between proteins and short peptides with helical propensity is an unmet challenge. Here we describe parametric generation and deep learning-based methods for designing proteins to address this challenge. We show that by extending RFdiffusion8 to enable binder design to flexible targets, and to refining input structure models by successive noising and denoising (partial diffusion), picomolar-affinity binders can be generated to helical peptide targets by either refining designs generated with other methods, or completely de novo starting from random noise distributions without any subsequent experimental optimization. The RFdiffusion designs enable the enrichment and subsequent detection of parathyroid hormone and glucagon by mass spectrometry, and the construction of bioluminescence-based protein biosensors. The ability to design binders to conformationally variable targets, and to optimize by partial diffusion both natural and designed proteins, should be broadly useful.
Asunto(s)
Diseño Asistido por Computadora , Aprendizaje Profundo , Péptidos , Proteínas , Técnicas Biosensibles , Difusión , Glucagón/química , Glucagón/metabolismo , Mediciones Luminiscentes , Espectrometría de Masas , Hormona Paratiroidea/química , Hormona Paratiroidea/metabolismo , Péptidos/química , Péptidos/metabolismo , Estructura Secundaria de Proteína , Proteínas/química , Proteínas/metabolismo , Especificidad por Sustrato , Modelos MolecularesRESUMEN
Endocytosis and lysosomal trafficking of cell surface receptors can be triggered by endogenous ligands. Therapeutic approaches such as lysosome-targeting chimaeras1,2 (LYTACs) and cytokine receptor-targeting chimeras3 (KineTACs) have used this to target specific proteins for degradation by fusing modified native ligands to target binding proteins. Although powerful, these approaches can be limited by competition with native ligands and requirements for chemical modification that limit genetic encodability and can complicate manufacturing, and, more generally, there may be no native ligands that stimulate endocytosis through a given receptor. Here we describe computational design approaches for endocytosis-triggering binding proteins (EndoTags) that overcome these challenges. We present EndoTags for insulin-like growth factor 2 receptor (IGF2R) and asialoglycoprotein receptor (ASGPR), sortilin and transferrin receptors, and show that fusing these tags to soluble or transmembrane target protein binders leads to lysosomal trafficking and target degradation. As these receptors have different tissue distributions, the different EndoTags could enable targeting of degradation to different tissues. EndoTag fusion to a PD-L1 antibody considerably increases efficacy in a mouse tumour model compared to antibody alone. The modularity and genetic encodability of EndoTags enables AND gate control for higher-specificity targeted degradation, and the localized secretion of degraders from engineered cells. By promoting endocytosis, EndoTag fusion increases signalling through an engineered ligand-receptor system by nearly 100-fold. EndoTags have considerable therapeutic potential as targeted degradation inducers, signalling activators for endocytosis-dependent pathways, and cellular uptake inducers for targeted antibody-drug and antibody-RNA conjugates.
RESUMEN
A challenge for design of protein-small-molecule recognition is that incorporation of cavities with size, shape, and composition suitable for specific recognition can considerably destabilize protein monomers. This challenge can be overcome through binding pockets formed at homo-oligomeric interfaces between folded monomers. Interfaces surrounding the central homo-oligomer symmetry axes necessarily have the same symmetry and so may not be well suited to binding asymmetric molecules. To enable general recognition of arbitrary asymmetric substrates and small molecules, we developed an approach to designing asymmetric interfaces at off-axis sites on homo-oligomers, analogous to those found in native homo-oligomeric proteins such as glutamine synthetase. We symmetrically dock curved helical repeat proteins such that they form pockets at the asymmetric interface of the oligomer with sizes ranging from several angstroms, appropriate for binding a single ion, to up to more than 20 Å across. Of the 133 proteins tested, 84 had soluble expression in E. coli, 47 had correct oligomeric states in solution, 35 had small-angle X-ray scattering (SAXS) data largely consistent with design models, and 8 had negative-stain electron microscopy (nsEM) 2D class averages showing the structures coming together as designed. Both an X-ray crystal structure and a cryogenic electron microscopy (cryoEM) structure are close to the computational design models. The nature of these proteins as homo-oligomers allows them to be readily built into higher-order structures such as nanocages, and the asymmetric pockets of these structures open rich possibilities for small-molecule binder design free from the constraints associated with monomer destabilization.
Asunto(s)
Proteínas , Escherichia coli/genética , Glutamato-Amoníaco Ligasa , Proteínas/química , Dispersión del Ángulo Pequeño , Difracción de Rayos XRESUMEN
Francis Crick's global parameterization of coiled coil geometry has been widely useful for guiding design of new protein structures and functions. However, design guided by similar global parameterization of beta barrel structures has been less successful, likely due to the deviations required from ideal beta barrel geometry to maintain extensive inter-strand hydrogen bonding without introducing considerable backbone strain. Instead, beta barrels and other protein folds have been designed guided by 2D structural blueprints; while this approach has successfully generated new fluorescent proteins, transmembrane nanopores, and other structures, it requires considerable expert knowledge and provides only indirect control over the global barrel shape. Here we show that the simplicity and control over shape and structure provided by global parametric representations can be generalized beyond coiled coils by taking advantage of the rich sequence-structure relationships implicit in RoseTTAFold based inpainting and diffusion design methods. Starting from parametrically generated idealized barrel backbones, both RFjoint inpainting and RFdiffusion readily incorporate the backbone irregularities necessary for proper folding with minimal deviation from the idealized barrel geometries. We show that for beta barrels across a broad range of global beta sheet parameterizations, these methods achieve high in silico and experimental success rates, with atomic accuracy confirmed by an X-ray crystal structure of a novel beta barrel topology, and de novo designed 12, 14, and 16 stranded transmembrane nanopores with conductances ranging from 200 to 500 pS. By combining the simplicity and control of parametric generation with the high success rates of deep learning based protein design methods, our approach makes the design of proteins where global shape confers function, such as beta barrel nanopores, more precisely specifiable and accessible.
RESUMEN
Protein denoising diffusion probabilistic models are used for the de novo generation of protein backbones but are limited in their ability to guide generation of proteins with sequence-specific attributes and functional properties. To overcome this limitation, we developed ProteinGenerator (PG), a sequence space diffusion model based on RoseTTAFold that simultaneously generates protein sequences and structures. Beginning from a noised sequence representation, PG generates sequence and structure pairs by iterative denoising, guided by desired sequence and structural protein attributes. We designed thermostable proteins with varying amino acid compositions and internal sequence repeats and cage bioactive peptides, such as melittin. By averaging sequence logits between diffusion trajectories with distinct structural constraints, we designed multistate parent-child protein triples in which the same sequence folds to different supersecondary structures when intact in the parent versus split into two child domains. PG design trajectories can be guided by experimental sequence-activity data, providing a general approach for integrated computational and experimental optimization of protein function.
RESUMEN
Proteins which bind intrinsically disordered proteins (IDPs) and intrinsically disordered regions (IDRs) with high affinity and specificity could have considerable utility for therapeutic and diagnostic applications. However, a general methodology for targeting IDPs/IDRs has yet to be developed. Here, we show that starting only from the target sequence of the input, and freely sampling both target and binding protein conformation, RFdiffusion can generate binders to IDPs and IDRs in a wide range of conformations. We use this approach to generate binders to the IDPs Amylin, C-peptide and VP48 in a range of conformations with Kds in the 3 -100nM range. The Amylin binder inhibits amyloid fibril formation and dissociates existing fibers, and enables enrichment of amylin for mass spectrometry-based detection. For the IDRs G3bp1, common gamma chain (IL2RG) and prion, we diffused binders to beta strand conformations of the targets, obtaining 10 to 100 nM affinity. The IL2RG binder colocalizes with the receptor in cells, enabling new approaches to modulating IL2 signaling. Our approach should be widely useful for creating binders to flexible IDPs/IDRs spanning a wide range of intrinsic conformational preferences.
RESUMEN
A general approach to design proteins that bind tightly and specifically to intrinsically disordered regions (IDRs) of proteins and flexible peptides would have wide application in biological research, therapeutics, and diagnosis. However, the lack of defined structures and the high variability in sequence and conformational preferences has complicated such efforts. We sought to develop a method combining biophysical principles with deep learning to readily generate binders for any disordered sequence. Instead of assuming a fixed regular structure for the target, general recognition is achieved by threading the query sequence through diverse extended binding modes in hundreds of templates with varying pocket depths and spacings, followed by RFdiffusion refinement to optimize the binder-target fit. We tested the method by designing binders to 39 highly diverse unstructured targets. Experimental testing of ~36 designs per target yielded binders with affinities better than 100 nM in 34 cases, and in the pM range in four cases. The co-crystal structure of a designed binder in complex with dynorphin A is closely consistent with the design model. All by all binding experiments for 20 designs binding diverse targets show they are highly specific for the intended targets, with no crosstalk even for the closely related dynorphin A and dynorphin B. Our approach thus could provide a general solution to the intrinsically disordered protein and peptide recognition problem.
RESUMEN
Integrin α5ß1 is crucial for cell attachment and migration in development and tissue regeneration, and α5ß1 binding proteins could have considerable utility in regenerative medicine and next-generation therapeutics. We use computational protein design to create de novo α5ß1-specific modulating miniprotein binders, called NeoNectins, that bind to and stabilize the open state of α5ß1. When immobilized onto titanium surfaces and throughout 3D hydrogels, the NeoNectins outperform native fibronectin and RGD peptide in enhancing cell attachment and spreading, and NeoNectin-grafted titanium implants outperformed fibronectin and RGD-grafted implants in animal models in promoting tissue integration and bone growth. NeoNectins should be broadly applicable for tissue engineering and biomedicine.
RESUMEN
While there has been progress in the de novo design of small globular miniproteins (50-65 residues) to bind to primarily concave regions of a target protein surface, computational design of minibinders to convex binding sites remains an outstanding challenge due to low level of overall shape complementarity. Here, we describe a general approach to generate computationally designed proteins which bind to convex target sites that employ geometrically matching concave scaffolds. We used this approach to design proteins binding to TGFßRII, CTLA-4 and PD-L1 which following experimental optimization have low nanomolar to picomolar affinities and potent biological activity. Co-crystal structures of the TGFßRII and CTLA-4 binders in complex with the receptors are in close agreement with the design models. Our approach provides a general route to generating very high affinity binders to convex protein target sites.
RESUMEN
Snakebite envenoming remains a devastating and neglected tropical disease, claiming over 100,000 lives annually and causing severe complications and long-lasting disabilities for many more1,2. Three-finger toxins (3FTx) are highly toxic components of elapid snake venoms that can cause diverse pathologies, including severe tissue damage3 and inhibition of nicotinic acetylcholine receptors (nAChRs) resulting in life-threatening neurotoxicity4. Currently, the only available treatments for snakebite consist of polyclonal antibodies derived from the plasma of immunized animals, which have high cost and limited efficacy against 3FTxs5,6,7. Here, we use deep learning methods to de novo design proteins to bind short- and long-chain α-neurotoxins and cytotoxins from the 3FTx family. With limited experimental screening, we obtain protein designs with remarkable thermal stability, high binding affinity, and near-atomic level agreement with the computational models. The designed proteins effectively neutralize all three 3FTx sub-families in vitro and protect mice from a lethal neurotoxin challenge. Such potent, stable, and readily manufacturable toxin-neutralizing proteins could provide the basis for safer, cost-effective, and widely accessible next-generation antivenom therapeutics. Beyond snakebite, our computational design methodology should help democratize therapeutic discovery, particularly in resource-limited settings, by substantially reducing costs and resource requirements for development of therapies to neglected tropical diseases.
RESUMEN
In nature, proteins that switch between two conformations in response to environmental stimuli structurally transduce biochemical information in a manner analogous to how transistors control information flow in computing devices. Designing proteins with two distinct but fully structured conformations is a challenge for protein design as it requires sculpting an energy landscape with two distinct minima. Here we describe the design of "hinge" proteins that populate one designed state in the absence of ligand and a second designed state in the presence of ligand. X-ray crystallography, electron microscopy, double electron-electron resonance spectroscopy, and binding measurements demonstrate that despite the significant structural differences the two states are designed with atomic level accuracy and that the conformational and binding equilibria are closely coupled.
Asunto(s)
Ingeniería de Proteínas , Cristalografía por Rayos X , Ligandos , Ingeniería de Proteínas/métodos , Conformación ProteicaRESUMEN
Growth factors and cytokines signal by binding to the extracellular domains of their receptors and drive association and transphosphorylation of the receptor intracellular tyrosine kinase domains, initiating downstream signaling cascades. To enable systematic exploration of how receptor valency and geometry affects signaling outcomes, we designed cyclic homo-oligomers with up to 8 subunits using repeat protein building blocks that can be modularly extended. By incorporating a de novo designed fibroblast growth-factor receptor (FGFR) binding module into these scaffolds, we generated a series of synthetic signaling ligands that exhibit potent valency- and geometry-dependent Ca2+ release and MAPK pathway activation. The high specificity of the designed agonists reveal distinct roles for two FGFR splice variants in driving endothelial and mesenchymal cell fates during early vascular development. The ability to incorporate receptor binding domains and repeat extensions in a modular fashion makes our designed scaffolds broadly useful for probing and manipulating cellular signaling pathways.
RESUMEN
Endocytosis and lysosomal trafficking of cell surface receptors can be triggered by interaction with endogenous ligands. Therapeutic approaches such as LYTAC1,2 and KineTAC3, have taken advantage of this to target specific proteins for degradation by fusing modified native ligands to target binding proteins. While powerful, these approaches can be limited by possible competition with the endogenous ligand(s), the requirement in some cases for chemical modification that limits genetic encodability and can complicate manufacturing, and more generally, there may not be natural ligands which stimulate endocytosis through a given receptor. Here we describe general protein design approaches for designing endocytosis triggering binding proteins (EndoTags) that overcome these challenges. We present EndoTags for the IGF-2R, ASGPR, Sortillin, and Transferrin receptors, and show that fusing these tags to proteins which bind to soluble or transmembrane protein leads to lysosomal trafficking and target degradation; as these receptors have different tissue distributions, the different EndoTags could enable targeting of degradation to different tissues. The modularity and genetic encodability of EndoTags enables AND gate control for higher specificity targeted degradation, and the localized secretion of degraders from engineered cells. The tunability and modularity of our genetically encodable EndoTags should contribute to deciphering the relationship between receptor engagement and cellular trafficking, and they have considerable therapeutic potential as targeted degradation inducers, signaling activators for endocytosis-dependent pathways, and cellular uptake inducers for targeted antibody drug and RNA conjugates.
RESUMEN
Transmembrane ß-barrel proteins (TMBs) are of great interest for single-molecule analytical technologies because they can spontaneously fold and insert into membranes and form stable pores, but the range of pore properties that can be achieved by repurposing natural TMBs is limited. We leverage the power of de novo computational design coupled with a "hypothesis, design, and test" approach to determine TMB design principles, notably, the importance of negative design to slow ß-sheet assembly. We design new eight-stranded TMBs, with no homology to known TMBs, that insert and fold reversibly into synthetic lipid membranes and have nuclear magnetic resonance and x-ray crystal structures very similar to the computational models. These advances should enable the custom design of pores for a wide range of applications.
Asunto(s)
Simulación por Computador , Proteínas de la Membrana/química , Modelos Moleculares , Conformación Proteica en Lámina beta , Ingeniería de Proteínas , Secuencia de Aminoácidos , Cristalografía por Rayos X , Enlace de Hidrógeno , Interacciones Hidrofóbicas e Hidrofílicas , Membrana Dobles de Lípidos , Espectroscopía de Resonancia Magnética , Membranas Artificiales , Micelas , Conformación Proteica , Pliegue de Proteína , Estabilidad ProteicaRESUMEN
The evolution of complex body plans in land plants has been paralleled by gene duplication and divergence within nuclear auxin-signaling networks. A deep mechanistic understanding of auxin signaling proteins therefore may allow rational engineering of novel plant architectures. Toward that end, we analyzed natural variation in the auxin receptor F-box family of wild accessions of the reference plant Arabidopsis thaliana and used this information to populate a structure/function map. We employed a synthetic assay to identify natural hypermorphic F-box variants and then assayed auxin-associated phenotypes in accessions expressing these variants. To more directly measure the impact of the strongest variant in our synthetic assay on auxin sensitivity, we generated transgenic plants expressing this allele. Together, our findings link evolved sequence variation to altered molecular performance and auxin sensitivity. This approach demonstrates the potential for combining synthetic biology approaches with quantitative phenotypes to harness the wealth of available sequence information and guide future engineering efforts of diverse signaling pathways.
Asunto(s)
Proteínas de Arabidopsis/genética , Arabidopsis/genética , Evolución Molecular , Proteínas F-Box/genética , Variación Genética , Proteínas de Plantas/genética , Receptores de Superficie Celular/genética , Alelos , Arabidopsis/metabolismo , Proteínas de Arabidopsis/química , Proteínas de Arabidopsis/metabolismo , Proteínas F-Box/química , Proteínas F-Box/metabolismo , Ácidos Indolacéticos/metabolismo , Proteínas de Plantas/química , Proteínas de Plantas/metabolismo , Receptores de Superficie Celular/química , Receptores de Superficie Celular/metabolismo , Transducción de SeñalRESUMEN
Macromolecular function arises from structure, and many diseases are associated with misfolding of proteins. Molecular simulation methods can augment experimental techniques to understand misfolding and aggregation pathways with atomistic resolution, but the reliability of these predictions is a function of the parameters used for the simulation. There are many biomolecular force fields available, but most are validated using stably folded structures. Here, we present the results of molecular dynamics simulations on the intrinsically disordered amyloid ß-peptide (Aß), whose misfolding and aggregation give rise to the symptoms of Alzheimer's disease. Because of the link between secondary structure changes and pathology, being able to accurately model the structure of Aß would greatly improve our understanding of this disease, and it may facilitate application of modeling approaches to other protein misfolding disorders. To this end, we compared five popular atomistic force fields (AMBER03, CHARMM22 + CMAP, GROMOS96 53A6, GROMOS96 54A7, and OPLS-AA) to determine which could best model the structure of Aß. By comparing secondary structure content, NMR shifts, and radius of gyration to available experimental data, we conclude that AMBER03 and CHARMM22 + CMAP over-stabilize helical structure within Aß, with CHARMM22 + CMAP also producing elongated Aß structures, in conflict with experimental findings. OPLS-AA, GROMOS96 53A6, and GROMOS96 54A7 produce very similar results in terms of helical and ß-strand content, calculated NMR shifts, and radii of gyration that agree well with experimental data.