RESUMO
A wooden house frame consists of many different lumber pieces, but because of the regularity of these building blocks, the structure can be designed using straightforward geometrical principles. The design of multicomponent protein assemblies, in comparison, has been much more complex, largely owing to the irregular shapes of protein structures1. Here we describe extendable linear, curved and angled protein building blocks, as well as inter-block interactions, that conform to specified geometric standards; assemblies designed using these blocks inherit their extendability and regular interaction surfaces, enabling them to be expanded or contracted by varying the number of modules, and reinforced with secondary struts. Using X-ray crystallography and electron microscopy, we validate nanomaterial designs ranging from simple polygonal and circular oligomers that can be concentrically nested, up to large polyhedral nanocages and unbounded straight 'train track' assemblies with reconfigurable sizes and geometries that can be readily blueprinted. Because of the complexity of protein structures and sequence-structure relationships, it has not previously been possible to build up large protein assemblies by deliberate placement of protein backbones onto a blank three-dimensional canvas; the simplicity and geometric regularity of our design platform now enables construction of protein nanomaterials according to 'back of an envelope' architectural blueprints.
Assuntos
Nanoestruturas , Proteínas , Cristalografia por Raios X , Nanoestruturas/química , Proteínas/química , Proteínas/metabolismo , Microscopia Eletrônica , Reprodutibilidade dos TestesRESUMO
There has been considerable recent progress in designing new proteins using deep-learning methods1-9. Despite this progress, a general deep-learning framework for protein design that enables solution of a wide range of design challenges, including de novo binder design and design of higher-order symmetric architectures, has yet to be described. Diffusion models10,11 have had considerable success in image and language generative modelling but limited success when applied to protein modelling, probably due to the complexity of protein backbone geometry and sequence-structure relationships. Here we show that by fine-tuning the RoseTTAFold structure prediction network on protein structure denoising tasks, we obtain a generative model of protein backbones that achieves outstanding performance on unconditional and topology-constrained protein monomer design, protein binder design, symmetric oligomer design, enzyme active site scaffolding and symmetric motif scaffolding for therapeutic and metal-binding protein design. We demonstrate the power and generality of the method, called RoseTTAFold diffusion (RFdiffusion), by experimentally characterizing the structures and functions of hundreds of designed symmetric assemblies, metal-binding proteins and protein binders. The accuracy of RFdiffusion is confirmed by the cryogenic electron microscopy structure of a designed binder in complex with influenza haemagglutinin that is nearly identical to the design model. In a manner analogous to networks that produce images from user-specified inputs, RFdiffusion enables the design of diverse functional proteins from simple molecular specifications.
Assuntos
Aprendizado Profundo , Proteínas , Domínio Catalítico , Microscopia Crioeletrônica , Glicoproteínas de Hemaglutininação de Vírus da Influenza/química , Glicoproteínas de Hemaglutininação de Vírus da Influenza/metabolismo , Glicoproteínas de Hemaglutininação de Vírus da Influenza/ultraestrutura , Ligação Proteica , Proteínas/química , Proteínas/metabolismo , Proteínas/ultraestruturaRESUMO
Protein crystallization plays a central role in structural biology. Despite this, the process of crystallization remains poorly understood and highly empirical, with crystal contacts, lattice packing arrangements and space group preferences being largely unpredictable. Programming protein crystallization through precisely engineered side-chain-side-chain interactions across protein-protein interfaces is an outstanding challenge. Here we develop a general computational approach for designing three-dimensional protein crystals with prespecified lattice architectures at atomic accuracy that hierarchically constrains the overall number of degrees of freedom of the system. We design three pairs of oligomers that can be individually purified, and upon mixing, spontaneously self-assemble into >100 µm three-dimensional crystals. The structures of these crystals are nearly identical to the computational design models, closely corresponding in both overall architecture and the specific protein-protein interactions. The dimensions of the crystal unit cell can be systematically redesigned while retaining the space group symmetry and overall architecture, and the crystals are extremely porous and highly stable. Our approach enables the computational design of protein crystals with high accuracy, and the designed protein crystals, which have both structural and assembly information encoded in their primary sequences, provide a powerful platform for biological materials engineering.
Assuntos
Proteínas , Proteínas/química , CristalizaçãoRESUMO
Protein nanomaterial design is an emerging discipline with applications in medicine and beyond. A long-standing design approach uses genetic fusion to join protein homo-oligomer subunits via α-helical linkers to form more complex symmetric assemblies, but this method is hampered by linker flexibility and a dearth of geometric solutions. Here, we describe a general computational method for rigidly fusing homo-oligomer and spacer building blocks to generate user-defined architectures that generates far more geometric solutions than previous approaches. The fusion junctions are then optimized using Rosetta to minimize flexibility. We apply this method to design and test 92 dihedral symmetric protein assemblies using a set of designed homodimers and repeat protein building blocks. Experimental validation by native mass spectrometry, small-angle X-ray scattering, and negative-stain single-particle electron microscopy confirms the assembly states for 11 designs. Most of these assemblies are constructed from designed ankyrin repeat proteins (DARPins), held in place on one end by α-helical fusion and on the other by a designed homodimer interface, and we explored their use for cryogenic electron microscopy (cryo-EM) structure determination by incorporating DARPin variants selected to bind targets of interest. Although the target resolution was limited by preferred orientation effects and small scaffold size, we found that the dual anchoring strategy reduced the flexibility of the target-DARPIN complex with respect to the overall assembly, suggesting that multipoint anchoring of binding domains could contribute to cryo-EM structure determination of small proteins.
Assuntos
Nanoestruturas/química , Engenharia de Proteínas , Proteínas/química , Repetição de Anquirina , Nanoestruturas/ultraestrutura , Conformação Proteica em alfa-Hélice , Proteínas/genética , Proteínas/ultraestruturaRESUMO
Biological systems have evolved efficient sensing and decision-making mechanisms to maximize fitness in changing molecular environments. Synthetic biologists have exploited these capabilities to engineer control on information and energy processing in living cells. While engineered organisms pose important technological and ethical challenges, de novo assembly of non-living biomolecular devices could offer promising avenues toward various real-world applications. However, assembling biochemical parts into functional information processing systems has remained challenging due to extensive multidimensional parameter spaces that must be sampled comprehensively in order to identify robust, specification compliant molecular implementations. We introduce a systematic methodology based on automated computational design and microfluidics enabling the programming of synthetic cell-like microreactors embedding biochemical logic circuits, or protosensors, to perform accurate biosensing and biocomputing operations in vitro according to temporal logic specifications. We show that proof-of-concept protosensors integrating diagnostic algorithms detect specific patterns of biomarkers in human clinical samples. Protosensors may enable novel approaches to medicine and represent a step toward autonomous micromachines capable of precise interfacing of human physiology or other complex biological environments, ecosystems, or industrial bioprocesses.
Assuntos
Reatores Biológicos , Técnicas Biossensoriais , Redes Reguladoras de Genes/genética , Biologia Sintética , Humanos , MicrofluídicaRESUMO
In this article we present a new kind of computing device that uses biochemical reactions networks as building blocks to implement logic gates. The architecture of a computing machine relies on these generic and composable building blocks, computation units, that can be used in multiple instances to perform complex boolean functions. Standard logical operations are implemented by biochemical networks, encapsulated and insulated within synthetic vesicles called protocells. These protocells are capable of exchanging energy and information with each other through transmembrane electron transfer. In the paradigm of computation we propose, protoputing, a machine can solve only one problem and therefore has to be built specifically. Thus, the programming phase in the standard computing paradigm is represented in our approach by the set of assembly instructions (specific attachments) that directs the wiring of the protocells that constitute the machine itself. To demonstrate the computing power of protocellular machines, we apply it to solve a NP-complete problem, known to be very demanding in computing power, the 3-SAT problem. We show how to program the assembly of a machine that can verify the satisfiability of a given boolean formula. Then we show how to use the massive parallelism of these machines to verify in less than 20 min all the valuations of the input variables and output a fluorescent signal when the formula is satisfiable or no signal at all otherwise.
Assuntos
Células Artificiais , Simulação por Computador , Biologia Sintética/métodos , Algoritmos , Elétrons , Enzimas/fisiologia , Fosforilação Oxidativa , Fotossíntese , TermodinâmicaRESUMO
In pseudocyclic proteins, such as TIM barrels, ß barrels, and some helical transmembrane channels, a single subunit is repeated in a cyclic pattern, giving rise to a central cavity that can serve as a pocket for ligand binding or enzymatic activity. Inspired by these proteins, we devised a deep-learning-based approach to broadly exploring the space of closed repeat proteins starting from only a specification of the repeat number and length. Biophysical data for 38 structurally diverse pseudocyclic designs produced in Escherichia coli are consistent with the design models, and the three crystal structures we were able to obtain are very close to the designed structures. Docking studies suggest the diversity of folds and central pockets provide effective starting points for designing small-molecule binders and enzymes.
Assuntos
Alucinações , Proteínas , Humanos , Proteínas/químicaRESUMO
As a result of evolutionary selection, the subunits of naturally occurring protein assemblies often fit together with substantial shape complementarity to generate architectures optimal for function in a manner not achievable by current design approaches. We describe a "top-down" reinforcement learning-based design approach that solves this problem using Monte Carlo tree search to sample protein conformers in the context of an overall architecture and specified functional constraints. Cryo-electron microscopy structures of the designed disk-shaped nanopores and ultracompact icosahedra are very close to the computational models. The icosohedra enable very-high-density display of immunogens and signaling molecules, which potentiates vaccine response and angiogenesis induction. Our approach enables the top-down design of complex protein nanomaterials with desired system properties and demonstrates the power of reinforcement learning in protein design.
Assuntos
Aprendizado de Máquina , Nanoestruturas , Engenharia de Proteínas , Proteínas , Microscopia Crioeletrônica , Proteínas/químicaRESUMO
A wooden house frame consists of many different lumber pieces, but because of the regularity of these building blocks, the structure can be designed using straightforward geometrical principles. The design of multicomponent protein assemblies in comparison has been much more complex, largely due to the irregular shapes of protein structures 1 . Here we describe extendable linear, curved, and angled protein building blocks, as well as inter-block interactions that conform to specified geometric standards; assemblies designed using these blocks inherit their extendability and regular interaction surfaces, enabling them to be expanded or contracted by varying the number of modules, and reinforced with secondary struts. Using X-ray crystallography and electron microscopy, we validate nanomaterial designs ranging from simple polygonal and circular oligomers that can be concentrically nested, up to large polyhedral nanocages and unbounded straight "train track" assemblies with reconfigurable sizes and geometries that can be readily blueprinted. Because of the complexity of protein structures and sequence-structure relationships, it has not been previously possible to build up large protein assemblies by deliberate placement of protein backbones onto a blank 3D canvas; the simplicity and geometric regularity of our design platform now enables construction of protein nanomaterials according to "back of an envelope" architectural blueprints.
RESUMO
Asymmetric multiprotein complexes that undergo subunit exchange play central roles in biology but present a challenge for design because the components must not only contain interfaces that enable reversible association but also be stable and well behaved in isolation. We use implicit negative design to generate ß sheet-mediated heterodimers that can be assembled into a wide variety of complexes. The designs are stable, folded, and soluble in isolation and rapidly assemble upon mixing, and crystal structures are close to the computational models. We construct linearly arranged hetero-oligomers with up to six different components, branched hetero-oligomers, closed C4-symmetric two-component rings, and hetero-oligomers assembled on a cyclic homo-oligomeric central hub and demonstrate that such complexes can readily reconfigure through subunit exchange. Our approach provides a general route to designing asymmetric reconfigurable protein systems.
Assuntos
Complexos Multiproteicos/química , Engenharia de Proteínas , Proteínas/química , Simulação por Computador , Cristalografia por Raios X , Escherichia coli/genética , Células HeLa , Humanos , Modelos Moleculares , Conformação Proteica , Conformação Proteica em Folha beta , Dobramento de Proteína , Multimerização Proteica , Estrutura Quaternária de Proteína , Subunidades ProteicasRESUMO
A systematic and robust approach to generating complex protein nanomaterials would have broad utility. We develop a hierarchical approach to designing multi-component protein assemblies from two classes of modular building blocks: designed helical repeat proteins (DHRs) and helical bundle oligomers (HBs). We first rigidly fuse DHRs to HBs to generate a large library of oligomeric building blocks. We then generate assemblies with cyclic, dihedral, and point group symmetries from these building blocks using architecture guided rigid helical fusion with new software named WORMS. X-ray crystallography and cryo-electron microscopy characterization show that the hierarchical design approach can accurately generate a wide range of assemblies, including a 43 nm diameter icosahedral nanocage. The computational methods and building block sets described here provide a very general route to de novo designed protein nanomaterials.
Assuntos
Ciência dos Materiais/métodos , Complexos Multiproteicos/ultraestrutura , Nanoestruturas/ultraestrutura , Cristalografia por Raios X , Simulação de Dinâmica Molecular , Proteínas Recombinantes/genética , Proteínas Recombinantes/isolamento & purificação , Proteínas Recombinantes/metabolismo , Proteínas Recombinantes/ultraestrutura , SoftwareRESUMO
Recent advances in synthetic posttranslational protein circuits are substantially impacting the landscape of cellular engineering and offer several advantages compared to traditional gene circuits. However, engineering dynamic phenomena such as oscillations in protein-level circuits remains an outstanding challenge. Few examples of biological posttranslational oscillators are known, necessitating theoretical progress to determine realizable oscillators. We construct mathematical models for two posttranslational oscillators, using few components that interact only through reversible binding and phosphorylation/dephosphorylation reactions. Our designed oscillators rely on the self-assembly of two protein species into multimeric functional enzymes that respectively inhibit and enhance this self-assembly. We limit our analysis to within experimental constraints, finding (i) significant portions of the restricted parameter space yielding oscillations and (ii) that oscillation periods can be tuned by several orders of magnitude using recent advances in computational protein design. Our work paves the way for the rational design and realization of protein-based dynamic systems.
RESUMO
Displaying a strong antiproliferative activity on a wide variety of cancer cells, EAPB0203 and EAPB0503 belong to the imidazo[1,2-a]quinoxalines family of imiquimod structural analogues. EAPB0503 has been shown to inhibit tubulin polymerization. The aim of the present study is to characterize the interaction of EAPB0203 and EAPB0503 with tubulin. We combine experimental approaches at the cellular and the molecular level both in vitro and in silico in order to evaluate the interaction of EAPB0203 and EAPB0503 with tubulin. We examine the influence of EAPB0203 and EAPB0503 on the cell cycle and fate, explore the binding interaction with purified tubulin, and use a computational molecular docking model to determine the binding modes to the microtubule. We then use a drug combination study with other anti-microtubule agents to compare the binding site of EAPB0203 and EAPB0503 to known potent tubulin inhibitors. We demonstrate that EAPB0203 and EAPB0503 are capable of blocking human melanoma cells in G2 and M phases and inducing cell death and apoptosis. Second, we show that EAPB0203 and EAPB0503, but also unexpectedly imiquimod, bind directly to purified tubulin and inhibit tubulin polymerization. As suggested by molecular docking and binding competition studies, we identify the colchicine binding site on ß-tubulin as the interaction pocket. Furthermore, we find that EAPB0203, EAPB0503 and imiquimod display antagonistic cytotoxic effect when combined with colchicine, and disrupt tubulin network in human melanoma cells. We conclude that EAPB0203, EAPB0503, as well as imiquimod, interact with tubulin through the colchicine binding site, and that the cytotoxic activity of EAPB0203, EAPB0503 and imiquimod is correlated to their tubulin inhibiting effect. These compounds appear as interesting anticancer drug candidates as suggested by their activity and mechanism of action, and deserve further investigation for their use in the clinic.
Assuntos
Antineoplásicos/farmacologia , Morte Celular/efeitos dos fármacos , Colchicina/farmacologia , Quinoxalinas/farmacologia , Moduladores de Tubulina/farmacologia , Tubulina (Proteína)/metabolismo , Apoptose/efeitos dos fármacos , Sítios de Ligação , Ciclo Celular/efeitos dos fármacos , Linhagem Celular Tumoral , Proliferação de Células/efeitos dos fármacos , Simulação por Computador , Humanos , Modelos Moleculares , Simulação de Acoplamento Molecular , Ligação ProteicaRESUMO
Whole-cell biosensors have several advantages for the detection of biological substances and have proven to be useful analytical tools. However, several hurdles have limited whole-cell biosensor application in the clinic, primarily their unreliable operation in complex media and low signal-to-noise ratio. We report that bacterial biosensors with genetically encoded digital amplifying genetic switches can detect clinically relevant biomarkers in human urine and serum. These bactosensors perform signal digitization and amplification, multiplexed signal processing with the use of Boolean logic gates, and data storage. In addition, we provide a framework with which to quantify whole-cell biosensor robustness in clinical samples together with a method for easily reprogramming the sensor module for distinct medical detection agendas. Last, we demonstrate that bactosensors can be used to detect pathological glycosuria in urine from diabetic patients. These next-generation whole-cell biosensors with improved computing and amplification capacity could meet clinical requirements and should enable new approaches for medical diagnosis.