RESUMEN
Synthetic genetic circuits offer the potential to wield computational control over biology, but their complexity is limited by the accuracy of mathematical models. Here, we present advances that enable the complete encoding of an electronic chip in the DNA carried by Escherichia coli (E. coli). The chip is a binary-coded digit (BCD) to 7-segment decoder, associated with clocks and calculators, to turn on segments to visualize 0-9. Design automation is used to build seven strains, each of which contains a circuit with up to 12 repressors and two activators (totaling 63 regulators and 76,000 bp DNA). The inputs to each circuit represent the digit to be displayed (encoded in binary by four molecules), and output is the segment state, reported as fluorescence. Implementation requires an advanced gate model that captures dynamics, promoter interference, and a measure of total power usage (RNAP flux). This project is an exemplar of design automation pushing engineering beyond that achievable "by hand", essential for realizing the potential of biology.
Asunto(s)
Escherichia coli/genética , Procesamiento de Señales Asistido por Computador/instrumentación , Biología Sintética/instrumentación , Algoritmos , Inteligencia Artificial , Equipos de Almacenamiento de Computador , Diseño de EquipoRESUMEN
Genetic circuits implement computational operations within a cell. Debugging them is difficult because their function is defined by multiple states (e.g., combinations of inputs) that vary in time. Here, we develop RNA-seq methods that enable the simultaneous measurement of: (i) the states of internal gates, (ii) part performance (promoters, insulators, terminators), and (iii) impact on host gene expression. This is applied to a three-input one-output circuit consisting of three sensors, five NOR/NOT gates, and 46 genetic parts. Transcription profiles are obtained for all eight combinations of inputs, from which biophysical models can extract part activities and the response functions of sensors and gates. Various unexpected failure modes are identified, including cryptic antisense promoters, terminator failure, and a sensor malfunction due to media-induced changes in host gene expression. This can guide the selection of new parts to fix these problems, which we demonstrate by using a bidirectional terminator to disrupt observed antisense transcription. This work introduces RNA-seq as a powerful method for circuit characterization and debugging that overcomes the limitations of fluorescent reporters and scales to large systems composed of many parts.
Asunto(s)
Escherichia coli/genética , Redes Reguladoras de Genes , ARN/genética , Biología Sintética/métodos , Transcripción Genética , Proteínas Bacterianas/genética , Proteínas Bacterianas/metabolismo , Escherichia coli/efectos de los fármacos , Escherichia coli/metabolismo , Biblioteca de Genes , Elementos Aisladores , Isopropil Tiogalactósido/farmacología , Plásmidos/química , Plásmidos/metabolismo , Regiones Promotoras Genéticas , ARN/metabolismo , Proteínas Represoras/genética , Proteínas Represoras/metabolismo , Regiones Terminadoras Genéticas , TransgenesRESUMEN
We computationally designed a de novo protein-protein interaction between wild-type ubiquitin and a redesigned scaffold. Our strategy was to incorporate zinc at the designed interface to promote affinity and orientation specificity. A large set of monomeric scaffold surfaces were computationally engineered with three-residue zinc coordination sites, and the ubiquitin residue H68 was docked to the open coordination site to complete a tetrahedral zinc site. This single coordination bond was intended as a hotspot and polar interaction for ubiquitin binding, and surrounding residues on the scaffold were optimized primarily as hydrophobic residues using a rotamer-based sequence design protocol in Rosetta. From thousands of independent design simulations, four sequences were selected for experimental characterization. The best performing design, called Spelter, binds tightly to zinc (Kd < 10 nM) and binds ubiquitin with a Kd of 20 µM in the presence of zinc and 68 µM in the absence of zinc. Mutagenesis studies and nuclear magnetic resonance chemical shift perturbation experiments indicate that Spelter interacts with H68 and the target surface on ubiquitin; however, H68 does not form a hotspot as intended. Instead, mutation of H68 to alanine results in tighter binding. Although a 3/1 zinc coordination arrangement at an interface cannot be ruled out as a means to improve affinity, our study led us to conclude that 2/2 coordination arrangements or multiple-zinc designs are more likely to promote high-affinity protein interactions.
Asunto(s)
Unión Proteica , Ubiquitina/química , Zinc/química , Alanina/química , Sitios de Unión , Espectroscopía de Resonancia Magnética , Mutagénesis , Mapas de Interacción de ProteínasRESUMEN
Here we show that a recent computationally designed zinc-mediated protein interface is serendipitously capable of catalyzing carboxyester and phosphoester hydrolysis. Although the original motivation was to design a de novo zinc-mediated protein-protein interaction (called MID1-zinc), we observed in the homodimer crystal structure a small cleft and open zinc coordination site. We investigated if the cleft and zinc site at the designed interface were sufficient for formation of a primitive active site that can perform hydrolysis. MID1-zinc hydrolyzes 4-nitrophenyl acetate with a rate acceleration of 10(5) and a k(cat)/K(M) of 630 M(-1) s(-1) and 4-nitrophenyl phosphate with a rate acceleration of 10(4) and a k(cat)/K(M) of 14 M(-1) s(-1). These rate accelerations by an unoptimized active site highlight the catalytic power of zinc and suggest that the clefts formed by protein-protein interactions are well-suited for creating enzyme active sites. This discovery has implications for protein evolution and engineering: from an evolutionary perspective, three-coordinated zinc at a homodimer interface cleft represents a simple evolutionary path to nascent enzymatic activity; from a protein engineering perspective, future efforts in de novo design of enzyme active sites may benefit from exploring clefts at protein interfaces for active site placement.
Asunto(s)
Dominio Catalítico , Metaloproteínas/química , Zinc/metabolismo , Secuencia de Aminoácidos , Catálisis , Evolución Molecular , Concentración de Iones de Hidrógeno , Cinética , Metaloproteínas/metabolismo , Datos de Secuencia Molecular , Nitrofenoles/metabolismo , Compuestos Organofosforados/metabolismo , Ingeniería de ProteínasRESUMEN
Computationally designing protein-protein interactions with high affinity and desired orientation is a challenging task. Incorporating metal-binding sites at the target interface may be one approach for increasing affinity and specifying the binding mode, thereby improving robustness of designed interactions for use as tools in basic research as well as in applications from biotechnology to medicine. Here we describe a Rosetta-based approach for the rational design of a protein monomer to form a zinc-mediated, symmetric homodimer. Our metal interface design, named MID1 (NESG target ID OR37), forms a tight dimer in the presence of zinc (MID1-zinc) with a dissociation constant <30 nM. Without zinc the dissociation constant is 4 µM. The crystal structure of MID1-zinc shows good overall agreement with the computational model, but only three out of four designed histidines coordinate zinc. However, a histidine-to-glutamate point mutation resulted in four-coordination of zinc, and the resulting metal binding site and dimer orientation closely matches the computational model (Cα rmsd = 1.4 Å).
Asunto(s)
Diseño de Fármacos , Multimerización de Proteína , Proteínas/química , Zinc , Modelos Moleculares , Estructura Cuaternaria de Proteína , Proteínas/metabolismoRESUMEN
The arginine-binding protein from Thermotoga maritima (TmArgBP) is an arginine-binding component of the ATP-binding cassette (ABC) transport system in this hyperthermophilic bacterium. This protein is endowed with an extraordinary stability towards thermal and chemical denaturation. Its structural characterization may provide useful insights for the clarification of structure-stability relationships and for the design of new biosensors. Crystallization trials were set up for both arginine-bound and ligand-free forms of TmArgBP and crystals suitable for crystallographic investigations were obtained for both forms. Ordered crystals of the arginine adduct of TmArgBP could only be obtained by using the detergent LDAO as an additive to the crystallization medium. These crystals were hexagonal, with unit-cell parameters a = 78.2, c = 434.7 Å, and diffracted to 2.7 Å resolution. The crystals of the ligand-free form were orthorhombic, with unit-cell parameters a = 51.8, b = 91.9, c = 117.9 Å, and diffracted to 2.25 Å resolution.
Asunto(s)
Proteínas Bacterianas/química , Proteínas de Homeodominio/química , Thermotoga maritima/química , Arginina/química , Arginina/metabolismo , Proteínas Bacterianas/metabolismo , Proteínas Portadoras , Cristalización , Cristalografía por Rayos X , Proteínas de Homeodominio/metabolismo , Ligandos , Thermotoga maritima/metabolismoRESUMEN
Recent modeling of filopodia--the actin-based cell organelles employed for sensing and motility--reveals that one of the key limiting factors of filopodial length is diffusional transport of G-actin monomers to the polymerizing barbed ends. We have explored the possibility of active transport of G-actin by myosin motors, which would be an expected biological response to overcome the limitation of a diffusion-based process. We found that in a straightforward implementation of active transport the increase in length was unimpressive, < or = 30%, due to sequestering of G-actin by freely diffusing motors. However, artificially removing motor sequestration reactions led to approximately threefold increases in filopodial length, with the transport being mainly limited by the motors failing to detach from the filaments near the tip, clogging the cooperative conveyer belt dynamics. Making motors sterically transparent led to a qualitative change of the dynamics to a different regime of steady growth without a stationary length. Having identified sequestration and clogging as ubiquitous constraints to motor-driven transport, we devised and tested a speculative means to sidestep these limitations in filopodia by employing cross-linking and putative scaffolding roles of Ena/VASP proteins. We conclude that a naïve design of molecular-motor-based active transport would almost always be inefficient--an intricately organized kinetic scheme, with finely tuned rate constants, is required to achieve high-flux transport.
Asunto(s)
Actinas/metabolismo , Moléculas de Adhesión Celular/metabolismo , Proteínas de Microfilamentos/metabolismo , Miosinas/metabolismo , Fosfoproteínas/metabolismo , Seudópodos/metabolismo , Animales , Transporte Biológico Activo , Simulación por Computador , Difusión , Modelos Biológicos , Transporte de ProteínasRESUMEN
We used the molecular modeling program Rosetta to identify clusters of amino acid substitutions in antibody fragments (scFvs and scAbs) that improve global protein stability and resistance to thermal deactivation. Using this methodology, we increased the melting temperature (Tm) and resistance to heat treatment of an antibody fragment that binds to the Clostridium botulinum hemagglutinin protein (anti-HA33). Two designed antibody fragment variants with two amino acid replacement clusters, designed to stabilize local regions, were shown to have both higher Tm compared to the parental scFv and importantly, to retain full antigen binding activity after 2 hours of incubation at 70 °C. The crystal structure of one thermostabilized scFv variants was solved at 1.6 Å and shown to be in close agreement with the RosettaAntibody model prediction.
RESUMEN
Primordial sequence signatures in modern proteins imply ancestral origins tracing back to simple peptides. Although short peptides seldom adopt unique folds, metal ions might have templated their assembly into higher-order structures in early evolution and imparted useful chemical reactivity. Recapitulating such a biogenetic scenario, we have combined design and laboratory evolution to transform a zinc-binding peptide into a globular enzyme capable of accelerating ester cleavage with exacting enantiospecificity and high catalytic efficiency (k cat/K M ~ 106 M-1 s-1). The simultaneous optimization of structure and function in a naïve peptide scaffold not only illustrates a plausible enzyme evolutionary pathway from the distant past to the present but also proffers exciting future opportunities for enzyme design and engineering.
Asunto(s)
Enzimas/química , Metaloproteínas/química , Oligopéptidos/química , Zinc/química , Biocatálisis , Evolución Molecular Dirigida , Enzimas/ultraestructura , Ésteres/química , Evolución Molecular , Hidrólisis , Metaloproteínas/ultraestructuraRESUMEN
Computational design of proteins with altered ligand specificity is an emerging method for the creation of new biosensing systems. In this work, we investigated the outcome of site-directed mutagenesis on the Escherichia coli ribose binding protein (RBP), which is frequently used as a design scaffold for computational searches. A ribose biosensor was first constructed whereby an environmentally sensitive fluorescent probe was covalently attached to RBP at position S265C. This protein conjugate displayed a 54% decrease in emission intensity upon the addition of saturating ribose concentrations and exhibited an apparent dissociation constant (K(d) ) of 3.4 microM. Site-directed mutants within the RBP binding pocket were created and examined for ribose binding ability and overall structural stability. Because as many as 12 mutations are needed to alter ligand specificity in RBP, we measured the effect of single and multiple alanine mutations on stability and signal transduction potential of the ribose biosensor. Single alanine mutations had significant impact on both stability and signaling. Mutations of N190A and F214A each produced melting temperatures >8 degrees C below those observed for the wild-type protein. Residue Q235, located in the hinge region of RBP, appeared to be a hot spot for global protein stability as well. Additional single alanine mutations demonstrated as much as 200-fold increase in apparent K(d) but retained overall protein stability. The data collected from this study may be incorporated into design algorithms to help create more stable biosensors and optimize signal transduction properties for a variety of important analytes.
Asunto(s)
Técnicas Biosensibles , Proteínas de Escherichia coli/química , Proteínas de Unión Periplasmáticas/química , Ribosa/metabolismo , Alanina/genética , Sitios de Unión , Dicroismo Circular , Proteínas de Escherichia coli/genética , Proteínas de Escherichia coli/metabolismo , Colorantes Fluorescentes , Ligandos , Modelos Moleculares , Mutagénesis Sitio-Dirigida , Proteínas de Unión Periplasmáticas/genética , Proteínas de Unión Periplasmáticas/metabolismo , Unión Proteica , Espectrometría de FluorescenciaRESUMEN
DNAplotlib ( www.dnaplotlib.org ) is a computational toolkit for the programmable visualization of highly customizable, standards-compliant genetic designs. Functions are provided to aid with both visualization tasks and to extract and overlay associated experimental data. High-quality output is produced in the form of vector-based PDFs, rasterized images, and animated movies. All aspects of the rendering process can be easily customized or extended by the user to cover new forms of genetic part or regulation. DNAplotlib supports improved communication of genetic design information and offers new avenues for static, interactive and dynamic visualizations that map and explore the links between the structure and function of genetic parts, devices and systems; including metabolic pathways and genetic circuits. DNAplotlib is cross-platform software developed using Python.
Asunto(s)
Biología Computacional/métodos , Programas Informáticos , Biología Sintética/métodos , Redes y Vías Metabólicas/genética , Interfaz Usuario-ComputadorRESUMEN
Structure-based protein design tests our understanding of the minimal determinants of protein structure and function. Previous studies have demonstrated that placing zinc binding amino acids (His, Glu, Asp or Cys) near each other in a folded protein in an arrangement predicted to be tetrahedral is often sufficient to achieve binding to zinc. However, few designs have been characterized with high-resolution structures. Here, we use X-ray crystallography, binding studies and mutation analysis to evaluate three alternative strategies for designing zinc binding sites with the molecular modeling program Rosetta. While several of the designs were observed to bind zinc, crystal structures of two designs reveal binding configurations that differ from the design model. In both cases, the modeling did not accurately capture the presence or absence of second-shell hydrogen bonds critical in determining binding-site structure. Efforts to more explicitly design second-shell hydrogen bonds were largely unsuccessful as evidenced by mutation analysis and low expression of proteins engineered with extensive primary and secondary networks. Our results suggest that improved methods for designing interaction networks will be needed for creating metal binding sites with high accuracy.
Asunto(s)
Modelos Moleculares , Ingeniería de Proteínas/métodos , Proteínas/química , Proteínas/metabolismo , Zinc/metabolismo , Sitios de Unión , Cristalografía por Rayos X , Enlace de Hidrógeno , Mutación Puntual , Conformación Proteica , Proteínas/genéticaRESUMEN
Computation can be performed in living cells by DNA-encoded circuits that process sensory information and control biological functions. Their construction is time-intensive, requiring manual part assembly and balancing of regulator expression. We describe a design environment, Cello, in which a user writes Verilog code that is automatically transformed into a DNA sequence. Algorithms build a circuit diagram, assign and connect gates, and simulate performance. Reliable circuit design requires the insulation of gates from genetic context, so that they function identically when used in different circuits. We used Cello to design 60 circuits forEscherichia coli(880,000 base pairs of DNA), for which each DNA sequence was built as predicted by the software with no additional tuning. Of these, 45 circuits performed correctly in every output state (up to 10 regulators and 55 parts), and across all circuits 92% of the output states functioned as predicted. Design automation simplifies the incorporation of genetic circuits into biotechnology projects that require decision-making, control, sensing, or spatial organization.
Asunto(s)
Biotecnología , ADN/genética , Escherichia coli/genética , Redes Reguladoras de Genes , Algoritmos , Emparejamiento Base , Secuencia de Bases , Lenguajes de Programación , Programas Informáticos , Biología SintéticaRESUMEN
The de novo design of proteins is a rigorous test of our understanding of the key determinants of protein structure. The helix bundle is an interesting de novo design model system due to the diverse topologies that can be generated from a few simple α-helices. Previously, noncomputational studies demonstrated that connecting amphipathic helices together with short loops can sometimes generate helix bundle proteins, regardless of the bundle's exact sequence. However, using such methods, the precise positions of helices and side chains cannot be predetermined. Since protein function depends on exact positioning of residues, we examined if sequence design tools in the program Rosetta could be used to design a four-helix bundle with a predetermined structure. Helix position was specified using a folding procedure that constrained the design model to a defined topology, and iterative rounds of rotamer-based sequence design and backbone refinement were used to identify a low energy sequence for characterization. The designed protein, DND_4HB, unfolds cooperatively (Tm >90°C) and a NMR solution structure shows that it adopts the target helical bundle topology. Helices 2, 3, and 4 agree very closely with the design model (backbone RMSD = 1.11 Å) and >90% of the core side chain χ1 and χ2 angles are correctly predicted. Helix 1 lies in the target groove against the other helices, but is displaced 3 Å along the bundle axis. This result highlights the potential of computational design to create bundles with atomic-level precision, but also points at remaining challenges for achieving specific positioning between amphipathic helices.
Asunto(s)
Ingeniería de Proteínas/métodos , Proteínas Recombinantes/química , Secuencia de Aminoácidos , Clonación Molecular , Modelos Moleculares , Datos de Secuencia Molecular , Resonancia Magnética Nuclear Biomolecular , Conformación Proteica , Pliegue de Proteína , Proteínas Recombinantes/genética , Proteínas Recombinantes/metabolismoRESUMEN
There has been significant recent progress in the computational design of protein interactions including the creation of novel heterodimers, homodimers, nanohedra, fibril caps and a protein crystal. Essential to these successes has been the use of innovative strategies for finding binding modes that are achievable, that is, identifying binding partners and docked conformations that can be successfully stabilized via sequence optimization and backbone refinement. In many cases this has involved the use of structural motifs commonly found at naturally occurring interfaces including alpha helices inserted into hydrophobic grooves, beta-strand pairing, metal binding, established helix packing motifs, and the use of symmetry to form cooperative interactions. Future challenges include the creation of hydrogen bond networks and antibody-like interactions based on the redesign of protein surface loops.
Asunto(s)
Ingeniería de Proteínas/métodos , Multimerización de Proteína , Proteínas/química , Proteínas/síntesis química , Sitios de Unión , Enlace de Hidrógeno , Interacciones Hidrofóbicas e Hidrofílicas , Modelos Moleculares , Unión Proteica , Conformación Proteica , Electricidad EstáticaRESUMEN
Reengineering protein surfaces to exhibit high net charge, referred to as "supercharging", can improve reversibility of unfolding by preventing aggregation of partially unfolded states. Incorporation of charged side chains should be optimized while considering structural and energetic consequences, as numerous mutations and accumulation of like-charges can also destabilize the native state. A previously demonstrated approach deterministically mutates flexible polar residues (amino acids DERKNQ) with the fewest average neighboring atoms per side chain atom (AvNAPSA). Our approach uses Rosetta-based energy calculations to choose the surface mutations. Both protocols are available for use through the ROSIE web server. The automated Rosetta and AvNAPSA approaches for supercharging choose dissimilar mutations, raising an interesting division in surface charging strategy. Rosetta-supercharged variants of GFP (RscG) ranging from -11 to -61 and +7 to +58 were experimentally tested, and for comparison, we re-tested the previously developed AvNAPSA-supercharged variants of GFP (AscG) with +36 and -30 net charge. Mid-charge variants demonstrated â¼3-fold improvement in refolding with retention of stability. However, as we pushed to higher net charges, expression and soluble yield decreased, indicating that net charge or mutational load may be limiting factors. Interestingly, the two different approaches resulted in GFP variants with similar refolding properties. Our results show that there are multiple sets of residues that can be mutated to successfully supercharge a protein, and combining alternative supercharge protocols with experimental testing can be an effective approach for charge-based improvement to refolding.
Asunto(s)
Aminoácidos/química , Proteínas Fluorescentes Verdes/química , Ingeniería de Proteínas , Programas Informáticos , Secuencia de Aminoácidos , Aminoácidos/genética , Animales , Cnidarios , Proteínas Fluorescentes Verdes/genética , Enlace de Hidrógeno , Modelos Moleculares , Datos de Secuencia Molecular , Mutación , Conformación Proteica , Estabilidad Proteica , Desplegamiento Proteico , Electricidad Estática , TermodinámicaRESUMEN
The Rosetta molecular modeling software package provides experimentally tested and rapidly evolving tools for the 3D structure prediction and high-resolution design of proteins, nucleic acids, and a growing number of non-natural polymers. Despite its free availability to academic users and improving documentation, use of Rosetta has largely remained confined to developers and their immediate collaborators due to the code's difficulty of use, the requirement for large computational resources, and the unavailability of servers for most of the Rosetta applications. Here, we present a unified web framework for Rosetta applications called ROSIE (Rosetta Online Server that Includes Everyone). ROSIE provides (a) a common user interface for Rosetta protocols, (b) a stable application programming interface for developers to add additional protocols, (c) a flexible back-end to allow leveraging of computer cluster resources shared by RosettaCommons member institutions, and (d) centralized administration by the RosettaCommons to ensure continuous maintenance. This paper describes the ROSIE server infrastructure, a step-by-step 'serverification' protocol for use by Rosetta developers, and the deployment of the first nine ROSIE applications by six separate developer teams: Docking, RNA de novo, ERRASER, Antibody, Sequence Tolerance, Supercharge, Beta peptide design, NCBB design, and VIP redesign. As illustrated by the number and diversity of these applications, ROSIE offers a general and speedy paradigm for serverification of Rosetta applications that incurs negligible cost to developers and lowers barriers to Rosetta use for the broader biological community. ROSIE is available at http://rosie.rosettacommons.org.
Asunto(s)
Internet , Modelos Moleculares , Programas Informáticos , Interfaz Usuario-Computador , Simulación de Dinámica MolecularRESUMEN
Mutation of surface residues to charged amino acids increases resistance to aggregation and can enable reversible unfolding. We have developed a protocol using the Rosetta computational design package that "supercharges" proteins while considering the energetic implications of each mutation. Using a homology model, a single-chain variable fragment antibody was designed that has a markedly enhanced resistance to thermal inactivation and displays an unanticipated ≈30-fold improvement in affinity. Such supercharged antibodies should prove useful for assays in resource-limited settings and for developing reagents with improved shelf lives.
Asunto(s)
Anticuerpos de Cadena Única/química , Enlace de Hidrógeno , Ingeniería de Proteínas , Pliegue de Proteína , Estructura Terciaria de Proteína , Anticuerpos de Cadena Única/metabolismo , Programas Informáticos , TemperaturaRESUMEN
Members of the periplasmic binding protein superfamily are involved in the selective passage of ligands through bacterial cell membranes. The hyperthermophilic eubacterium Thermotoga maritima was found to encode a highly stable and specific periplasmic arginine-binding protein (TM0593). Following signal sequence removal and overexpression in Escherichia coli, TM0593 was purified by thermoprecipitation and affinity chromatography. The ultra-stable protein with a monomeric molecular weight of 27.7 kDa was found to exist as both a homodimer and homotrimer at appreciable concentrations even under strongly denaturing conditions, with an estimated transition temperature of 116 degrees C. Its multimeric structure may provide further evidence of the importance of quaternary structure in the movement of nutrients across bacterial membranes. Purified and refolded TM0593 was further characterized by fluorescence spectroscopy, mass spectrometry, and circular dichroism to demonstrate the specificity of the protein for arginine and to elucidate structural changes associated with arginine binding. The protein binds arginine with a dissociation constant of 20 muM as determined by surface plasmon resonance measurements. Due to its high thermodynamic stability, TM0593 may serve as a scaffold for the creation of a robust fluorescent biosensor.