RESUMO
Native cell-free transcription-translation systems offer a rapid route to characterize the regulatory elements (promoters, transcription factors) for gene expression from nonmodel microbial hosts, which can be difficult to assess through traditional in vivo approaches. One such host, Bacillus megaterium, is a giant Gram-positive bacterium with potential biotechnology applications, although many of its regulatory elements remain uncharacterized. Here, we have developed a rapid automated platform for measuring and modeling in vitro cell-free reactions and have applied this to B. megaterium to quantify a range of ribosome binding site variants and previously uncharacterized endogenous constitutive and inducible promoters. To provide quantitative models for cell-free systems, we have also applied a Bayesian approach to infer ordinary differential equation model parameters by simultaneously using time-course data from multiple experimental conditions. Using this modeling framework, we were able to infer previously unknown transcription factor binding affinities and quantify the sharing of cell-free transcription-translation resources (energy, ribosomes, RNA polymerases, nucleotides, and amino acids) using a promoter competition experiment. This allows insights into resource limiting-factors in batch cell-free synthesis mode. Our combined automated and modeling platform allows for the rapid acquisition and model-based analysis of cell-free transcription-translation data from uncharacterized microbial cell hosts, as well as resource competition within cell-free systems, which potentially can be applied to a range of cell-free synthetic biology and biotechnology applications.
Assuntos
Bacillus megaterium , Modelos Biológicos , Biossíntese de Proteínas , Transcrição Gênica , Bacillus megaterium/química , Bacillus megaterium/genética , Bacillus megaterium/metabolismo , Sistema Livre de Células/química , Sistema Livre de Células/metabolismoRESUMO
The ability to design and construct structures with atomic level precision is one of the key goals of nanotechnology. Proteins offer an attractive target for atomic design because they can be synthesized chemically or biologically and can self-assemble. However, the generalized protein folding and design problem is unsolved. One approach to simplifying the problem is to use a repetitive protein as a scaffold. Repeat proteins are intrinsically modular, and their folding and structures are better understood than large globular domains. Here, we have developed a class of synthetic repeat proteins based on the pentapeptide repeat family of beta-solenoid proteins. We have constructed length variants of the basic scaffold and computationally designed de novo loops projecting from the scaffold core. The experimentally solved 3.56-Å resolution crystal structure of one designed loop matches closely the designed hairpin structure, showing the computational design of a backbone extension onto a synthetic protein core without the use of backbone fragments from known structures. Two other loop designs were not clearly resolved in the crystal structures, and one loop appeared to be in an incorrect conformation. We have also shown that the repeat unit can accommodate whole-domain insertions by inserting a domain into one of the designed loops.
Assuntos
Peptídeos/química , Conformação Proteica , Proteínas/química , Sequências Repetitivas de Aminoácidos/genética , Sequência de Aminoácidos/genética , Cristalografia por Raios X , Peptídeos/genética , Engenharia de Proteínas , Dobramento de Proteína , Estrutura Secundária de Proteína , Proteínas/genéticaRESUMO
Plants are a tremendous source of diverse chemicals, including many natural product-derived drugs. It has recently become apparent that the genes for the biosynthesis of numerous different types of plant natural products are organized as metabolic gene clusters, thereby unveiling a highly unusual form of plant genome architecture and offering novel avenues for discovery and exploitation of plant specialized metabolism. Here we show that these clustered pathways are characterized by distinct chromatin signatures of histone 3 lysine trimethylation (H3K27me3) and histone 2 variant H2A.Z, associated with cluster repression and activation, respectively, and represent discrete windows of co-regulation in the genome. We further demonstrate that knowledge of these chromatin signatures along with chromatin mutants can be used to mine genomes for cluster discovery. The roles of H3K27me3 and H2A.Z in repression and activation of single genes in plants are well known. However, our discovery of highly localized operon-like co-regulated regions of chromatin modification is unprecedented in plants. Our findings raise intriguing parallels with groups of physically linked multi-gene complexes in animals and with clustered pathways for specialized metabolism in filamentous fungi.
Assuntos
Proteínas de Arabidopsis/genética , Arabidopsis/genética , Cromatina/química , Regulação da Expressão Gênica de Plantas , Genoma de Planta , Histonas/genética , Família Multigênica , Arabidopsis/metabolismo , Proteínas de Arabidopsis/metabolismo , Avena/genética , Avena/metabolismo , Cromatina/metabolismo , Mapeamento Cromossômico , Sequenciamento de Nucleotídeos em Larga Escala , Histonas/metabolismo , Redes e Vias Metabólicas , Raízes de Plantas/genética , Raízes de Plantas/metabolismo , Plântula/genética , Plântula/metabolismo , Triterpenos/metabolismo , Zea mays/genética , Zea mays/metabolismoRESUMO
Cell-free transcription-translation is an expanding field in synthetic biology as a rapid prototyping platform for blueprinting the design of synthetic biological devices. Exemplar efforts include translation of prototype designs into medical test kits for on-site identification of viruses (Zika and Ebola), while gene circuit cascades can be tested, debugged and re-designed within rapid turnover times. Coupled with mathematical modelling, this discipline lends itself towards the precision engineering of new synthetic life. The next stages of cell-free look set to unlock new microbial hosts that remain slow to engineer and unsuited to rapid iterative design cycles. It is hoped that the development of such systems will provide new tools to aid the transition from cell-free prototype designs to functioning synthetic genetic circuits and engineered natural product pathways in living cells.
Assuntos
Biologia Sintética/métodos , Vias Biossintéticas , Sistema Livre de Células , Redes Reguladoras de GenesRESUMO
The computational algorithms used in the design of artificial proteins have become increasingly sophisticated in recent years, producing a series of remarkable successes. The most dramatic of these is the de novo design of artificial enzymes. The majority of these designs have reused naturally occurring protein structures as 'scaffolds' onto which novel functionality can be grafted without having to redesign the backbone structure. The incorporation of backbone flexibility into protein design is a much more computationally challenging problem due to the greatly increased search space, but promises to remove the limitations of reusing natural protein scaffolds. In this review, we outline the principles of computational protein design methods and discuss recent efforts to consider backbone plasticity in the design process.
Assuntos
Biologia Computacional/métodos , Conformação Proteica , Engenharia de Proteínas/métodos , Proteínas/química , Algoritmos , Enzimas/química , Enzimas/genética , Enzimas/metabolismo , Modelos Moleculares , Ligação Proteica , Dobramento de Proteína , Proteínas/genética , Proteínas/metabolismoRESUMO
Cell-free transcription-translation systems were originally applied towards in vitro protein production. More recently, synthetic biology is enabling these systems to be used within a systematic design context for prototyping DNA regulatory elements, genetic logic circuits and biosynthetic pathways. The Gram-positive soil bacterium, Bacillus subtilis, is an established model organism of industrial importance. To this end, we developed several B. subtilis-based cell-free systems. Our improved B. subtilis WB800N-based system was capable of producing 0.8µM GFP, which gave a ~72x fold-improvement when compared with a B. subtilis 168 cell-free system. Our improved system was applied towards the prototyping of a B. subtilis promoter library in which we engineered several promoters, derived from the wild-type Pgrac (σA) promoter, that display a range of comparable in vitro and in vivo transcriptional activities. Additionally, we demonstrate the cell-free characterisation of an inducible expression system, and the activity of a model enzyme - renilla luciferase.
Assuntos
Bacillus subtilis/genética , Proteínas de Bactérias/biossíntese , Proteínas de Bactérias/genética , Biossíntese de Proteínas/genética , Elementos Reguladores de Transcrição/genética , Transcrição Gênica/genética , Sistema Livre de Células/fisiologia , Regulação Bacteriana da Expressão Gênica/genética , Melhoramento Genético/métodos , Engenharia Metabólica/métodos , Redes e Vias Metabólicas/genéticaRESUMO
Overlap-directed DNA assembly methods allow multiple DNA parts to be assembled together in one reaction. These methods, which rely on sequence homology between the ends of DNA parts, have become widely adopted in synthetic biology, despite being incompatible with a key principle of engineering: modularity. To answer this, we present MODAL: a Modular Overlap-Directed Assembly with Linkers strategy that brings modularity to overlap-directed methods, allowing assembly of an initial set of DNA parts into a variety of arrangements in one-pot reactions. MODAL is accompanied by a custom software tool that designs overlap linkers to guide assembly, allowing parts to be assembled in any specified order and orientation. The in silico design of synthetic orthogonal overlapping junctions allows for much greater efficiency in DNA assembly for a variety of different methods compared with using non-designed sequence. In tests with three different assembly technologies, the MODAL strategy gives assembly of both yeast and bacterial plasmids, composed of up to five DNA parts in the kilobase range with efficiencies of between 75 and 100%. It also seamlessly allows mutagenesis to be performed on any specified DNA parts during the process, allowing the one-step creation of construct libraries valuable for synthetic biology applications.
Assuntos
DNA/química , Análise de Sequência de DNA , Biologia Sintética/métodos , DNA/síntese química , Escherichia coli/genética , Genes Sintéticos , Plasmídeos/genética , Reação em Cadeia da Polimerase , Saccharomyces cerevisiae/genética , Software , Biologia Sintética/normasRESUMO
Lipoteichoic acid (LTA) is an important cell wall component required for proper cell growth in many Gram-positive bacteria. In Listeria monocytogenes, two enzymes are required for the synthesis of this polyglycerolphosphate polymer. The LTA primase LtaP(Lm) initiates LTA synthesis by transferring the first glycerolphosphate (GroP) subunit onto the glycolipid anchor and the LTA synthase LtaS(Lm) extends the polymer by the repeated addition of GroP subunits to the tip of the growing chain. Here, we present the crystal structures of the enzymatic domains of LtaP(Lm) and LtaS(Lm). Although the enzymes share the same fold, substantial differences in the cavity of the catalytic site and surface charge distribution contribute to enzyme specialization. The eLtaS(Lm) structure was also determined in complex with GroP revealing a second GroP binding site. Mutational analysis confirmed an essential function for this binding site and allowed us to propose a model for the binding of the growing chain.
Assuntos
Proteínas de Bactérias/química , Parede Celular/química , Glicerofosfatos/química , Lipopolissacarídeos/biossíntese , Listeria monocytogenes/química , Ácidos Teicoicos/biossíntese , Sequência de Aminoácidos , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Domínio Catalítico , Parede Celular/enzimologia , Cristalografia por Raios X , Escherichia coli/genética , Escherichia coli/metabolismo , Glicerofosfatos/metabolismo , Listeria monocytogenes/classificação , Listeria monocytogenes/enzimologia , Modelos Moleculares , Dados de Sequência Molecular , Mutagênese Sítio-Dirigida , Filogenia , Ligação Proteica , Dobramento de Proteína , Estrutura Secundária de Proteína , Estrutura Terciária de Proteína , Proteínas Recombinantes/química , Proteínas Recombinantes/genética , Proteínas Recombinantes/metabolismo , Eletricidade EstáticaRESUMO
Base excision repair (BER) is a highly conserved DNA repair pathway throughout all kingdoms from bacteria to humans. Whereas several enzymes are required to complete the multistep repair process of damaged bases, apurinic-apyrimidic (AP) endonucleases play an essential role in enabling the repair process by recognizing intermediary abasic sites cleaving the phosphodiester backbone 5' to the abasic site. Despite extensive study, there is no structure of a bacterial AP endonuclease bound to substrate DNA. Furthermore, the structural mechanism for AP-site cleavage is incomplete. Here we report a detailed structural and biochemical study of the AP endonuclease from Neisseria meningitidis that has allowed us to capture structural intermediates providing more complete snapshots of the catalytic mechanism. Our data reveal subtle differences in AP-site recognition and kinetics between the human and bacterial enzymes that may reflect different evolutionary pressures.
Assuntos
Reparo do DNA/genética , DNA Liase (Sítios Apurínicos ou Apirimidínicos)/química , DNA Liase (Sítios Apurínicos ou Apirimidínicos)/metabolismo , DNA/metabolismo , Modelos Moleculares , Neisseria meningitidis/genética , Cristalografia por Raios X , DNA/química , Furanos , Humanos , Estrutura Molecular , Neisseria meningitidis/metabolismo , Conformação Proteica , Dobramento de ProteínaRESUMO
Coarse-grained protein structure models offer increased efficiency in structural modeling, but these must be coupled with fast and accurate methods to revert to a full-atom structure. Here, we present a novel algorithm to reconstruct mainchain models from C traces. This has been parameterized by fitting Gaussian mixture models (GMMs) to short backbone fragments centered on idealized peptide bonds. The method we have developed is statistically significantly more accurate than several competing methods, both in terms of RMSD values and dihedral angle differences. The method produced Ramachandran dihedral angle distributions that are closer to that observed in real proteins and better Phaser molecular replacement log-likelihood gains. Amino acid residue sidechain reconstruction accuracy using SCWRL4 was found to be statistically significantly correlated to backbone reconstruction accuracy. Finally, the PD2 method was found to produce significantly lower energy full-atom models using Rosetta which has implications for multiscale protein modeling using coarse-grained models. A webserver and C++ source code is freely available for noncommercial use from: http://www.sbg.bio.ic.ac.uk/phyre2/PD2_ca2main/.
Assuntos
Algoritmos , Carbono/química , Simulação de Dinâmica Molecular , Proteínas/química , Software , Conformação ProteicaRESUMO
Synthetic networks require complex intertwined genetic regulation often relying on transcriptional activation or repression of target genes. CRISPRi-based transcription factors facilitate the programmable modulation of endogenous or synthetic promoter activity and the process can be optimised by using software to select appropriate gRNAs and limit non-specific gene modulation. Here, we develop a computational software pipeline, gDesigner, that enables the automated selection of orthogonal gRNAs with minimized off-target effects and promoter crosstalk. We next engineered a Lachnospiraceae bacterium Cas12a (dLbCas12a)-based repression system that downregulates target gene expression by means of steric hindrance of the cognate promoter. Finally, we generated a library of orthogonal synthetic dCas12a-repressed promoters and experimentally demonstrated it in HEK293FT, U2OS and H1299 cells lines. Our system expands the toolkit of mammalian synthetic promoters with a new complementary and orthogonal CRISPRi-based system, ultimately enabling the design of synthetic promoter libraries for multiplex gene perturbation that facilitate the understanding of complex cellular phenotypes.
Assuntos
Sistemas CRISPR-Cas , RNA Guia de Cinetoplastídeos , Animais , Expressão Gênica , Mamíferos/genética , Regiões Promotoras Genéticas/genética , RNA Guia de Cinetoplastídeos/genética , Fatores de Transcrição/genéticaRESUMO
In recent years, there have been significant advances in the field of computational protein design including the successful computational design of enzymes based on backbone scaffolds from experimentally solved structures. It is likely that large-scale sampling of protein backbone conformations will become necessary as further progress is made on more complicated systems. Removing the constraint of having to use scaffolds based on known protein backbones is a potential method of solving the problem. With this application in mind, we describe a method to systematically construct a large number of de novo backbone structures from idealized topological forms in a top-down hierarchical approach. The structural properties of these novel backbone scaffolds were analyzed and compared with a set of high-resolution experimental structures from the protein data bank (PDB). It was found that the Ramachandran plot distribution and relative gamma- and beta-turn frequencies were similar to those found in the PDB. The de novo scaffolds were sequence designed with RosettaDesign, and the energy distributions and amino acid compositions were comparable with the results for redesigned experimentally solved backbones.
Assuntos
Engenharia de Proteínas , Estrutura Secundária de Proteína , Proteínas/química , Sequência de Aminoácidos , Ligação de Hidrogênio , Modelos Moleculares , Dados de Sequência Molecular , Conformação Proteica , Dobramento de Proteína , Proteínas/genéticaRESUMO
The field of mammalian synthetic biology is expanding quickly, and technologies for engineering large synthetic gene circuits are increasingly accessible. However, for mammalian cell engineering, traditional tissue culture methods are slow and cumbersome, and are not suited for high-throughput characterization measurements. Here we have utilized mammalian cell-free protein synthesis (CFPS) assays using HeLa cell extracts and liquid handling automation as an alternative to tissue culture and flow cytometry-based measurements. Our CFPS assays take a few hours, and we have established optimized protocols for small-volume reactions using automated acoustic liquid handling technology. As a proof-of-concept, we characterized diverse types of genetic regulation in CFPS, including T7 constitutive promoter variants, internal ribosomal entry sites (IRES) constitutive translation-initiation sequence variants, CRISPR/dCas9-mediated transcription repression, and L7Ae-mediated translation repression. Our data shows simple regulatory elements for use in mammalian cells can be quickly prototyped in a CFPS model system.
Assuntos
Engenharia Celular/métodos , Engenharia Genética/métodos , Biossíntese de Proteínas/genética , Sítios de Ligação , Proteína 9 Associada à CRISPR/química , Proteína 9 Associada à CRISPR/genética , Sistemas CRISPR-Cas , Sistema Livre de Células , Escherichia coli/genética , Redes Reguladoras de Genes , Células HeLa , Humanos , Sítios Internos de Entrada Ribossomal/genética , Plasmídeos/genética , Regiões Promotoras Genéticas , Reprodutibilidade dos Testes , Biologia Sintética/métodos , Transcrição Gênica/genéticaRESUMO
Pseudomonas aeruginosa has three type VI secretion systems (T6SSs), H1-, H2-, and H3-T6SS, each belonging to a distinct group. The two T6SS components, TssB/VipA and TssC/VipB, assemble to form tubules that conserve structural/functional homology with tail sheaths of contractile bacteriophages and pyocins. Here, we used cryoelectron microscopy to solve the structure of the H1-T6SS P. aeruginosa TssB1C1 sheath at 3.3 Å resolution. Our structure allowed us to resolve some features of the T6SS sheath that were not resolved in the Vibrio cholerae VipAB and Francisella tularensis IglAB structures. Comparison with sheath structures from other contractile machines, including T4 phage and R-type pyocins, provides a better understanding of how these systems have conserved similar functions/mechanisms despite evolution. We used the P. aeruginosa R2 pyocin as a structural template to build an atomic model of the TssB1C1 sheath in its extended conformation, allowing us to propose a coiled-spring-like mechanism for T6SS sheath contraction.
Assuntos
Proteínas de Bactérias/metabolismo , Bacteriófago T4/metabolismo , Modelos Moleculares , Pseudomonas aeruginosa/metabolismo , Sistemas de Secreção Tipo VI/metabolismo , Microscopia CrioeletrônicaRESUMO
Recently developed DNA assembly methods have enabled the rapid and simultaneous assembly of multiple parts to create complex synthetic gene circuits. A number of groups have proposed the use of computationally designed orthogonal spacer sequences to guide the ordered assembly of parts using overlap-directed or homologous recombination-based methods. This approach is particularly useful for assembling multiple parts with repetitive elements. Orthogonal spacer sequences (sometimes called UNSs-unique nucleotide sequences) also have a number of other potential uses including in the design of synthetic promoters regulated by novel regulatory elements.
Assuntos
DNA/genética , Redes Reguladoras de Genes , Sequências Repetitivas de Ácido Nucleico , Software , Algoritmos , Animais , Escherichia coli/genética , Recombinação Homóloga , Humanos , Saccharomyces cerevisiae/genética , Biologia SintéticaRESUMO
The thermodynamic and kinetic stabilities of the eye lens family of betagamma-crystallins are important factors in the etiology of senile cataract. They control the chance of proteins unfolding, which can lead to aggregation and loss of transparency. betaB2-Crystallin orthologs are of low stability and comprise two typical betagamma-crystallin domains, although, uniquely, the N-terminal domain has a cysteine in one of the conserved folded beta-hairpins. Using high-temperature (500 K) molecular dynamics simulations with explicit solvent on the N-terminal domain of rodent betaB2-crystallin, we have identified in silico local flexibility in this folded beta-hairpin. We have shown in vitro using two-domain human betaB2-crystallin that replacement of this cysteine with a more usual aromatic residue (phenylalanine) results in a gain in conformational stability and a reduction in the rate of unfolding. We have used principal components analysis to visualize and cluster the coordinates from eight separate simulated unfolding trajectories of both the wild-type and the C50F mutant N-terminal domains. These data, representing fluctuations around the native well, show that although the mutant and wild-type appear to behave similarly over the early time period, the wild type appears to explore a different region of conformational space. It is proposed that the advantage of having this low-stability cysteine may be correlated with a subunit-exchange mechanism that allows betaB2-crystallin to interact with a range of other beta-crystallin subunits.
Assuntos
Cristalinas/química , Cisteína/química , Dicroísmo Circular , Cinética , Modelos Moleculares , Conformação Proteica , Desnaturação ProteicaRESUMO
Synthetic biology is principally concerned with the rational design and engineering of biologically based parts, devices, or systems. However, biological systems are generally complex and unpredictable, and are therefore, intrinsically difficult to engineer. In order to address these fundamental challenges, synthetic biology is aiming to unify a "body of knowledge" from several foundational scientific fields, within the context of a set of engineering principles. This shift in perspective is enabling synthetic biologists to address complexity, such that robust biological systems can be designed, assembled, and tested as part of a biological design cycle. The design cycle takes a forward-design approach in which a biological system is specified, modeled, analyzed, assembled, and its functionality tested. At each stage of the design cycle, an expanding repertoire of tools is being developed. In this review, we highlight several of these tools in terms of their applications and benefits to the synthetic biology community.
RESUMO
R2oDNA Designer is a web application that stochastically generates orthogonal sets of synthetic DNA sequences designed to be biologically neutral. Biologically neutral sequences may be used for directing efficient DNA assembly by overlap-directed methods, as a negative control for functional DNA, as barcodes, or potentially as spacer regions to insulate biological parts from local context. The software creates optimized sequences using a Monte Carlo simulated annealing approach followed by the elimination of sequences homologous to host genomes and commonly used biological parts. An orthogonal set is finally determined by using a network elimination algorithm. Design constraints can be defined using either a web-based graphical user interface (GUI) or uploading a file containing a set of text commands.
Assuntos
DNA/química , DNA/genética , Software , Genoma , Método de Monte Carlo , Processos Estocásticos , Interface Usuário-ComputadorRESUMO
Coarse-grained (CG) methods for sampling protein conformational space have the potential to increase computational efficiency by reducing the degrees of freedom. The gain in computational efficiency of CG methods often comes at the expense of non-protein like local conformational features. This could cause problems when transitioning to full atom models in a hierarchical framework. Here, a CG potential energy function was validated by applying it to the problem of loop prediction. A novel method to sample the conformational space of backbone atoms was benchmarked using a standard test set consisting of 351 distinct loops. This method used a sequence-independent CG potential energy function representing the protein using [Formula: see text]-carbon positions only and sampling conformations with a Monte Carlo simulated annealing based protocol. Backbone atoms were added using a method previously described and then gradient minimised in the Rosetta force field. Despite the CG potential energy function being sequence-independent, the method performed similarly to methods that explicitly use either fragments of known protein backbones with similar sequences or residue-specific [Formula: see text]/[Formula: see text]-maps to restrict the search space. The method was also able to predict with sub-Angstrom accuracy two out of seven loops from recently solved crystal structures of proteins with low sequence and structure similarity to previously deposited structures in the PDB. The ability to sample realistic loop conformations directly from a potential energy function enables the incorporation of additional geometric restraints and the use of more advanced sampling methods in a way that is not possible to do easily with fragment replacement methods and also enable multi-scale simulations for protein design and protein structure prediction. These restraints could be derived from experimental data or could be design restraints in the case of computational protein design. C++ source code is available for download from http://www.sbg.bio.ic.ac.uk/phyre2/PD2/.
Assuntos
Modelos Químicos , Proteínas/química , Cristalografia por Raios X , Método de Monte Carlo , Conformação ProteicaRESUMO
A general understanding of the complex phenomenon of protein evolution requires the accurate description of the constraints that define the sub-space of proteins with mutations that do not appreciably reduce the fitness of the organism. Such constraints can have multiple origins, in this work we present a model for constrained evolutionary trajectories represented by a markovian process throughout a set of protein-like structures artificially constructed to be topological intermediates between the structure of two natural occurring proteins. The number and type of intermediate steps defines how constrained the total evolutionary process is. By using a coarse-grained representation for the protein structures, we derive an analytic formulation of the transition rates between each of the intermediate structures. The results indicate that compact structures with a high number of hydrogen bonds are more probable and have a higher likelihood to arise during evolution. Knowledge of the transition rates allows for the study of complex evolutionary pathways represented by trajectories through a set of intermediate structures.