Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 59
Filtrar
1.
Nucleic Acids Res ; 2024 Jun 12.
Artigo em Inglês | MEDLINE | ID: mdl-38864396

RESUMO

Model-guided DNA sequence design can accelerate the reprogramming of living cells. It allows us to engineer more complex biological systems by removing the need to physically assemble and test each potential design. While mechanistic models of gene expression have seen some success in supporting this goal, data-centric, deep learning-based approaches often provide more accurate predictions. This accuracy, however, comes at a cost - a lack of generalization across genetic and experimental contexts that has limited their wider use outside the context in which they were trained. Here, we address this issue by demonstrating how a simple transfer learning procedure can effectively tune a pre-trained deep learning model to predict protein translation rate from 5' untranslated region (5'UTR) sequence for diverse contexts in Escherichia coli using a small number of new measurements. This allows for important model features learnt from expensive massively parallel reporter assays to be easily transferred to new settings. By releasing our trained deep learning model and complementary calibration procedure, this study acts as a starting point for continually refined model-based sequence design that builds on previous knowledge and future experimental efforts.

2.
Bioinformatics ; 39(5)2023 05 04.
Artigo em Inglês | MEDLINE | ID: mdl-37084251

RESUMO

MOTIVATION: The ability to measure the phenotype of millions of different genetic designs using Massively Parallel Reporter Assays (MPRAs) has revolutionized our understanding of genotype-to-phenotype relationships and opened avenues for data-centric approaches to biological design. However, our knowledge of how best to design these costly experiments and the effect that our choices have on the quality of the data produced is lacking. RESULTS: In this article, we tackle the issues of data quality and experimental design by developing FORECAST, a Python package that supports the accurate simulation of cell-sorting and sequencing-based MPRAs and robust maximum likelihood-based inference of genetic design function from MPRA data. We use FORECAST's capabilities to reveal rules for MPRA experimental design that help ensure accurate genotype-to-phenotype links and show how the simulation of MPRA experiments can help us better understand the limits of prediction accuracy when this data are used for training deep learning-based classifiers. As the scale and scope of MPRAs grows, tools like FORECAST will help ensure we make informed decisions during their development and the most of the data produced. AVAILABILITY AND IMPLEMENTATION: The FORECAST package is available at: https://gitlab.com/Pierre-Aurelien/forecast. Code for the deep learning analysis performed in this study is available at: https://gitlab.com/Pierre-Aurelien/rebeca.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , Software , Funções Verossimilhança , Análise de Sequência de DNA , Bioensaio
3.
PLoS Comput Biol ; 19(4): e1010988, 2023 04.
Artigo em Inglês | MEDLINE | ID: mdl-37079494

RESUMO

Mechanistic models have been used for centuries to describe complex interconnected processes, including biological ones. As the scope of these models has widened, so have their computational demands. This complexity can limit their suitability when running many simulations or when real-time results are required. Surrogate machine learning (ML) models can be used to approximate the behaviour of complex mechanistic models, and once built, their computational demands are several orders of magnitude lower. This paper provides an overview of the relevant literature, both from an applicability and a theoretical perspective. For the latter, the paper focuses on the design and training of the underlying ML models. Application-wise, we show how ML surrogates have been used to approximate different mechanistic models. We present a perspective on how these approaches can be applied to models representing biological processes with potential industrial applications (e.g., metabolism and whole-cell modelling) and show why surrogate ML models may hold the key to making the simulation of complex biological systems possible using a typical desktop computer.


Assuntos
Aprendizado de Máquina , Modelos Biológicos , Simulação por Computador
4.
PLoS Comput Biol ; 19(12): e1011652, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-38060459

RESUMO

Information is the cornerstone of research, from experimental (meta)data and computational processes to complex inventories of reagents and equipment. These 10 simple rules discuss best practices for leveraging laboratory information management systems to transform this large information load into useful scientific findings.

5.
Nucleic Acids Res ; 49(13): 7775-7790, 2021 07 21.
Artigo em Inglês | MEDLINE | ID: mdl-34197613

RESUMO

CRISPR Cas12a is an RNA-programmable endonuclease particularly suitable for gene regulation. This is due to its preference for T-rich PAMs that allows it to more easily target AT-rich promoter sequences, and built-in RNase activity which can process a single CRISPR RNA array encoding multiple spacers into individual guide RNAs (gRNAs), thereby simplifying multiplexed gene regulation. Here, we develop a flexible dCas12a-based CRISPRi system for Saccharomyces cerevisiae and systematically evaluate its design features. This includes the role of the NLS position, use of repression domains, and the position of the gRNA target. Our optimal system is comprised of dCas12a E925A with a single C-terminal NLS and a Mxi1 or a MIG1 repression domain, which enables up to 97% downregulation of a reporter gene. We also extend this system to allow for inducible regulation via an RNAP II-controlled promoter, demonstrate position-dependent effects in crRNA arrays, and use multiplexed regulation to stringently control a heterologous ß-carotene pathway. Together these findings offer valuable insights into the design constraints of dCas12a-based CRISPRi and enable new avenues for flexible and efficient gene regulation in S. cerevisiae.


Assuntos
Proteínas Associadas a CRISPR/química , Sistemas CRISPR-Cas , Endodesoxirribonucleases/química , Regulação da Expressão Gênica , Saccharomyces cerevisiae/genética , Proteínas Associadas a CRISPR/genética , Proteínas Associadas a CRISPR/metabolismo , Regulação para Baixo , Endodesoxirribonucleases/genética , Endodesoxirribonucleases/metabolismo , Proteínas de Fluorescência Verde/genética , Sinais de Localização Nuclear , Regiões Promotoras Genéticas , Domínios Proteicos , RNA/metabolismo , RNA Polimerase II/metabolismo , beta Caroteno/biossíntese
6.
Ecol Lett ; 25(12): 2753-2775, 2022 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-36264848

RESUMO

High-resolution monitoring is fundamental to understand ecosystems dynamics in an era of global change and biodiversity declines. While real-time and automated monitoring of abiotic components has been possible for some time, monitoring biotic components-for example, individual behaviours and traits, and species abundance and distribution-is far more challenging. Recent technological advancements offer potential solutions to achieve this through: (i) increasingly affordable high-throughput recording hardware, which can collect rich multidimensional data, and (ii) increasingly accessible artificial intelligence approaches, which can extract ecological knowledge from large datasets. However, automating the monitoring of facets of ecological communities via such technologies has primarily been achieved at low spatiotemporal resolutions within limited steps of the monitoring workflow. Here, we review existing technologies for data recording and processing that enable automated monitoring of ecological communities. We then present novel frameworks that combine such technologies, forming fully automated pipelines to detect, track, classify and count multiple species, and record behavioural and morphological traits, at resolutions which have previously been impossible to achieve. Based on these rapidly developing technologies, we illustrate a solution to one of the greatest challenges in ecology: the ability to rapidly generate high-resolution, multidimensional and standardised data across complex ecologies.


Assuntos
Inteligência Artificial , Ecossistema , Biodiversidade , Biota
7.
Nat Methods ; 15(5): 387-393, 2018 05.
Artigo em Inglês | MEDLINE | ID: mdl-29578536

RESUMO

Cells use feedback regulation to ensure robust growth despite fluctuating demands for resources and differing environmental conditions. However, the expression of foreign proteins from engineered constructs is an unnatural burden that cells are not adapted for. Here we combined RNA-seq with an in vivo assay to identify the major transcriptional changes that occur in Escherichia coli when inducible synthetic constructs are expressed. We observed that native promoters related to the heat-shock response activated expression rapidly in response to synthetic expression, regardless of the construct. Using these promoters, we built a dCas9-based feedback-regulation system that automatically adjusts the expression of a synthetic construct in response to burden. Cells equipped with this general-use controller maintained their capacity for native gene expression to ensure robust growth and thus outperformed unregulated cells in terms of protein yield in batch production. This engineered feedback is to our knowledge the first example of a universal, burden-based biomolecular control system and is modular, tunable and portable.


Assuntos
Escherichia coli/fisiologia , Regulação Bacteriana da Expressão Gênica/fisiologia , Biologia Sintética , Escherichia coli/genética , Sequenciamento de Nucleotídeos em Larga Escala , Plasmídeos , Regiões Promotoras Genéticas , Análise de Sequência de RNA , Transcrição Gênica
8.
Mol Syst Biol ; 16(8): e9584, 2020 08.
Artigo em Inglês | MEDLINE | ID: mdl-32812710

RESUMO

Genetic circuits have many applications, from guiding living therapeutics to ordering process in a bioreactor, but to be useful they have to be genetically stable and not hinder the host. Encoding circuits in the genome reduces burden, but this decreases performance and can interfere with native transcription. We have designed genomic landing pads in Escherichia coli at high-expression sites, flanked by ultrastrong double terminators. DNA payloads >8 kb are targeted to the landing pads using phage integrases. One landing pad is dedicated to carrying a sensor array, and two are used to carry genetic circuits. NOT/NOR gates based on repressors are optimized for the genome and characterized in the landing pads. These data are used, in conjunction with design automation software (Cello 2.0), to design circuits that perform quantitatively as predicted. These circuits require fourfold less RNA polymerase than when carried on a plasmid and are stable for weeks in a recA+ strain without selection. This approach enables the design of synthetic regulatory networks to guide cells in environments or for applications where plasmid use is infeasible.


Assuntos
Escherichia coli/genética , Redes Reguladoras de Genes , Engenharia Genética/métodos , RNA Polimerases Dirigidas por DNA/metabolismo , Proteínas de Escherichia coli/genética , Plasmídeos/genética , Software , Biologia Sintética
9.
Mol Syst Biol ; 15(5): e8719, 2019 05 03.
Artigo em Inglês | MEDLINE | ID: mdl-31053575

RESUMO

Translation of mRNAs into proteins is a key cellular process. Ribosome binding sites and stop codons provide signals to initiate and terminate translation, while stable secondary mRNA structures can induce translational recoding events. Fluorescent proteins are commonly used to characterize such elements but require the modification of a part's natural context and allow only a few parameters to be monitored concurrently. Here, we combine Ribo-seq with quantitative RNA-seq to measure at nucleotide resolution and in absolute units the performance of elements controlling transcriptional and translational processes during protein synthesis. We simultaneously measure 779 translation initiation rates and 750 translation termination efficiencies across the Escherichia coli transcriptome, in addition to translational frameshifting induced at a stable RNA pseudoknot structure. By analyzing the transcriptional and translational response, we discover that sequestered ribosomes at the pseudoknot contribute to a σ32-mediated stress response, codon-specific pausing, and a drop in translation initiation rates across the cell. Our work demonstrates the power of integrating global approaches toward a comprehensive and quantitative understanding of gene regulation and burden in living cells.


Assuntos
Regulação Bacteriana da Expressão Gênica , RNA Mensageiro/metabolismo , Ribossomos/metabolismo , Análise de Sequência de DNA , Análise de Sequência de RNA , Códon , Escherichia coli/genética , Escherichia coli/metabolismo , Perfilação da Expressão Gênica , Genoma Bacteriano , Conformação de Ácido Nucleico , Regiões Promotoras Genéticas , Biossíntese de Proteínas , RNA Mensageiro/genética , Transcriptoma
10.
Nucleic Acids Res ; 45(3): 1553-1565, 2017 02 17.
Artigo em Inglês | MEDLINE | ID: mdl-28007941

RESUMO

Genetic designs can consist of dozens of genes and hundreds of genetic parts. After evaluating a design, it is desirable to implement changes without the cost and burden of starting the construction process from scratch. Here, we report a two-step process where a large design space is divided into deep pools of composite parts, from which individuals are retrieved and assembled to build a final construct. The pools are built via multiplexed assembly and sequenced using next-generation sequencing. Each pool consists of ∼20 Mb of up to 5000 unique and sequence-verified composite parts that are barcoded for retrieval by PCR. This approach is applied to a 16-gene nitrogen fixation pathway, which is broken into pools containing a total of 55 848 composite parts (71.0 Mb). The pools encompass an enormous design space (1043 possible 23 kb constructs), from which an algorithm-guided 192-member 4.5 Mb library is built. Next, all 1030 possible genetic circuits based on 10 repressors (NOR/NOT gates) are encoded in pools where each repressor is fused to all permutations of input promoters. These demonstrate that multiplexing can be applied to encompass entire design spaces from which individuals can be accessed and evaluated.


Assuntos
Algoritmos , Redes Reguladoras de Genes , Engenharia Genética/métodos , Escherichia coli/genética , Biblioteca Gênica , Sequenciamento de Nucleotídeos em Larga Escala , Klebsiella/genética , Klebsiella/metabolismo , Fixação de Nitrogênio/genética , Nitrogenase/genética , Nitrogenase/metabolismo , Regiões Promotoras Genéticas
11.
Mol Syst Biol ; 13(11): 952, 2017 11 09.
Artigo em Inglês | MEDLINE | ID: mdl-29122925

RESUMO

Genetic circuits implement computational operations within a cell. Debugging them is difficult because their function is defined by multiple states (e.g., combinations of inputs) that vary in time. Here, we develop RNA-seq methods that enable the simultaneous measurement of: (i) the states of internal gates, (ii) part performance (promoters, insulators, terminators), and (iii) impact on host gene expression. This is applied to a three-input one-output circuit consisting of three sensors, five NOR/NOT gates, and 46 genetic parts. Transcription profiles are obtained for all eight combinations of inputs, from which biophysical models can extract part activities and the response functions of sensors and gates. Various unexpected failure modes are identified, including cryptic antisense promoters, terminator failure, and a sensor malfunction due to media-induced changes in host gene expression. This can guide the selection of new parts to fix these problems, which we demonstrate by using a bidirectional terminator to disrupt observed antisense transcription. This work introduces RNA-seq as a powerful method for circuit characterization and debugging that overcomes the limitations of fluorescent reporters and scales to large systems composed of many parts.


Assuntos
Escherichia coli/genética , Redes Reguladoras de Genes , RNA/genética , Biologia Sintética/métodos , Transcrição Gênica , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Escherichia coli/efeitos dos fármacos , Escherichia coli/metabolismo , Biblioteca Gênica , Elementos Isolantes , Isopropiltiogalactosídeo/farmacologia , Plasmídeos/química , Plasmídeos/metabolismo , Regiões Promotoras Genéticas , RNA/metabolismo , Proteínas Repressoras/genética , Proteínas Repressoras/metabolismo , Regiões Terminadoras Genéticas , Transgenes
12.
Biochem Soc Trans ; 45(3): 793-803, 2017 06 15.
Artigo em Inglês | MEDLINE | ID: mdl-28620041

RESUMO

A synthetic biology workflow is composed of data repositories that provide information about genetic parts, sequence-level design tools to compose these parts into circuits, visualization tools to depict these designs, genetic design tools to select parts to create systems, and modeling and simulation tools to evaluate alternative design choices. Data standards enable the ready exchange of information within such a workflow, allowing repositories and tools to be connected from a diversity of sources. The present paper describes one such workflow that utilizes, among others, the Synthetic Biology Open Language (SBOL) to describe genetic designs, the Systems Biology Markup Language to model these designs, and SBOL Visual to visualize these designs. We describe how a standard-enabled workflow can be used to produce types of design information, including multiple repositories and software tools exchanging information using a variety of data standards. Recently, the ACS Synthetic Biology journal has recommended the use of SBOL in their publications.


Assuntos
Biologia Sintética/métodos , Fluxo de Trabalho , Modelos Biológicos , Software
13.
Nucleic Acids Res ; 43(6): 3022-32, 2015 Mar 31.
Artigo em Inglês | MEDLINE | ID: mdl-25765653

RESUMO

Translation of protein from mRNA is a complex multi-step process that occurs at a non-uniform rate. Variability in ribosome speed along an mRNA enables refinement of the proteome and plays a critical role in protein biogenesis. Detailed single protein studies have found both tRNA abundance and mRNA secondary structure as key modulators of translation elongation rate, but recent genome-wide ribosome profiling experiments have not observed significant influence of either on translation efficiency. Here we provide evidence that this results from an inherent trade-off between these factors. We find codons pairing to high-abundance tRNAs are preferentially used in regions of high secondary structure content, while codons read by significantly less abundant tRNAs are located in lowly structured regions. By considering long stretches of high and low mRNA secondary structure in Saccharomyces cerevisiae and Escherichia coli and comparing them to randomized-gene models and experimental expression data, we were able to distinguish clear selective pressures and increased protein expression for specific codon choices. The trade-off between secondary structure and tRNA-concentration based codon choice allows for compensation of their independent effects on translation, helping to smooth overall translational speed and reducing the chance of potentially detrimental points of excessively slow or fast ribosome movement.


Assuntos
RNA Mensageiro/química , RNA Mensageiro/metabolismo , RNA de Transferência/metabolismo , Códon/genética , Escherichia coli K12/genética , Escherichia coli K12/metabolismo , Proteínas de Escherichia coli/biossíntese , Proteínas de Escherichia coli/genética , Cinética , Conformação de Ácido Nucleico , Elongação Traducional da Cadeia Peptídica , RNA Bacteriano/química , RNA Bacteriano/genética , RNA Bacteriano/metabolismo , RNA Fúngico/química , RNA Fúngico/genética , RNA Fúngico/metabolismo , RNA Mensageiro/genética , RNA de Transferência/genética , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Proteínas de Saccharomyces cerevisiae/biossíntese , Proteínas de Saccharomyces cerevisiae/genética
14.
Nucleic Acids Res ; 41(17): 8021-33, 2013 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-23842674

RESUMO

The synthesis of protein from messenger RNA during translation is a highly dynamic process that plays a key role in controlling the efficiency and fidelity of genome-wide protein expression. The availability of aminoacylated transfer RNA (tRNA) is a major factor influencing the speed of ribosomal movement, which depending on codon choices, varies considerably along a transcript. Furthermore, it has been shown experimentally that tRNA availability can vary significantly under different growth and stress conditions, offering the cell a way to adapt translational dynamics across the genome. Existing models of translation have neglected fluctuations of tRNA pools, instead assuming fixed tRNA availabilities over time. This has lead to an incomplete understanding of this process. Here, we show for the entire Escherichia coli genome how and to what extent translational speed profiles, which capture local aspects of translational elongation, respond to measured shifts in tRNA availability. We find that translational profiles across the genome are affected to differing degrees, with genes that are essential or related to fundamental processes such as translation, being more robust than those linked to regulation. Furthermore, we reveal how fluctuating tRNA availability influences profiles of specific sequences known to play a significant role in translational control of gene expression.


Assuntos
Escherichia coli/genética , Genoma Bacteriano , Biossíntese de Proteínas , RNA de Transferência/metabolismo , Regiões 5' não Traduzidas , Códon
16.
Sci Adv ; 10(3): eadi3621, 2024 Jan 19.
Artigo em Inglês | MEDLINE | ID: mdl-38241375

RESUMO

Design in synthetic biology is typically goal oriented, aiming to repurpose or optimize existing biological functions, augmenting biology with new-to-nature capabilities, or creating life-like systems from scratch. While the field has seen many advances, bottlenecks in the complexity of the systems built are emerging and designs that function in the lab often fail when used in real-world contexts. Here, we propose an open-ended approach to biological design, with the novelty of designed biology being at least as important as how well it fulfils its goal. Rather than solely focusing on optimization toward a single best design, designing with novelty in mind may allow us to move beyond the diminishing returns we see in performance for most engineered biology. Research from the artificial life community has demonstrated that embracing novelty can automatically generate innovative and unexpected solutions to challenging problems beyond local optima. Synthetic biology offers the ideal playground to explore more creative approaches to biological design.


Assuntos
Evolução Biológica , Biologia Sintética , Estudos Longitudinais
17.
Nat Commun ; 15(1): 3640, 2024 Apr 29.
Artigo em Inglês | MEDLINE | ID: mdl-38684714

RESUMO

Careful consideration of how we approach design is crucial to all areas of biotechnology. However, choosing or developing an effective design methodology is not always easy as biology, unlike most areas of engineering, is able to adapt and evolve. Here, we put forward that design and evolution follow a similar cyclic process and therefore all design methods, including traditional design, directed evolution, and even random trial and error, exist within an evolutionary design spectrum. This contrasts with conventional views that often place these methods at odds and provides a valuable framework for unifying engineering approaches for challenging biological design problems.


Assuntos
Evolução Molecular Direcionada , Bioengenharia/métodos , Evolução Biológica , Biotecnologia/métodos , Evolução Molecular Direcionada/métodos , Biologia Sintética/métodos
18.
J R Soc Interface ; 21(213): 20240078, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38593842

RESUMO

Biofilms are responsible for most chronic infections and are highly resistant to antibiotic treatments. Previous studies have demonstrated that periodic dosing of antibiotics can help sensitize persistent subpopulations and reduce the overall dosage required for treatment. Because the dynamics and mechanisms of biofilm growth and the formation of persister cells are diverse and are affected by environmental conditions, it remains a challenge to design optimal periodic dosing regimens. Here, we develop a computational agent-based model to streamline this process and determine key parameters for effective treatment. We used our model to test a broad range of persistence switching dynamics and found that if periodic antibiotic dosing was tuned to biofilm dynamics, the dose required for effective treatment could be reduced by nearly 77%. The biofilm architecture and its response to antibiotics were found to depend on the dynamics of persister cells. Despite some differences in the response of biofilm governed by different persister switching rates, we found that a general optimized periodic treatment was still effective in significantly reducing the required antibiotic dose. As persistence becomes better quantified and understood, our model has the potential to act as a foundation for more effective strategies to target bacterial infections.


Assuntos
Bactérias , Infecções Bacterianas , Humanos , Antibacterianos/farmacologia , Antibacterianos/uso terapêutico , Biofilmes
19.
Biodes Res ; 6: 0037, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38919711

RESUMO

Living cells are exquisitely tuned to sense and respond to changes in their environment. Repurposing these systems to create engineered biosensors has seen growing interest in the field of synthetic biology and provides a foundation for many innovative applications spanning environmental monitoring to improved biobased production. In this review, we present a detailed overview of currently available biosensors and the methods that have supported their development, scale-up, and deployment. We focus on genetic sensors in living cells whose outputs affect gene expression. We find that emerging high-throughput experimental assays and evolutionary approaches combined with advanced bioinformatics and machine learning are establishing pipelines to produce genetic sensors for virtually any small molecule, protein, or nucleic acid. However, more complex sensing tasks based on classifying compositions of many stimuli and the reliable deployment of these systems into real-world settings remain challenges. We suggest that recent advances in our ability to precisely modify nonmodel organisms and the integration of proven control engineering principles (e.g., feedback) into the broader design of genetic sensing systems will be necessary to overcome these hurdles and realize the immense potential of the field.

20.
J Microbio Robot ; 20(1): 2, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38616892

RESUMO

The ability to optically interact with cells on both an individual and collective level has applications from wound healing to cancer treatment. Building systems that can facilitate both localised light illumination and visualisation of cells can, however, be challenging and costly. This work takes the Dynamic Optical MicroEnvironment (DOME), an existing platform for the closed-loop optical control of microscale agents, and adapts the design to support live-cell imaging. Through modifications made to the imaging and projection systems within the DOME, a significantly higher resolution, alternative imaging channels and the ability to customise light wavelengths are achieved (Bio-DOME). This is accompanied by an interactive calibration procedure that is robust to changes in the hardware configuration and provides fluorescence imaging (Fluoro-DOME). These alterations to the fundamental design allow for long-term use of the DOME in an environment of higher temperature and humidity. Thus, long-term imaging of living cells in a wound, with closed-loop control of real-time frontier illumination via projected light patterns, is facilitated. Supplementary Information: The online version contains supplementary material available at 10.1007/s12213-024-00165-0.

SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa