Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
Mais filtros

Base de dados
Tipo de documento
País de afiliação
Intervalo de ano de publicação
1.
Curr Opin Biotechnol ; 81: 102941, 2023 06.
Artigo em Inglês | MEDLINE | ID: mdl-37087839

RESUMO

Advances in high-throughput DNA synthesis and sequencing have fuelled the use of massively parallel reporter assays for strain characterization. These experiments produce large datasets that map DNA sequences to protein expression levels, and have sparked increased interest in data-driven methods for sequence-to-expression modeling. Here, we highlight progress in deep learning models of protein expression and their potential for optimizing strains engineered to produce recombinant proteins. We discuss recent works that built highly accurate models as well as the challenges that hinder wider adoption by end users. There is a need to better align this technology with the requirements and capabilities encountered in strain engineering, particularly the cost of data acquisition and the need for interpretable models that generalize beyond the training data. Overcoming these barriers will help to incentivize academic and industrial laboratories to tap into a new era of data-centric strain engineering.


Assuntos
Bioengenharia , Aprendizado Profundo , Proteínas , Proteínas Recombinantes
2.
Nat Commun ; 13(1): 7755, 2022 12 15.
Artigo em Inglês | MEDLINE | ID: mdl-36517468

RESUMO

Synthetic biology often involves engineering microbial strains to express high-value proteins. Thanks to progress in rapid DNA synthesis and sequencing, deep learning has emerged as a promising approach to build sequence-to-expression models for strain optimization. But such models need large and costly training data that create steep entry barriers for many laboratories. Here we study the relation between accuracy and data efficiency in an atlas of machine learning models trained on datasets of varied size and sequence diversity. We show that deep learning can achieve good prediction accuracy with much smaller datasets than previously thought. We demonstrate that controlled sequence diversity leads to substantial gains in data efficiency and employed Explainable AI to show that convolutional neural networks can finely discriminate between input DNA sequences. Our results provide guidelines for designing genotype-phenotype screens that balance cost and quality of training data, thus helping promote the wider adoption of deep learning in the biotechnology sector.


Assuntos
Aprendizado Profundo , Redes Neurais de Computação , Aprendizado de Máquina , Proteínas
3.
Methods Mol Biol ; 2229: 267-291, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33405227

RESUMO

Heterologous gene expression draws resources from host cells. These resources include vital components to sustain growth and replication, and the resulting cellular burden is a widely recognized bottleneck in the design of robust circuits. In this tutorial we discuss the use of computational models that integrate gene circuits and the physiology of host cells. Through various use cases, we illustrate the power of host-circuit models to predict the impact of design parameters on both burden and circuit functionality. Our approach relies on a new generation of computational models for microbial growth that can flexibly accommodate resource bottlenecks encountered in gene circuit design. Adoption of this modeling paradigm can facilitate fast and robust design cycles in synthetic biology.


Assuntos
Bactérias/genética , Redes Reguladoras de Genes , Biologia de Sistemas/métodos , Fenômenos Fisiológicos Bacterianos , Proteínas de Bactérias/genética , Expressão Gênica , Genes Sintéticos , Modelos Biológicos , Biologia Sintética
4.
ACS Synth Biol ; 8(6): 1231-1240, 2019 06 21.
Artigo em Inglês | MEDLINE | ID: mdl-31181895

RESUMO

Synthetic gene circuits perturb the physiology of their cellular host. The extra load on endogenous processes shifts the equilibrium of resource allocation in the host, leading to slow growth and reduced biosynthesis. Here we built integrated host-circuit models to quantify growth defects caused by synthetic gene circuits. Simulations reveal a complex relation between circuit output and cellular capacity for gene expression. For weak induction of heterologous genes, protein output can be increased at the expense of growth defects. Yet for stronger induction, cellular capacity reaches a tipping point, beyond which both gene expression and growth rate drop sharply. Extensive simulations across various growth conditions and large regions of the design space suggest that the critical capacity is a result of ribosomal scarcity. We studied the impact of growth defects on various gene circuits and transcriptional logic gates, which highlights the extent to which cellular burden can limit, shape, and even break down circuit function. Our approach offers a comprehensive framework to assess the impact of host-circuit interactions in silico, with wide-ranging implications for the design and optimization of bacterial gene circuits.


Assuntos
Redes Reguladoras de Genes/genética , Genes Sintéticos/genética , Modelos Genéticos , Biologia Sintética/métodos , Simulação por Computador , Genes Bacterianos/genética , Ribossomos/genética , Ribossomos/metabolismo
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA