Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 20
Filtrar
1.
mBio ; 15(10): e0087324, 2024 Oct 16.
Artigo em Inglês | MEDLINE | ID: mdl-39207109

RESUMO

Microorganisms with simplified genomes represent interesting cell chassis for systems and synthetic biology. However, genome reduction can lead to undesired traits, such as decreased growth rate and metabolic imbalances. To investigate the impact of genome reduction on Escherichia coli strain DGF-298, a strain in which ~ 36% of the genome has been removed, we reconstructed a strain-specific metabolic model (iAC1061), investigated the regulation of gene expression using iModulon-based transcriptome analysis, and performed adaptive laboratory evolution to let the strain correct potential imbalances that arose during its simplification. The model notably predicted that the removal of all three key pathways for glycolaldehyde disposal in this microorganism would lead to a metabolic bottleneck through folate starvation. Glycolaldehyde is also known to cause self-generation of reactive oxygen species, as evidenced by the increased expression of oxidative stress resistance genes in the SoxS iModulon. The reintroduction of the aldA gene, responsible for one native glycolaldehyde disposal route, alleviated the constitutive oxidative stress response. Our results suggest that systems-level approaches and adaptive laboratory evolution have additive benefits when trying to repair and optimize genome-engineered strains. IMPORTANCE: Genomic streamlining can be employed in model organisms to reduce complexity and enhance strain predictability. One of the most striking examples is the bacterial strain Escherichia coli DGF-298, notable for having over one-third of its genome deleted. However, such extensive genome modifications raise the question of how similar this simplified cell remains when compared with its parent, and what are the possible unintended consequences of this simplification. In this study, we used metabolic modeling along with iModulon-based transcriptomic analysis in different growth conditions to assess the impact of genome reduction on metabolism and gene regulation. We observed little impact of genomic reduction on the regulatory network of E. coli DGF-298 and identified a potential metabolic bottleneck leading to the constitutive activity of the SoxS iModulon. We then leveraged the model's predictions to successfully restore SoxS activity to the basal level.


Assuntos
Escherichia coli , Genoma Bacteriano , Escherichia coli/genética , Escherichia coli/metabolismo , Proteínas de Escherichia coli/genética , Proteínas de Escherichia coli/metabolismo , Estresse Oxidativo , Regulação Bacteriana da Expressão Gênica , Redes e Vias Metabólicas/genética , Perfilação da Expressão Gênica
2.
Synth Syst Biotechnol ; 7(3): 900-910, 2022 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-35647330

RESUMO

In silico genome mining provides easy access to secondary metabolite biosynthetic gene clusters (BGCs) encoding the biosynthesis of many bioactive compounds, which are the basis for many important drugs used in human medicine. However, the association between BGCs and other functions encoded in the genomes of producers have remained elusive. Here, we present a systems biology workflow that integrates genome mining with a detailed pangenome analysis for detecting genes associated with a particular BGC. We analyzed 3,889 enterobacterial genomes and found 13,266 BGCs, represented by 252 distinct BGC families and 347 additional singletons. A pangenome analysis revealed 88 genes putatively associated with a specific BGC coding for the colon cancer-related colibactin that code for diverse metabolic and regulatory functions. The presented workflow opens up the possibility to discover novel secondary metabolites, better understand their physiological roles, and provides a guide to identify and analyze BGC associated gene sets.

3.
Mol Syst Biol ; 17(7): e10099, 2021 07.
Artigo em Inglês | MEDLINE | ID: mdl-34288418

RESUMO

Mesoplasma florum, a fast-growing near-minimal organism, is a compelling model to explore rational genome designs. Using sequence and structural homology, the set of metabolic functions its genome encodes was identified, allowing the reconstruction of a metabolic network representing ˜ 30% of its protein-coding genes. Growth medium simplification enabled substrate uptake and product secretion rate quantification which, along with experimental biomass composition, were integrated as species-specific constraints to produce the functional iJL208 genome-scale model (GEM) of metabolism. Genome-wide expression and essentiality datasets as well as growth data on various carbohydrates were used to validate and refine iJL208. Discrepancies between model predictions and observations were mechanistically explained using protein structures and network analysis. iJL208 was also used to propose an in silico reduced genome. Comparing this prediction to the minimal cell JCVI-syn3.0 and its parent JCVI-syn1.0 revealed key features of a minimal gene set. iJL208 is a stepping-stone toward model-driven whole-genome engineering.


Assuntos
Genoma , Redes e Vias Metabólicas , Genoma/genética , Genômica , Redes e Vias Metabólicas/genética , Modelos Biológicos
4.
PLoS Comput Biol ; 17(6): e1007817, 2021 06.
Artigo em Inglês | MEDLINE | ID: mdl-34161321

RESUMO

Sustaining a robust metabolic network requires a balanced and fully functioning proteome. In addition to amino acids, many enzymes require cofactors (coenzymes and engrafted prosthetic groups) to function properly. Extensively validated resource allocation models, such as genome-scale models of metabolism and gene expression (ME-models), have the ability to compute an optimal proteome composition underlying a metabolic phenotype, including the provision of all required cofactors. Here we apply the ME-model for Escherichia coli K-12 MG1655 to computationally examine how environmental conditions change the proteome and its accompanying cofactor usage. We found that: (1) The cofactor requirements computed by the ME-model mostly agree with the standard biomass objective function used in models of metabolism alone (M-models); (2) ME-model computations reveal non-intuitive variability in cofactor use under different growth conditions; (3) An analysis of ME-model predicted protein use in aerobic and anaerobic conditions suggests an enrichment in the use of peroxyl scavenging acids in the proteins used to sustain aerobic growth; (4) The ME-model could describe how limitation in key protein components affect the metabolic state of E. coli. Genome-scale models have thus reached a level of sophistication where they reveal intricate properties of functional proteomes and how they support different E. coli lifestyles.


Assuntos
Biologia Computacional/métodos , Escherichia coli K12/crescimento & desenvolvimento , Nutrientes/metabolismo , Proteoma , Escherichia coli K12/metabolismo , Modelos Biológicos
5.
Proc Natl Acad Sci U S A ; 117(37): 23182-23190, 2020 09 15.
Artigo em Inglês | MEDLINE | ID: mdl-32873645

RESUMO

Enzyme turnover numbers (kcats) are essential for a quantitative understanding of cells. Because kcats are traditionally measured in low-throughput assays, they can be inconsistent, labor-intensive to obtain, and can miss in vivo effects. We use a data-driven approach to estimate in vivo kcats using metabolic specialist Escherichia coli strains that resulted from gene knockouts in central metabolism followed by metabolic optimization via laboratory evolution. By combining absolute proteomics with fluxomics data, we find that in vivo kcats are robust against genetic perturbations, suggesting that metabolic adaptation to gene loss is mostly achieved through other mechanisms, like gene-regulatory changes. Combining machine learning and genome-scale metabolic models, we show that the obtained in vivo kcats predict unseen proteomics data with much higher precision than in vitro kcats. The results demonstrate that in vivo kcats can solve the problem of inconsistent and low-coverage parameterizations of genome-scale cellular models.


Assuntos
Escherichia coli/metabolismo , Escherichia coli/genética , Proteínas de Escherichia coli/genética , Proteínas de Escherichia coli/metabolismo , Técnicas de Inativação de Genes/métodos , Genoma/genética , Cinética , Aprendizado de Máquina , Modelos Biológicos , Proteômica/métodos
6.
Nat Rev Microbiol ; 18(12): 731-743, 2020 12.
Artigo em Inglês | MEDLINE | ID: mdl-32958892

RESUMO

Escherichia coli is considered to be the best-known microorganism given the large number of published studies detailing its genes, its genome and the biochemical functions of its molecular components. This vast literature has been systematically assembled into a reconstruction of the biochemical reaction networks that underlie E. coli's functions, a process which is now being applied to an increasing number of microorganisms. Genome-scale reconstructed networks are organized and systematized knowledge bases that have multiple uses, including conversion into computational models that interpret and predict phenotypic states and the consequences of environmental and genetic perturbations. These genome-scale models (GEMs) now enable us to develop pan-genome analyses that provide mechanistic insights, detail the selection pressures on proteome allocation and address stress phenotypes. In this Review, we first discuss the overall development of GEMs and their applications. Next, we review the evolution of the most complete GEM that has been developed to date: the E. coli GEM. Finally, we explore three emerging areas in genome-scale modelling of microbial phenotypes: collections of strain-specific models, metabolic and macromolecular expression models, and simulation of stress responses.


Assuntos
Escherichia coli/genética , Redes Reguladoras de Genes , Genoma Bacteriano , Genômica/métodos , Redes e Vias Metabólicas/genética , Modelos Genéticos , Actinobacteria/classificação , Actinobacteria/genética , Actinobacteria/crescimento & desenvolvimento , Actinobacteria/metabolismo , Simulação por Computador , Cianobactérias/classificação , Cianobactérias/genética , Cianobactérias/crescimento & desenvolvimento , Cianobactérias/metabolismo , Escherichia coli/crescimento & desenvolvimento , Escherichia coli/metabolismo , Firmicutes/classificação , Firmicutes/genética , Firmicutes/crescimento & desenvolvimento , Firmicutes/metabolismo , Genômica/instrumentação , Fenótipo , Proteobactérias/classificação , Proteobactérias/genética , Proteobactérias/crescimento & desenvolvimento , Proteobactérias/metabolismo , Estresse Fisiológico/genética , Thermotoga/classificação , Thermotoga/genética , Thermotoga/crescimento & desenvolvimento , Thermotoga/metabolismo , Sequenciamento Completo do Genoma
7.
PLoS Comput Biol ; 15(12): e1007525, 2019 12.
Artigo em Inglês | MEDLINE | ID: mdl-31809503

RESUMO

Response to acid stress is critical for Escherichia coli to successfully complete its life-cycle by passing through the stomach to colonize the digestive tract. To develop a fundamental understanding of this response, we established a molecular mechanistic description of acid stress mitigation responses in E. coli and integrated them with a genome-scale model of its metabolism and macromolecular expression (ME-model). We considered three known mechanisms of acid stress mitigation: 1) change in membrane lipid fatty acid composition, 2) change in periplasmic protein stability over external pH and periplasmic chaperone protection mechanisms, and 3) change in the activities of membrane proteins. After integrating these mechanisms into an established ME-model, we could simulate their responses in the context of other cellular processes. We validated these simulations using RNA sequencing data obtained from five E. coli strains grown under external pH ranging from 5.5 to 7.0. We found: i) that for the differentially expressed genes accounted for in the ME-model, 80% of the upregulated genes were correctly predicted by the ME-model, and ii) that these genes are mainly involved in translation processes (45% of genes), membrane proteins and related processes (18% of genes), amino acid metabolism (12% of genes), and cofactor and prosthetic group biosynthesis (8% of genes). We also demonstrated several intervention strategies on acid tolerance that can be simulated by the ME-model. We thus established a quantitative framework that describes, on a genome-scale, the acid stress mitigation response of E. coli that has both scientific and practical uses.


Assuntos
Escherichia coli/genética , Escherichia coli/metabolismo , Modelos Biológicos , Ácidos , Biologia Computacional , Simulação por Computador , Escherichia coli/crescimento & desenvolvimento , Proteínas de Escherichia coli/metabolismo , Ácidos Graxos/metabolismo , Regulação Bacteriana da Expressão Gênica , Genoma Bacteriano , Concentração de Íons de Hidrogênio , Lipídeos de Membrana/metabolismo , Modelos Genéticos , Estabilidade Proteica , Análise de Sequência de RNA/estatística & dados numéricos , Estresse Fisiológico
8.
Proc Natl Acad Sci U S A ; 116(28): 14368-14373, 2019 07 09.
Artigo em Inglês | MEDLINE | ID: mdl-31270234

RESUMO

Catalysis using iron-sulfur clusters and transition metals can be traced back to the last universal common ancestor. The damage to metalloproteins caused by reactive oxygen species (ROS) can prevent cell growth and survival when unmanaged, thus eliciting an essential stress response that is universal and fundamental in biology. Here we develop a computable multiscale description of the ROS stress response in Escherichia coli, called OxidizeME. We use OxidizeME to explain four key responses to oxidative stress: 1) ROS-induced auxotrophy for branched-chain, aromatic, and sulfurous amino acids; 2) nutrient-dependent sensitivity of growth rate to ROS; 3) ROS-specific differential gene expression separate from global growth-associated differential expression; and 4) coordinated expression of iron-sulfur cluster (ISC) and sulfur assimilation (SUF) systems for iron-sulfur cluster biosynthesis. These results show that we can now develop fundamental and quantitative genotype-phenotype relationships for stress responses on a genome-wide basis.


Assuntos
Proteínas Ferro-Enxofre/genética , Ferro/metabolismo , Metaloproteínas/genética , Espécies Reativas de Oxigênio/metabolismo , Catálise , Proliferação de Células/genética , Escherichia coli/genética , Escherichia coli/metabolismo , Regulação da Expressão Gênica/genética , Peróxido de Hidrogênio/metabolismo , Óperon/genética , Estresse Oxidativo/genética , Enxofre/metabolismo
9.
PLoS Comput Biol ; 15(4): e1006971, 2019 04.
Artigo em Inglês | MEDLINE | ID: mdl-31009451

RESUMO

Genome-scale metabolic models (GEMs) are mathematically structured knowledge bases of metabolism that provide phenotypic predictions from genomic information. GEM-guided predictions of growth phenotypes rely on the accurate definition of a biomass objective function (BOF) that is designed to include key cellular biomass components such as the major macromolecules (DNA, RNA, proteins), lipids, coenzymes, inorganic ions and species-specific components. Despite its importance, no standardized computational platform is currently available to generate species-specific biomass objective functions in a data-driven, unbiased fashion. To fill this gap in the metabolic modeling software ecosystem, we implemented BOFdat, a Python package for the definition of a Biomass Objective Function from experimental data. BOFdat has a modular implementation that divides the BOF definition process into three independent modules defined here as steps: 1) the coefficients for major macromolecules are calculated, 2) coenzymes and inorganic ions are identified and their stoichiometric coefficients estimated, 3) the remaining species-specific metabolic biomass precursors are algorithmically extracted in an unbiased way from experimental data. We used BOFdat to reconstruct the BOF of the Escherichia coli model iML1515, a gold standard in the field. The BOF generated by BOFdat resulted in the most concordant biomass composition, growth rate, and gene essentiality prediction accuracy when compared to other methods. Installation instructions for BOFdat are available in the documentation and the source code is available on GitHub (https://github.com/jclachance/BOFdat).


Assuntos
Biomassa , Genômica/métodos , Redes e Vias Metabólicas , Modelos Biológicos , Software , Escherichia coli/genética , Escherichia coli/metabolismo , Genoma Bacteriano
10.
PLoS Comput Biol ; 15(3): e1006213, 2019 03.
Artigo em Inglês | MEDLINE | ID: mdl-30822347

RESUMO

Understanding the fundamental characteristics of microbial communities could have far reaching implications for human health and applied biotechnology. Despite this, much is still unknown regarding the genetic basis and evolutionary strategies underlying the formation of viable synthetic communities. By pairing auxotrophic mutants in co-culture, it has been demonstrated that viable nascent E. coli communities can be established where the mutant strains are metabolically coupled. A novel algorithm, OptAux, was constructed to design 61 unique multi-knockout E. coli auxotrophic strains that require significant metabolite uptake to grow. These predicted knockouts included a diverse set of novel non-specific auxotrophs that result from inhibition of major biosynthetic subsystems. Three OptAux predicted non-specific auxotrophic strains-with diverse metabolic deficiencies-were co-cultured with an L-histidine auxotroph and optimized via adaptive laboratory evolution (ALE). Time-course sequencing revealed the genetic changes employed by each strain to achieve higher community growth rates and provided insight into mechanisms for adapting to the syntrophic niche. A community model of metabolism and gene expression was utilized to predict the relative community composition and fundamental characteristics of the evolved communities. This work presents new insight into the genetic strategies underlying viable nascent community formation and a cutting-edge computational method to elucidate metabolic changes that empower the creation of cooperative communities.


Assuntos
Adaptação Fisiológica , Escherichia coli/fisiologia , Modelos Biológicos , Algoritmos , Evolução Biológica , Técnicas de Cocultura , Escherichia coli/genética , Genes Bacterianos , Mutação
11.
BMC Syst Biol ; 13(1): 2, 2019 01 09.
Artigo em Inglês | MEDLINE | ID: mdl-30626386

RESUMO

BACKGROUND: Genome-scale models of metabolism and macromolecular expression (ME models) enable systems-level computation of proteome allocation coupled to metabolic phenotype. RESULTS: We develop DynamicME, an algorithm enabling time-course simulation of cell metabolism and protein expression. DynamicME correctly predicted the substrate utilization hierarchy on a mixed carbon substrate medium. We also found good agreement between predicted and measured time-course expression profiles. ME models involve considerably more parameters than metabolic models (M models). We thus generate an ensemble of models (each model having its rate constants perturbed), and then analyze the models by identifying archetypal time-course metabolite concentration profiles. Furthermore, we use a metaheuristic optimization method to calibrate ME model parameters using time-course measurements such as from a (fed-) batch culture. Finally, we show that constraints on protein concentration dynamics ("inertia") alter the metabolic response to environmental fluctuations, including increased substrate-level phosphorylation and lowered oxidative phosphorylation. CONCLUSIONS: Overall, DynamicME provides a novel method for understanding proteome allocation and metabolism under complex and transient environments, and to utilize time-course cell culture data for model-based interpretation or model refinement.


Assuntos
Perfilação da Expressão Gênica , Metabolômica , Modelos Biológicos , Proteínas/genética , Proteínas/metabolismo , Algoritmos , Calibragem , Escherichia coli/citologia , Escherichia coli/genética , Escherichia coli/metabolismo , Genômica
12.
Nat Commun ; 9(1): 5252, 2018 12 07.
Artigo em Inglês | MEDLINE | ID: mdl-30531987

RESUMO

Knowing the catalytic turnover numbers of enzymes is essential for understanding the growth rate, proteome composition, and physiology of organisms, but experimental data on enzyme turnover numbers is sparse and noisy. Here, we demonstrate that machine learning can successfully predict catalytic turnover numbers in Escherichia coli based on integrated data on enzyme biochemistry, protein structure, and network context. We identify a diverse set of features that are consistently predictive for both in vivo and in vitro enzyme turnover rates, revealing novel protein structural correlates of catalytic turnover. We use our predictions to parameterize two mechanistic genome-scale modelling frameworks for proteome-limited metabolism, leading to significantly higher accuracy in the prediction of quantitative proteome data than previous approaches. The presented machine learning models thus provide a valuable tool for understanding metabolism and the proteome at the genome scale, and elucidate structural, biochemical, and network properties that underlie enzyme kinetics.


Assuntos
Proteínas de Escherichia coli/metabolismo , Escherichia coli/enzimologia , Aprendizado de Máquina , Redes e Vias Metabólicas , Algoritmos , Biocatálise , Escherichia coli/genética , Proteínas de Escherichia coli/genética , Cinética , Modelos Biológicos , Proteoma/genética , Proteoma/metabolismo
13.
PLoS Comput Biol ; 14(7): e1006302, 2018 07.
Artigo em Inglês | MEDLINE | ID: mdl-29975681

RESUMO

Genome-scale models of metabolism and macromolecular expression (ME-models) explicitly compute the optimal proteome composition of a growing cell. ME-models expand upon the well-established genome-scale models of metabolism (M-models), and they enable a new fundamental understanding of cellular growth. ME-models have increased predictive capabilities and accuracy due to their inclusion of the biosynthetic costs for the machinery of life, but they come with a significant increase in model size and complexity. This challenge results in models which are both difficult to compute and challenging to understand conceptually. As a result, ME-models exist for only two organisms (Escherichia coli and Thermotoga maritima) and are still used by relatively few researchers. To address these challenges, we have developed a new software framework called COBRAme for building and simulating ME-models. It is coded in Python and built on COBRApy, a popular platform for using M-models. COBRAme streamlines computation and analysis of ME-models. It provides tools to simplify constructing and editing ME-models to enable ME-model reconstructions for new organisms. We used COBRAme to reconstruct a condensed E. coli ME-model called iJL1678b-ME. This reformulated model gives functionally identical solutions to previous E. coli ME-models while using 1/6 the number of free variables and solving in less than 10 minutes, a marked improvement over the 6 hour solve time of previous ME-model formulations. Errors in previous ME-models were also corrected leading to 52 additional genes that must be expressed in iJL1678b-ME to grow aerobically in glucose minimal in silico media. This manuscript outlines the architecture of COBRAme and demonstrates how ME-models can be created, modified, and shared most efficiently using the new software framework.


Assuntos
Simulação por Computador , Expressão Gênica , Metabolismo/genética , Modelos Genéticos , Design de Software , Algoritmos , Genoma
15.
Proc Natl Acad Sci U S A ; 114(38): 10286-10291, 2017 09 19.
Artigo em Inglês | MEDLINE | ID: mdl-28874552

RESUMO

Transcriptional regulatory networks (TRNs) have been studied intensely for >25 y. Yet, even for the Escherichia coli TRN-probably the best characterized TRN-several questions remain. Here, we address three questions: (i) How complete is our knowledge of the E. coli TRN; (ii) how well can we predict gene expression using this TRN; and (iii) how robust is our understanding of the TRN? First, we reconstructed a high-confidence TRN (hiTRN) consisting of 147 transcription factors (TFs) regulating 1,538 transcription units (TUs) encoding 1,764 genes. The 3,797 high-confidence regulatory interactions were collected from published, validated chromatin immunoprecipitation (ChIP) data and RegulonDB. For 21 different TF knockouts, up to 63% of the differentially expressed genes in the hiTRN were traced to the knocked-out TF through regulatory cascades. Second, we trained supervised machine learning algorithms to predict the expression of 1,364 TUs given TF activities using 441 samples. The algorithms accurately predicted condition-specific expression for 86% (1,174 of 1,364) of the TUs, while 193 TUs (14%) were predicted better than random TRNs. Third, we identified 10 regulatory modules whose definitions were robust against changes to the TRN or expression compendium. Using surrogate variable analysis, we also identified three unmodeled factors that systematically influenced gene expression. Our computational workflow comprehensively characterizes the predictive capabilities and systems-level functions of an organism's TRN from disparate data types.


Assuntos
Escherichia coli/metabolismo , Regulação Bacteriana da Expressão Gênica , Redes Reguladoras de Genes , Fatores de Transcrição/metabolismo , Escherichia coli/genética , Transcriptoma
16.
Appl Environ Microbiol ; 83(13)2017 07 01.
Artigo em Inglês | MEDLINE | ID: mdl-28455337

RESUMO

Adaptive laboratory evolution (ALE) experiments are often designed to maintain a static culturing environment to minimize confounding variables that could influence the adaptive process, but dynamic nutrient conditions occur frequently in natural and bioprocessing settings. To study the nature of carbon substrate fitness tradeoffs, we evolved batch cultures of Escherichia coli via serial propagation into tubes alternating between glucose and either xylose, glycerol, or acetate. Genome sequencing of evolved cultures revealed several genetic changes preferentially selected for under dynamic conditions and different adaptation strategies depending on the substrates being switched between; in some environments, a persistent "generalist" strain developed, while in another, two "specialist" subpopulations arose that alternated dominance. Diauxic lag phenotype varied across the generalists and specialists, in one case being completely abolished, while gene expression data distinguished the transcriptional strategies implemented by strains in pursuit of growth optimality. Genome-scale metabolic modeling techniques were then used to help explain the inherent substrate differences giving rise to the observed distinct adaptive strategies. This study gives insight into the population dynamics of adaptation in an alternating environment and into the underlying metabolic and genetic mechanisms. Furthermore, ALE-generated optimized strains have phenotypes with potential industrial bioprocessing applications.IMPORTANCE Evolution and natural selection inexorably lead to an organism's improved fitness in a given environment, whether in a laboratory or natural setting. However, despite the frequent natural occurrence of complex and dynamic growth environments, laboratory evolution experiments typically maintain simple, static culturing environments so as to reduce selection pressure complexity. In this study, we investigated the adaptive strategies underlying evolution to fluctuating environments by evolving Escherichia coli to conditions of frequently switching growth substrate. Characterization of evolved strains via a number of different data types revealed the various genetic and phenotypic changes implemented in pursuit of growth optimality and how these differed across the different growth substrates and switching protocols. This work not only helps to establish general principles of adaptation to complex environments but also suggests strategies for experimental design to achieve desired evolutionary outcomes.


Assuntos
Escherichia coli/genética , Meios de Cultura/metabolismo , Meio Ambiente , Escherichia coli/metabolismo , Aptidão Genética , Genoma Bacteriano , Glucose/metabolismo , Glicerol/metabolismo , Fenótipo , Xilose/metabolismo
17.
Sci Rep ; 6: 36734, 2016 11 18.
Artigo em Inglês | MEDLINE | ID: mdl-27857205

RESUMO

Integrating omics data to refine or make context-specific models is an active field of constraint-based modeling. Proteomics now cover over 95% of the Escherichia coli proteome by mass. Genome-scale models of Metabolism and macromolecular Expression (ME) compute proteome allocation linked to metabolism and fitness. Using proteomics data, we formulated allocation constraints for key proteome sectors in the ME model. The resulting calibrated model effectively computed the "generalist" (wild-type) E. coli proteome and phenotype across diverse growth environments. Across 15 growth conditions, prediction errors for growth rate and metabolic fluxes were 69% and 14% lower, respectively. The sector-constrained ME model thus represents a generalist ME model reflecting both growth rate maximization and "hedging" against uncertain environments and stresses, as indicated by significant enrichment of these sectors for the general stress response sigma factor σS. Finally, the sector constraints represent a general formalism for integrating omics data from any experimental condition into constraint-based ME models. The constraints can be fine-grained (individual proteins) or coarse-grained (functionally-related protein groups) as demonstrated here. This flexible formalism provides an accessible approach for narrowing the gap between the complexity captured by omics data and governing principles of proteome allocation described by systems-level models.


Assuntos
Proteoma/genética , Simulação por Computador , Escherichia coli/genética , Escherichia coli/metabolismo , Genoma Bacteriano , Modelos Genéticos , Proteoma/metabolismo , Proteômica
18.
BMC Bioinformatics ; 17(1): 391, 2016 Sep 22.
Artigo em Inglês | MEDLINE | ID: mdl-27659412

RESUMO

BACKGROUND: Genome-scale models of metabolism and macromolecular expression (ME) significantly expand the scope and predictive capabilities of constraint-based modeling. ME models present considerable computational challenges: they are much (>30 times) larger than corresponding metabolic reconstructions (M models), are multiscale, and growth maximization is a nonlinear programming (NLP) problem, mainly due to macromolecule dilution constraints. RESULTS: Here, we address these computational challenges. We develop a fast and numerically reliable solution method for growth maximization in ME models using a quad-precision NLP solver (Quad MINOS). Our method was up to 45 % faster than binary search for six significant digits in growth rate. We also develop a fast, quad-precision flux variability analysis that is accelerated (up to 60× speedup) via solver warm-starts. Finally, we employ the tools developed to investigate growth-coupled succinate overproduction, accounting for proteome constraints. CONCLUSIONS: Just as genome-scale metabolic reconstructions have become an invaluable tool for computational and systems biologists, we anticipate that these fast and numerically reliable ME solution methods will accelerate the wide-spread adoption of ME models for researchers in these fields.

19.
Proc Natl Acad Sci U S A ; 112(34): 10810-5, 2015 Aug 25.
Artigo em Inglês | MEDLINE | ID: mdl-26261351

RESUMO

Finding the minimal set of gene functions needed to sustain life is of both fundamental and practical importance. Minimal gene lists have been proposed by using comparative genomics-based core proteome definitions. A definition of a core proteome that is supported by empirical data, is understood at the systems-level, and provides a basis for computing essential cell functions is lacking. Here, we use a systems biology-based genome-scale model of metabolism and expression to define a functional core proteome consisting of 356 gene products, accounting for 44% of the Escherichia coli proteome by mass based on proteomics data. This systems biology core proteome includes 212 genes not found in previous comparative genomics-based core proteome definitions, accounts for 65% of known essential genes in E. coli, and has 78% gene function overlap with minimal genomes (Buchnera aphidicola and Mycoplasma genitalium). Based on transcriptomics data across environmental and genetic backgrounds, the systems biology core proteome is significantly enriched in nondifferentially expressed genes and depleted in differentially expressed genes. Compared with the noncore, core gene expression levels are also similar across genetic backgrounds (two times higher Spearman rank correlation) and exhibit significantly more complex transcriptional and posttranscriptional regulatory features (40% more transcription start sites per gene, 22% longer 5'UTR). Thus, genome-scale systems biology approaches rigorously identify a functional core proteome needed to support growth. This framework, validated by using high-throughput datasets, facilitates a mechanistic understanding of systems-level core proteome function through in silico models; it de facto defines a paleome.


Assuntos
Regulação Bacteriana da Expressão Gênica , Genes Bacterianos , Ensaios de Triagem em Larga Escala , Metaboloma , Proteoma , Biologia de Sistemas , Buchnera/genética , Buchnera/metabolismo , Simulação por Computador , Conjuntos de Dados como Assunto , Escherichia coli/genética , Escherichia coli/metabolismo , Proteínas de Escherichia coli/genética , Modelos Biológicos , Família Multigênica , Mycoplasma genitalium/genética , Mycoplasma genitalium/metabolismo , Transcriptoma
20.
Curr Opin Biotechnol ; 35: 23-9, 2015 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-25575024

RESUMO

Constraint-based reconstruction and analysis (COBRA) methods have become widely used tools for metabolic engineering in both academic and industrial laboratories. By employing a genome-scale in silico representation of the metabolic network of a host organism, COBRA methods can be used to predict optimal genetic modifications that improve the rate and yield of chemical production. A new generation of COBRA models and methods is now being developed--encompassing many biological processes and simulation strategies-and next-generation models enable new types of predictions. Here, three key examples of applying COBRA methods to strain optimization are presented and discussed. Then, an outlook is provided on the next generation of COBRA models and the new types of predictions they will enable for systems metabolic engineering.


Assuntos
Genoma , Engenharia Metabólica , Biomassa , Deleção de Genes , Humanos , Redes e Vias Metabólicas , Modelos Biológicos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA