RESUMO
Clostridium thermocellum is a promising candidate for consolidated bioprocessing because it can directly ferment cellulose to ethanol. Despite significant efforts, achieved yields and titers fall below industrially relevant targets. This implies that there still exist unknown enzymatic, regulatory, and/or possibly thermodynamic bottlenecks that can throttle back metabolic flow. By (i) elucidating internal metabolic fluxes in wild-type C. thermocellum grown on cellobiose via 13C-metabolic flux analysis (13C-MFA), (ii) parameterizing a core kinetic model, and (iii) subsequently deploying an ensemble-docking workflow for discovering substrate-level regulations, this paper aims to reveal some of these factors and expand our knowledgebase governing C. thermocellum metabolism. Generated 13C labeling data were used with 13C-MFA to generate a wild-type flux distribution for the metabolic network. Notably, flux elucidation through MFA alluded to serine generation via the mercaptopyruvate pathway. Using the elucidated flux distributions in conjunction with batch fermentation process yield data for various mutant strains, we constructed a kinetic model of C. thermocellum core metabolism (i.e. k-ctherm138). Subsequently, we used the parameterized kinetic model to explore the effect of removing substrate-level regulations on ethanol yield and titer. Upon exploring all possible simultaneous (up to four) regulation removals we identified combinations that lead to many-fold model predicted improvement in ethanol titer. In addition, by coupling a systematic method for identifying putative competitive inhibitory mechanisms using K-FIT kinetic parameterization with the ensemble-docking workflow, we flagged 67 putative substrate-level inhibition mechanisms across central carbon metabolism supported by both kinetic formalism and docking analysis.
Assuntos
Clostridium thermocellum , Celobiose/metabolismo , Clostridium thermocellum/genética , Clostridium thermocellum/metabolismo , Etanol/metabolismo , Fermentação , CinéticaRESUMO
Kinetic models predict the metabolic flows by directly linking metabolite concentrations and enzyme levels to reaction fluxes. Robust parameterization of organism-level kinetic models that faithfully reproduce the effect of different genetic or environmental perturbations remains an open challenge due to the intractability of existing algorithms. This paper introduces Kinetics-based Fluxomics Integration Tool (K-FIT), a robust kinetic parameterization workflow that leverages a novel decomposition approach to identify steady-state fluxes in response to genetic perturbations followed by a gradient-based update of kinetic parameters until predictions simultaneously agree with the fluxomic data in all perturbed metabolic networks. The applicability of K-FIT to large-scale models is demonstrated by parameterizing an expanded kinetic model for E. coli (307 reactions and 258 metabolites) using fluxomic data from six mutants. The achieved thousand-fold speed-up afforded by K-FIT over meta-heuristic approaches is transformational enabling follow-up robustness of inference analyses and optimal design of experiments to inform metabolic engineering strategies.
Assuntos
Algoritmos , Escherichia coli , Modelos Biológicos , Escherichia coli/genética , Escherichia coli/metabolismo , CinéticaRESUMO
Background: Genome-scale metabolic network models and constraint-based modeling techniques have become important tools for analyzing cellular metabolism. Thermodynamically infeasible cycles (TICs) causing unbounded metabolic flux ranges are often encountered. TICs satisfy the mass balance and directionality constraints but violate the second law of thermodynamics. Current practices involve implementing additional constraints to ensure not only optimal but also loopless flux distributions. However, the mixed integer linear programming problems required to solve become computationally intractable for genome-scale metabolic models. Results: We aimed to identify the fewest needed constraints sufficient for optimality under the loopless requirement. We found that loopless constraints are required only for the reactions that share elementary flux modes representing TICs with reactions that are part of the objective function. We put forth the concept of localized loopless constraints (LLCs) to enforce this minimal required set of loopless constraints. By combining with a novel procedure for minimal null-space calculation, the computational time for loopless flux variability analysis (ll-FVA) is reduced by a factor of 10-150 compared to the original loopless constraints and by 4-20 times compared to the current fastest method Fast-SNP with the percent improvement increasing with model size. Importantly, LLCs offer a scalable strategy for loopless flux calculations for multi-compartment/multi-organism models of large sizes, for example, shortening the CPU time for ll-FVA from 35 h to less than 2 h for a model with more than104 reactions. Availability and implementation: Matlab functions are available in the Supplementary Material or at https://github.com/maranasgroup/lll-FVA. Supplementary information: Supplementary data are available at Bioinformatics online.
Assuntos
Biologia Computacional , Genoma , Redes e Vias Metabólicas , Modelos Biológicos , Biologia Computacional/métodos , Genoma/genética , Programação Linear , TermodinâmicaRESUMO
Clostridium thermocellum is a candidate for consolidated bioprocessing by carrying out both cellulose solubilization and fermentation. However, despite significant efforts the maximum ethanol titer achieved to date remains below industrially required targets. Several studies have analyzed the impact of increasing ethanol concentration on C. thermocellum's membrane properties, cofactor pool ratios, and altered enzyme regulation. In this study, we explore the extent to which thermodynamic equilibrium limits maximum ethanol titer. We used the max-min driving force (MDF) algorithm (Noor et al., 2014) to identify the range of allowable metabolite concentrations that maintain a negative free energy change for all reaction steps in the pathway from cellobiose to ethanol. To this end, we used a time-series metabolite concentration dataset to flag five reactions (phosphofructokinase (PFK), fructose bisphosphate aldolase (FBA), glyceraldehyde-3-phosphate dehydrogenase (GAPDH), aldehyde dehydrogenase (ALDH) and alcohol dehydrogenase (ADH)) which become thermodynamic bottlenecks under high external ethanol concentrations. Thermodynamic analysis was also deployed in a prospective mode to evaluate genetic interventions which can improve pathway thermodynamics by generating minimal set of reactions or elementary flux modes (EFMs) which possess unique genetic variations while ensuring mass and redox balance with ethanol production. MDF evaluation of all generated (336) EFMs indicated that, i) pyruvate phosphate dikinase (PPDK) has a higher pathway MDF than the malate shunt alternative due to limiting CO2 concentrations under physiological conditions, and ii) NADPH-dependent glyceraldehyde-3-phosphate dehydrogenase (GAPN) can alleviate thermodynamic bottlenecks at high ethanol concentrations due to cofactor modification and reduction in ATP generation. The combination of ATP linked phosphofructokinase (PFK-ATP) and NADPH linked alcohol dehydrogenase (ADH-NADPH) with NADPH linked aldehyde dehydrogenase (ALDH-NADPH) or ferredoxin: NADP â+ âoxidoreductase (NADPH-FNOR) emerges as the best intervention strategy for ethanol production that balances MDF improvements with ATP generation, and appears to functionally reproduce the pathway employed by the ethanologen Thermoanaerobacterium saccharolyticum. Expanding the list of measured intracellular metabolites and improving the quantification accuracy of measurements was found to improve the fidelity of pathway thermodynamics analysis in C. thermocellum. This study demonstrates even before addressing an organism's enzyme kinetics and allosteric regulations, pathway thermodynamics can flag pathway bottlenecks and identify testable strategies for enhancing pathway thermodynamic feasibility and function.
Assuntos
Proteínas de Bactérias/metabolismo , Celobiose/metabolismo , Clostridium thermocellum/metabolismo , Etanol/metabolismo , Modelos Biológicos , TermodinâmicaRESUMO
Computational pathway design tools often face the challenges of balancing the stoichiometry of co-metabolites and cofactors, and dealing with reaction rule utilization in a single workflow. To this end, we provide an overview of two complementary stoichiometry-based pathway design tools optStoic and novoStoic developed in our group to tackle these challenges. optStoic is designed to determine the stoichiometry of overall conversion first which optimizes a performance criterion (e.g. high carbon/energy efficiency) and ensures a comprehensive search of co-metabolites and cofactors. The procedure then identifies the minimum number of intervening reactions to connect the source and sink metabolites. We also further the pathway design procedure by expanding the search space to include both known and hypothetical reactions, represented by reaction rules, in a new tool termed novoStoic. Reaction rules are derived based on a mixed-integer linear programming (MILP) compatible reaction operator, which allow us to explore natural promiscuous enzymes, engineer candidate enzymes that are not already promiscuous as well as design de novo enzymes. The identified biochemical reaction rules then guide novoStoic to design routes that expand the currently known biotransformation space using a single MILP modeling procedure. We demonstrate the use of the two computational tools in pathway elucidation by designing novel synthetic routes for isobutanol.
Assuntos
Técnicas de Química Combinatória , Algoritmos , Redes e Vias MetabólicasRESUMO
Solving environmental and social challenges such as climate change requires a shift from our current non-renewable manufacturing model to a sustainable bioeconomy. To lower carbon emissions in the production of fuels and chemicals, plant biomass feedstocks can replace petroleum using microorganisms as biocatalysts. The anaerobic thermophile Clostridium thermocellum is a promising bacterium for bioconversion due to its capability to efficiently degrade lignocellulosic biomass. However, the complex metabolism of C. thermocellum is not fully understood, hindering metabolic engineering to achieve high titers, rates, and yields of targeted molecules. In this study, we developed an updated genome-scale metabolic model of C. thermocellum that accounts for recent metabolic findings, has improved prediction accuracy, and is standard-conformant to ensure easy reproducibility. We illustrated two applications of the developed model. We first formulated a multi-omics integration protocol and used it to understand redox metabolism and potential bottlenecks in biofuel (e.g., ethanol) production in C. thermocellum. Second, we used the metabolic model to design modular cells for efficient production of alcohols and esters with broad applications as flavors, fragrances, solvents, and fuels. The proposed designs not only feature intuitive push-and-pull metabolic engineering strategies, but also present novel manipulations around important central metabolic branch-points. We anticipate the developed genome-scale metabolic model will provide a useful tool for system analysis of C. thermocellum metabolism to fundamentally understand its physiology and guide metabolic engineering strategies to rapidly generate modular production strains for effective biosynthesis of biofuels and biochemicals from lignocellulosic biomass.
RESUMO
Metabolic pathways reflect an organism's chemical repertoire and hence their elucidation and design have been a primary goal in metabolic engineering. Various computational methods have been developed to design novel metabolic pathways while taking into account several prerequisites such as pathway stoichiometry, thermodynamics, host compatibility, and enzyme availability. The choice of the method is often determined by the nature of the metabolites of interest and preferred host organism, along with computational complexity and availability of software tools. In this paper, we review different computational approaches used to design metabolic pathways based on the reaction network representation of the database (i.e., graph or stoichiometric matrix) and the search algorithm (i.e., graph search, flux balance analysis, or retrosynthetic search). We also put forth a systematic workflow that can be implemented in projects requiring pathway design and highlight current limitations and obstacles in computational pathway design.
RESUMO
BACKGROUND: Clostridium thermocellum is a Gram-positive anaerobe with the ability to hydrolyze and metabolize cellulose into biofuels such as ethanol, making it an attractive candidate for consolidated bioprocessing (CBP). At present, metabolic engineering in C. thermocellum is hindered due to the incomplete description of its metabolic repertoire and regulation within a predictive metabolic model. Genome-scale metabolic (GSM) models augmented with kinetic models of metabolism have been shown to be effective at recapitulating perturbed metabolic phenotypes. RESULTS: In this effort, we first update a second-generation genome-scale metabolic model (iCth446) for C. thermocellum by correcting cofactor dependencies, restoring elemental and charge balances, and updating GAM and NGAM values to improve phenotype predictions. The iCth446 model is next used as a scaffold to develop a core kinetic model (k-ctherm118) of the C. thermocellum central metabolism using the Ensemble Modeling (EM) paradigm. Model parameterization is carried out by simultaneously imposing fermentation yield data in lactate, malate, acetate, and hydrogen production pathways for 19 measured metabolites spanning a library of 19 distinct single and multiple gene knockout mutants along with 18 intracellular metabolite concentration data for a Δgldh mutant and ten experimentally measured Michaelis-Menten kinetic parameters. CONCLUSIONS: The k-ctherm118 model captures significant metabolic changes caused by (1) nitrogen limitation leading to increased yields for lactate, pyruvate, and amino acids, and (2) ethanol stress causing an increase in intracellular sugar phosphate concentrations (~1.5-fold) due to upregulation of cofactor pools. Robustness analysis of k-ctherm118 alludes to the presence of a secondary activity of ketol-acid reductoisomerase and possible regulation by valine and/or leucine pool levels. In addition, cross-validation and robustness analysis allude to missing elements in k-ctherm118 and suggest additional experiments to improve kinetic model prediction fidelity. Overall, the study quantitatively assesses the advantages of EM-based kinetic modeling towards improved prediction of C. thermocellum metabolism and develops a predictive kinetic model which can be used to design biofuel-overproducing strains.
RESUMO
Anaerobic Clostridium spp. is an important bioproduction microbial genus that can produce solvents and utilize a broad spectrum of substrates including cellulose and syngas. Genome-scale metabolic (GSM) models are increasingly being put forth for various clostridial strains to explore their respective metabolic capabilities and suitability for various bioconversions. In this study, we have selected representative GSM models for six different clostridia (Clostridium acetobutylicum, C. beijerinckii, C. butyricum, C. cellulolyticum, C. ljungdahlii and C. thermocellum) and performed a detailed model comparison contrasting their metabolic repertoire. We also discuss various applications of these GSM models to guide metabolic engineering interventions as well as assessing cellular physiology.
Assuntos
Clostridium/genética , Clostridium/metabolismo , Celulose/metabolismo , Clostridium/classificação , Clostridium acetobutylicum/metabolismo , Fermentação , Genoma Bacteriano , Engenharia Metabólica , Modelos BiológicosRESUMO
BACKGROUND: Clostridia are anaerobic Gram-positive Firmicutes containing broad and flexible systems for substrate utilization, which have been used successfully to produce a range of industrial compounds. In particular, Clostridium acetobutylicum has been used to produce butanol on an industrial scale through acetone-butanol-ethanol (ABE) fermentation. A genome-scale metabolic (GSM) model is a powerful tool for understanding the metabolic capacities of an organism and developing metabolic engineering strategies for strain development. The integration of stress-related specific transcriptomics information with the GSM model provides opportunities for elucidating the focal points of regulation. RESULTS: We describe here the construction and validation of a GSM model for C. acetobutylicum ATCC 824, iCac802. iCac802 spans 802 genes and includes 1,137 metabolites and 1,462 reactions, along with gene-protein-reaction associations. Both (13)C-MFA and gene deletion data in the ABE fermentation pathway were used to test the predicted flux ranges allowed by the model. We also describe the CoreReg method, introduced in this paper, to integrate transcriptomic data and identify core sets of reactions that, when their flux was selectively restricted, reproduced flux and biomass-formation ranges seen under all regulatory constraints. CoreReg was used in response to butanol and butyrate stress to tighten bounds for 50 reactions within the iCac802 model. These bounds affected the flux of tens of reactions in core metabolism. The model, incorporating the regulatory restrictions from CoreReg under chemical stress, exhibited an approximate 70% reduction in biomass yield for most stress conditions. CONCLUSIONS: The regulation placed on the model for the two stresses using CoreReg identified differences in the respective responses, including distinct core sets and the restriction of biomass production similar to experimental observations. Given the core sets predicted by the CoreReg method, remedial actions can be taken to counteract the effect of stress on metabolism. For less well-known systems, plausible regulatory loops can be suggested around the affected metabolic reactions, and the hypotheses can be tested experimentally.