Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 54
Filtrar
1.
BMC Bioinformatics ; 24(1): 106, 2023 Mar 22.
Artigo em Inglês | MEDLINE | ID: mdl-36949401

RESUMO

BACKGROUND: Biochemical reaction prediction tools leverage enzymatic promiscuity rules to generate reaction networks containing novel compounds and reactions. The resulting reaction networks can be used for multiple applications such as designing novel biosynthetic pathways and annotating untargeted metabolomics data. It is vital for these tools to provide a robust, user-friendly method to generate networks for a given application. However, existing tools lack the flexibility to easily generate networks that are tailor-fit for a user's application due to lack of exhaustive reaction rules, restriction to pre-computed networks, and difficulty in using the software due to lack of documentation. RESULTS: Here we present Pickaxe, an open-source, flexible software that provides a user-friendly method to generate novel reaction networks. This software iteratively applies reaction rules to a set of metabolites to generate novel reactions. Users can select rules from the prepackaged JN1224min ruleset, derived from MetaCyc, or define their own custom rules. Additionally, filters are provided which allow for the pruning of a network on-the-fly based on compound and reaction properties. The filters include chemical similarity to target molecules, metabolomics, thermodynamics, and reaction feasibility filters. Example applications are given to highlight the capabilities of Pickaxe: the expansion of common biological databases with novel reactions, the generation of industrially useful chemicals from a yeast metabolome database, and the annotation of untargeted metabolomics peaks from an E. coli dataset. CONCLUSION: Pickaxe predicts novel metabolic reactions and compounds, which can be used for a variety of applications. This software is open-source and available as part of the MINE Database python package ( https://pypi.org/project/minedatabase/ ) or on GitHub ( https://github.com/tyo-nu/MINE-Database ). Documentation and examples can be found on Read the Docs ( https://mine-database.readthedocs.io/en/latest/ ). Through its documentation, pre-packaged features, and customizable nature, Pickaxe allows users to generate novel reaction networks tailored to their application.


Assuntos
Fenômenos Bioquímicos , Escherichia coli , Escherichia coli/genética , Software , Metabolômica , Metaboloma
2.
Bioinformatics ; 38(13): 3484-3487, 2022 06 27.
Artigo em Inglês | MEDLINE | ID: mdl-35595247

RESUMO

SUMMARY: Although advances in untargeted metabolomics have made it possible to gather data on thousands of cellular metabolites in parallel, identification of novel metabolites from these datasets remains challenging. To address this need, Metabolic in silico Network Expansions (MINEs) were developed. A MINE is an expansion of known biochemistry which can be used as a list of potential structures for unannotated metabolomics peaks. Here, we present MINE 2.0, which utilizes a new set of biochemical transformation rules that covers 93% of MetaCyc reactions (compared to 25% in MINE 1.0). This results in a 17-fold increase in database size and a 40% increase in MINE database compounds matching unannotated peaks from an untargeted metabolomics dataset. MINE 2.0 is thus a significant improvement to this community resource. AVAILABILITY AND IMPLEMENTATION: The MINE 2.0 website can be accessed at https://minedatabase.ci.northwestern.edu. The MINE 2.0 web API documentation can be accessed at https://mine-api.readthedocs.io/en/latest/. The data and code underlying this article are available in the MINE-2.0-Paper repository at https://github.com/tyo-nu/MINE-2.0-Paper. MINE 2.0 source code can be accessed at https://github.com/tyo-nu/MINE-Database (MINE construction), https://github.com/tyo-nu/MINE-Server (backend web API) and https://github.com/tyo-nu/MINE-app (web app). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Metabolômica , Software , Bases de Dados Factuais , Bioquímica , Documentação
3.
Metab Eng ; 76: 133-145, 2023 03.
Artigo em Inglês | MEDLINE | ID: mdl-36724840

RESUMO

Cell-free systems are useful tools for prototyping metabolic pathways and optimizing the production of various bioproducts. Mechanistically-based kinetic models are uniquely suited to analyze dynamic experimental data collected from cell-free systems and provide vital qualitative insight. However, to date, dynamic kinetic models have not been applied with rigorous biological constraints or trained on adequate experimental data to the degree that they would give high confidence in predictions and broadly demonstrate the potential for widespread use of such kinetic models. In this work, we construct a large-scale dynamic model of cell-free metabolism with the goal of understanding and optimizing butanol production in a cell-free system. Using a combination of parameterization methods, the resultant model captures experimental metabolite measurements across two experimental conditions for nine metabolites at timepoints between 0 and 24 h. We present analysis of the model predictions, provide recommendations for butanol optimization, and identify the aldehyde/alcohol dehydrogenase as the primary bottleneck in butanol production. Sensitivity analysis further reveals the extent to which various parameters are constrained, and our approach for probing valid parameter ranges can be applied to other modeling efforts.


Assuntos
1-Butanol , Butanóis , Butanóis/metabolismo , Etanol/metabolismo , Modelos Biológicos , Cinética
4.
Nucleic Acids Res ; 48(9): 5169-5182, 2020 05 21.
Artigo em Inglês | MEDLINE | ID: mdl-32246719

RESUMO

One primary objective of synthetic biology is to improve the sustainability of chemical manufacturing. Naturally occurring biological systems can utilize a variety of carbon sources, including waste streams that pose challenges to traditional chemical processing, such as lignin biomass, providing opportunity for remediation and valorization of these materials. Success, however, depends on identifying micro-organisms that are both metabolically versatile and engineerable. Identifying organisms with this combination of traits has been a historic hindrance. Here, we leverage the facile genetics of the metabolically versatile bacterium Acinetobacter baylyi ADP1 to create easy and rapid molecular cloning workflows, including a Cas9-based single-step marker-less and scar-less genomic integration method. In addition, we create a promoter library, ribosomal binding site (RBS) variants and test an unprecedented number of rationally integrated bacterial chromosomal protein expression sites and variants. At last, we demonstrate the utility of these tools by examining ADP1's catabolic repression regulation, creating a strain with improved potential for lignin bioprocessing. Taken together, this work highlights ADP1 as an ideal host for a variety of sustainability and synthetic biology applications.


Assuntos
Acinetobacter/genética , Engenharia Metabólica , Acinetobacter/metabolismo , Clonagem Molecular/métodos , Genoma Bacteriano , Genômica , Lignina/metabolismo , Regiões Promotoras Genéticas , Ribossomos/metabolismo
5.
J Am Chem Soc ; 143(40): 16630-16640, 2021 10 13.
Artigo em Inglês | MEDLINE | ID: mdl-34591459

RESUMO

Employing DNA as a high-density data storage medium has paved the way for next-generation digital storage and biosensing technologies. However, the multipart architecture of current DNA-based recording techniques renders them inherently slow and incapable of recording fluctuating signals with subhour frequencies. To address this limitation, we developed a simplified system employing a single enzyme, terminal deoxynucleotidyl transferase (TdT), to transduce environmental signals into DNA. TdT adds nucleotides to the 3'-ends of single-stranded DNA (ssDNA) in a template-independent manner, selecting bases according to inherent preferences and environmental conditions. By characterizing TdT nucleotide selectivity under different conditions, we show that TdT can encode various physiologically relevant signals such as Co2+, Ca2+, and Zn2+ concentrations and temperature changes in vitro. Further, by considering the average rate of nucleotide incorporation, we show that the resulting ssDNA functions as a molecular ticker tape. With this method we accurately encode a temporal record of fluctuations in Co2+ concentration to within 1 min over a 60 min period. Finally, we engineer TdT to allosterically turn off in the presence of a physiologically relevant concentration of calcium. We use this engineered TdT in concert with a reference TdT to develop a two-polymerase system capable of recording a single-step change in the Ca2+ signal to within 1 min over a 60 min period. This work expands the repertoire of DNA-based recording techniques by developing a novel DNA synthesis-based system that can record temporal environmental signals into DNA with a resolution of minutes.


Assuntos
DNA Nucleotidilexotransferase
6.
Metab Eng ; 65: 79-87, 2021 05.
Artigo em Inglês | MEDLINE | ID: mdl-33662575

RESUMO

Enzyme substrate promiscuity has significant implications for metabolic engineering. The ability to predict the space of possible enzymatic side reactions is crucial for elucidating underground metabolic networks in microorganisms, as well as harnessing novel biosynthetic capabilities of enzymes to produce desired chemicals. Reaction rule-based cheminformatics platforms have been implemented to computationally enumerate possible promiscuous reactions, relying on existing knowledge of enzymatic transformations to inform novel reactions. However, past versions of curated reaction rules have been limited by a lack of comprehensiveness in representing all possible transformations, as well as the need to prune rules to enhance computational efficiency in pathway expansion. To this end, we curated a set of 1224 most generalized reaction rules, automatically abstracted from atom-mapped MetaCyc reactions and verified to uniquely cover all common enzymatic transformations. We developed a framework to systematically identify and correct redundancies and errors in the curation process, resulting in a minimal, yet comprehensive, rule set. These reaction rules were capable of reproducing more than 85% of all reactions in the KEGG and BRENDA databases, for which a large fraction of reactions is not present in MetaCyc. Our rules exceed all previously published rule sets for which reproduction was possible in this coverage analysis, which allows for the exploration of a larger space of known enzymatic transformations. By leveraging the entire knowledge of possible metabolic reactions through generalized enzymatic reaction rules, we are able to better utilize underground metabolic pathways and accelerate novel biosynthetic pathway design to enable bioproduction towards a wider range of new molecules.


Assuntos
Vias Biossintéticas , Redes e Vias Metabólicas , Vias Biossintéticas/genética , Bases de Dados Factuais , Engenharia Metabólica , Redes e Vias Metabólicas/genética
7.
PLoS Comput Biol ; 15(11): e1007424, 2019 11.
Artigo em Inglês | MEDLINE | ID: mdl-31682600

RESUMO

Modern biological tools generate a wealth of data on metabolite and protein concentrations that can be used to help inform new strain designs. However, learning from these data to predict how a cell will respond to genetic changes, a key need for engineering, remains challenging. A promising technique for leveraging omics measurements in metabolic modeling involves the construction of kinetic descriptions of the enzymatic reactions that occur within a cell. Parameterizing these models from biological data can be computationally difficult, since methods must also quantify the uncertainty in model parameters resulting from the observed data. While the field of Bayesian inference offers a wide range of methods for efficiently estimating distributions in parameter uncertainty, such techniques are poorly suited to traditional kinetic models due to their complex rate laws and resulting nonlinear dynamics. In this paper, we employ linear-logarithmic kinetics to simplify the calculation of steady-state flux distributions and enable efficient sampling and inference methods. We demonstrate that detailed information on the posterior distribution of parameters can be obtained efficiently at a variety of problem scales, including nearly genome-scale kinetic models trained on multiomics datasets. These results allow modern Bayesian machine learning tools to be leveraged in understanding biological data and in developing new, efficient strain designs.


Assuntos
Enzimas/metabolismo , Metabolismo/fisiologia , Metabolômica/métodos , Algoritmos , Teorema de Bayes , Genômica/métodos , Cinética , Aprendizado de Máquina , Engenharia Metabólica/estatística & dados numéricos , Modelos Biológicos
8.
Nucleic Acids Res ; 46(13): e78, 2018 07 27.
Artigo em Inglês | MEDLINE | ID: mdl-29718339

RESUMO

DNA polymerase fidelity is affected by both intrinsic properties and environmental conditions. Current strategies for measuring DNA polymerase error rate in vitro are constrained by low error subtype sensitivity, poor scalability, and lack of flexibility in types of sequence contexts that can be tested. We have developed the Magnification via Nucleotide Imbalance Fidelity (MagNIFi) assay, a scalable next-generation sequencing assay that uses a biased deoxynucleotide pool to quantitatively shift error rates into a range where errors are frequent and hence measurement is robust, while still allowing for accurate mapping to error rates under typical conditions. This assay is compatible with a wide range of fidelity-modulating conditions, and enables high-throughput analysis of sequence context effects on base substitution and single nucleotide deletion fidelity using a built-in template library. We validate this assay by comparing to previously established fidelity metrics, and use it to investigate neighboring sequence-mediated effects on fidelity for several DNA polymerases. Through these demonstrations, we establish the MagNIFi assay for robust, high-throughput analysis of DNA polymerase fidelity.


Assuntos
DNA Polimerase Dirigida por DNA/metabolismo , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Desoxirribonucleotídeos/metabolismo
9.
Nat Methods ; 13(11): 928-930, 2016 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-27723752

RESUMO

Deep mutational scanning is a foundational tool for addressing the functional consequences of large numbers of mutants, but a more efficient and accessible method for construction of user-defined mutagenesis libraries is needed. Here we present nicking mutagenesis, a robust, single-day, one-pot saturation mutagenesis method performed on routinely prepped plasmid dsDNA. The method can be used to produce comprehensive or single- or multi-site saturation mutagenesis libraries.


Assuntos
DNA/genética , Mutagênese Sítio-Dirigida/métodos , Plasmídeos/genética , Amidoidrolases/genética , Quebras de DNA de Cadeia Simples , Enzimas de Restrição do DNA/genética , Escherichia coli/enzimologia , Escherichia coli/genética , Biblioteca Gênica , Genes Bacterianos , Mutação , Pseudomonas aeruginosa/enzimologia , Pseudomonas aeruginosa/genética , Análise de Sequência de DNA , beta-Lactamases/genética
10.
Appl Microbiol Biotechnol ; 103(23-24): 9697-9709, 2019 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-31686141

RESUMO

Directed evolution is frequently applied to identify genetic variants with improvements in a single or multiple properties. When used to improve multiple properties simultaneously, a common strategy is to apply alternating rounds of selection criteria to enrich for variants with each desirable trait. In particular, counterselection, or selection against undesired traits rather than for desired ones, has been successfully employed in many studies. Although the sequence and stringency of alternating selective pressures for different traits are known to be highly consequential for the outcome of the screen, the effects of these parameters have not been systematically evaluated. We developed a method for producing a statistical modeling framework to elucidate these effects. The model uses single-cell fluorescence intensity distributions to estimate the proportions of phenotypic populations within a library and then predicts the changes in these proportions depending on specified positive selective or counterselective pressures. We validated the approach using recently described systems for metabolite-responsive bacterial transcription factors and yeast G-protein-coupled receptors. Finally, we applied the model to identify biological sources that exert undesirable selective pressure on libraries during sorting. Notably, these pressures produce substantial artifacts that, if unaddressed, can lead to failure of the screen. This method for model generation can be applied to FACS-based directed evolution experiments to create a quantitative framework that identifies subtle population effects. Such models can guide the choice of experimental design parameters to better enrich for true positive genetic variants and improve the chance of successful directed evolution.


Assuntos
Técnicas Biossensoriais , Evolução Molecular Direcionada/métodos , Leveduras/genética , Citometria de Fluxo , Biblioteca Gênica , Modelos Estatísticos , Fenótipo
11.
Bioinformatics ; 33(6): 909-916, 2017 03 15.
Artigo em Inglês | MEDLINE | ID: mdl-27998936

RESUMO

Motivation: High throughput screening by fluorescence activated cell sorting (FACS) is a common task in protein engineering and directed evolution. It can also be a rate-limiting step if high false positive or negative rates necessitate multiple rounds of enrichment. Current FACS software requires the user to define sorting gates by intuition and is practically limited to two dimensions. In cases when multiple rounds of enrichment are required, the software cannot forecast the enrichment effort required. Results: We have developed CellSort, a support vector machine (SVM) algorithm that identifies optimal sorting gates based on machine learning using positive and negative control populations. CellSort can take advantage of more than two dimensions to enhance the ability to distinguish between populations. We also present a Bayesian approach to predict the number of sorting rounds required to enrich a population from a given library size. This Bayesian approach allowed us to determine strategies for biasing the sorting gates in order to reduce the required number of enrichment rounds. This algorithm should be generally useful for improve sorting outcomes and reducing effort when using FACS. Availability and Implementation: Source code available at http://tyolab.northwestern.edu/tools/ . k-tyo@northwestern.edu. Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Separação Celular/métodos , Citometria de Fluxo/métodos , Software , Máquina de Vetores de Suporte , Algoritmos , Teorema de Bayes , Leveduras
12.
PLoS Comput Biol ; 13(5): e1005483, 2017 05.
Artigo em Inglês | MEDLINE | ID: mdl-28459860

RESUMO

Using a DNA polymerase to record intracellular calcium levels has been proposed as a novel neural recording technique, promising massive-scale, single-cell resolution monitoring of large portions of the brain. This technique relies on local storage of neural activity in strands of DNA, followed by offline analysis of that DNA. In simple implementations of this scheme, the time when each nucleotide was written cannot be determined directly by post-hoc DNA sequencing; the timing data must be estimated instead. Here, we use a Dynamic Time Warping-based algorithm to perform this estimation, exploiting correlations between neural activity and observed experimental variables to translate DNA-based signals to an estimate of neural activity over time. This algorithm improves the parallelizability of traditional Dynamic Time Warping, allowing several-fold increases in computation speed. The algorithm also provides a solution to several critical problems with the molecular recording paradigm: determining recording start times and coping with DNA polymerase pausing. The algorithm can generally locate DNA-based records to within <10% of a recording window, allowing for the estimation of unobserved incorporation times and latent neural tunings. We apply our technique to an in silico motor control neuroscience experiment, using the algorithm to estimate both timings of DNA-based data and the directional tuning of motor cortical cells during a center-out reaching task. We also use this algorithm to explore the impact of polymerase characteristics on system performance, determining the precision of a molecular recorder as a function of its kinetic and error-generating properties. We find useful ranges of properties for DNA polymerase-based recorders, providing guidance for future protein engineering attempts. This work demonstrates a useful general extension to dynamic alignment algorithms, as well as direct applications of that extension toward the development of molecular recorders, providing a necessary stepping stone for future biological work.


Assuntos
Algoritmos , Biologia Computacional/métodos , DNA , Nucleotídeos , Cálcio/análise , Cálcio/metabolismo , Simulação por Computador , DNA/análise , DNA/química , DNA/metabolismo , DNA Polimerase Dirigida por DNA/metabolismo , Modelos Biológicos , Neurônios/metabolismo , Neurociências , Nucleotídeos/análise , Nucleotídeos/metabolismo , Análise de Célula Única , Fatores de Tempo
13.
J Infect Dis ; 215(suppl_1): S37-S43, 2017 Feb 15.
Artigo em Inglês | MEDLINE | ID: mdl-28375518

RESUMO

Klebsiella pneumoniae has a reputation for causing a wide range of infectious conditions, with numerous highly virulent and antibiotic-resistant strains. Metabolic models have the potential to provide insights into the growth behavior, nutrient requirements, essential genes, and candidate drug targets in these strains. Here we develop a metabolic model for KPPR1, a highly virulent strain of K. pneumoniae. We apply a combination of Biolog phenotype data and fitness data to validate and refine our KPPR1 model. The final model displays a predictive accuracy of 75% in identifying potential carbon and nitrogen sources for K. pneumoniae and of 99% in predicting nonessential genes in rich media. We demonstrate how this model is useful in studying the differences in the metabolic capabilities of the low-virulence MGH 78578 strain and the highly virulent KPPR1 strain. For example, we demonstrate that these strains differ in carbohydrate metabolism, including the ability to metabolize dulcitol as a primary carbon source. Our model makes numerous other predictions for follow-up verification and analysis.


Assuntos
Genoma Bacteriano , Klebsiella pneumoniae/metabolismo , Modelos Biológicos , Metabolismo dos Carboidratos , Meios de Cultura , Farmacorresistência Bacteriana Múltipla/genética , Genes Essenciais , Klebsiella pneumoniae/efeitos dos fármacos , Klebsiella pneumoniae/genética , Fenótipo , Análise de Sequência de DNA
14.
Biophys J ; 113(5): 1150-1162, 2017 Sep 05.
Artigo em Inglês | MEDLINE | ID: mdl-28877496

RESUMO

Developing reliable, predictive kinetic models of metabolism is a difficult, yet necessary, priority toward understanding and deliberately altering cellular behavior. Constraint-based modeling has enabled the fields of metabolic engineering and systems biology to make great strides in interrogating cellular metabolism but does not provide sufficient insight into regulation or kinetic limitations of metabolic pathways. Moreover, the growth-optimized assumptions that constraint-based models often rely on do not hold when studying stationary or persistor cell populations. However, developing kinetic models provides many unique challenges, as many of the kinetic parameters and rate laws governing individual enzymes are unknown. Ensemble modeling (EM) was developed to circumnavigate this challenge and effectively sample the large kinetic parameter solution space using consistent experimental datasets. Unfortunately, EM, in its base form, requires long solve times to complete and often leads to unstable kinetic model predictions. Furthermore, these limitations scale prohibitively with increasing model size. As larger metabolic models are developed with increasing genetic information and experimental validation, the demand to incorporate kinetic information increases. Therefore, in this work, we have begun to tackle the challenges of EM by introducing additional steps to the existing method framework specifically through reducing computation time and optimizing parameter sampling. We first reduce the structural complexity of the network by removing dependent species, and second, we sample locally stable parameter sets to reflect realistic biological states of cells. Lastly, we presort the screening data to eliminate the most incorrect predictions in the earliest screening stages, saving further calculations in later stages. Our complementary improvements to this EM framework are easily incorporated into concurrent EM efforts and broaden the application opportunities and accessibility of kinetic modeling across the field.


Assuntos
Fenômenos Fisiológicos Celulares , Metabolismo Energético , Modelos Biológicos , Escherichia coli , Cinética
15.
Metab Eng ; 44: 171-181, 2017 11.
Artigo em Inglês | MEDLINE | ID: mdl-29030274

RESUMO

Enzymatic substrate promiscuity is more ubiquitous than previously thought, with significant consequences for understanding metabolism and its application to biocatalysis. This realization has given rise to the need for efficient characterization of enzyme promiscuity. Enzyme promiscuity is currently characterized with a limited number of human-selected compounds that may not be representative of the enzyme's versatility. While testing large numbers of compounds may be impractical, computational approaches can exploit existing data to determine the most informative substrates to test next, thereby more thoroughly exploring an enzyme's versatility. To demonstrate this, we used existing studies and tested compounds for four different enzymes, developed support vector machine (SVM) models using these datasets, and selected additional compounds for experiments using an active learning approach. SVMs trained on a chemically diverse set of compounds were discovered to achieve maximum accuracies of ~80% using ~33% fewer compounds than datasets based on all compounds tested in existing studies. Active learning-selected compounds for testing resolved apparent conflicts in the existing training data, while adding diversity to the dataset. The application of these algorithms to wide arrays of metabolic enzymes would result in a library of SVMs that can predict high-probability promiscuous enzymatic reactions and could prove a valuable resource for the design of novel metabolic pathways.


Assuntos
Escherichia coli , Metaboloma , Modelos Biológicos , Máquina de Vetores de Suporte , Escherichia coli/genética , Escherichia coli/metabolismo
16.
Appl Environ Microbiol ; 83(18)2017 09 15.
Artigo em Inglês | MEDLINE | ID: mdl-28710267

RESUMO

The present study aimed to increase the processivity of Sulfolobus solfataricus DNA polymerase Dpo4. Protein engineering and bioinformatics were used to compile a library of potential Dpo4 mutation sites. Ten potential mutants were identified and constructed. A primer extension assay was used to evaluate the processivity of Dpo4 mutants. Thumb (A181D) and finger (E63K) domain mutants showed a processivity of 20 and 19 nucleotides (nt), respectively. A little finger domain mutant (I248Y) exhibited a processivity of 17 nt, only 1 nt more than wild-type Dpo4. Furthermore, the A181D mutant showed lower fidelity and higher nucleotide incorporation efficiency (4.74 × 10-4 s-1 µM-1) than E63K and I248Y mutants. When tasked with bypassing damage, the A181D mutant exhibited a 3.81-fold and 2.62-fold higher catalytic efficiency (kcat/Km ) at incorporating dCTP and dATP, respectively, than wild-type Dpo4. It also showed a 55% and 91.5% higher catalytic efficiency when moving beyond the damaged 8-oxoG:C and 8-oxoG:A base pairs, respectively, compared to wild-type Dpo4. Protein engineering and bioinformatics methods can effectively increase the processivity and translesion synthesis ability of Dpo4.IMPORTANCE DNA polymerases with poor fidelity can be exploited to store data and record changes in response to the intracellular environment. Sulfolobus solfataricus Dpo4 is such an enzyme, although its use is hindered by its low processivity. In this work, we used a bioinformatics and protein engineering approach to generate Dpo4 mutants with improved processivity. We identified the Dpo4 thumb domain as the most relevant in controlling processivity.


Assuntos
Proteínas Arqueais/genética , Proteínas Arqueais/metabolismo , DNA Polimerase Dirigida por DNA/genética , DNA Polimerase Dirigida por DNA/metabolismo , Nucleotídeos/metabolismo , Sulfolobus solfataricus/enzimologia , Proteínas Arqueais/química , Replicação do DNA , DNA Arqueal/genética , DNA Arqueal/metabolismo , DNA Polimerase Dirigida por DNA/química , Cinética , Mutação , Domínios Proteicos , Engenharia de Proteínas , Sulfolobus solfataricus/química , Sulfolobus solfataricus/genética , Sulfolobus solfataricus/metabolismo
17.
Bioinformatics ; 31(7): 1016-24, 2015 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-25417203

RESUMO

MOTIVATION: The urgent need for efficient and sustainable biological production of fuels and high-value chemicals has elicited a wave of in silico techniques for identifying promising novel pathways to these compounds in large putative metabolic networks. To date, these approaches have primarily used general graph search algorithms, which are prohibitively slow as putative metabolic networks may exceed 1 million compounds. To alleviate this limitation, we report two methods--SimIndex (SI) and SimZyme--which use chemical similarity of 2D chemical fingerprints to efficiently navigate large metabolic networks and propose enzymatic connections between the constituent nodes. We also report a Byers-Waterman type pathway search algorithm for further paring down pertinent networks. RESULTS: Benchmarking tests run with SI show it can reduce the number of nodes visited in searching a putative network by 100-fold with a computational time improvement of up to 10(5)-fold. Subsequent Byers-Waterman search application further reduces the number of nodes searched by up to 100-fold, while SimZyme demonstrates ∼ 90% accuracy in matching query substrates with enzymes. Using these modules, we have designed and annotated an alternative to the methylerythritol phosphate pathway to produce isopentenyl pyrophosphate with more favorable thermodynamics than the native pathway. These algorithms will have a significant impact on our ability to use large metabolic networks that lack annotation of promiscuous reactions. AVAILABILITY AND IMPLEMENTATION: Python files will be available for download at http://tyolab.northwestern.edu/tools/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Algoritmos , Biologia Computacional/métodos , Hemiterpenos/metabolismo , Redes e Vias Metabólicas , Metabolômica/métodos , Compostos Organofosforados/metabolismo , Preparações Farmacêuticas/química , Software , Bases de Dados de Compostos Químicos , Anotação de Sequência Molecular
18.
Metab Eng ; 33: 138-147, 2016 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-26655066

RESUMO

There have been many achievements in applying biochemical synthetic routes to the synthesis of commodity chemicals. However, most of these endeavors have focused on optimizing and increasing the yields of naturally existing pathways. We sought to evaluate the potential for biosynthesis beyond the limits of known biochemistry towards the production of small molecule drugs that do not exist in nature. Because of the potential for improved yields compared to total synthesis, and therefore lower manufacturing costs, we focused on drugs for diseases endemic to many resource poor regions, like tuberculosis and HIV. Using generalized biochemical reaction rules, we were able to design biochemical pathways for the production of eight small molecule drugs or drug precursors and identify potential enzyme-substrate pairs for nearly every predicted reaction. All pathways begin from native metabolites, abrogating the need for specialized precursors. The simulated pathways showed several trends with the sequential ordering of reactions as well as the types of chemistries used. For some compounds, the main obstacles to finding feasible biochemical pathways were the lack of appropriate, natural starting compounds and a low diversity of biochemical coupling reactions necessary to synthesize molecules with larger molecular size.


Assuntos
Escherichia coli/metabolismo , Análise do Fluxo Metabólico/métodos , Modelos Biológicos , Complexos Multienzimáticos/metabolismo , Peptídeos/metabolismo , Transdução de Sinais/fisiologia , Vias Biossintéticas/fisiologia , Simulação por Computador , Escherichia coli/genética , Complexos Multienzimáticos/genética , Peptídeos/genética , Preparações Farmacêuticas , Software
19.
Biotechnol Bioeng ; 113(5): 944-52, 2016 May.
Artigo em Inglês | MEDLINE | ID: mdl-26479709

RESUMO

Chemicals with aldehyde moieties are useful in the synthesis of polymerization reagents, pharmaceuticals, pesticides, flavors, and fragrances because of their high reactivity. However, chemical synthesis of aldehydes from carboxylic acids has unfavorable thermodynamics and limited specificity. Enzymatically catalyzed reductive bioaldehyde synthesis is an attractive route that overcomes unfavorable thermodynamics by ATP hydrolysis in ambient, aqueous conditions. Carboxylic acid reductases (Cars) are particularly attractive, as only one enzyme is required. We sought to increase the knowledge base of permitted substrates for four Cars. Additionally, the Lys2 enzyme family was found to be mechanistically the same as Cars and two isozymes were also tested. Our results show that Cars prefer molecules where the carboxylic acid is the only polar/charged group. Using this data and other published data, we develop a support vector classifier (SVC) for predicting Car reactivity and make predictions on all carboxylic acid metabolites in iAF1260 and Model SEED.


Assuntos
Aldeídos/metabolismo , Ácidos Carboxílicos/metabolismo , Mycobacterium/enzimologia , Nocardia/enzimologia , Oxirredutases/metabolismo , Simulação por Computador , Microbiologia Industrial/métodos , Modelos Biológicos , NADP/metabolismo , Oxirredução , Especificidade por Substrato , Máquina de Vetores de Suporte , Termodinâmica
20.
Metab Eng ; 28: 180-189, 2015 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-25598516

RESUMO

Poly-3-hydroxybutyrate (PHB) synthesis in Escherichia coli elicits regulatory responses that affect product yield and productivity. We used controlled, steady-state cultures (chemostats) of a genetically stable strain to determine growth-independent metabolic flux regulation. We measured flux and steady-state intracellular metabolite concentrations across different dilution rates (0.05, 0.15, 0.3h(-1)), limitations (glucose, gluconate and nitrogen), and operon copy counts of the PHB pathway (0, 6, 17, and 29). As PHB flux increases, specific substrate consumption and lactate secretion increase while formate and acetate secretion decreases in N-limited, glucose-fed conditions. To understand the regulatory mechanisms that resulted in these macroscopic changes, we used a flux balance analysis model to analyze intracellular redox conditions. Our model shows that under N-limited conditions, synthesis of PHB creates excess reducing equivalents. Cells, under these conditions, secrete more reduced metabolites in order to recycle reducing equivalents. By switching to a more oxidized substrate (gluconate) that decreased excess reducing equivalents, PHB flux yield increased 1.6 fold compared to glucose-fed fermentations. High flux of PHB (~1.2 mmol/g DCWh) was maintained under these steady-state, oxidized conditions. These results imply redox imbalance is a driving force in industrial production of PHB, and substrates that are more oxidized than glucose can increase productivity.


Assuntos
Carbono/metabolismo , Escherichia coli/crescimento & desenvolvimento , Glucose/metabolismo , Hidroxibutiratos/metabolismo , Poliésteres/metabolismo , Escherichia coli/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA