Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 9 de 9
Filter
Add more filters










Database
Language
Publication year range
1.
Bioinformatics ; 39(8)2023 08 01.
Article in English | MEDLINE | ID: mdl-37589572

ABSTRACT

MOTIVATION: The importance and rate of development of genome-scale metabolic models have been growing for the last few years, increasing the demand for software solutions that automate several steps of this process. However, since TRIAGE's release, software development for the automatic integration of transport reactions into models has stalled. RESULTS: Here, we present the Transport Systems Tracker (TranSyT). Unlike other transport systems annotation software, TranSyT does not rely on manual curation to expand its internal database, which is derived from highly curated records retrieved from the Transporters Classification Database and complemented with information from other data sources. TranSyT compiles information regarding transporter families and proteins, and derives reactions into its internal database, making it available for rapid annotation of complete genomes. All transport reactions have GPR associations and can be exported with identifiers from four different metabolite databases. TranSyT is currently available as a plugin for merlin v4.0 and an app for KBase. AVAILABILITY AND IMPLEMENTATION: TranSyT web service: https://transyt.bio.di.uminho.pt/; GitHub for the tool: https://github.com/BioSystemsUM/transyt; GitHub with examples and instructions to run TranSyT: https://github.com/ecunha1996/transyt_paper.


Subject(s)
Software , Databases, Factual
2.
bioRxiv ; 2023 Jul 26.
Article in English | MEDLINE | ID: mdl-37502915

ABSTRACT

Predicting elemental cycles and maintaining water quality under increasing anthropogenic influence requires understanding the spatial drivers of river microbiomes. However, the unifying microbial processes governing river biogeochemistry are hindered by a lack of genome-resolved functional insights and sampling across multiple rivers. Here we employed a community science effort to accelerate the sampling, sequencing, and genome-resolved analyses of river microbiomes to create the Genome Resolved Open Watersheds database (GROWdb). This resource profiled the identity, distribution, function, and expression of thousands of microbial genomes across rivers covering 90% of United States watersheds. Specifically, GROWdb encompasses 1,469 microbial species from 27 phyla, including novel lineages from 10 families and 128 genera, and defines the core river microbiome for the first time at genome level. GROWdb analyses coupled to extensive geospatial information revealed local and regional drivers of microbial community structuring, while also presenting a myriad of foundational hypotheses about ecosystem function. Building upon the previously conceived River Continuum Concept 1 , we layer on microbial functional trait expression, which suggests the structure and function of river microbiomes is predictable. We make GROWdb available through various collaborative cyberinfrastructures 2, 3 so that it can be widely accessed across disciplines for watershed predictive modeling and microbiome-based management practices.

3.
Metab Eng ; 69: 302-312, 2022 01.
Article in English | MEDLINE | ID: mdl-34958914

ABSTRACT

Spontaneous reactions between metabolites are often neglected in favor of emphasizing enzyme-catalyzed chemistry because spontaneous reaction rates are assumed to be insignificant under physiological conditions. However, synthetic biology and engineering efforts can raise natural metabolites' levels or introduce unnatural ones, so that previously innocuous or nonexistent spontaneous reactions become an issue. Problems arise when spontaneous reaction rates exceed the capacity of a platform organism to dispose of toxic or chemically active reaction products. While various reliable sources list competing or toxic enzymatic pathways' side-reactions, no corresponding compilation of spontaneous side-reactions exists, nor is it possible to predict their occurrence. We addressed this deficiency by creating the Chemical Damage (CD)-MINE resource. First, we used literature data to construct a comprehensive database of metabolite reactions that occur spontaneously in physiological conditions. We then leveraged this data to construct 148 reaction rules describing the known spontaneous chemistry in a substrate-generic way. We applied these rules to all compounds in the ModelSEED database, predicting 180,891 spontaneous reactions. The resulting (CD)-MINE is available at https://minedatabase.mcs.anl.gov/cdmine/#/home and through developer tools. We also demonstrate how damage-prone intermediates and end products are widely distributed among metabolic pathways, and how predicting spontaneous chemical damage helps rationalize toxicity and carbon loss using examples from published pathways to commercial products. We explain how analyzing damage-prone areas in metabolism helps design effective engineering strategies. Finally, we use the CD-MINE toolset to predict the formation of the novel damage product N-carbamoyl proline, and present mass spectrometric evidence for its presence in Escherichia coli.


Subject(s)
Metabolic Networks and Pathways , Cell Cycle Proteins , Databases, Factual , Escherichia coli , Metabolic Networks and Pathways/genetics , Metabolome , Synthetic Biology
4.
Nucleic Acids Res ; 49(D1): D575-D588, 2021 01 08.
Article in English | MEDLINE | ID: mdl-32986834

ABSTRACT

For over 10 years, ModelSEED has been a primary resource for the construction of draft genome-scale metabolic models based on annotated microbial or plant genomes. Now being released, the biochemistry database serves as the foundation of biochemical data underlying ModelSEED and KBase. The biochemistry database embodies several properties that, taken together, distinguish it from other published biochemistry resources by: (i) including compartmentalization, transport reactions, charged molecules and proton balancing on reactions; (ii) being extensible by the user community, with all data stored in GitHub; and (iii) design as a biochemical 'Rosetta Stone' to facilitate comparison and integration of annotations from many different tools and databases. The database was constructed by combining chemical data from many resources, applying standard transformations, identifying redundancies and computing thermodynamic properties. The ModelSEED biochemistry is continually tested using flux balance analysis to ensure the biochemical network is modeling-ready and capable of simulating diverse phenotypes. Ontologies can be designed to aid in comparing and reconciling metabolic reconstructions that differ in how they represent various metabolic pathways. ModelSEED now includes 33,978 compounds and 36,645 reactions, available as a set of extensible files on GitHub, and available to search at https://modelseed.org/biochem and KBase.


Subject(s)
Bacteria/metabolism , Databases, Factual , Fungi/metabolism , Metabolic Networks and Pathways , Molecular Sequence Annotation , Plants/metabolism , Bacteria/genetics , Genome, Bacterial , Thermodynamics
8.
Biotechnol Biofuels ; 12: 230, 2019.
Article in English | MEDLINE | ID: mdl-31583016

ABSTRACT

BACKGROUND: One of the European Union directives indicates that 10% of all fuels must be bio-synthesized by 2020. In this regard, biobutanol-natively produced by clostridial strains-poses as a promising alternative biofuel. One possible approach to overcome the difficulties of the industrial exploration of the native producers is the expression of more suitable pathways in robust microorganisms such as Escherichia coli. The enumeration of novel pathways is a powerful tool, allowing to identify non-obvious combinations of enzymes to produce a target compound. RESULTS: This work describes the in silico driven design of E. coli strains able to produce butanol via 2-oxoglutarate by a novel pathway. This butanol pathway was generated by a hypergraph algorithm and selected from an initial set of 105,954 different routes by successively applying different filters, such as stoichiometric feasibility, size and novelty. The implementation of this pathway involved seven catalytic steps and required the insertion of nine heterologous genes from various sources in E. coli distributed in three plasmids. Expressing butanol genes in E. coli K12 and cultivation in High-Density Medium formulation seem to favor butanol accumulation via the 2-oxoglutarate pathway. The maximum butanol titer obtained was 85 ± 1 mg L-1 by cultivating the cells in bioreactors. CONCLUSIONS: In this work, we were able to successfully translate the computational analysis into in vivo applications, designing novel strains of E. coli able to produce n-butanol via an innovative pathway. Our results demonstrate that enumeration algorithms can broad the spectrum of butanol producing pathways. This validation encourages further research to other target compounds.

9.
J Integr Bioinform ; 16(1)2018 Dec 21.
Article in English | MEDLINE | ID: mdl-30808160

ABSTRACT

Metabolism has been a major field of study in the last years, mainly due to its importance in understanding cell physiology and certain disease phenotypes due to its deregulation. Genome-scale metabolic models (GSMMs) have been established as important tools to help achieve a better understanding of human metabolism. Towards this aim, advances in systems biology and bioinformatics have allowed the reconstruction of several human GSMMs, although some limitations and challenges remain, such as the lack of external identifiers for both metabolites and reactions. A pipeline was developed to integrate multiple GSMMs, starting by retrieving information from the main human GSMMs and evaluating the presence of external database identifiers and annotations for both metabolites and reactions. Information from metabolites was included into a graph database with omics data repositories, allowing clustering of metabolites through their similarity regarding database cross-referencing. Metabolite annotation of several older GSMMs was enriched, allowing the identification and integration of common entities. Using this information, as well as other metrics, we successfully integrated reactions from these models. These methods can be leveraged towards the creation of a unified consensus model of human metabolism.


Subject(s)
Computational Biology/methods , Genome, Human , Metabolic Networks and Pathways , Models, Statistical , Databases, Factual , Humans , Molecular Sequence Annotation , Transcription, Genetic
SELECTION OF CITATIONS
SEARCH DETAIL
...