RESUMO
MOTIVATION: The importance and rate of development of genome-scale metabolic models have been growing for the last few years, increasing the demand for software solutions that automate several steps of this process. However, since TRIAGE's release, software development for the automatic integration of transport reactions into models has stalled. RESULTS: Here, we present the Transport Systems Tracker (TranSyT). Unlike other transport systems annotation software, TranSyT does not rely on manual curation to expand its internal database, which is derived from highly curated records retrieved from the Transporters Classification Database and complemented with information from other data sources. TranSyT compiles information regarding transporter families and proteins, and derives reactions into its internal database, making it available for rapid annotation of complete genomes. All transport reactions have GPR associations and can be exported with identifiers from four different metabolite databases. TranSyT is currently available as a plugin for merlin v4.0 and an app for KBase. AVAILABILITY AND IMPLEMENTATION: TranSyT web service: https://transyt.bio.di.uminho.pt/; GitHub for the tool: https://github.com/BioSystemsUM/transyt; GitHub with examples and instructions to run TranSyT: https://github.com/ecunha1996/transyt_paper.
Assuntos
Software , Bases de Dados FactuaisRESUMO
Microbial genome annotation is the process of identifying structural and functional elements in DNA sequences and subsequently attaching biological information to those elements. DRAM is a tool developed to annotate bacterial, archaeal, and viral genomes derived from pure cultures or metagenomes. DRAM goes beyond traditional annotation tools by distilling multiple gene annotations to genome level summaries of functional potential. Despite these benefits, a downside of DRAM is the requirement of large computational resources, which limits its accessibility. Further, it did not integrate with downstream metabolic modeling tools that require genome annotation. To alleviate these constraints, DRAM and the viral counterpart, DRAM-v, are now available and integrated with the freely accessible KBase cyberinfrastructure. With kb_DRAM users can generate DRAM annotations and functional summaries from microbial or viral genomes in a point-and-click interface, as well as generate genome-scale metabolic models from DRAM annotations. AVAILABILITY AND IMPLEMENTATION: For kb_DRAM users, the kb_DRAM apps on KBase can be found in the catalog at https://narrative.kbase.us/#catalog/modules/kb_DRAM. For kb_DRAM users, a tutorial workflow with all documentation is available at https://narrative.kbase.us/narrative/129480. For kb_DRAM developers, software is available at https://github.com/shafferm/kb_DRAM.
Assuntos
Bactérias , Software , Anotação de Sequência Molecular , Bactérias/genética , Archaea/genética , MetabolômicaRESUMO
Metabolic engineering uses enzymes as parts to build biosystems for specified tasks. Although a part's working life and failure modes are key engineering performance indicators, this is not yet so in metabolic engineering because it is not known how long enzymes remain functional in vivo or whether cumulative deterioration (wear-out), sudden random failure, or other causes drive replacement. Consequently, enzymes cannot be engineered to extend life and cut the high energy costs of replacement. Guided by catalyst engineering, we adopted catalytic cycles until replacement (CCR) as a metric for enzyme functional life span in vivo. CCR is the number of catalytic cycles that an enzyme mediates in vivo before failure or replacement, i.e., metabolic flux rate/protein turnover rate. We used estimated fluxes and measured protein turnover rates to calculate CCRs for â¼100-200 enzymes each from Lactococcus lactis, yeast, and Arabidopsis CCRs in these organisms had similar ranges (<103 to >107) but different median values (3-4 × 104 in L. lactis and yeast versus 4 × 105 in Arabidopsis). In all organisms, enzymes whose substrates, products, or mechanisms can attack reactive amino acid residues had significantly lower median CCR values than other enzymes. Taken with literature on mechanism-based inactivation, the latter finding supports the proposal that 1) random active-site damage by reaction chemistry is an important cause of enzyme failure, and 2) reactive noncatalytic residues in the active-site region are likely contributors to damage susceptibility. Enzyme engineering to raise CCRs and lower replacement costs may thus be both beneficial and feasible.
Assuntos
Arabidopsis/enzimologia , Biocatálise , Enzimas/química , Lactococcus lactis/enzimologia , Engenharia Metabólica , Saccharomyces cerevisiae/enzimologiaRESUMO
MOTIVATION: Nutrient and contaminant behavior in the subsurface are governed by multiple coupled hydrobiogeochemical processes which occur across different temporal and spatial scales. Accurate description of macroscopic system behavior requires accounting for the effects of microscopic and especially microbial processes. Microbial processes mediate precipitation and dissolution and change aqueous geochemistry, all of which impacts macroscopic system behavior. As 'omics data describing microbial processes is increasingly affordable and available, novel methods for using this data quickly and effectively for improved ecosystem models are needed. RESULTS: We propose a workflow ('Omics to Reactive Transport-ORT) for utilizing metagenomic and environmental data to describe the effect of microbiological processes in macroscopic reactive transport models. This workflow utilizes and couples two open-source software packages: KBase (a software platform for systems biology) and PFLOTRAN (a reactive transport modeling code). We describe the architecture of ORT and demonstrate an implementation using metagenomic and geochemical data from a river system. Our demonstration uses microbiological drivers of nitrification and denitrification to predict nitrogen cycling patterns which agree with those provided with generalized stoichiometries. While our example uses data from a single measurement, our workflow can be applied to spatiotemporal metagenomic datasets to allow for iterative coupling between KBase and PFLOTRAN. AVAILABILITY AND IMPLEMENTATION: Interactive models available at https://pflotranmodeling.paf.subsurfaceinsights.com/pflotran-simple-model/. Microbiological data available at NCBI via BioProject ID PRJNA576070. ORT Python code available at https://github.com/subsurfaceinsights/ort-kbase-to-pflotran. KBase narrative available at https://narrative.kbase.us/narrative/71260 or static narrative (no login required) at https://kbase.us/n/71260/258. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Ecossistema , Software , Fluxo de Trabalho , Metagenômica , Biologia de SistemasRESUMO
For over 10 years, ModelSEED has been a primary resource for the construction of draft genome-scale metabolic models based on annotated microbial or plant genomes. Now being released, the biochemistry database serves as the foundation of biochemical data underlying ModelSEED and KBase. The biochemistry database embodies several properties that, taken together, distinguish it from other published biochemistry resources by: (i) including compartmentalization, transport reactions, charged molecules and proton balancing on reactions; (ii) being extensible by the user community, with all data stored in GitHub; and (iii) design as a biochemical 'Rosetta Stone' to facilitate comparison and integration of annotations from many different tools and databases. The database was constructed by combining chemical data from many resources, applying standard transformations, identifying redundancies and computing thermodynamic properties. The ModelSEED biochemistry is continually tested using flux balance analysis to ensure the biochemical network is modeling-ready and capable of simulating diverse phenotypes. Ontologies can be designed to aid in comparing and reconciling metabolic reconstructions that differ in how they represent various metabolic pathways. ModelSEED now includes 33,978 compounds and 36,645 reactions, available as a set of extensible files on GitHub, and available to search at https://modelseed.org/biochem and KBase.
Assuntos
Bactérias/metabolismo , Bases de Dados Factuais , Fungos/metabolismo , Redes e Vias Metabólicas , Anotação de Sequência Molecular , Plantas/metabolismo , Bactérias/genética , Genoma Bacteriano , TermodinâmicaRESUMO
Spontaneous reactions between metabolites are often neglected in favor of emphasizing enzyme-catalyzed chemistry because spontaneous reaction rates are assumed to be insignificant under physiological conditions. However, synthetic biology and engineering efforts can raise natural metabolites' levels or introduce unnatural ones, so that previously innocuous or nonexistent spontaneous reactions become an issue. Problems arise when spontaneous reaction rates exceed the capacity of a platform organism to dispose of toxic or chemically active reaction products. While various reliable sources list competing or toxic enzymatic pathways' side-reactions, no corresponding compilation of spontaneous side-reactions exists, nor is it possible to predict their occurrence. We addressed this deficiency by creating the Chemical Damage (CD)-MINE resource. First, we used literature data to construct a comprehensive database of metabolite reactions that occur spontaneously in physiological conditions. We then leveraged this data to construct 148 reaction rules describing the known spontaneous chemistry in a substrate-generic way. We applied these rules to all compounds in the ModelSEED database, predicting 180,891 spontaneous reactions. The resulting (CD)-MINE is available at https://minedatabase.mcs.anl.gov/cdmine/#/home and through developer tools. We also demonstrate how damage-prone intermediates and end products are widely distributed among metabolic pathways, and how predicting spontaneous chemical damage helps rationalize toxicity and carbon loss using examples from published pathways to commercial products. We explain how analyzing damage-prone areas in metabolism helps design effective engineering strategies. Finally, we use the CD-MINE toolset to predict the formation of the novel damage product N-carbamoyl proline, and present mass spectrometric evidence for its presence in Escherichia coli.
Assuntos
Redes e Vias Metabólicas , Proteínas de Ciclo Celular , Bases de Dados Factuais , Escherichia coli , Redes e Vias Metabólicas/genética , Metaboloma , Biologia SintéticaRESUMO
Enzymes have in vivo life spans. Analysis of life spans, i.e., lifetime totals of catalytic turnovers, suggests that nonsurvivable collateral chemical damage from the very reactions that enzymes catalyze is a common but underdiagnosed cause of enzyme death. Analysis also implies that many enzymes are moderately deficient in that their active-site regions are not naturally as hardened against such collateral damage as they could be, leaving room for improvement by rational design or directed evolution. Enzyme life span might also be improved by engineering systems that repair otherwise fatal active-site damage, of which a handful are known and more are inferred to exist. Unfortunately, the data needed to design and execute such improvements are lacking: there are too few measurements of in vivo life span, and existing information about the extent, nature, and mechanisms of active-site damage and repair during normal enzyme operation is too scarce, anecdotal, and speculative to act on. Fortunately, advances in proteomics, metabolomics, cheminformatics, comparative genomics, and structural biochemistry now empower a systematic, data-driven approach for identifying, predicting, and validating instances of active-site damage and its repair. These capabilities would be practically useful in enzyme redesign and improvement of in-use stability and could change our thinking about which enzymes die young in vivo, and why.
Assuntos
Biocatálise , Estabilidade Enzimática , Domínio Catalítico , Biologia de SistemasRESUMO
For many decades comparative analyses of protein sequences and structures have been used to investigate fundamental principles of molecular evolution. In contrast, relatively little is known about the long-term evolution of species' phenotypic and genetic properties. This represents an important gap in our understanding of evolution, as exactly these proprieties play key roles in natural selection and adaptation to diverse environments. Here we perform a comparative analysis of bacterial growth and gene deletion phenotypes using hundreds of genome-scale metabolic models. Overall, bacterial phenotypic evolution can be described by a two-stage process with a rapid initial phenotypic diversification followed by a slow long-term exponential divergence. The observed average divergence trend, with approximately similar fractions of phenotypic properties changing per unit time, continues for billions of years. We experimentally confirm the predicted divergence trend using the phenotypic profiles of 40 diverse bacterial species across more than 60 growth conditions. Our analysis suggests that, at long evolutionary distances, gene essentiality is significantly more conserved than the ability to utilize different nutrients, while synthetic lethality is significantly less conserved. We also find that although a rapid phenotypic evolution is sometimes observed within the same species, a transition from high to low phenotypic similarity occurs primarily at the genus level.
Assuntos
Bactérias/classificação , Bactérias/metabolismo , Evolução Biológica , Fenótipo , Bactérias/genética , Bactérias/crescimento & desenvolvimento , Deleção de Genes , Genoma Bacteriano/genética , Seleção GenéticaRESUMO
Genome-scale metabolic reconstructions help us to understand and engineer metabolism. Next-generation sequencing technologies are delivering genomes and transcriptomes for an ever-widening range of plants. While such omic data can, in principle, be used to compare metabolic reconstructions in different species, organs and environmental conditions, these comparisons require a standardized framework for the reconstruction of metabolic networks from transcript data. We previously introduced PlantSEED as a framework covering primary metabolism for 10 species. We have now expanded PlantSEED to include 39 species and provide tools that enable automated annotation and metabolic reconstruction from transcriptome data. The algorithm for automated annotation in PlantSEED propagates annotations using a set of signature k-mers (short amino acid sequences characteristic of particular proteins) that identify metabolic enzymes with an accuracy of about 97%. PlantSEED reconstructions are built from a curated template that includes consistent compartmentalization for more than 100 primary metabolic subsystems. Together, the annotation and reconstruction algorithms produce reconstructions without gaps and with more accurate compartmentalization than existing resources. These tools are available via the PlantSEED web interface at http://modelseed.org, which enables users to upload, annotate and reconstruct from private transcript data and simulate metabolic activity under various conditions using flux balance analysis. We demonstrate the ability to compare these metabolic reconstructions with a case study involving growth on several nitrogen sources in roots of four species.
Assuntos
Biologia Computacional/métodos , Bases de Dados Factuais , Metabolômica/métodos , Plantas/metabolismo , Algoritmos , Genoma de Planta/genética , Sequenciamento de Nucleotídeos em Larga Escala , Redes e Vias Metabólicas , Plantas/genética , TranscriptomaRESUMO
The pantothenate (vitamin B5) synthesis pathway in plants is not fully defined because the subcellular site of its ketopantoate â pantoate reduction step is unclear. However, the pathway is known to be split between cytosol, mitochondria, and potentially plastids, and inferred to involve mitochondrial or plastidial transport of ketopantoate or pantoate. No proteins that mediate these transport steps have been identified. Comparative genomic and transcriptomic analyses identified Arabidopsis thaliana BASS1 (At1g78560) and its maize (Zea mays) ortholog as candidates for such a transport role. BASS1 proteins belong to the bile acid : sodium symporter family and share similarity with the Salmonella enterica PanS pantoate/ketopantoate transporter and with predicted bacterial transporters whose genes cluster on the chromosome with pantothenate synthesis genes. Furthermore, Arabidopsis BASS1 is co-expressed with genes related to metabolism of coenzyme A, the cofactor derived from pantothenate. Expression of Arabidopsis or maize BASS1 promoted the growth of a S. enterica panB panS mutant strain when pantoate, but not ketopantoate, was supplied, and increased the rate of [3H]pantoate uptake. Subcellular localization of green fluorescent protein fusions in Nicotiana tabacum BY-2 cells demonstrated that Arabidopsis BASS1 is targeted solely to the plastid inner envelope. Two independent Arabidopsis BASS1 knockout mutants accumulated pantoate â¼10-fold in leaves and had smaller seeds. Taken together, these data indicate that BASS1 is a physiologically significant plastidial pantoate transporter and that the pantoate reduction step in pantothenate biosynthesis could be at least partly localized in plastids.
Assuntos
Proteínas de Membrana Transportadoras/genética , Redes e Vias Metabólicas/genética , Ácido Pantotênico/genética , Proteínas de Plantas/genética , Plastídeos/genética , Arabidopsis/genética , Proteínas de Arabidopsis/metabolismo , Proteínas de Cloroplastos/metabolismo , Citosol/enzimologia , Regulação da Expressão Gênica de Plantas , Técnicas de Inativação de Genes , Proteínas de Fluorescência Verde/genética , Mitocôndrias/genética , Proteínas Mitocondriais , Transportadores de Ácidos Monocarboxílicos , Transportadores de Ânions Orgânicos Dependentes de Sódio/metabolismo , Ácido Pantotênico/biossíntese , Salmonella enterica/genética , Zea mays/genéticaRESUMO
The Pathosystems Resource Integration Center (PATRIC) is the bacterial Bioinformatics Resource Center (https://www.patricbrc.org). Recent changes to PATRIC include a redesign of the web interface and some new services that provide users with a platform that takes them from raw reads to an integrated analysis experience. The redesigned interface allows researchers direct access to tools and data, and the emphasis has changed to user-created genome-groups, with detailed summaries and views of the data that researchers have selected. Perhaps the biggest change has been the enhanced capability for researchers to analyze their private data and compare it to the available public data. Researchers can assemble their raw sequence reads and annotate the contigs using RASTtk. PATRIC also provides services for RNA-Seq, variation, model reconstruction and differential expression analysis, all delivered through an updated private workspace. Private data can be compared by 'virtual integration' to any of PATRIC's public data. The number of genomes available for comparison in PATRIC has expanded to over 80 000, with a special emphasis on genomes with antimicrobial resistance data. PATRIC uses this data to improve both subsystem annotation and k-mer classification, and tags new genomes as having signatures that indicate susceptibility or resistance to specific antibiotics.
Assuntos
Bactérias/genética , Biologia Computacional/métodos , Bases de Dados Genéticas , Genoma Bacteriano , Genômica/métodos , Antibacterianos/farmacologia , Bactérias/efeitos dos fármacos , Bactérias/metabolismo , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Farmacorresistência Bacteriana , Anotação de Sequência Molecular , Proteoma , Proteômica/métodos , Software , NavegadorRESUMO
In the era of next-generation sequencing and ubiquitous assembly and binning of metagenomes, new putative genome sequences are being produced from isolate and microbiome samples at ever-increasing rates. Genome-scale metabolic models have enormous utility for supporting the analysis and predictive characterization of these genomes based on sequence data. As a result, tools for rapid automated reconstruction of metabolic models are becoming critically important for supporting the analysis of new genome sequences. Many tools and algorithms have now emerged to support rapid model reconstruction and analysis. Here, we are comparing and contrasting the capabilities and output of a variety of these tools, including ModelSEED, Raven Toolbox, PathwayTools, SuBliMinal Toolbox and merlin.
Assuntos
Metagenoma , Modelos Biológicos , Automação , Bases de Dados Genéticas , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Redes e Vias Metabólicas , Microbiota , Design de SoftwareRESUMO
Klebsiella pneumoniae has a reputation for causing a wide range of infectious conditions, with numerous highly virulent and antibiotic-resistant strains. Metabolic models have the potential to provide insights into the growth behavior, nutrient requirements, essential genes, and candidate drug targets in these strains. Here we develop a metabolic model for KPPR1, a highly virulent strain of K. pneumoniae. We apply a combination of Biolog phenotype data and fitness data to validate and refine our KPPR1 model. The final model displays a predictive accuracy of 75% in identifying potential carbon and nitrogen sources for K. pneumoniae and of 99% in predicting nonessential genes in rich media. We demonstrate how this model is useful in studying the differences in the metabolic capabilities of the low-virulence MGH 78578 strain and the highly virulent KPPR1 strain. For example, we demonstrate that these strains differ in carbohydrate metabolism, including the ability to metabolize dulcitol as a primary carbon source. Our model makes numerous other predictions for follow-up verification and analysis.
Assuntos
Genoma Bacteriano , Klebsiella pneumoniae/metabolismo , Modelos Biológicos , Metabolismo dos Carboidratos , Meios de Cultura , Farmacorresistência Bacteriana Múltipla/genética , Genes Essenciais , Klebsiella pneumoniae/efeitos dos fármacos , Klebsiella pneumoniae/genética , Fenótipo , Análise de Sequência de DNARESUMO
The necessarily sharp focus of metabolic engineering and metabolic synthetic biology on pathways and their fluxes has tended to divert attention from the damaging enzymatic and chemical side-reactions that pathway metabolites can undergo. Although historically overlooked and underappreciated, such metabolite damage reactions are now known to occur throughout metabolism and to generate (formerly enigmatic) peaks detected in metabolomics datasets. It is also now known that metabolite damage is often countered by dedicated repair enzymes that undo or prevent it. Metabolite damage and repair are highly relevant to engineered pathway design: metabolite damage reactions can reduce flux rates and product yields, and repair enzymes can provide robust, host-independent solutions. Herein, after introducing the core principles of metabolite damage and repair, we use case histories to document how damage and repair processes affect efficient operation of engineered pathways - particularly those that are heterologous, non-natural, or cell-free. We then review how metabolite damage reactions can be predicted, how repair reactions can be prospected, and how metabolite damage and repair can be built into genome-scale metabolic models. Lastly, we propose a versatile 'plug and play' set of well-characterized metabolite repair enzymes to solve metabolite damage problems known or likely to occur in metabolic engineering and synthetic biology projects.
Assuntos
Engenharia Metabólica/métodos , MetabolomaRESUMO
Recent insights suggest that non-specific and/or promiscuous enzymes are common and active across life. Understanding the role of such enzymes is an important open question in biology. Here we develop a genome-wide method, PROPER, that uses a permissive PSI-BLAST approach to predict promiscuous activities of metabolic genes. Enzyme promiscuity is typically studied experimentally using multicopy suppression, in which over-expression of a promiscuous 'replacer' gene rescues lethality caused by inactivation of a 'target' gene. We use PROPER to predict multicopy suppression in Escherichia coli, achieving highly significant overlap with published cases (hypergeometric p = 4.4e-13). We then validate three novel predicted target-replacer gene pairs in new multicopy suppression experiments. We next go beyond PROPER and develop a network-based approach, GEM-PROPER, that integrates PROPER with genome-scale metabolic modeling to predict promiscuous replacements via alternative metabolic pathways. GEM-PROPER predicts a new indirect replacer (thiG) for an essential enzyme (pdxB) in production of pyridoxal 5'-phosphate (the active form of Vitamin B6), which we validate experimentally via multicopy suppression. We perform a structural analysis of thiG to determine its potential promiscuous active site, which we validate experimentally by inactivating the pertaining residues and showing a loss of replacer activity. Thus, this study is a successful example where a computational investigation leads to a network-based identification of an indirect promiscuous replacement of a key metabolic enzyme, which would have been extremely difficult to identify directly.
Assuntos
Biologia Computacional/métodos , Escherichia coli/enzimologia , Escherichia coli/metabolismo , Fosfato de Piridoxal/metabolismo , Desidrogenases de Carboidrato/genética , Desidrogenases de Carboidrato/metabolismo , Escherichia coli/genética , Proteínas de Escherichia coli/genética , Proteínas de Escherichia coli/metabolismo , Modelos MolecularesRESUMO
We investigate the evolution of co-occurring analogous enzymes involved in L-tryptophan and L-histidine biosynthesis in Actinobacteria Phylogenetic analysis of trpF homologues, a missing gene in certain clades of this lineage whose absence is complemented by a dual-substrate HisA homologue, termed PriA, found that they fall into three categories: (i) trpF-1, an L-tryptophan biosynthetic gene horizontally acquired by certain Corynebacterium species; (ii) trpF-2, a paralogue known to be involved in synthesizing a pyrrolopyrrole moiety and (iii) trpF-3, a variable non-conserved orthologue of trpF-1 We previously investigated the effect of trpF-1 upon the evolution of PriA substrate specificity, but nothing is known about the relationship between trpF-3 and priA After in vitro steady-state enzyme kinetics we found that trpF-3 encodes a phosphoribosyl anthranilate isomerase. However, mutation of this gene in Streptomyces sviceus did not lead to auxothrophy, as expected from the biosynthetic role of trpF-1 Biochemical characterization of a dozen co-occurring TrpF-2 or TrpF-3, with PriA homologues, explained the prototrophic phenotype, and unveiled an enzyme activity trade-off between TrpF and PriA. X-ray structural analysis suggests that the function of these PriA homologues is mediated by non-conserved mutations in the flexible L5 loop, which may be responsible for different substrate affinities. Thus, the PriA homologues that co-occur with TrpF-3 represent a novel enzyme family, termed PriB, which evolved in response to PRA isomerase activity. The characterization of co-occurring enzymes provides insights into the influence of functional redundancy on the evolution of enzyme function, which could be useful for enzyme functional annotation.
Assuntos
Proteínas de Bactérias/genética , Evolução Molecular , Isomerases/genética , Streptomyces , Estrutura Secundária de Proteína , Streptomyces/enzimologia , Streptomyces/genéticaRESUMO
The increasing number of sequenced plant genomes is placing new demands on the methods applied to analyze, annotate, and model these genomes. Today's annotation pipelines result in inconsistent gene assignments that complicate comparative analyses and prevent efficient construction of metabolic models. To overcome these problems, we have developed the PlantSEED, an integrated, metabolism-centric database to support subsystems-based annotation and metabolic model reconstruction for plant genomes. PlantSEED combines SEED subsystems technology, first developed for microbial genomes, with refined protein families and biochemical data to assign fully consistent functional annotations to orthologous genes, particularly those encoding primary metabolic pathways. Seamless integration with its parent, the prokaryotic SEED database, makes PlantSEED a unique environment for cross-kingdom comparative analysis of plant and bacterial genomes. The consistent annotations imposed by PlantSEED permit rapid reconstruction and modeling of primary metabolism for all plant genomes in the database. This feature opens the unique possibility of model-based assessment of the completeness and accuracy of gene annotation and thus allows computational identification of genes and pathways that are restricted to certain genomes or need better curation. We demonstrate the PlantSEED system by producing consistent annotations for 10 reference genomes. We also produce a functioning metabolic model for each genome, gapfilling to identify missing annotations and proposing gene candidates for missing annotations. Models are built around an extended biomass composition representing the most comprehensive published to date. To our knowledge, our models are the first to be published for seven of the genomes analyzed.
Assuntos
Biologia Computacional/métodos , Bases de Dados Genéticas , Genoma de Planta/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Anotação de Sequência Molecular/métodos , Plantas/genética , Software , Redes e Vias Metabólicas/genética , Modelos Biológicos , Plantas/metabolismo , Biologia de Sistemas/métodosRESUMO
Metabolic network modeling of microbial communities provides an in-depth understanding of community-wide metabolic and regulatory processes. Compared to single organism analyses, community metabolic network modeling is more complex because it needs to account for interspecies interactions. To date, most approaches focus on reconstruction of high-quality individual networks so that, when combined, they can predict community behaviors as a result of interspecies interactions. However, this conventional method becomes ineffective for communities whose members are not well characterized and cannot be experimentally interrogated in isolation. Here, we tested a new approach that uses community-level data as a critical input for the network reconstruction process. This method focuses on directly predicting interspecies metabolic interactions in a community, when axenic information is insufficient. We validated our method through the case study of a bacterial photoautotroph-heterotroph consortium that was used to provide data needed for a community-level metabolic network reconstruction. Resulting simulations provided experimentally validated predictions of how a photoautotrophic cyanobacterium supports the growth of an obligate heterotrophic species by providing organic carbon and nitrogen sources. J. Cell. Physiol. 231: 2339-2345, 2016. © 2016 Wiley Periodicals, Inc.