RESUMO
Comparative analysis of genome-scale metabolic networks (GSMNs) may yield important information on the biology, evolution, and adaptation of species. However, it is impeded by the high heterogeneity of the quality and completeness of structural and functional genome annotations, which may bias the results of such comparisons. To address this issue, we developed AuCoMe, a pipeline to automatically reconstruct homogeneous GSMNs from a heterogeneous set of annotated genomes without discarding available manual annotations. We tested AuCoMe with three data sets, one bacterial, one fungal, and one algal, and showed that it successfully reduces technical biases while capturing the metabolic specificities of each organism. Our results also point out shared and divergent metabolic traits among evolutionarily distant algae, underlining the potential of AuCoMe to accelerate the broad exploration of metabolic evolution across the tree of life.
RESUMO
Fibrobacter succinogenes is a cellulolytic bacterium that plays an essential role in the degradation of plant fibers in the rumen ecosystem. It converts cellulose polymers into intracellular glycogen and the fermentation metabolites succinate, acetate, and formate. We developed dynamic models of F. succinogenes S85 metabolism on glucose, cellobiose, and cellulose on the basis of a network reconstruction done with the automatic reconstruction of metabolic model workspace. The reconstruction was based on genome annotation, five template-based orthology methods, gap filling, and manual curation. The metabolic network of F. succinogenes S85 comprises 1,565 reactions with 77% linked to 1,317 genes, 1,586 unique metabolites, and 931 pathways. The network was reduced using the NetRed algorithm and analyzed for the computation of elementary flux modes. A yield analysis was further performed to select a minimal set of macroscopic reactions for each substrate. The accuracy of the models was acceptable in simulating F. succinogenes carbohydrate metabolism with an average coefficient of variation of the root mean squared error of 19%. The resulting models are useful resources for investigating the metabolic capabilities of F. succinogenes S85, including the dynamics of metabolite production. Such an approach is a key step toward the integration of omics microbial information into predictive models of rumen metabolism. IMPORTANCE F. succinogenes S85 is a cellulose-degrading and succinate-producing bacterium. Such functions are central for the rumen ecosystem and are of special interest for several industrial applications. This work illustrates how information of the genome of F. succinogenes can be translated to develop predictive dynamic models of rumen fermentation processes. We expect this approach can be applied to other rumen microbes for producing a model of rumen microbiome that can be used for studying microbial manipulation strategies aimed at enhancing feed utilization and mitigating enteric emissions.
Assuntos
Fibrobacter , Genoma Bacteriano , Modelos Biológicos , Rúmen , Fibrobacter/genética , Genoma Bacteriano/genética , Metaboloma/genética , Rúmen/metabolismo , Rúmen/microbiologia , Animais , BovinosRESUMO
Sterols are biologically important molecules that serve as membrane fluidity regulators and precursors of signaling molecules, either endogenous or involved in biotic interactions. There is currently no model of their biosynthesis pathways in brown algae. Here, we benefit from the availability of genome data and gas chromatography-mass spectrometry (GC-MS) sterol profiling using a database of internal standards to build such a model. We expand the set of identified sterols in 11 species of red, brown, and green macroalgae and integrate these new data with genomic data. Our analyses suggest that some metabolic reactions may be conserved despite the loss of canonical eukaryotic enzymes, like the sterol side-chain reductase (SSR). Our findings are consistent with the principle of metabolic pathway drift through enzymatic replacement and show that cholesterol synthesis from cycloartenol may be a widespread but variable pathway among chlorophyllian eukaryotes. Among the factors contributing to this variability, one could be the recruitment of cholesterol biosynthetic intermediates to make signaling molecules, such as the mozukulins. These compounds were found in some brown algae belonging to Ectocarpales, and we here provide a first mozukulin biosynthetic model. Our results demonstrate that integrative approaches can already be used to infer experimentally testable models, which will be useful to further investigate the biological roles of those newly identified algal pathways.
RESUMO
Genome-scale metabolic models have become the tool of choice for the global analysis of microorganism metabolism, and their reconstruction has attained high standards of quality and reliability. Improvements in this area have been accompanied by the development of some major platforms and databases, and an explosion of individual bioinformatics methods. Consequently, many recent models result from "à la carte" pipelines, combining the use of platforms, individual tools and biological expertise to enhance the quality of the reconstruction. Although very useful, introducing heterogeneous tools, that hardly interact with each other, causes loss of traceability and reproducibility in the reconstruction process. This represents a real obstacle, especially when considering less studied species whose metabolic reconstruction can greatly benefit from the comparison to good quality models of related organisms. This work proposes an adaptable workspace, AuReMe, for sustainable reconstructions or improvements of genome-scale metabolic models involving personalized pipelines. At each step, relevant information related to the modifications brought to the model by a method is stored. This ensures that the process is reproducible and documented regardless of the combination of tools used. Additionally, the workspace establishes a way to browse metabolic models and their metadata through the automatic generation of ad-hoc local wikis dedicated to monitoring and facilitating the process of reconstruction. AuReMe supports exploration and semantic query based on RDF databases. We illustrate how this workspace allowed handling, in an integrated way, the metabolic reconstructions of non-model organisms such as an extremophile bacterium or eukaryote algae. Among relevant applications, the latter reconstruction led to putative evolutionary insights of a metabolic pathway.
Assuntos
Bases de Dados Factuais , Genômica , Armazenamento e Recuperação da Informação , Internet , Redes e Vias Metabólicas/genética , Antioxidantes/metabolismo , Genômica/métodos , Genômica/normas , Armazenamento e Recuperação da Informação/métodos , Armazenamento e Recuperação da Informação/normas , Microalgas/genética , Microalgas/metabolismo , Modelos Teóricos , Reprodutibilidade dos TestesRESUMO
BACKGROUND: The emergence of functions in biological systems is a long-standing issue that can now be addressed at the cell level with the emergence of high throughput technologies for genome sequencing and phenotyping. The reconstruction of complete metabolic networks for various organisms is a key outcome of the analysis of these data, giving access to a global view of cell functioning. The analysis of metabolic networks may be carried out by simply considering the architecture of the reaction network or by taking into account the stoichiometry of reactions. In both approaches, this analysis is generally centered on the outcome of the network and considers all metabolic compounds to be equivalent in this respect. As in the case of genes and reactions, about which the concept of essentiality has been developed, it seems, however, that some metabolites play crucial roles in system responses, due to the cell structure or the internal wiring of the metabolic network. RESULTS: We propose a classification of metabolic compounds according to their capacity to influence the activation of targeted functions (generally the growth phenotype) in a cell. We generalize the concept of essentiality to metabolites and introduce the concept of the phenotypic essential metabolite (PEM) which influences the growth phenotype according to sustainability, producibility or optimal-efficiency criteria. We have developed and made available a tool, Conquests, which implements a method combining graph-based and flux-based analysis, two approaches that are usually considered separately. The identification of PEMs is made effective by using a logical programming approach. CONCLUSION: The exhaustive study of phenotypic essential metabolites in six genome-scale metabolic models suggests that the combination and the comparison of graph, stoichiometry and optimal flux-based criteria allows some features of the metabolic network functionality to be deciphered by focusing on a small number of compounds. By considering the best combination of both graph-based and flux-based techniques, the Conquests python package advocates for a broader use of these compounds both to facilitate network curation and to promote a precise understanding of metabolic phenotype.
RESUMO
Increasing amounts of sequence data are becoming available for a wide range of non-model organisms. Investigating and modelling the metabolic behaviour of those organisms is highly relevant to understand their biology and ecology. As sequences are often incomplete and poorly annotated, draft networks of their metabolism largely suffer from incompleteness. Appropriate gap-filling methods to identify and add missing reactions are therefore required to address this issue. However, current tools rely on phenotypic or taxonomic information, or are very sensitive to the stoichiometric balance of metabolic reactions, especially concerning the co-factors. This type of information is often not available or at least prone to errors for newly-explored organisms. Here we introduce Meneco, a tool dedicated to the topological gap-filling of genome-scale draft metabolic networks. Meneco reformulates gap-filling as a qualitative combinatorial optimization problem, omitting constraints raised by the stoichiometry of a metabolic network considered in other methods, and solves this problem using Answer Set Programming. Run on several artificial test sets gathering 10,800 degraded Escherichia coli networks Meneco was able to efficiently identify essential reactions missing in networks at high degradation rates, outperforming the stoichiometry-based tools in scalability. To demonstrate the utility of Meneco we applied it to two case studies. Its application to recent metabolic networks reconstructed for the brown algal model Ectocarpus siliculosus and an associated bacterium Candidatus Phaeomarinobacter ectocarpi revealed several candidate metabolic pathways for algal-bacterial interactions. Then Meneco was used to reconstruct, from transcriptomic and metabolomic data, the first metabolic network for the microalga Euglena mutabilis. These two case studies show that Meneco is a versatile tool to complete draft genome-scale metabolic networks produced from heterogeneous data, and to suggest relevant reactions that explain the metabolic capacity of a biological system.