Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 26
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Bioinformatics ; 39(4)2023 04 03.
Artigo em Inglês | MEDLINE | ID: mdl-36857575

RESUMO

Microbial genome annotation is the process of identifying structural and functional elements in DNA sequences and subsequently attaching biological information to those elements. DRAM is a tool developed to annotate bacterial, archaeal, and viral genomes derived from pure cultures or metagenomes. DRAM goes beyond traditional annotation tools by distilling multiple gene annotations to genome level summaries of functional potential. Despite these benefits, a downside of DRAM is the requirement of large computational resources, which limits its accessibility. Further, it did not integrate with downstream metabolic modeling tools that require genome annotation. To alleviate these constraints, DRAM and the viral counterpart, DRAM-v, are now available and integrated with the freely accessible KBase cyberinfrastructure. With kb_DRAM users can generate DRAM annotations and functional summaries from microbial or viral genomes in a point-and-click interface, as well as generate genome-scale metabolic models from DRAM annotations. AVAILABILITY AND IMPLEMENTATION: For kb_DRAM users, the kb_DRAM apps on KBase can be found in the catalog at https://narrative.kbase.us/#catalog/modules/kb_DRAM. For kb_DRAM users, a tutorial workflow with all documentation is available at https://narrative.kbase.us/narrative/129480. For kb_DRAM developers, software is available at https://github.com/shafferm/kb_DRAM.


Assuntos
Bactérias , Software , Anotação de Sequência Molecular , Bactérias/genética , Archaea/genética , Metabolômica
2.
Nucleic Acids Res ; 49(D1): D575-D588, 2021 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-32986834

RESUMO

For over 10 years, ModelSEED has been a primary resource for the construction of draft genome-scale metabolic models based on annotated microbial or plant genomes. Now being released, the biochemistry database serves as the foundation of biochemical data underlying ModelSEED and KBase. The biochemistry database embodies several properties that, taken together, distinguish it from other published biochemistry resources by: (i) including compartmentalization, transport reactions, charged molecules and proton balancing on reactions; (ii) being extensible by the user community, with all data stored in GitHub; and (iii) design as a biochemical 'Rosetta Stone' to facilitate comparison and integration of annotations from many different tools and databases. The database was constructed by combining chemical data from many resources, applying standard transformations, identifying redundancies and computing thermodynamic properties. The ModelSEED biochemistry is continually tested using flux balance analysis to ensure the biochemical network is modeling-ready and capable of simulating diverse phenotypes. Ontologies can be designed to aid in comparing and reconciling metabolic reconstructions that differ in how they represent various metabolic pathways. ModelSEED now includes 33,978 compounds and 36,645 reactions, available as a set of extensible files on GitHub, and available to search at https://modelseed.org/biochem and KBase.


Assuntos
Bactérias/metabolismo , Bases de Dados Factuais , Fungos/metabolismo , Redes e Vias Metabólicas , Anotação de Sequência Molecular , Plantas/metabolismo , Bactérias/genética , Genoma Bacteriano , Termodinâmica
4.
BMC Genomics ; 17: 568, 2016 Aug 08.
Artigo em Inglês | MEDLINE | ID: mdl-27502787

RESUMO

BACKGROUND: Automatically generated bacterial metabolic models, and even some curated models, lack accuracy in predicting energy yields due to poor representation of key pathways in energy biosynthesis and the electron transport chain (ETC). Further compounding the problem, complex interlinking pathways in genome-scale metabolic models, and the need for extensive gapfilling to support complex biomass reactions, often results in predicting unrealistic yields or unrealistic physiological flux profiles. RESULTS: To overcome this challenge, we developed methods and tools ( http://coremodels.mcs.anl.gov ) to build high quality core metabolic models (CMM) representing accurate energy biosynthesis based on a well studied, phylogenetically diverse set of model organisms. We compare these models to explore the variability of core pathways across all microbial life, and by analyzing the ability of our core models to synthesize ATP and essential biomass precursors, we evaluate the extent to which the core metabolic pathways and functional ETCs are known for all microbes. 6,600 (80 %) of our models were found to have some type of aerobic ETC, whereas 5,100 (62 %) have an anaerobic ETC, and 1,279 (15 %) do not have any ETC. Using our manually curated ETC and energy biosynthesis pathways with no gapfilling at all, we predict accurate ATP yields for nearly 5586 (70 %) of the models under aerobic and anaerobic growth conditions. This study revealed gaps in our knowledge of the central pathways that result in 2,495 (30 %) CMMs being unable to produce ATP under any of the tested conditions. We then established a methodology for the systematic identification and correction of inconsistent annotations using core metabolic models coupled with phylogenetic analysis. CONCLUSIONS: We predict accurate energy yields based on our improved annotations in energy biosynthesis pathways and the implementation of diverse ETC reactions across the microbial tree of life. We highlighted missing annotations that were essential to energy biosynthesis in our models. We examine the diversity of these pathways across all microbial life and enable the scientific community to explore the analyses generated from this large-scale analysis of over 8000 microbial genomes.


Assuntos
Metabolismo Energético , Redes e Vias Metabólicas , Modelos Biológicos , Trifosfato de Adenosina/biossíntese , Bactérias/classificação , Bactérias/genética , Bactérias/metabolismo , Biomassa , Biologia Computacional/métodos , Complexo de Proteínas da Cadeia de Transporte de Elétrons/metabolismo , Genômica/métodos , Anotação de Sequência Molecular , Filogenia
5.
Nat Biotechnol ; 41(9): 1320-1331, 2023 09.
Artigo em Inglês | MEDLINE | ID: mdl-36658342

RESUMO

The human microbiome influences the efficacy and safety of a wide variety of commonly prescribed drugs. Designing precision medicine approaches that incorporate microbial metabolism would require strain- and molecule-resolved, scalable computational modeling. Here, we extend our previous resource of genome-scale metabolic reconstructions of human gut microorganisms with a greatly expanded version. AGORA2 (assembly of gut organisms through reconstruction and analysis, version 2) accounts for 7,302 strains, includes strain-resolved drug degradation and biotransformation capabilities for 98 drugs, and was extensively curated based on comparative genomics and literature searches. The microbial reconstructions performed very well against three independently assembled experimental datasets with an accuracy of 0.72 to 0.84, surpassing other reconstruction resources and predicted known microbial drug transformations with an accuracy of 0.81. We demonstrate that AGORA2 enables personalized, strain-resolved modeling by predicting the drug conversion potential of the gut microbiomes from 616 patients with colorectal cancer and controls, which greatly varied between individuals and correlated with age, sex, body mass index and disease stages. AGORA2 serves as a knowledge base for the human microbiome and paves the way to personalized, predictive analysis of host-microbiome metabolic interactions.


Assuntos
Microbioma Gastrointestinal , Microbiota , Humanos , Medicina de Precisão , Genoma , Genômica , Microbioma Gastrointestinal/genética
6.
bioRxiv ; 2023 Jul 26.
Artigo em Inglês | MEDLINE | ID: mdl-37502915

RESUMO

Predicting elemental cycles and maintaining water quality under increasing anthropogenic influence requires understanding the spatial drivers of river microbiomes. However, the unifying microbial processes governing river biogeochemistry are hindered by a lack of genome-resolved functional insights and sampling across multiple rivers. Here we employed a community science effort to accelerate the sampling, sequencing, and genome-resolved analyses of river microbiomes to create the Genome Resolved Open Watersheds database (GROWdb). This resource profiled the identity, distribution, function, and expression of thousands of microbial genomes across rivers covering 90% of United States watersheds. Specifically, GROWdb encompasses 1,469 microbial species from 27 phyla, including novel lineages from 10 families and 128 genera, and defines the core river microbiome for the first time at genome level. GROWdb analyses coupled to extensive geospatial information revealed local and regional drivers of microbial community structuring, while also presenting a myriad of foundational hypotheses about ecosystem function. Building upon the previously conceived River Continuum Concept 1 , we layer on microbial functional trait expression, which suggests the structure and function of river microbiomes is predictable. We make GROWdb available through various collaborative cyberinfrastructures 2, 3 so that it can be widely accessed across disciplines for watershed predictive modeling and microbiome-based management practices.

7.
Methods Mol Biol ; 2349: 291-320, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-34719000

RESUMO

The DOE Systems Biology Knowledgebase (KBase) platform offers a range of powerful tools for the reconstruction, refinement, and analysis of genome-scale metabolic models built from microbial isolate genomes. In this chapter, we describe and demonstrate these tools in action with an analysis of isoprene production in the Bacillus subtilis DSM genome. Two different methods are applied to build initial metabolic models for the DSM genome, then the models are gapfilled in three different growth conditions. Next, flux balance analysis (FBA) and flux variability analysis (FVA) techniques are applied to both study the growth of these models in minimal media and classify reactions within each model based on essentiality and functionality. The models are applied with the FBA method to predict essential genes, which are then compared to an updated list of essential genes obtained for B. subtilis 168, a very similar strain to the DSM isolate. The models are also applied to simulate Biolog growth conditions, and these results are compared with Biolog data collected for B. subtilis 168. Finally, the DSM metabolic models are applied to explore the pathways and genes responsible for producing isoprene in this strain. These studies demonstrate the accuracy and utility of models generated from the KBase pipelines, as well as exploring the tools available for analyzing these models.


Assuntos
Genes Essenciais , Biologia de Sistemas , Bacillus subtilis/genética , Genoma Microbiano , Bases de Conhecimento , Redes e Vias Metabólicas/genética , Modelos Biológicos
8.
J Bacteriol ; 193(16): 4199-213, 2011 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-21705586

RESUMO

Bacillus megaterium is deep-rooted in the Bacillus phylogeny, making it an evolutionarily key species and of particular importance in understanding genome evolution, dynamics, and plasticity in the bacilli. B. megaterium is a commercially available, nonpathogenic host for the biotechnological production of several substances, including vitamin B(12), penicillin acylase, and amylases. Here, we report the analysis of the first complete genome sequences of two important B. megaterium strains, the plasmidless strain DSM319 and QM B1551, which harbors seven indigenous plasmids. The 5.1-Mbp chromosome carries approximately 5,300 genes, while QM B1551 plasmids represent a combined 417 kb and 523 genes, one of the largest plasmid arrays sequenced in a single bacterial strain. We have documented extensive gene transfer between the plasmids and the chromosome. Each strain carries roughly 300 strain-specific chromosomal genes that account for differences in their experimentally confirmed phenotypes. B. megaterium is able to synthesize vitamin B(12) through an oxygen-independent adenosylcobalamin pathway, which together with other key energetic and metabolic pathways has now been fully reconstructed. Other novel genes include a second ftsZ gene, which may be responsible for the large cell size of members of this species, as well as genes for gas vesicles, a second ß-galactosidase gene, and most but not all of the genes needed for genetic competence. Comprehensive analyses of the global Bacillus gene pool showed that only an asymmetric region around the origin of replication was syntenic across the genus. This appears to be a characteristic feature of the Bacillus spp. genome architecture and may be key to their sporulating lifestyle.


Assuntos
Bacillus megaterium/classificação , Bacillus megaterium/genética , Genoma Bacteriano , Bacillus megaterium/metabolismo , Cromossomos Bacterianos , Flagelos/genética , Flagelos/metabolismo , Regulação Bacteriana da Expressão Gênica/fisiologia , Variação Genética , Dados de Sequência Molecular , Filogenia , Plasmídeos , Especificidade da Espécie
9.
Nat Biotechnol ; 39(4): 499-509, 2021 04.
Artigo em Inglês | MEDLINE | ID: mdl-33169036

RESUMO

The reconstruction of bacterial and archaeal genomes from shotgun metagenomes has enabled insights into the ecology and evolution of environmental and host-associated microbiomes. Here we applied this approach to >10,000 metagenomes collected from diverse habitats covering all of Earth's continents and oceans, including metagenomes from human and animal hosts, engineered environments, and natural and agricultural soils, to capture extant microbial, metabolic and functional potential. This comprehensive catalog includes 52,515 metagenome-assembled genomes representing 12,556 novel candidate species-level operational taxonomic units spanning 135 phyla. The catalog expands the known phylogenetic diversity of bacteria and archaea by 44% and is broadly available for streamlined comparative analyses, interactive exploration, metabolic modeling and bulk download. We demonstrate the utility of this collection for understanding secondary-metabolite biosynthetic potential and for resolving thousands of new host linkages to uncultivated viruses. This resource underscores the value of genome-centric approaches for revealing genomic properties of uncultivated microorganisms that affect ecosystem processes.


Assuntos
Archaea/genética , Bactérias/genética , Metabolômica/métodos , Metagenoma , Metagenômica/métodos , Vírus/genética , Microbiologia do Ar , Animais , Archaea/classificação , Archaea/isolamento & purificação , Bactérias/classificação , Bactérias/isolamento & purificação , Catálogos como Assunto , Ecossistema , Humanos , Filogenia , Microbiologia do Solo , Vírus/isolamento & purificação , Microbiologia da Água
10.
Microbiol Resour Announc ; 9(34)2020 Aug 20.
Artigo em Inglês | MEDLINE | ID: mdl-32816982

RESUMO

Here, we report the draft genome sequence of Arthrobacter sp. strain ATCC 49987, consisting of three contigs with a total length of 4.4 Mbp. Based on the genome sequence, we suggest reclassification of Arthrobacter sp. strain ATCC 49987 as Pseudarthrobacter sp. strain ATCC 49987.

11.
Microbiol Resour Announc ; 9(38)2020 Sep 17.
Artigo em Inglês | MEDLINE | ID: mdl-32943555

RESUMO

We present here the draft genome sequence of a pyridine-degrading bacterium, Micrococcus luteus ATCC 49442, which was reclassified as Pseudarthrobacter sp. strain ATCC 49442 based on its draft genome sequence. Its genome length is 4.98 Mbp, with 64.81% GC content.

13.
Microbiol Resour Announc ; 8(18)2019 May 02.
Artigo em Inglês | MEDLINE | ID: mdl-31048380

RESUMO

We present here the draft genome sequence of a carbazole-degrading Enterobacter species. The draft genome sequence will provide insight into various genes involved in the degradation of carbazole and other related aromatic compounds.

14.
Microbiol Resour Announc ; 8(25)2019 Jun 20.
Artigo em Inglês | MEDLINE | ID: mdl-31221646

RESUMO

We report here the 4.9-Mb genome sequence of a quinoline-degrading bacterium, Rhodococcus sp. strain ATCC 49988. The draft genome data will enable the identification of genes and future genetic modification to enhance traits relevant to heteroaromatic compound degradation.

15.
Methods Mol Biol ; 1716: 111-129, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-29222751

RESUMO

Genome-scale metabolic models (GEMs) generated from automated reconstruction pipelines often lack accuracy due to the need for extensive gapfilling and the inference of periphery metabolic pathways based on lower-confidence annotations. The central carbon pathways and electron transport chains are among the most well-understood regions of microbial metabolism, and these pathways contribute significantly toward defining cellular behavior and growth conditions. Thus, it is often useful to construct a simplified core metabolic model (CMM) that is comprised of only the high-confidence central pathways. In this chapter, we discuss methods for producing core metabolic models (CMM) based on genome annotations. With its reduced scope compared to GEMs, CMM reconstruction focuses on accurate representation of the central metabolic pathways related to energy biosynthesis and accurate energy yield predictions. We demonstrate the reconstruction and analysis of CMMs using the DOE Systems Biology Knowledgebase (KBase). The complete workflow is available at http://kbase.us/core-models/.


Assuntos
Redes e Vias Metabólicas , Biologia de Sistemas/métodos , Carbono/metabolismo , Genoma Microbiano , Modelos Biológicos , Anotação de Sequência Molecular , Fluxo de Trabalho
16.
Sci Rep ; 7(1): 8391, 2017 08 21.
Artigo em Inglês | MEDLINE | ID: mdl-28827682

RESUMO

Microbial electrosynthesis is a renewable energy and chemical production platform that relies on microbial cells to capture electrons from a cathode and fix carbon. Yet despite the promise of this technology, the metabolic capacity of the microbes that inhabit the electrode surface and catalyze electron transfer in these systems remains largely unknown. We assembled thirteen draft genomes from a microbial electrosynthesis system producing primarily acetate from carbon dioxide, and their transcriptional activity was mapped to genomes from cells on the electrode surface and in the supernatant. This allowed us to create a metabolic model of the predominant community members belonging to Acetobacterium, Sulfurospirillum, and Desulfovibrio. According to the model, the Acetobacterium was the primary carbon fixer, and a keystone member of the community. Transcripts of soluble hydrogenases and ferredoxins from Acetobacterium and hydrogenases, formate dehydrogenase, and cytochromes of Desulfovibrio were found in high abundance near the electrode surface. Cytochrome c oxidases of facultative members of the community were highly expressed in the supernatant despite completely sealed reactors and constant flushing with anaerobic gases. These molecular discoveries and metabolic modeling now serve as a foundation for future examination and development of electrosynthetic microbial communities.


Assuntos
Acetobacterium/metabolismo , Fontes de Energia Bioelétrica/microbiologia , Campylobacteraceae/metabolismo , Desulfovibrio/metabolismo , Eletricidade , Redes e Vias Metabólicas/genética , Acetatos/metabolismo , Acetobacterium/genética , Campylobacteraceae/genética , Dióxido de Carbono/metabolismo , Desulfovibrio/genética , Eletrodos/microbiologia , Transporte de Elétrons , Perfilação da Expressão Gênica , Genoma Bacteriano
17.
Elife ; 62017 03 31.
Artigo em Inglês | MEDLINE | ID: mdl-28362260

RESUMO

The connection between gene loss and the functional adaptation of retained proteins is still poorly understood. We apply phylogenomics and metabolic modeling to detect bacterial species that are evolving by gene loss, with the finding that Actinomycetaceae genomes from human cavities are undergoing sizable reductions, including loss of L-histidine and L-tryptophan biosynthesis. We observe that the dual-substrate phosphoribosyl isomerase A or priA gene, at which these pathways converge, appears to coevolve with the occurrence of trp and his genes. Characterization of a dozen PriA homologs shows that these enzymes adapt from bifunctionality in the largest genomes, to a monofunctional, yet not necessarily specialized, inefficient form in genomes undergoing reduction. These functional changes are accomplished via mutations, which result from relaxation of purifying selection, in residues structurally mapped after sequence and X-ray structural analyses. Our results show how gene loss can drive the evolution of substrate specificity from retained enzymes.


Assuntos
Actinomycetaceae/enzimologia , Actinomycetaceae/metabolismo , Adaptação Biológica , Aldose-Cetose Isomerases/genética , Aldose-Cetose Isomerases/metabolismo , Deleção de Genes , Actinomycetaceae/genética , Evolução Molecular , Mutação , Especificidade por Substrato
18.
Front Microbiol ; 7: 1819, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-27933038

RESUMO

Understanding gene function and regulation is essential for the interpretation, prediction, and ultimate design of cell responses to changes in the environment. An important step toward meeting the challenge of understanding gene function and regulation is the identification of sets of genes that are always co-expressed. These gene sets, Atomic Regulons (ARs), represent fundamental units of function within a cell and could be used to associate genes of unknown function with cellular processes and to enable rational genetic engineering of cellular systems. Here, we describe an approach for inferring ARs that leverages large-scale expression data sets, gene context, and functional relationships among genes. We computed ARs for Escherichia coli based on 907 gene expression experiments and compared our results with gene clusters produced by two prevalent data-driven methods: Hierarchical clustering and k-means clustering. We compared ARs and purely data-driven gene clusters to the curated set of regulatory interactions for E. coli found in RegulonDB, showing that ARs are more consistent with gold standard regulons than are data-driven gene clusters. We further examined the consistency of ARs and data-driven gene clusters in the context of gene interactions predicted by Context Likelihood of Relatedness (CLR) analysis, finding that the ARs show better agreement with CLR predicted interactions. We determined the impact of increasing amounts of expression data on AR construction and find that while more data improve ARs, it is not necessary to use the full set of gene expression experiments available for E. coli to produce high quality ARs. In order to explore the conservation of co-regulated gene sets across different organisms, we computed ARs for Shewanella oneidensis, Pseudomonas aeruginosa, Thermus thermophilus, and Staphylococcus aureus, each of which represents increasing degrees of phylogenetic distance from E. coli. Comparison of the organism-specific ARs showed that the consistency of AR gene membership correlates with phylogenetic distance, but there is clear variability in the regulatory networks of closely related organisms. As large scale expression data sets become increasingly common for model and non-model organisms, comparative analyses of atomic regulons will provide valuable insights into fundamental regulatory modules used across the bacterial domain.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA