RESUMO
Plasmids are extrachromosomal genetic elements that often encode fitness-enhancing features. However, many bacteria carry "cryptic" plasmids that do not confer clear beneficial functions. We identified one such cryptic plasmid, pBI143, which is ubiquitous across industrialized gut microbiomes and is 14 times as numerous as crAssphage, currently established as the most abundant extrachromosomal genetic element in the human gut. The majority of mutations in pBI143 accumulate in specific positions across thousands of metagenomes, indicating strong purifying selection. pBI143 is monoclonal in most individuals, likely due to the priority effect of the version first acquired, often from one's mother. pBI143 can transfer between Bacteroidales, and although it does not appear to impact bacterial host fitness in vivo, it can transiently acquire additional genetic content. We identified important practical applications of pBI143, including its use in identifying human fecal contamination and its potential as an alternative approach to track human colonic inflammatory states.
Assuntos
Bactérias , Trato Gastrointestinal , Metagenoma , Plasmídeos , Humanos , Bactérias/genética , Bacteroidetes/genética , Fezes/microbiologia , Plasmídeos/genéticaRESUMO
Microbes drive most ecosystems and are modulated by viruses that impact their lifespan, gene flow, and metabolic outputs. However, ecosystem-level impacts of viral community diversity remain difficult to assess due to classification issues and few reference genomes. Here, we establish an â¼12-fold expanded global ocean DNA virome dataset of 195,728 viral populations, now including the Arctic Ocean, and validate that these populations form discrete genotypic clusters. Meta-community analyses revealed five ecological zones throughout the global ocean, including two distinct Arctic regions. Across the zones, local and global patterns and drivers in viral community diversity were established for both macrodiversity (inter-population diversity) and microdiversity (intra-population genetic variation). These patterns sometimes, but not always, paralleled those from macro-organisms and revealed temperate and tropical surface waters and the Arctic as biodiversity hotspots and mechanistic hypotheses to explain them. Such further understanding of ocean viruses is critical for broader inclusion in ecosystem models.
Assuntos
Organismos Aquáticos/genética , Biodiversidade , Vírus de DNA/genética , DNA Viral/genética , Metagenoma , Microbiologia da ÁguaRESUMO
Ocean microbial communities strongly influence the biogeochemistry, food webs, and climate of our planet. Despite recent advances in understanding their taxonomic and genomic compositions, little is known about how their transcriptomes vary globally. Here, we present a dataset of 187 metatranscriptomes and 370 metagenomes from 126 globally distributed sampling stations and establish a resource of 47 million genes to study community-level transcriptomes across depth layers from pole-to-pole. We examine gene expression changes and community turnover as the underlying mechanisms shaping community transcriptomes along these axes of environmental variation and show how their individual contributions differ for multiple biogeochemically relevant processes. Furthermore, we find the relative contribution of gene expression changes to be significantly lower in polar than in non-polar waters and hypothesize that in polar regions, alterations in community activity in response to ocean warming will be driven more strongly by changes in organismal composition than by gene regulatory mechanisms. VIDEO ABSTRACT.
Assuntos
Regulação da Expressão Gênica , Metagenoma , Oceanos e Mares , Transcriptoma/genética , Geografia , Microbiota/genética , Anotação de Sequência Molecular , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Água do Mar/microbiologia , TemperaturaRESUMO
The ocean is home to myriad small planktonic organisms that underpin the functioning of marine ecosystems. However, their spatial patterns of diversity and the underlying drivers remain poorly known, precluding projections of their responses to global changes. Here we investigate the latitudinal gradients and global predictors of plankton diversity across archaea, bacteria, eukaryotes, and major virus clades using both molecular and imaging data from Tara Oceans. We show a decline of diversity for most planktonic groups toward the poles, mainly driven by decreasing ocean temperatures. Projections into the future suggest that severe warming of the surface ocean by the end of the 21st century could lead to tropicalization of the diversity of most planktonic groups in temperate and polar regions. These changes may have multiple consequences for marine ecosystem functioning and services and are expected to be particularly significant in key areas for carbon sequestration, fisheries, and marine conservation. VIDEO ABSTRACT.
Assuntos
Biodiversidade , Plâncton/fisiologia , Água do Mar/microbiologia , Geografia , Modelos Teóricos , Oceanos e Mares , FilogeniaRESUMO
Endosymbioses have profoundly impacted the evolution of life and continue to shape the ecology of a wide range of species. They give rise to new combinations of biochemical capabilities that promote innovation and diversification1,2. Despite the many examples of known endosymbioses across the tree of life, their de novo emergence is rare and challenging to uncover in retrospect3-5. Here we implant bacteria into the filamentous fungus Rhizopus microsporus to follow the fate of artificially induced endosymbioses. Whereas Escherichia coli implanted into the cytosol induced septum formation, effectively halting endosymbiogenesis, Mycetohabitans rhizoxinica was transmitted vertically to the progeny at a low frequency. Continuous positive selection on endosymbiosis mitigated initial fitness constraints by several orders of magnitude upon adaptive evolution. Phenotypic changes were underscored by the accumulation of mutations in the host as the system stabilized. The bacterium produced rhizoxin congeners in its new host, demonstrating the transfer of a metabolic function through induced endosymbiosis. Single-cell implantation thus provides a powerful experimental approach to study critical events at the onset of endosymbiogenesis and opens opportunities for synthetic approaches towards designing endosymbioses with desired traits.
RESUMO
Many of the Earth's microbes remain uncultured and understudied, limiting our understanding of the functional and evolutionary aspects of their genetic material, which remain largely overlooked in most metagenomic studies1. Here we analysed 149,842 environmental genomes from multiple habitats2-6 and compiled a curated catalogue of 404,085 functionally and evolutionarily significant novel (FESNov) gene families exclusive to uncultivated prokaryotic taxa. All FESNov families span multiple species, exhibit strong signals of purifying selection and qualify as new orthologous groups, thus nearly tripling the number of bacterial and archaeal gene families described to date. The FESNov catalogue is enriched in clade-specific traits, including 1,034 novel families that can distinguish entire uncultivated phyla, classes and orders, probably representing synapomorphies that facilitated their evolutionary divergence. Using genomic context analysis and structural alignments we predicted functional associations for 32.4% of FESNov families, including 4,349 high-confidence associations with important biological processes. These predictions provide a valuable hypothesis-driven framework that we used for experimental validatation of a new gene family involved in cell motility and a novel set of antimicrobial peptides. We also demonstrate that the relative abundance profiles of novel families can discriminate between environments and clinical conditions, leading to the discovery of potentially new biomarkers associated with colorectal cancer. We expect this work to enhance future metagenomics studies and expand our knowledge of the genetic repertory of uncultivated organisms.
Assuntos
Archaea , Bactérias , Ecossistema , Evolução Molecular , Genes Arqueais , Genes Bacterianos , Genômica , Conhecimento , Peptídeos Antimicrobianos/genética , Archaea/classificação , Archaea/genética , Bactérias/classificação , Bactérias/genética , Biomarcadores , Movimento Celular/genética , Neoplasias Colorretais/genética , Genômica/métodos , Genômica/tendências , Metagenômica/tendências , Família Multigênica , Filogenia , Reprodutibilidade dos TestesRESUMO
Microbial genes encode the majority of the functional repertoire of life on earth. However, despite increasing efforts in metagenomic sequencing of various habitats1-3, little is known about the distribution of genes across the global biosphere, with implications for human and planetary health. Here we constructed a non-redundant gene catalogue of 303 million species-level genes (clustered at 95% nucleotide identity) from 13,174 publicly available metagenomes across 14 major habitats and use it to show that most genes are specific to a single habitat. The small fraction of genes found in multiple habitats is enriched in antibiotic-resistance genes and markers for mobile genetic elements. By further clustering these species-level genes into 32 million protein families, we observed that a small fraction of these families contain the majority of the genes (0.6% of families account for 50% of the genes). The majority of species-level genes and protein families are rare. Furthermore, species-level genes, and in particular the rare ones, show low rates of positive (adaptive) selection, supporting a model in which most genetic variability observed within each protein family is neutral or nearly neutral.
Assuntos
Metagenoma , Metagenômica , Antibacterianos/farmacologia , Resistência Microbiana a Medicamentos , Ecossistema , Humanos , Metagenoma/genéticaRESUMO
Natural microbial communities are phylogenetically and metabolically diverse. In addition to underexplored organismal groups1, this diversity encompasses a rich discovery potential for ecologically and biotechnologically relevant enzymes and biochemical compounds2,3. However, studying this diversity to identify genomic pathways for the synthesis of such compounds4 and assigning them to their respective hosts remains challenging. The biosynthetic potential of microorganisms in the open ocean remains largely uncharted owing to limitations in the analysis of genome-resolved data at the global scale. Here we investigated the diversity and novelty of biosynthetic gene clusters in the ocean by integrating around 10,000 microbial genomes from cultivated and single cells with more than 25,000 newly reconstructed draft genomes from more than 1,000 seawater samples. These efforts revealed approximately 40,000 putative mostly new biosynthetic gene clusters, several of which were found in previously unsuspected phylogenetic groups. Among these groups, we identified a lineage rich in biosynthetic gene clusters ('Candidatus Eudoremicrobiaceae') that belongs to an uncultivated bacterial phylum and includes some of the most biosynthetically diverse microorganisms in this environment. From these, we characterized the phospeptin and pythonamide pathways, revealing cases of unusual bioactive compound structure and enzymology, respectively. Together, this research demonstrates how microbiomics-driven strategies can enable the investigation of previously undescribed enzymes and natural products in underexplored microbial groups and environments.
Assuntos
Vias Biossintéticas , Microbiota , Oceanos e Mares , Bactérias/classificação , Bactérias/genética , Vias Biossintéticas/genética , Genômica , Microbiota/genética , Família Multigênica/genética , FilogeniaRESUMO
Salmonella Typhimurium elicits gut inflammation by the costly expression of HilD-controlled virulence factors. This inflammation alleviates colonization resistance (CR) mediated by the microbiota and thereby promotes pathogen blooms. However, the inflamed gut-milieu can also select for hilD mutants, which cannot elicit or maintain inflammation, therefore causing a loss of the pathogen's virulence. This raises the question of which conditions support the maintenance of virulence in S. Typhimurium. Indeed, it remains unclear why the wild-type hilD allele is dominant among natural isolates. Here, we show that microbiota transfer from uninfected or recovered hosts leads to rapid clearance of hilD mutants that feature attenuated virulence, and thereby contributes to the preservation of the virulent S. Typhimurium genotype. Using mouse models featuring a range of microbiota compositions and antibiotic- or inflammation-inflicted microbiota disruptions, we found that irreversible disruption of the microbiota leads to the accumulation of hilD mutants. In contrast, in models with a transient microbiota disruption, selection for hilD mutants was prevented by the regrowing microbiota community dominated by Lachnospirales and Oscillospirales. Strikingly, even after an irreversible microbiota disruption, microbiota transfer from uninfected donors prevented the rise of hilD mutants. Our results establish that robust S. Typhimurium gut colonization hinges on optimizing its manipulation of the host: A transient and tempered microbiota perturbation is favorable for the pathogen to both flourish in the inflamed gut and also minimize loss of virulence. Moreover, besides conferring CR, the microbiota may have the additional consequence of maintaining costly enteropathogen virulence mechanisms.
Assuntos
Microbiota , Salmonella typhimurium , Animais , Camundongos , Virulência/genética , Salmonella typhimurium/genética , Fatores de Virulência/genética , InflamaçãoRESUMO
Meta'omic data on microbial diversity and function accrue exponentially in public repositories, but derived information is often siloed according to data type, study or sampled microbial environment. Here we present SPIRE, a Searchable Planetary-scale mIcrobiome REsource that integrates various consistently processed metagenome-derived microbial data modalities across habitats, geography and phylogeny. SPIRE encompasses 99 146 metagenomic samples from 739 studies covering a wide array of microbial environments and augmented with manually-curated contextual data. Across a total metagenomic assembly of 16 Tbp, SPIRE comprises 35 billion predicted protein sequences and 1.16 million newly constructed metagenome-assembled genomes (MAGs) of medium or high quality. Beyond mapping to the high-quality genome reference provided by proGenomes3 (http://progenomes.embl.de), these novel MAGs form 92 134 novel species-level clusters, the majority of which are unclassified at species level using current tools. SPIRE enables taxonomic profiling of these species clusters via an updated, custom mOTUs database (https://motu-tool.org/) and includes several layers of functional annotation, as well as crosslinks to several (micro-)biological databases. The resource is accessible, searchable and browsable via http://spire.embl.de.
Assuntos
Bases de Dados Factuais , Metagenoma , Microbiota , Metagenômica , Microbiota/genéticaRESUMO
MOTIVATION: DNA barcoding has become a powerful tool for assessing the fitness of strains in a variety of studies, including random transposon mutagenesis screens, attenuation of site-directed mutants, and population dynamics of isogenic strain pools. However, the statistical analysis, visualization, and contextualization of the data resulting from such experiments can be complex and require bioinformatic skills. RESULTS: Here, we developed mBARq, a user-friendly tool designed to simplify these steps for diverse experimental setups. The tool is seamlessly integrated with an intuitive web app for interactive data exploration via the STRING and KEGG databases to accelerate scientific discovery. AVAILABILITY AND IMPLEMENTATION: The tool is implemented in Python. The source code is freely available (https://github.com/MicrobiologyETHZ/mbarq) and the web app can be accessed at: https://microbiomics.io/tools/mbarq-app.
Assuntos
Código de Barras de DNA Taxonômico , Software , DNA , Biologia ComputacionalRESUMO
The phenotype of an organism results from its genotype and the influence of the environment throughout development. Even when using animals of the same genotype, independent studies may test animals of different phenotypes, resulting in poor replicability due to genotype-by-environment interactions. Thus, genetically defined strains of mice may respond differently to experimental treatments depending on their rearing environment. However, the extent of such phenotypic plasticity and its implications for the replicability of research findings have remained unknown. Here, we examined the extent to which common environmental differences between animal facilities modulate the phenotype of genetically homogeneous (inbred) mice. We conducted a comprehensive multicentre study, whereby inbred C57BL/6J mice from a single breeding cohort were allocated to and reared in 5 different animal facilities throughout early life and adolescence, before being transported to a single test laboratory. We found persistent effects of the rearing facility on the composition and heterogeneity of the gut microbial community. These effects were paralleled by persistent differences in body weight and in the behavioural phenotype of the mice. Furthermore, we show that environmental variation among animal facilities is strong enough to influence epigenetic patterns in neurons at the level of chromatin organisation. We detected changes in chromatin organisation in the regulatory regions of genes involved in nucleosome assembly, neuronal differentiation, synaptic plasticity, and regulation of behaviour. Our findings demonstrate that common environmental differences between animal facilities may produce facility-specific phenotypes, from the molecular to the behavioural level. Furthermore, they highlight an important limitation of inferences from single-laboratory studies and thus argue that study designs should take environmental background into account to increase the robustness and replicability of findings.
Assuntos
Cromatina , Meio Ambiente , Camundongos , Animais , Camundongos Endogâmicos C57BL , Fenótipo , GenótipoRESUMO
The capacity of the intestinal microbiota to degrade otherwise indigestible diet components is known to greatly improve the recovery of energy from food. This has led to the hypothesis that increased digestive efficiency may underlie the contribution of the microbiota to obesity. OligoMM12-colonized gnotobiotic mice have a consistently higher fat mass than germ-free (GF) or fully colonized counterparts. We therefore investigated their food intake, digestion efficiency, energy expenditure, and respiratory quotient using a novel isolator-housed metabolic cage system, which allows long-term measurements without contamination risk. This demonstrated that microbiota-released calories are perfectly balanced by decreased food intake in fully colonized versus gnotobiotic OligoMM12 and GF mice fed a standard chow diet, i.e., microbiota-released calories can in fact be well integrated into appetite control. We also observed no significant difference in energy expenditure after normalization by lean mass between the different microbiota groups, suggesting that cumulative small differences in energy balance, or altered energy storage, must underlie fat accumulation in OligoMM12 mice. Consistent with altered energy storage, major differences were observed in the type of respiratory substrates used in metabolism over the circadian cycle: In GF mice, the respiratory exchange ratio (RER) was consistently lower than that of fully colonized mice at all times of day, indicative of more reliance on fat and less on glucose metabolism. Intriguingly, the RER of OligoMM12-colonized gnotobiotic mice phenocopied fully colonized mice during the dark (active/eating) phase but phenocopied GF mice during the light (fasting/resting) phase. Further, OligoMM12-colonized mice showed a GF-like drop in liver glycogen storage during the light phase and both liver and plasma metabolomes of OligoMM12 mice clustered closely with GF mice. This implies the existence of microbiota functions that are required to maintain normal host metabolism during the resting/fasting phase of circadian cycle and which are absent in the OligoMM12 consortium.
Assuntos
Glicogênio Hepático , Microbiota , Animais , Vida Livre de Germes , Glucose , Camundongos , Obesidade/metabolismoRESUMO
The interpretation of genomic, transcriptomic and other microbial 'omics data is highly dependent on the availability of well-annotated genomes. As the number of publicly available microbial genomes continues to increase exponentially, the need for quality control and consistent annotation is becoming critical. We present proGenomes3, a database of 907 388 high-quality genomes containing 4 billion genes that passed stringent criteria and have been consistently annotated using multiple functional and taxonomic databases including mobile genetic elements and biosynthetic gene clusters. proGenomes3 encompasses 41 171 species-level clusters, defined based on universal single copy marker genes, for which pan-genomes and contextual habitat annotations are provided. The database is available at http://progenomes.embl.de/.
Assuntos
Genoma , Células Procarióticas , Bases de Dados Genéticas , Genômica , Anotação de Sequência Molecular , Bactérias/classificação , Bactérias/genéticaRESUMO
Soils harbour some of the most diverse microbiomes on Earth and are essential for both nutrient cycling and carbon storage. To understand soil functioning, it is necessary to model the global distribution patterns and functional gene repertoires of soil microorganisms, as well as the biotic and environmental associations between the diversity and structure of both bacterial and fungal soil communities1-4. Here we show, by leveraging metagenomics and metabarcoding of global topsoil samples (189 sites, 7,560 subsamples), that bacterial, but not fungal, genetic diversity is highest in temperate habitats and that microbial gene composition varies more strongly with environmental variables than with geographic distance. We demonstrate that fungi and bacteria show global niche differentiation that is associated with contrasting diversity responses to precipitation and soil pH. Furthermore, we provide evidence for strong bacterial-fungal antagonism, inferred from antibiotic-resistance genes, in topsoil and ocean habitats, indicating the substantial role of biotic interactions in shaping microbial communities. Our results suggest that both competition and environmental filtering affect the abundance, composition and encoded gene functions of bacterial and fungal communities, indicating that the relative contributions of these microorganisms to global nutrient cycling varies spatially.
Assuntos
Bactérias/isolamento & purificação , Biodiversidade , Planeta Terra , Fungos/isolamento & purificação , Microbiota/fisiologia , Microbiologia do Solo , Bactérias/genética , Código de Barras de DNA Taxonômico , Resistência Microbiana a Medicamentos/genética , Fungos/genética , Concentração de Íons de Hidrogênio , Metagenômica , Microbiota/genética , Oceanos e Mares , Chuva , Água do Mar/microbiologiaRESUMO
Testing hypothesis about the biogeography of genes using large data resources such as Tara Oceans marine metagenomes and metatranscriptomes requires significant hardware resources and programming skills. The new release of the 'Ocean Gene Atlas' (OGA2) is a freely available intuitive online service to mine large and complex marine environmental genomic databases. OGA2 datasets available have been extended and now include, from the Tara Oceans portfolio: (i) eukaryotic Metagenome-Assembled-Genomes (MAGs) and Single-cell Assembled Genomes (SAGs) (10.2E+6 coding genes), (ii) version 2 of Ocean Microbial Reference Gene Catalogue (46.8E+6 non-redundant genes), (iii) 924 MetaGenomic Transcriptomes (7E+6 unigenes), (iv) 530 MAGs from an Arctic MAG catalogue (1E+6 genes) and (v) 1888 Bacterial and Archaeal Genomes (4.5E+6 genes), and an additional dataset from the Malaspina 2010 global circumnavigation: (vi) 317 Malaspina Deep Metagenome Assembled Genomes (0.9E+6 genes). Novel analyses enabled by OGA2 include phylogenetic tree inference to visualize user queries within their context of sequence homologues from both the marine environmental dataset and the RefSeq database. An Application Programming Interface (API) now allows users to query OGA2 using command-line tools, hence providing local workflow integration. Finally, gene abundance can be interactively filtered directly on map displays using any of the available environmental variables. Ocean Gene Atlas v2.0 is freely-available at: https://tara-oceans.mio.osupytheas.fr/ocean-gene-atlas/.
Assuntos
Bactérias , Eucariotos , Biologia Marinha , Plâncton , Bactérias/genética , Eucariotos/genética , Metagenoma , Filogenia , Plâncton/genéticaRESUMO
SUMMARY: Taxonomic analysis of microbial communities is well supported at the level of species and strains. However, species can contain significant phenotypic diversity and strains are rarely widely shared across global populations. Stratifying the diversity between species and strains can identify 'subspecies', which are a useful intermediary. High-throughput identification and profiling of subspecies is not yet supported in the microbiome field. Here, we use an operational definition of subspecies based on single nucleotide variant (SNV) patterns within species to identify and profile subspecies in metagenomes, along with their distinctive SNVs and genes. We incorporate this method into metaSNV v2, which extends existing SNV-calling software to support further SNV interpretation for population genetics. These new features support microbiome analyses to link SNV profiles with host phenotype or environment and niche-specificity. We demonstrate subspecies identification in marine and fecal metagenomes. In the latter, we analyze 70 species in 7524 adult and infant subjects, supporting a common subspecies population structure in the human gut microbiome and illustrating some limits in subspecies calling. AVAILABILITY AND IMPLEMENTATION: Source code, documentation, tutorials and test data are available at https://github.com/metasnv-tool/metaSNV and https://metasnv.embl.de. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Microbioma Gastrointestinal , Microbiota , Humanos , Metagenoma , Software , FenótipoRESUMO
A Western lifestyle with high salt consumption can lead to hypertension and cardiovascular disease. High salt may additionally drive autoimmunity by inducing T helper 17 (TH17) cells, which can also contribute to hypertension. Induction of TH17 cells depends on gut microbiota; however, the effect of salt on the gut microbiome is unknown. Here we show that high salt intake affects the gut microbiome in mice, particularly by depleting Lactobacillus murinus. Consequently, treatment of mice with L. murinus prevented salt-induced aggravation of actively induced experimental autoimmune encephalomyelitis and salt-sensitive hypertension by modulating TH17 cells. In line with these findings, a moderate high-salt challenge in a pilot study in humans reduced intestinal survival of Lactobacillus spp., increased TH17 cells and increased blood pressure. Our results connect high salt intake to the gut-immune axis and highlight the gut microbiome as a potential therapeutic target to counteract salt-sensitive conditions.
Assuntos
Microbioma Gastrointestinal/efeitos dos fármacos , Lactobacillus/efeitos dos fármacos , Lactobacillus/isolamento & purificação , Cloreto de Sódio/farmacologia , Células Th17/efeitos dos fármacos , Células Th17/imunologia , Animais , Autoimunidade/efeitos dos fármacos , Pressão Sanguínea/efeitos dos fármacos , Modelos Animais de Doenças , Encefalomielite Autoimune Experimental/induzido quimicamente , Encefalomielite Autoimune Experimental/microbiologia , Encefalomielite Autoimune Experimental/patologia , Encefalomielite Autoimune Experimental/terapia , Fezes/microbiologia , Humanos , Hipertensão/induzido quimicamente , Ácidos Indolacéticos/metabolismo , Indóis/metabolismo , Intestinos/citologia , Intestinos/efeitos dos fármacos , Intestinos/imunologia , Intestinos/microbiologia , Lactobacillus/imunologia , Ativação Linfocitária/efeitos dos fármacos , Contagem de Linfócitos , Masculino , Camundongos , Projetos Piloto , Cloreto de Sódio/administração & dosagem , Simbiose , Células Th17/citologia , Triptofano/metabolismoRESUMO
Bacterial specialized metabolites are increasingly recognized as important factors in animal-microbiome interactions: for example, by providing the host with chemical defenses. Even in chemically rich animals, such compounds have been found to originate from individual members of more diverse microbiomes. Here, we identified a remarkable case of a moderately complex microbiome in the sponge host Mycale hentscheli in which multiple symbionts jointly generate chemical diversity. In addition to bacterial pathways for three distinct polyketide families comprising microtubule-inhibiting peloruside drug candidates, mycalamide-type contact poisons, and the eukaryotic translation-inhibiting pateamines, we identified extensive biosynthetic potential distributed among a broad phylogenetic range of bacteria. Biochemical data on one of the orphan pathways suggest a previously unknown member of the rare polytheonamide-type cytotoxin family as its product. Other than supporting a scenario of cooperative symbiosis based on bacterial metabolites, the data provide a rationale for the chemical variability of M. hentscheli and could pave the way toward biotechnological peloruside production. Most bacterial lineages in the compositionally unusual sponge microbiome were not known to synthesize bioactive metabolites, supporting the concept that microbial dark matter harbors diverse producer taxa with as yet unrecognized drug discovery potential.
Assuntos
Bactérias/metabolismo , Microbiota/fisiologia , Poríferos/microbiologia , Animais , Citotoxinas/metabolismo , Genoma Bacteriano , SimbioseRESUMO
Profiling the taxonomic composition of microbial communities commonly involves the classification of ribosomal RNA gene fragments. As a trade-off to maintain high classification accuracy, existing tools are typically limited to the genus level. Here, we present mTAGs, a taxonomic profiling tool that implements the alignment of metagenomic sequencing reads to degenerate consensus reference sequences of small subunit ribosomal RNA genes. It uses DNA fragments, that is, paired-end sequencing reads, as count units and provides relative abundance profiles at multiple taxonomic ranks, including operational taxonomic units based on a 97% sequence identity cutoff. At the genus rank, mTAGs outperformed other tools across several metrics, such as the F1 score by >11% across data from different environments, and achieved competitive (F1 score) or better results (Bray-Curtis dissimilarity) at the sub-genus level. AVAILABILITY AND IMPLEMENTATION: The software tool mTAGs is implemented in Python. The source code and binaries are freely available (https://github.com/SushiLab/mTAGs). The data underlying this article are available in Zenodo, at https://doi.org/10.5281/zenodo.4352762. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.