Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 50
Filtrar
1.
Nat Microbiol ; 9(7): 1873-1883, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-38902374

RESUMO

Historically neglected by microbial ecologists, soil viruses are now thought to be critical to global biogeochemical cycles. However, our understanding of their global distribution, activities and interactions with the soil microbiome remains limited. Here we present the Global Soil Virus Atlas, a comprehensive dataset compiled from 2,953 previously sequenced soil metagenomes and composed of 616,935 uncultivated viral genomes and 38,508 unique viral operational taxonomic units. Rarefaction curves from the Global Soil Virus Atlas indicate that most soil viral diversity remains unexplored, further underscored by high spatial turnover and low rates of shared viral operational taxonomic units across samples. By examining genes associated with biogeochemical functions, we also demonstrate the viral potential to impact soil carbon and nutrient cycling. This study represents an extensive characterization of soil viral diversity and provides a foundation for developing testable hypotheses regarding the role of the virosphere in the soil microbiome and global biogeochemistry.


Assuntos
Biodiversidade , Genoma Viral , Metagenoma , Microbiota , Microbiologia do Solo , Solo , Vírus , Vírus/genética , Vírus/classificação , Vírus/isolamento & purificação , Solo/química , Genoma Viral/genética , Microbiota/genética , Carbono/metabolismo , Metagenômica , Filogenia , Viroma/genética , Bactérias/genética , Bactérias/classificação , Bactérias/isolamento & purificação
2.
Nature ; 622(7983): 594-602, 2023 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-37821698

RESUMO

Metagenomes encode an enormous diversity of proteins, reflecting a multiplicity of functions and activities1,2. Exploration of this vast sequence space has been limited to a comparative analysis against reference microbial genomes and protein families derived from those genomes. Here, to examine the scale of yet untapped functional diversity beyond what is currently possible through the lens of reference genomes, we develop a computational approach to generate reference-free protein families from the sequence space in metagenomes. We analyse 26,931 metagenomes and identify 1.17 billion protein sequences longer than 35 amino acids with no similarity to any sequences from 102,491 reference genomes or the Pfam database3. Using massively parallel graph-based clustering, we group these proteins into 106,198 novel sequence clusters with more than 100 members, doubling the number of protein families obtained from the reference genomes clustered using the same approach. We annotate these families on the basis of their taxonomic, habitat, geographical and gene neighbourhood distributions and, where sufficient sequence diversity is available, predict protein three-dimensional models, revealing novel structures. Overall, our results uncover an enormously diverse functional space, highlighting the importance of further exploring the microbial functional dark matter.


Assuntos
Metagenoma , Metagenômica , Microbiologia , Proteínas , Análise por Conglomerados , Metagenoma/genética , Metagenômica/métodos , Proteínas/química , Proteínas/classificação , Proteínas/genética , Bases de Dados de Proteínas , Conformação Proteica
3.
mSystems ; 8(5): e0019723, 2023 Oct 26.
Artigo em Inglês | MEDLINE | ID: mdl-37702511

RESUMO

IMPORTANCE: The findings of this study are significant, as N4-like viruses represent a unique viral lineage with a distinct replication mechanism and a conserved core genome. This work has resulted in a comprehensive global map of the entire N4-like viral lineage, including information on their distribution in different biomes, evolutionary divergence, genomic diversity, and the potential for viral-mediated host metabolic reprogramming. As such, this work significantly contributes to our understanding of the ecological function and viral-host interactions of bacteriophages.


Assuntos
Bacteriófagos , Vírus , Genoma Viral/genética , Filogenia , Vírus/genética , Bacteriófagos/genética , Genômica
4.
Front Bioeng Biotechnol ; 11: 1182500, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37064232

RESUMO

[This corrects the article DOI: 10.3389/fbioe.2020.00034.].

5.
Front Bioinform ; 3: 1157956, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36959975

RESUMO

Metagenomics has enabled accessing the genetic repertoire of natural microbial communities. Metagenome shotgun sequencing has become the method of choice for studying and classifying microorganisms from various environments. To this end, several methods have been developed to process and analyze the sequence data from raw reads to end-products such as predicted protein sequences or families. In this article, we provide a thorough review to simplify such processes and discuss the alternative methodologies that can be followed in order to explore biodiversity at the protein family level. We provide details for analysis tools and we comment on their scalability as well as their advantages and disadvantages. Finally, we report the available data repositories and recommend various approaches for protein family annotation related to phylogenetic distribution, structure prediction and metadata enrichment.

6.
Nat Commun ; 13(1): 5485, 2022 09 19.
Artigo em Inglês | MEDLINE | ID: mdl-36123347

RESUMO

Metagenomics is unearthing the previously hidden world of soil viruses. Many soil viral sequences in metagenomes contain putative auxiliary metabolic genes (AMGs) that are not associated with viral replication. Here, we establish that AMGs on soil viruses actually produce functional, active proteins. We focus on AMGs that potentially encode chitosanase enzymes that metabolize chitin - a common carbon polymer. We express and functionally screen several chitosanase genes identified from environmental metagenomes. One expressed protein showing endo-chitosanase activity (V-Csn) is crystalized and structurally characterized at ultra-high resolution, thus representing the structure of a soil viral AMG product. This structure provides details about the active site, and together with structure models determined using AlphaFold, facilitates understanding of substrate specificity and enzyme mechanism. Our findings support the hypothesis that soil viruses contribute auxiliary functions to their hosts.


Assuntos
Solo , Vírus , Carbono , Quitina , Glicosídeo Hidrolases/metabolismo , Proteínas Virais/genética , Vírus/genética
7.
Cell ; 185(21): 4023-4037.e18, 2022 10 13.
Artigo em Inglês | MEDLINE | ID: mdl-36174579

RESUMO

High-throughput RNA sequencing offers broad opportunities to explore the Earth RNA virome. Mining 5,150 diverse metatranscriptomes uncovered >2.5 million RNA virus contigs. Analysis of >330,000 RNA-dependent RNA polymerases (RdRPs) shows that this expansion corresponds to a 5-fold increase of the known RNA virus diversity. Gene content analysis revealed multiple protein domains previously not found in RNA viruses and implicated in virus-host interactions. Extended RdRP phylogeny supports the monophyly of the five established phyla and reveals two putative additional bacteriophage phyla and numerous putative additional classes and orders. The dramatically expanded phylum Lenarviricota, consisting of bacterial and related eukaryotic viruses, now accounts for a third of the RNA virome. Identification of CRISPR spacer matches and bacteriolytic proteins suggests that subsets of picobirnaviruses and partitiviruses, previously associated with eukaryotes, infect prokaryotic hosts.


Assuntos
Bacteriófagos , Vírus de RNA , Bacteriófagos/genética , RNA Polimerases Dirigidas por DNA/genética , Genoma Viral , Filogenia , RNA , Vírus de RNA/genética , RNA Polimerase Dependente de RNA/genética , Viroma
8.
iScience ; 25(8): 104680, 2022 Aug 19.
Artigo em Inglês | MEDLINE | ID: mdl-35942087

RESUMO

Hadal ocean biosphere, that is, the deepest part of the world's oceans, harbors a unique microbial community, suggesting a potential uncovered co-occurring virioplankton assemblage. Herein, we reveal the unique virioplankton assemblages of the Challenger Deep, comprising 95,813 non-redundant viral contigs from the surface to the hadal zone. Almost all of the dominant viral contigs in the hadal zone were unclassified, potentially related to Alteromonadales and Oceanospirillales. 2,586 viral auxiliary metabolic genes from 132 different KEGG orthologous groups were mainly related to the carbon, nitrogen, sulfur, and arsenic metabolism. Lysogenic viral production and integrase genes were augmented in the hadal zone, suggesting the prevalence of viral lysogenic life strategy. Abundant rve genes in the hadal zone, which function as transposase in the caudoviruses, further suggest the prevalence of viral-mediated horizontal gene transfer. This study provides fundamental insights into the virioplankton assemblages of the hadal zone, reinforcing the necessity of incorporating virioplankton into the hadal biogeochemical cycles.

9.
Nat Commun ; 13(1): 2389, 2022 05 02.
Artigo em Inglês | MEDLINE | ID: mdl-35501347

RESUMO

Recent advances in environmental genomics have provided unprecedented opportunities for the investigation of viruses in natural settings. Yet, our knowledge of viral biogeographic patterns and the corresponding drivers is still limited. Here, we perform metagenomic deep sequencing on 90 acid mine drainage (AMD) sediments sampled across Southern China and examine the biogeography of viruses in this extreme environment. The results demonstrate that prokaryotic communities dictate viral taxonomic and functional diversity, abundance and structure, whereas other factors especially latitude and mean annual temperature also impact viral populations and functions. In silico predictions highlight lineage-specific virus-host abundance ratios and richness-dependent virus-host interaction structure. Further functional analyses reveal important roles of environmental conditions and horizontal gene transfers in shaping viral auxiliary metabolic genes potentially involved in phosphorus assimilation. Our findings underscore the importance of both abiotic and biotic factors in predicting the taxonomic and functional biogeographic dynamics of viruses in the AMD sediments.


Assuntos
Biodiversidade , Vírus , Ácidos , Metagenoma/genética , Mineração , Vírus/genética
10.
Microbiome ; 9(1): 231, 2021 11 26.
Artigo em Inglês | MEDLINE | ID: mdl-34823595

RESUMO

BACKGROUND: In Antarctica, summer sunlight enables phototrophic microorganisms to drive primary production, thereby "feeding" ecosystems to enable their persistence through the long, dark winter months. In Ace Lake, a stratified marine-derived system in the Vestfold Hills of East Antarctica, a Chlorobium species of green sulphur bacteria (GSB) is the dominant phototroph, although its seasonal abundance changes more than 100-fold. Here, we analysed 413 Gb of Antarctic metagenome data including 59 Chlorobium metagenome-assembled genomes (MAGs) from Ace Lake and nearby stratified marine basins to determine how genome variation and population structure across a 7-year period impacted ecosystem function. RESULTS: A single species, Candidatus Chlorobium antarcticum (most similar to Chlorobium phaeovibrioides DSM265) prevails in all three aquatic systems and harbours very little genomic variation (≥ 99% average nucleotide identity). A notable feature of variation that did exist related to the genomic capacity to biosynthesize cobalamin. The abundance of phylotypes with this capacity changed seasonally ~ 2-fold, consistent with the population balancing the value of a bolstered photosynthetic capacity in summer against an energetic cost in winter. The very high GSB concentration (> 108 cells ml-1 in Ace Lake) and seasonal cycle of cell lysis likely make Ca. Chlorobium antarcticum a major provider of cobalamin to the food web. Analysis of Ca. Chlorobium antarcticum viruses revealed the species to be infected by generalist (rather than specialist) viruses with a broad host range (e.g., infecting Gammaproteobacteria) that were present in diverse Antarctic lakes. The marked seasonal decrease in Ca. Chlorobium antarcticum abundance may restrict specialist viruses from establishing effective lifecycles, whereas generalist viruses may augment their proliferation using other hosts. CONCLUSION: The factors shaping Antarctic microbial communities are gradually being defined. In addition to the cold, the annual variation in sunlight hours dictates which phototrophic species can grow and the extent to which they contribute to ecosystem processes. The Chlorobium population studied was inferred to provide cobalamin, in addition to carbon, nitrogen, hydrogen, and sulphur cycling, as critical ecosystem services. The specific Antarctic environmental factors and major ecosystem benefits afforded by this GSB likely explain why such a coherent population structure has developed in this Chlorobium species. Video abstract.


Assuntos
Chlorobium , Microbiota , Regiões Antárticas , Chlorobium/genética , Ecossistema , Lagos/microbiologia , Metagenoma
11.
Nat Commun ; 12(1): 5398, 2021 09 13.
Artigo em Inglês | MEDLINE | ID: mdl-34518545

RESUMO

As one of the largest biotechnological applications, activated sludge (AS) systems in wastewater treatment plants (WWTPs) harbor enormous viruses, with 10-1,000-fold higher concentrations than in natural environments. However, the compositional variation and host-connections of AS viruses remain poorly explored. Here, we report a catalogue of ~50,000 prokaryotic viruses from six WWTPs, increasing the number of described viral species of AS by 23-fold, and showing the very high viral diversity which is largely unknown (98.4-99.6% of total viral contigs). Most viral genera are represented in more than one AS system with 53 identified across all. Viral infection widely spans 8 archaeal and 58 bacterial phyla, linking viruses with aerobic/anaerobic heterotrophs, and other functional microorganisms controlling nitrogen/phosphorous removal. Notably, Mycobacterium, notorious for causing AS foaming, is associated with 402 viral genera. Our findings expand the current AS virus catalogue and provide reference for the phage treatment to control undesired microorganisms in WWTPs.


Assuntos
Ciclo do Carbono , Células Procarióticas/virologia , Esgotos/virologia , Viroma/genética , Vírus/genética , Purificação da Água/métodos , Archaea/classificação , Archaea/genética , Archaea/virologia , Bactérias/classificação , Bactérias/genética , Bactérias/virologia , Metabolismo Energético/genética , Genes Virais/genética , Variação Genética , Interações Hospedeiro-Patógeno , Fases de Leitura Aberta/genética , Células Procarióticas/metabolismo , Análise de Sequência de DNA/métodos , Esgotos/microbiologia , Vírus/classificação , Vírus/metabolismo
12.
Nat Microbiol ; 6(7): 960-970, 2021 07.
Artigo em Inglês | MEDLINE | ID: mdl-34168315

RESUMO

Bacteriophages have important roles in the ecology of the human gut microbiome but are under-represented in reference databases. To address this problem, we assembled the Metagenomic Gut Virus catalogue that comprises 189,680 viral genomes from 11,810 publicly available human stool metagenomes. Over 75% of genomes represent double-stranded DNA phages that infect members of the Bacteroidia and Clostridia classes. Based on sequence clustering we identified 54,118 candidate viral species, 92% of which were not found in existing databases. The Metagenomic Gut Virus catalogue improves detection of viruses in stool metagenomes and accounts for nearly 40% of CRISPR spacers found in human gut Bacteria and Archaea. We also produced a catalogue of 459,375 viral protein clusters to explore the functional potential of the gut virome. This revealed tens of thousands of diversity-generating retroelements, which use error-prone reverse transcription to mutate target genes and may be involved in the molecular arms race between phages and their bacterial hosts.


Assuntos
Vírus de DNA/genética , Microbioma Gastrointestinal/genética , Genoma Viral/genética , Archaea/virologia , Bactérias/virologia , Bacteriófagos/genética , Catálogos como Assunto , Vírus de DNA/classificação , DNA Viral/genética , Fezes/microbiologia , Variação Genética , Humanos , Metagenômica , Filogenia , Proteínas Virais/genética
13.
Cell Host Microbe ; 29(5): 726-739.e5, 2021 05 12.
Artigo em Inglês | MEDLINE | ID: mdl-33957082

RESUMO

Rheumatoid arthritis (RA) is an autoimmune disease characterized in seropositive individuals by the presence of anti-cyclic citrullinated protein (CCP) antibodies. RA is linked to the intestinal microbiota, yet the association of microbes with CCP serology and their contribution to RA is unclear. We describe intestinal phage communities of individuals at risk for developing RA, with or without anti-CCP antibodies, whose first-degree relatives have been diagnosed with RA. We show that at-risk individuals harbor intestinal phage compositions that diverge based on CCP serology, are dominated by Streptococcaceae, Bacteroidaceae, and Lachnospiraceae phages, and may originate from disparate ecosystems. These phages encode unique repertoires of auxiliary metabolic genes, which associate with anti-CCP status, suggesting that these phages directly influence the metabolic and immunomodulatory capability of the microbiota. This work sets the stage for the use of phages as preclinical biomarkers and provides insight into a possible microbial-based causation of RA disease development.


Assuntos
Artrite Reumatoide/virologia , Bacteriófagos/isolamento & purificação , Intestinos/virologia , Adulto , Idoso , Anticorpos Antiproteína Citrulinada/sangue , Artrite Reumatoide/sangue , Artrite Reumatoide/metabolismo , Bacteriófagos/classificação , Bacteriófagos/genética , Feminino , Humanos , Masculino , Microbiota , Pessoa de Meia-Idade , Filogenia , Fatores de Risco
15.
PLoS Comput Biol ; 17(3): e1008841, 2021 03.
Artigo em Inglês | MEDLINE | ID: mdl-33770071

RESUMO

Understanding CRISPR-Cas systems-the adaptive defence mechanism that about half of bacterial species and most of archaea use to neutralise viral attacks-is important for explaining the biodiversity observed in the microbial world as well as for editing animal and plant genomes effectively. The CRISPR-Cas system learns from previous viral infections and integrates small pieces from phage genomes called spacers into the microbial genome. The resulting library of spacers collected in CRISPR arrays is then compared with the DNA of potential invaders. One of the most intriguing and least well understood questions about CRISPR-Cas systems is the distribution of spacers across the microbial population. Here, using empirical data, we show that the global distribution of spacer numbers in CRISPR arrays across multiple biomes worldwide typically exhibits scale-invariant power law behaviour, and the standard deviation is greater than the sample mean. We develop a mathematical model of spacer loss and acquisition dynamics which fits observed data from almost four thousand metagenomes well. In analogy to the classical 'rich-get-richer' mechanism of power law emergence, the rate of spacer acquisition is proportional to the CRISPR array size, which allows a small proportion of CRISPRs within the population to possess a significant number of spacers. Our study provides an alternative explanation for the rarity of all-resistant super microbes in nature and why proliferation of phages can be highly successful despite the effectiveness of CRISPR-Cas systems.


Assuntos
Sistemas CRISPR-Cas/genética , Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas , Metagenoma/genética , Modelos Genéticos , Archaea/genética , Bactérias/genética , Bacteriófagos/genética , Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas/genética , Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas/imunologia , DNA Intergênico/genética , DNA Viral/genética , Metagenômica
16.
mSystems ; 6(2)2021 Mar 16.
Artigo em Inglês | MEDLINE | ID: mdl-33727395

RESUMO

Viruses are ubiquitous and abundant in the oceans, and viral metagenomes (viromes) have been investigated extensively via several large-scale ocean sequencing projects. However, there have not been any systematic viromic studies in estuaries. Here, we investigated the viromes of the Delaware Bay and Chesapeake Bay, two Mid-Atlantic estuaries. Deep sequencing generated a total of 48,190 assembled viral sequences (>5 kb) and 26,487 viral populations (9,204 virus clusters and 17,845 singletons), including 319 circular viral contigs between 7.5 kb and 161.8 kb. Unknown viruses represented the vast majority of the dominant populations, while the composition of known viruses, such as pelagiphage and cyanophage, appeared to be relatively consistent across a wide range of salinity gradients and in different seasons. A difference between estuarine and ocean viromes was reflected by the proportions of Myoviridae, Podoviridae, Siphoviridae, Phycodnaviridae, and a few well-studied virus representatives. The difference in viral community between the Delaware Bay and Chesapeake Bay is significantly more pronounced than the difference caused by temperature or salinity, indicating strong local profiles caused by the unique ecology of each estuary. Interestingly, a viral contig similar to phages infecting Acinetobacter baumannii ("Iraqibacter") was found to be highly abundant in the Delaware Bay but not in the Chesapeake Bay, the source of which is yet to be identified. Highly abundant viruses in both estuaries have close hits to viral sequences derived from the marine single-cell genomes or long-read single-molecule sequencing, suggesting that important viruses are still waiting to be discovered in the estuarine environment.IMPORTANCE This is the first systematic study about spatial and temporal variation of virioplankton communities in estuaries using deep metagenomics sequencing. It is among the highest-quality viromic data sets to date, showing remarkably consistent sequencing depth and quality across samples. Our results indicate that there exists a large pool of abundant and diverse viruses in estuaries that have not yet been cultivated, their genomes only available thanks to single-cell genomics or single-molecule sequencing, demonstrating the importance of these methods for viral discovery. The spatiotemporal pattern of these abundant uncultivated viruses is more variable than that of cultured viruses. Despite strong environmental gradients, season and location had surprisingly little impact on the viral community within an estuary, but we saw a significant distinction between the two estuaries and also between estuarine and open ocean viromes.

17.
Front Microbiol ; 12: 641440, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33776973

RESUMO

Prokaryotic tolerance to inorganic arsenic is a widespread trait habitually determined by operons encoding an As (III)-responsive repressor (ArsR), an As (V)-reductase (ArsC), and an As (III)-export pump (ArsB), often accompanied by other complementary genes. Enigmatically, the genomes of many environmental bacteria typically contain two or more copies of this basic genetic device arsRBC. To shed some light on the logic of such apparently unnecessary duplication(s) we have inspected the regulation-together and by separate-of the two ars clusters borne by the soil bacterium Pseudomonas putida strain KT2440, in particular the cross talk between the two repressors ArsR1/ArsR2 and the respective promoters. DNase I footprinting and gel retardation analyses of Pars1 and Pars2 with their matching regulators revealed non-identical binding sequences and interaction patterns for each of the systems. However, in vitro transcription experiments exposed that the repressors could downregulate each other's promoters, albeit within a different set of parameters. The regulatory frame that emerges from these data corresponds to a particular type of bifan motif where all key interactions have a negative sign. The distinct regulatory architecture that stems from coexistence of various ArsR variants in the same cells could enter an adaptive advantage that favors the maintenance of the two proteins as separate repressors.

18.
Bioinformatics ; 37(13): 1805-1813, 2021 Jul 27.
Artigo em Inglês | MEDLINE | ID: mdl-33471063

RESUMO

MOTIVATION: Two key steps in the analysis of uncultured viruses recovered from metagenomes are the taxonomic classification of the viral sequences and the identification of putative host(s). Both steps rely mainly on the assignment of viral proteins to orthologs in cultivated viruses. Viral Protein Families (VPFs) can be used for the robust identification of new viral sequences in large metagenomics datasets. Despite the importance of VPF information for viral discovery, VPFs have not yet been explored for determining viral taxonomy and host targets. RESULTS: In this work, we classified the set of VPFs from the IMG/VR database and developed VPF-Class. VPF-Class is a tool that automates the taxonomic classification and host prediction of viral contigs based on the assignment of their proteins to a set of classified VPFs. Applying VPF-Class on 731K uncultivated virus contigs from the IMG/VR database, we were able to classify 363K contigs at the genus level and predict the host of over 461K contigs. In the RefSeq database, VPF-class reported an accuracy of nearly 100% to classify dsDNA, ssDNA and retroviruses, at the genus level, considering a membership ratio and a confidence score of 0.2. The accuracy in host prediction was 86.4%, also at the genus level, considering a membership ratio of 0.3 and a confidence score of 0.5. And, in the prophages dataset, the accuracy in host prediction was 86% considering a membership ratio of 0.6 and a confidence score of 0.8. Moreover, from the Global Ocean Virome dataset, over 817K viral contigs out of 1 million were classified. AVAILABILITY AND IMPLEMENTATION: The implementation of VPF-Class can be downloaded from https://github.com/biocom-uib/vpf-tools. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

19.
Nucleic Acids Res ; 49(D1): D764-D775, 2021 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-33137183

RESUMO

Viruses are integral components of all ecosystems and microbiomes on Earth. Through pervasive infections of their cellular hosts, viruses can reshape microbial community structure and drive global nutrient cycling. Over the past decade, viral sequences identified from genomes and metagenomes have provided an unprecedented view of viral genome diversity in nature. Since 2016, the IMG/VR database has provided access to the largest collection of viral sequences obtained from (meta)genomes. Here, we present the third version of IMG/VR, composed of 18 373 cultivated and 2 314 329 uncultivated viral genomes (UViGs), nearly tripling the total number of sequences compared to the previous version. These clustered into 935 362 viral Operational Taxonomic Units (vOTUs), including 188 930 with two or more members. UViGs in IMG/VR are now reported as single viral contigs, integrated proviruses or genome bins, and are annotated with a new standardized pipeline including genome quality estimation using CheckV, taxonomic classification reflecting the latest ICTV update, and expanded host taxonomy prediction. The new IMG/VR interface enables users to efficiently browse, search, and select UViGs based on genome features and/or sequence similarity. IMG/VR v3 is available at https://img.jgi.doe.gov/vr, and the underlying data are available to download at https://genome.jgi.doe.gov/portal/IMG_VR.


Assuntos
Bases de Dados Genéticas , Ecossistema , Evolução Molecular , Genoma Viral , Vírus/genética , Sequência de Bases , Análise por Conglomerados , Geografia , Anotação de Sequência Molecular , Homologia de Sequência do Ácido Nucleico , Interface Usuário-Computador
20.
Nat Biotechnol ; 39(4): 499-509, 2021 04.
Artigo em Inglês | MEDLINE | ID: mdl-33169036

RESUMO

The reconstruction of bacterial and archaeal genomes from shotgun metagenomes has enabled insights into the ecology and evolution of environmental and host-associated microbiomes. Here we applied this approach to >10,000 metagenomes collected from diverse habitats covering all of Earth's continents and oceans, including metagenomes from human and animal hosts, engineered environments, and natural and agricultural soils, to capture extant microbial, metabolic and functional potential. This comprehensive catalog includes 52,515 metagenome-assembled genomes representing 12,556 novel candidate species-level operational taxonomic units spanning 135 phyla. The catalog expands the known phylogenetic diversity of bacteria and archaea by 44% and is broadly available for streamlined comparative analyses, interactive exploration, metabolic modeling and bulk download. We demonstrate the utility of this collection for understanding secondary-metabolite biosynthetic potential and for resolving thousands of new host linkages to uncultivated viruses. This resource underscores the value of genome-centric approaches for revealing genomic properties of uncultivated microorganisms that affect ecosystem processes.


Assuntos
Archaea/genética , Bactérias/genética , Metabolômica/métodos , Metagenoma , Metagenômica/métodos , Vírus/genética , Microbiologia do Ar , Animais , Archaea/classificação , Archaea/isolamento & purificação , Bactérias/classificação , Bactérias/isolamento & purificação , Catálogos como Assunto , Ecossistema , Humanos , Filogenia , Microbiologia do Solo , Vírus/isolamento & purificação , Microbiologia da Água
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA