RESUMO
Plasmids are mobile genetic elements found in many clades of Archaea and Bacteria. They drive horizontal gene transfer, impacting ecological and evolutionary processes within microbial communities, and hold substantial importance in human health and biotechnology. To support plasmid research and provide scientists with data of an unprecedented diversity of plasmid sequences, we introduce the IMG/PR database, a new resource encompassing 699 973 plasmid sequences derived from genomes, metagenomes and metatranscriptomes. IMG/PR is the first database to provide data of plasmid that were systematically identified from diverse microbiome samples. IMG/PR plasmids are associated with rich metadata that includes geographical and ecosystem information, host taxonomy, similarity to other plasmids, functional annotation, presence of genes involved in conjugation and antibiotic resistance. The database offers diverse methods for exploring its extensive plasmid collection, enabling users to navigate plasmids through metadata-centric queries, plasmid comparisons and BLAST searches. The web interface for IMG/PR is accessible at https://img.jgi.doe.gov/pr. Plasmid metadata and sequences can be downloaded from https://genome.jgi.doe.gov/portal/IMG_PR.
Assuntos
Metagenoma , Microbiota , Humanos , Metadados , Software , Bases de Dados Genéticas , Plasmídeos/genéticaRESUMO
Multiomics approaches need to be applied in the central Arctic Ocean to benchmark biodiversity change and to identify novel species and their genes. As part of MOSAiC, EcoOmics will therefore be essential for conservation and sustainable bioprospecting in one of the least explored ecosystems on Earth.
Assuntos
Benchmarking , Ecossistema , Regiões Árticas , Biodiversidade , Oceanos e MaresRESUMO
The Integrated Microbial Genomes & Microbiomes system (IMG/M: https://img.jgi.doe.gov/m/) at the Department of Energy (DOE) Joint Genome Institute (JGI) continues to provide support for users to perform comparative analysis of isolate and single cell genomes, metagenomes, and metatranscriptomes. In addition to datasets produced by the JGI, IMG v.7 also includes datasets imported from public sources such as NCBI Genbank, SRA, and the DOE National Microbiome Data Collaborative (NMDC), or submitted by external users. In the past couple years, we have continued our effort to help the user community by improving the annotation pipeline, upgrading the contents with new reference database versions, and adding new analysis functionalities such as advanced scaffold search, Average Nucleotide Identity (ANI) for high-quality metagenome bins, new cassette search, improved gene neighborhood display, and improvements to metatranscriptome data display and analysis. We also extended the collaboration and integration efforts with other DOE-funded projects such as NMDC and DOE Biology Knowledgebase (KBase).
Assuntos
Gerenciamento de Dados , Genômica , Genoma Bacteriano , Software , Genoma Arqueal , Bases de Dados Genéticas , MetagenomaRESUMO
Viruses are widely recognized as critical members of all microbiomes. Metagenomics enables large-scale exploration of the global virosphere, progressively revealing the extensive genomic diversity of viruses on Earth and highlighting the myriad of ways by which viruses impact biological processes. IMG/VR provides access to the largest collection of viral sequences obtained from (meta)genomes, along with functional annotation and rich metadata. A web interface enables users to efficiently browse and search viruses based on genome features and/or sequence similarity. Here, we present the fourth version of IMG/VR, composed of >15 million virus genomes and genome fragments, a ≈6-fold increase in size compared to the previous version. These clustered into 8.7 million viral operational taxonomic units, including 231 408 with at least one high-quality representative. Viral sequences in IMG/VR are now systematically identified from genomes, metagenomes, and metatranscriptomes using a new detection approach (geNomad), and IMG standard annotation are complemented with genome quality estimation using CheckV, taxonomic classification reflecting the latest taxonomic standards, and microbial host taxonomy prediction. IMG/VR v4 is available at https://img.jgi.doe.gov/vr, and the underlying data are available to download at https://genome.jgi.doe.gov/portal/IMG_VR.
Assuntos
Bases de Dados Genéticas , Genoma Viral , Metadados , Metagenômica , SoftwareRESUMO
The Integrated Microbial Genomes & Microbiomes system (IMG/M: https://img.jgi.doe.gov/m/) contains annotated isolate genome and metagenome datasets sequenced at the DOE's Joint Genome Institute (JGI), submitted by external users, or imported from public sources such as NCBI. IMG v 6.0 includes advanced search functions and a new tool for statistical analysis of mixed sets of genomes and metagenome bins. The new IMG web user interface also has a new Help page with additional documentation and webinar tutorials to help users better understand how to use various IMG functions and tools for their research. New datasets have been processed with the prokaryotic annotation pipeline v.5, which includes extended protein family assignments.
Assuntos
Análise de Dados , Gerenciamento de Dados , Bases de Dados Genéticas , Genoma Arqueal , Genoma Microbiano , Metagenoma , RNA Ribossômico 16S/genética , Ferramenta de BuscaRESUMO
Microbial secondary metabolism is a reservoir of bioactive compounds of immense biotechnological and biomedical potential. The biosynthetic machinery responsible for the production of these secondary metabolites (SMs) (also called natural products) is often encoded by collocated groups of genes called biosynthetic gene clusters (BGCs). High-throughput genome sequencing of both isolates and metagenomic samples combined with the development of specialized computational workflows is enabling systematic identification of BGCs and the discovery of novel SMs. In order to advance exploration of microbial secondary metabolism and its diversity, we developed the largest publicly available database of predicted BGCs combined with experimentally verified BGCs, the Integrated Microbial Genomes Atlas of Biosynthetic gene Clusters (IMG-ABC) (https://img.jgi.doe.gov/abc-public). Here we describe the first major content update of the IMG-ABC knowledgebase, since its initial release in 2015, refreshing the BGC prediction pipeline with the latest version of antiSMASH (v5) as well as presenting the data in the context of underlying environmental metadata sourced from GOLD (https://gold.jgi.doe.gov/). This update has greatly improved the quality and expanded the types of predicted BGCs compared to the previous version.
Assuntos
Vias Biossintéticas/genética , Bases de Dados Genéticas , Genoma Microbiano , Família Multigênica , Metabolismo Secundário/genética , Bacteriocinas/biossíntese , Bacteriocinas/genética , Bases de Conhecimento , Metadados , Metagenoma , Interface Usuário-ComputadorRESUMO
Burkholderia cenocepacia TAtl-371 was isolated from the rhizosphere of a tomato plant growing in Atlatlahucan, Morelos, Mexico. This strain exhibited a broad antimicrobial spectrum against bacteria, yeast, and fungi. Here, we report and describe the improved, high-quality permanent draft genome of B. cenocepacia TAtl-371, which was sequenced using a combination of PacBio RS and PacBio RS II sequencing methods. The 7,496,106 bp genome of the TAtl-371 strain is arranged in three scaffolds, contains 6722 protein-coding genes, and 99 RNA only-encoding genes. Genome analysis revealed genes related to biosynthesis of antimicrobials such as non-ribosomal peptides, siderophores, chitinases, and bacteriocins. Moreover, analysis of bacterial growth on different carbon and nitrogen sources shows that the strain retains its antimicrobial ability.
Assuntos
Antibiose , Burkholderia cenocepacia/genética , Complexo Burkholderia cepacia , Carbono/metabolismo , Genoma Bacteriano , Nitrogênio/metabolismo , Bacteriocinas/genética , Burkholderia cenocepacia/isolamento & purificação , Quitinases/genética , Solanum lycopersicum/microbiologia , México , Rizosfera , Análise de Sequência de DNA , Sideróforos/genética , Microbiologia do SoloRESUMO
In May and June of 2021, marine microbial samples were collected for DNA sequencing in East Sound, WA, USA every 4 hours for 22 days. This high temporal resolution sampling effort captured the last 3 days of a Rhizosolenia sp. bloom, the initiation and complete bloom cycle of Chaetoceros socialis (8 days), and the following bacterial bloom (2 days). Metagenomes were completed on the time series, and the dataset includes 128 size-fractionated microbial samples (0.22-1.2 µm), providing gene abundances for the dominant members of bacteria, archaea, and viruses. This dataset also has time-matched nutrient analyses, flow cytometry data, and physical parameters of the environment at a single point of sampling within a coastal ecosystem that experiences regular bloom events, facilitating a range of modeling efforts that can be leveraged to understand microbial community structure and their influences on the growth, maintenance, and senescence of phytoplankton blooms.
RESUMO
We present eight metatranscriptomic datasets of light algal and cyanolichen biological soil crusts from the Mojave Desert in response to wetting. These data will help us understand gene expression patterns in desert biocrust microbial communities after they have been reactivated by the addition of water.
RESUMO
We present six whole community shotgun metagenomic sequencing data sets of two types of biological soil crusts sampled at the ecotone of the Mojave Desert and Colorado Desert in California. These data will help us understand the diversity and function of biocrust microbial communities, which are essential for desert ecosystems.
RESUMO
Novel bacterial isolates with the capabilities of lignin depolymerization, catabolism, or both, could be pertinent to lignocellulosic biofuel applications. In this study, we aimed to identify anaerobic bacteria that could address the economic challenges faced with microbial-mediated biotechnologies, such as the need for aeration and mixing. Using a consortium seeded from temperate forest soil and enriched under anoxic conditions with organosolv lignin as the sole carbon source, we successfully isolated a novel bacterium, designated 159R. Based on the 16S rRNA gene, the isolate belongs to the genus Sodalis in the family Bruguierivoracaceae. Whole-genome sequencing revealed a genome size of 6.38 Mbp and a GC content of 55 mol%. To resolve the phylogenetic position of 159R, its phylogeny was reconstructed using (i) 16S rRNA genes of its closest relatives, (ii) multilocus sequence analysis (MLSA) of 100 genes, (iii) 49 clusters of orthologous groups (COG) domains, and (iv) 400 conserved proteins. Isolate 159R was closely related to the deadwood associated Sodalis guild rather than the tsetse fly and other insect endosymbiont guilds. Estimated genome-sequence-based digital DNA-DNA hybridization (dDDH), genome percentage of conserved proteins (POCP), and an alignment analysis between 159R and the Sodalis clade species further supported that isolate 159R was part of the Sodalis genus and a strain of Sodalis ligni. We proposed the name Sodalis ligni str. 159R (=DSM 110549 = ATCC TSD-177). IMPORTANCE Currently, in the paper industry, paper mill pulping relies on unsustainable and costly processes to remove lignin from lignocellulosic material. A greener approach is biopulping, which uses microbes and their enzymes to break down lignin. However, there are limitations to biopulping that prevent it from outcompeting other pulping processes, such as requiring constant aeration and mixing. Anaerobic bacteria are a promising alternative source for consolidated depolymerization of lignin and its conversion to valuable by-products. We presented Sodalis ligni str. 159R and its characteristics as another example of potential mechanisms that can be developed for lignocellulosic applications.
Assuntos
Enterobacteriaceae , Lignina , Anaerobiose , Animais , Técnicas de Tipagem Bacteriana , DNA Bacteriano/genética , DNA Bacteriano/metabolismo , Enterobacteriaceae/genética , Lignina/metabolismo , Filogenia , RNA Ribossômico 16S/genética , Análise de Sequência de DNA , SimbioseRESUMO
We present 49 metagenome assemblies of the microbiome associated with Sphagnum (peat moss) collected from ambient, artificially warmed, and geothermally warmed conditions across Europe. These data will enable further research regarding the impact of climate change on plant-microbe symbiosis, ecology, and ecosystem functioning of northern peatland ecosystems.
RESUMO
The phylum Actinobacteria includes important human pathogens like Mycobacterium tuberculosis and Corynebacterium diphtheriae and renowned producers of secondary metabolites of commercial interest, yet only a small part of its diversity is represented by sequenced genomes. Here, we present 824 actinobacterial isolate genomes in the context of a phylum-wide analysis of 6,700 genomes including public isolates and metagenome-assembled genomes (MAGs). We estimate that only 30%-50% of projected actinobacterial phylogenetic diversity possesses genomic representation via isolates and MAGs. A comparison of gene functions reveals novel determinants of host-microbe interaction as well as environment-specific adaptations such as potential antimicrobial peptides. We identify plasmids and prophages across isolates and uncover extensive prophage diversity structured mainly by host taxonomy. Analysis of >80,000 biosynthetic gene clusters reveals that horizontal gene transfer and gene loss shape secondary metabolite repertoire across taxa. Our observations illustrate the essential role of and need for high-quality isolate genome sequences.
RESUMO
Cyanobacteria are ubiquitous microorganisms with crucial ecosystem functions, yet most knowledge of their biology relates to aquatic taxa. We have constructed metagenomes for 50 taxonomically well-characterized terrestrial cyanobacterial cultures. These data will support phylogenomic studies of evolutionary relationships and gene content among these unique algae and their aquatic relatives.
RESUMO
Thermoflexus hugenholtzii JAD2T, the only cultured representative of the Chloroflexota order Thermoflexales, is abundant in Great Boiling Spring (GBS), NV, United States, and close relatives inhabit geothermal systems globally. However, no defined medium exists for T. hugenholtzii JAD2T and no single carbon source is known to support its growth, leaving key knowledge gaps in its metabolism and nutritional needs. Here, we report comparative genomic analysis of the draft genome of T. hugenholtzii JAD2T and eight closely related metagenome-assembled genomes (MAGs) from geothermal sites in China, Japan, and the United States, representing "Candidatus Thermoflexus japonica," "Candidatus Thermoflexus tengchongensis," and "Candidatus Thermoflexus sinensis." Genomics was integrated with targeted exometabolomics and 13C metabolic probing of T. hugenholtzii. The Thermoflexus genomes each code for complete central carbon metabolic pathways and an unusually high abundance and diversity of peptidases, particularly Metallo- and Serine peptidase families, along with ABC transporters for peptides and some amino acids. The T. hugenholtzii JAD2T exometabolome provided evidence of extracellular proteolytic activity based on the accumulation of free amino acids. However, several neutral and polar amino acids appear not to be utilized, based on their accumulation in the medium and the lack of annotated transporters. Adenine and adenosine were scavenged, and thymine and nicotinic acid were released, suggesting interdependency with other organisms in situ. Metabolic probing of T. hugenholtzii JAD2T using 13C-labeled compounds provided evidence of oxidation of glucose, pyruvate, cysteine, and citrate, and functioning glycolytic, tricarboxylic acid (TCA), and oxidative pentose-phosphate pathways (PPPs). However, differential use of position-specific 13C-labeled compounds showed that glycolysis and the TCA cycle were uncoupled. Thus, despite the high abundance of Thermoflexus in sediments of some geothermal systems, they appear to be highly focused on chemoorganotrophy, particularly protein degradation, and may interact extensively with other microorganisms in situ.
RESUMO
Eukaryotic phytoplankton are responsible for at least 20% of annual global carbon fixation. Their diversity and activity are shaped by interactions with prokaryotes as part of complex microbiomes. Although differences in their local species diversity have been estimated, we still have a limited understanding of environmental conditions responsible for compositional differences between local species communities on a large scale from pole to pole. Here, we show, based on pole-to-pole phytoplankton metatranscriptomes and microbial rDNA sequencing, that environmental differences between polar and non-polar upper oceans most strongly impact the large-scale spatial pattern of biodiversity and gene activity in algal microbiomes. The geographic differentiation of co-occurring microbes in algal microbiomes can be well explained by the latitudinal temperature gradient and associated break points in their beta diversity, with an average breakpoint at 14 °C ± 4.3, separating cold and warm upper oceans. As global warming impacts upper ocean temperatures, we project that break points of beta diversity move markedly pole-wards. Hence, abrupt regime shifts in algal microbiomes could be caused by anthropogenic climate change.
Assuntos
Variação Genética , Microalgas/genética , Microbiota/genética , Fitoplâncton/genética , Transcriptoma/genética , Regiões Antárticas , Regiões Árticas , Biodiversidade , Ciclo do Carbono , Mudança Climática , Ontologia Genética , Geografia , Aquecimento Global , Microalgas/classificação , Microalgas/crescimento & desenvolvimento , Oceanos e Mares , Fitoplâncton/classificação , Fitoplâncton/crescimento & desenvolvimento , RNA Ribossômico 16S/genética , RNA Ribossômico 18S/genética , Análise de Sequência de DNA/métodos , Especificidade da Espécie , TemperaturaRESUMO
We report here the draft genome sequence of Yokenella regensburgei strain WCD67, isolated from the boxelder bug (Boisea trivittata). The genome is 5,277,883 bp in size, has a GC content of 54.12%, and has 5,416 genes. A total of 17 mobile elements were discovered, 6 of which were predicted to be phages.
RESUMO
We report eight genomes from representatives of the phylum Acidobacteria subdivisions 1 and 3, isolated from soils. The genome sizes range from 4.9 to 6.7 Mb. Genomic analysis reveals putative genes for low- and high-affinity respiratory oxygen reductases, high-affinity hydrogenases, and the capacity to use a diverse collection of carbohydrates.
RESUMO
The addition of glucose to soil has long been used to study the metabolic activity of microbes in soil; however, the response of the microbial ecophysiology remains poorly characterized. To address this, we sequenced the metagenomes and metatranscriptomes of glucose-amended soil microbial communities in a laboratory incubation.
RESUMO
Hydrologic changes modify microbial community structure and ecosystem functions, especially in wetland systems. Here, we present 24 metagenomes from a coastal freshwater wetland experiment in which we manipulated hydrologic conditions and plant presence. These wetland soil metagenomes will deepen our understanding of how hydrology and vegetation influence microbial functional diversity.