Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 179
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
3.
Nat Biotechnol ; 39(4): 499-509, 2021 04.
Artículo en Inglés | MEDLINE | ID: mdl-33169036

RESUMEN

The reconstruction of bacterial and archaeal genomes from shotgun metagenomes has enabled insights into the ecology and evolution of environmental and host-associated microbiomes. Here we applied this approach to >10,000 metagenomes collected from diverse habitats covering all of Earth's continents and oceans, including metagenomes from human and animal hosts, engineered environments, and natural and agricultural soils, to capture extant microbial, metabolic and functional potential. This comprehensive catalog includes 52,515 metagenome-assembled genomes representing 12,556 novel candidate species-level operational taxonomic units spanning 135 phyla. The catalog expands the known phylogenetic diversity of bacteria and archaea by 44% and is broadly available for streamlined comparative analyses, interactive exploration, metabolic modeling and bulk download. We demonstrate the utility of this collection for understanding secondary-metabolite biosynthetic potential and for resolving thousands of new host linkages to uncultivated viruses. This resource underscores the value of genome-centric approaches for revealing genomic properties of uncultivated microorganisms that affect ecosystem processes.


Asunto(s)
Archaea/genética , Bacterias/genética , Metabolómica/métodos , Metagenoma , Metagenómica/métodos , Virus/genética , Microbiología del Aire , Animales , Archaea/clasificación , Archaea/aislamiento & purificación , Bacterias/clasificación , Bacterias/aislamiento & purificación , Catálogos como Asunto , Ecosistema , Humanos , Filogenia , Microbiología del Suelo , Virus/aislamiento & purificación , Microbiología del Agua
4.
Nucleic Acids Res ; 49(D1): D764-D775, 2021 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-33137183

RESUMEN

Viruses are integral components of all ecosystems and microbiomes on Earth. Through pervasive infections of their cellular hosts, viruses can reshape microbial community structure and drive global nutrient cycling. Over the past decade, viral sequences identified from genomes and metagenomes have provided an unprecedented view of viral genome diversity in nature. Since 2016, the IMG/VR database has provided access to the largest collection of viral sequences obtained from (meta)genomes. Here, we present the third version of IMG/VR, composed of 18 373 cultivated and 2 314 329 uncultivated viral genomes (UViGs), nearly tripling the total number of sequences compared to the previous version. These clustered into 935 362 viral Operational Taxonomic Units (vOTUs), including 188 930 with two or more members. UViGs in IMG/VR are now reported as single viral contigs, integrated proviruses or genome bins, and are annotated with a new standardized pipeline including genome quality estimation using CheckV, taxonomic classification reflecting the latest ICTV update, and expanded host taxonomy prediction. The new IMG/VR interface enables users to efficiently browse, search, and select UViGs based on genome features and/or sequence similarity. IMG/VR v3 is available at https://img.jgi.doe.gov/vr, and the underlying data are available to download at https://genome.jgi.doe.gov/portal/IMG_VR.


Asunto(s)
Bases de Datos Genéticas , Ecosistema , Evolución Molecular , Genoma Viral , Virus/genética , Secuencia de Bases , Análisis por Conglomerados , Geografía , Anotación de Secuencia Molecular , Homología de Secuencia de Ácido Nucleico , Interfaz Usuario-Computador
5.
Nucleic Acids Res ; 47(D1): D666-D677, 2019 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-30289528

RESUMEN

The Integrated Microbial Genomes & Microbiomes system v.5.0 (IMG/M: https://img.jgi.doe.gov/m/) contains annotated datasets categorized into: archaea, bacteria, eukarya, plasmids, viruses, genome fragments, metagenomes, cell enrichments, single particle sorts, and metatranscriptomes. Source datasets include those generated by the DOE's Joint Genome Institute (JGI), submitted by external scientists, or collected from public sequence data archives such as NCBI. All submissions are typically processed through the IMG annotation pipeline and then loaded into the IMG data warehouse. IMG's web user interface provides a variety of analytical and visualization tools for comparative analysis of isolate genomes and metagenomes in IMG. IMG/M allows open access to all public genomes in the IMG data warehouse, while its expert review (ER) system (IMG/MER: https://img.jgi.doe.gov/mer/) allows registered users to access their private genomes and to store their private datasets in workspace for sharing and for further analysis. IMG/M data content has grown by 60% since the last report published in the 2017 NAR Database Issue. IMG/M v.5.0 has a new and more powerful genome search feature, new statistical tools, and supports metagenome binning.


Asunto(s)
Manejo de Datos/métodos , Bases de Datos Genéticas , Genómica/métodos , Metagenoma , Microbiota , Programas Informáticos , Anotación de Secuencia Molecular/métodos , Alineación de Secuencia/métodos
6.
Nucleic Acids Res ; 47(D1): D678-D686, 2019 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-30407573

RESUMEN

The Integrated Microbial Genome/Virus (IMG/VR) system v.2.0 (https://img.jgi.doe.gov/vr/) is the largest publicly available data management and analysis platform dedicated to viral genomics. Since the last report published in the 2016, NAR Database Issue, the data has tripled in size and currently contains genomes of 8389 cultivated reference viruses, 12 498 previously published curated prophages derived from cultivated microbial isolates, and 735 112 viral genomic fragments computationally predicted from assembled shotgun metagenomes. Nearly 60% of the viral genomes and genome fragments are clustered into 110 384 viral Operational Taxonomic Units (vOTUs) with two or more members. To improve data quality and predictions of host specificity, IMG/VR v.2.0 now separates prokaryotic and eukaryotic viruses, utilizes known prophage sequences to improve taxonomic assignments, and provides viral genome quality scores based on the estimated genome completeness. New features also include enhanced BLAST search capabilities for external queries. Finally, geographic map visualization to locate user-selected viral genomes or genome fragments has been implemented and download options have been extended. All of these features make IMG/VR v.2.0 a key resource for the study of viruses.


Asunto(s)
Manejo de Datos/métodos , Genoma Viral , Genómica/métodos , Programas Informáticos
7.
Genome Announc ; 5(11)2017 Mar 16.
Artículo en Inglés | MEDLINE | ID: mdl-28302769

RESUMEN

Nitrosomonas cryotolerans ATCC 49181 is a cold-tolerant marine ammonia-oxidizing bacterium isolated from seawater collected in the Gulf of Alaska. The high-quality complete genome contains a 2.87-Mbp chromosome and a 56.6-kbp plasmid. Chemolithoautotrophic modules encoding ammonia oxidation and CO2 fixation were identified.

8.
Nucleic Acids Res ; 45(D1): D457-D465, 2017 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-27799466

RESUMEN

Viruses represent the most abundant life forms on the planet. Recent experimental and computational improvements have led to a dramatic increase in the number of viral genome sequences identified primarily from metagenomic samples. As a result of the expanding catalog of metagenomic viral sequences, there exists a need for a comprehensive computational platform integrating all these sequences with associated metadata and analytical tools. Here we present IMG/VR (https://img.jgi.doe.gov/vr/), the largest publicly available database of 3908 isolate reference DNA viruses with 264 413 computationally identified viral contigs from >6000 ecologically diverse metagenomic samples. Approximately half of the viral contigs are grouped into genetically distinct quasi-species clusters. Microbial hosts are predicted for 20 000 viral sequences, revealing nine microbial phyla previously unreported to be infected by viruses. Viral sequences can be queried using a variety of associated metadata, including habitat type and geographic location of the samples, or taxonomic classification according to hallmark viral genes. IMG/VR has a user-friendly interface that allows users to interrogate all integrated data and interact by comparing with external sequences, thus serving as an essential resource in the viral genomics community.


Asunto(s)
Virus ADN/genética , Bases de Datos Genéticas , Genoma Viral , Genómica/métodos , Metagenómica/métodos , Retroviridae/genética , Programas Informáticos , Microbiología Ambiental , Interacciones Huésped-Patógeno , Metagenoma , Análisis de Secuencia de ADN
9.
Nucleic Acids Res ; 45(D1): D507-D516, 2017 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-27738135

RESUMEN

The Integrated Microbial Genomes with Microbiome Samples (IMG/M: https://img.jgi.doe.gov/m/) system contains annotated DNA and RNA sequence data of (i) archaeal, bacterial, eukaryotic and viral genomes from cultured organisms, (ii) single cell genomes (SCG) and genomes from metagenomes (GFM) from uncultured archaea, bacteria and viruses and (iii) metagenomes from environmental, host associated and engineered microbiome samples. Sequence data are generated by DOE's Joint Genome Institute (JGI), submitted by individual scientists, or collected from public sequence data archives. Structural and functional annotation is carried out by JGI's genome and metagenome annotation pipelines. A variety of analytical and visualization tools provide support for examining and comparing IMG/M's datasets. IMG/M allows open access interactive analysis of publicly available datasets, while manual curation, submission and access to private datasets and computationally intensive workspace-based analysis require login/password access to its expert review (ER) companion system (IMG/M ER: https://img.jgi.doe.gov/mer/). Since the last report published in the 2014 NAR Database Issue, IMG/M's dataset content has tripled in terms of number of datasets and overall protein coding genes, while its analysis tools have been extended to cope with the rapid growth in the number and size of datasets handled by the system.


Asunto(s)
Biología Computacional/métodos , Metagenoma , Metagenómica/métodos , Microbiota/genética , Programas Informáticos , Navegador Web
10.
Nucleic Acids Res ; 45(D1): D560-D565, 2017 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-27903896

RESUMEN

Secondary metabolites produced by microbes have diverse biological functions, which makes them a great potential source of biotechnologically relevant compounds with antimicrobial, anti-cancer and other activities. The proteins needed to synthesize these natural products are often encoded by clusters of co-located genes called biosynthetic gene clusters (BCs). In order to advance the exploration of microbial secondary metabolism, we developed the largest publically available database of experimentally verified and predicted BCs, the Integrated Microbial Genomes Atlas of Biosynthetic gene Clusters (IMG-ABC) (https://img.jgi.doe.gov/abc/). Here, we describe an update of IMG-ABC, which includes ClusterScout, a tool for targeted identification of custom biosynthetic gene clusters across 40 000 isolate microbial genomes, and a new search capability to query more than 700 000 BCs from isolate genomes for clusters with similar Pfam composition. Additional features enable fast exploration and analysis of BCs through two new interactive visualization features, a BC function heatmap and a BC similarity network graph. These new tools and features add to the value of IMG-ABC's vast body of BC data, facilitating their in-depth analysis and accelerating secondary metabolite discovery.


Asunto(s)
Bacterias/genética , Bacterias/metabolismo , Genoma Bacteriano , Genómica/métodos , Metabolómica/métodos , Biología Computacional/métodos , Programas Informáticos , Navegador Web
11.
Stand Genomic Sci ; 11: 46, 2016.
Artículo en Inglés | MEDLINE | ID: mdl-27471578

RESUMEN

Nitrosospira briensis C-128 is an ammonia-oxidizing bacterium isolated from an acid agricultural soil. N. briensis C-128 was sequenced with PacBio RS technologies at the DOE-Joint Genome Institute through their Community Science Program (2010). The high-quality finished genome contains one chromosome of 3.21 Mb and no plasmids. We identified 3073 gene models, 3018 of which are protein coding. The two-way average nucleotide identity between the chromosomes of Nitrosospira multiformis ATCC 25196 and Nitrosospira briensis C-128 was found to be 77.2 %. Multiple copies of modules encoding chemolithotrophic metabolism were identified in their genomic context. The gene inventory supports chemolithotrophic metabolism with implications for function in soil environments.

12.
BMC Genomics ; 17: 307, 2016 Apr 26.
Artículo en Inglés | MEDLINE | ID: mdl-27118214

RESUMEN

BACKGROUND: The exponential growth of genomic data from next generation technologies renders traditional manual expert curation effort unsustainable. Many genomic systems have included community annotation tools to address the problem. Most of these systems adopted a "Wiki-based" approach to take advantage of existing wiki technologies, but encountered obstacles in issues such as usability, authorship recognition, information reliability and incentive for community participation. RESULTS: Here, we present a different approach, relying on tightly integrated method rather than "Wiki-based" method, to support community annotation and user collaboration in the Integrated Microbial Genomes (IMG) system. The IMG approach allows users to use existing IMG data warehouse and analysis tools to add gene, pathway and biosynthetic cluster annotations, to analyze/reorganize contigs, genes and functions using workspace datasets, and to share private user annotations and workspace datasets with collaborators. We show that the annotation effort using IMG can be part of the research process to overcome the user incentive and authorship recognition problems thus fostering collaboration among domain experts. The usability and reliability issues are addressed by the integration of curated information and analysis tools in IMG, together with DOE Joint Genome Institute (JGI) expert review. CONCLUSION: By incorporating annotation operations into IMG, we provide an integrated environment for users to perform deeper and extended data analysis and annotation in a single system that can lead to publications and community knowledge sharing as shown in the case studies.


Asunto(s)
Biología Computacional/métodos , Genoma Microbiano , Genómica/métodos , Anotación de Secuencia Molecular/métodos , Programas Informáticos , Conducta Cooperativa , Exactitud de los Datos , Difusión de la Información , Internet , Interfaz Usuario-Computador
13.
mBio ; 6(4): e00932, 2015 Jul 14.
Artículo en Inglés | MEDLINE | ID: mdl-26173699

RESUMEN

UNLABELLED: In the discovery of secondary metabolites, analysis of sequence data is a promising exploration path that remains largely underutilized due to the lack of computational platforms that enable such a systematic approach on a large scale. In this work, we present IMG-ABC (https://img.jgi.doe.gov/abc), an atlas of biosynthetic gene clusters within the Integrated Microbial Genomes (IMG) system, which is aimed at harnessing the power of "big" genomic data for discovering small molecules. IMG-ABC relies on IMG's comprehensive integrated structural and functional genomic data for the analysis of biosynthetic gene clusters (BCs) and associated secondary metabolites (SMs). SMs and BCs serve as the two main classes of objects in IMG-ABC, each with a rich collection of attributes. A unique feature of IMG-ABC is the incorporation of both experimentally validated and computationally predicted BCs in genomes as well as metagenomes, thus identifying BCs in uncultured populations and rare taxa. We demonstrate the strength of IMG-ABC's focused integrated analysis tools in enabling the exploration of microbial secondary metabolism on a global scale, through the discovery of phenazine-producing clusters for the first time in Alphaproteobacteria. IMG-ABC strives to fill the long-existent void of resources for computational exploration of the secondary metabolism universe; its underlying scalable framework enables traversal of uncovered phylogenetic and chemical structure space, serving as a doorway to a new era in the discovery of novel molecules. IMPORTANCE: IMG-ABC is the largest publicly available database of predicted and experimental biosynthetic gene clusters and the secondary metabolites they produce. The system also includes powerful search and analysis tools that are integrated with IMG's extensive genomic/metagenomic data and analysis tool kits. As new research on biosynthetic gene clusters and secondary metabolites is published and more genomes are sequenced, IMG-ABC will continue to expand, with the goal of becoming an essential component of any bioinformatic exploration of the secondary metabolism world.


Asunto(s)
Vías Biosintéticas/genética , Biología Computacional/métodos , Bases del Conocimiento , Familia de Multigenes , Metabolismo Secundario/genética
14.
Stand Genomic Sci ; 10: 8, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-26203325

RESUMEN

Pontibacter roseus is a member of genus Pontibacter family Cytophagaceae, class Cytophagia. While the type species of the genus Pontibacter actiniarum was isolated in 2005 from a marine environment, subsequent species of the same genus have been found in different types of habitats ranging from seawater, sediment, desert soil, rhizosphere, contaminated sites, solar saltern and muddy water. Here we describe the features of Pontibacter roseus strain SRC-1(T) along with its complete genome sequence and annotation from a culture of DSM 17521(T). The 4,581,480 bp long draft genome consists of 12 scaffolds with 4,003 protein-coding and 50 RNA genes and is a part of Genomic Encyclopedia of Type Strains: KMG-I project.

15.
Genome Announc ; 3(4)2015 Jul 23.
Artículo en Inglés | MEDLINE | ID: mdl-26205857

RESUMEN

Clostridium clariflavum strain 4-2a, a novel strain isolated from a thermophilic biocompost pile, has demonstrated an extensive capability to utilize both cellulose and hemicellulose under thermophilic anaerobic conditions. Here, we report the draft genome of this strain.

16.
Genome Announc ; 2(5)2014 Oct 16.
Artículo en Inglés | MEDLINE | ID: mdl-25323723

RESUMEN

High-quality draft genome sequences were determined for 10 Exiguobacterium strains in order to provide insight into their evolutionary strategies for speciation and environmental adaptation. The selected genomes include psychrotrophic and thermophilic species from a range of habitats, which will allow for a comparison of metabolic pathways and stress response genes.

17.
Stand Genomic Sci ; 9(3): 602-13, 2014 Jun 15.
Artículo en Inglés | MEDLINE | ID: mdl-25197447

RESUMEN

Ensifer meliloti strain RRI128 is an aerobic, motile, Gram-negative, non-spore-forming rod. RRI128 was isolated from a nodule recovered from the roots of barrel medic (Medicago truncatula) grown in the greenhouse and inoculated with soil collected from Victoria, Australia. The strain is used in commercial inoculants in Australia. RRI128 nodulates and forms an effective symbiosis with a diverse range of lucerne cultivars (Medicago sativa) and several species of annual medic (M. truncatula, Medicago littoralis and Medicago tornata), but forms an ineffective symbiosis with Medicago polymorpha. Here we describe the features of E. meliloti strain RRI128, together with genome sequence information and annotation. The 6,900,273 bp draft genome is arranged into 156 scaffolds of 157 contigs, contains 6,683 protein-coding genes and 87 RNA-only encoding genes, and is one of 100 rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.

18.
Stand Genomic Sci ; 9(3): 1076-88, 2014 Jun 15.
Artículo en Inglés | MEDLINE | ID: mdl-25197484

RESUMEN

Methanoplanus limicola Wildgruber et al. 1984 is a mesophilic methanogen that was isolated from a swamp composed of drilling waste near Naples, Italy, shortly after the Archaea were recognized as a separate domain of life. Methanoplanus is the type genus in the family Methanoplanaceae, a taxon that felt into disuse since modern 16S rRNA gene sequences-based taxonomy was established. Methanoplanus is now placed within the Methanomicrobiaceae, a family that is so far poorly characterized at the genome level. The only other type strain of the genus with a sequenced genome, Methanoplanus petrolearius SEBR 4847(T), turned out to be misclassified and required reclassification to Methanolacinia. Both, Methanoplanus and Methanolacinia, needed taxonomic emendations due to a significant deviation of the G+C content of their genomes from previously published (pre-genome-sequence era) values. Until now genome sequences were published for only four of the 33 species with validly published names in the Methanomicrobiaceae. Here we describe the features of M. limicola, together with the improved-high-quality draft genome sequence and annotation of the type strain, M3(T). The 3,200,946 bp long chromosome (permanent draft sequence) with its 3,064 protein-coding and 65 RNA genes is a part of the G enomic E ncyclopedia of B acteria and Archaea project.

19.
Stand Genomic Sci ; 9(3): 1089-104, 2014 Jun 15.
Artículo en Inglés | MEDLINE | ID: mdl-25197485

RESUMEN

Clostridium indolis DSM 755(T) is a bacterium commonly found in soils and the feces of birds and mammals. Despite its prevalence, little is known about the ecology or physiology of this species. However, close relatives, C. saccharolyticum and C. hathewayi, have demonstrated interesting metabolic potentials related to plant degradation and human health. The genome of C. indolis DSM 755(T) reveals an abundance of genes in functional groups associated with the transport and utilization of carbohydrates, as well as citrate, lactate, and aromatics. Ecologically relevant gene clusters related to nitrogen fixation and a unique type of bacterial microcompartment, the CoAT BMC, are also detected. Our genome analysis suggests hypotheses to be tested in future culture based work to better understand the physiology of this poorly described species.

20.
Stand Genomic Sci ; 9(3): 1105-17, 2014 Jun 15.
Artículo en Inglés | MEDLINE | ID: mdl-25197486

RESUMEN

Thermotoga thermarum Windberger et al. 1989 is a member to the genomically well characterized genus Thermotoga in the phylum 'Thermotogae'. T. thermarum is of interest for its origin from a continental solfataric spring vs. predominantly marine oil reservoirs of other members of the genus. The genome of strain LA3T also provides fresh data for the phylogenomic positioning of the (hyper-)thermophilic bacteria. T. thermarum strain LA3(T) is the fourth sequenced genome of a type strain from the genus Thermotoga, and the sixth in the family Thermotogaceae to be formally described in a publication. Phylogenetic analyses do not reveal significant discrepancies between the current classification of the group, 16S rRNA gene data and whole-genome sequences. Nevertheless, T. thermarum significantly differs from other Thermotoga species regarding its iron-sulfur cluster synthesis, as it contains only a minimal set of the necessary proteins. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 2,039,943 bp long chromosome with its 2,015 protein-coding and 51 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...