Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
PLoS Comput Biol ; 19(8): e1011422, 2023 08.
Artigo em Inglês | MEDLINE | ID: mdl-37639475

RESUMO

The study of viral communities has revealed the enormous diversity and impact these biological entities have on various ecosystems. These observations have sparked widespread interest in developing computational strategies that support the comprehensive characterisation of viral communities based on sequencing data. Here we introduce VIRify, a new computational pipeline designed to provide a user-friendly and accurate functional and taxonomic characterisation of viral communities. VIRify identifies viral contigs and prophages from metagenomic assemblies and annotates them using a collection of viral profile hidden Markov models (HMMs). These include our manually-curated profile HMMs, which serve as specific taxonomic markers for a wide range of prokaryotic and eukaryotic viral taxa and are thus used to reliably classify viral contigs. We tested VIRify on assemblies from two microbial mock communities, a large metagenomics study, and a collection of publicly available viral genomic sequences from the human gut. The results showed that VIRify could identify sequences from both prokaryotic and eukaryotic viruses, and provided taxonomic classifications from the genus to the family rank with an average accuracy of 86.6%. In addition, VIRify allowed the detection and taxonomic classification of a range of prokaryotic and eukaryotic viruses present in 243 marine metagenomic assemblies. Finally, the use of VIRify led to a large expansion in the number of taxonomically classified human gut viral sequences and the improvement of outdated and shallow taxonomic classifications. Overall, we demonstrate that VIRify is a novel and powerful resource that offers an enhanced capability to detect a broad range of viral contigs and taxonomically classify them.


Assuntos
Eucariotos , Microbiota , Humanos , Células Eucarióticas , Genoma Viral/genética , Metagenoma/genética
2.
J Mol Biol ; 435(14): 168016, 2023 07 15.
Artigo em Inglês | MEDLINE | ID: mdl-36806692

RESUMO

An increasingly common output arising from the analysis of shotgun metagenomic datasets is the generation of metagenome-assembled genomes (MAGs), with tens of thousands of MAGs now described in the literature. However, the discovery and comparison of these MAG collections is hampered by the lack of uniformity in their generation, annotation and storage. To address this, we have developed MGnify Genomes, a growing collection of biome-specific non-redundant microbial genome catalogues generated using MAGs and publicly available isolate genomes. Genomes within a biome-specific catalogue are organised into species clusters. For species that contain multiple conspecific genomes, the highest quality genome is selected as the representative, always prioritising an isolate genome over a MAG. The species representative sequences and annotations can be visualised on the MGnify website and the full catalogue and associated analysis outputs can be downloaded from MGnify servers. A suite of online search tools is provided allowing users to compare their own sequences, ranging from a gene to sets of genomes, against the catalogues. Seven biomes are available currently, comprising over 300,000 genomes that represent 11,048 non-redundant species, and include 36 taxonomic classes not currently represented by cultured genomes. MGnify Genomes is available at https://www.ebi.ac.uk/metagenomics/browse/genomes/.


Assuntos
Genoma Microbiano , Metagenoma , Metagenoma/genética , Metagenômica
3.
Nucleic Acids Res ; 51(D1): D753-D759, 2023 01 06.
Artigo em Inglês | MEDLINE | ID: mdl-36477304

RESUMO

The MGnify platform (https://www.ebi.ac.uk/metagenomics) facilitates the assembly, analysis and archiving of microbiome-derived nucleic acid sequences. The platform provides access to taxonomic assignments and functional annotations for nearly half a million analyses covering metabarcoding, metatranscriptomic, and metagenomic datasets, which are derived from a wide range of different environments. Over the past 3 years, MGnify has not only grown in terms of the number of datasets contained but also increased the breadth of analyses provided, such as the analysis of long-read sequences. The MGnify protein database now exceeds 2.4 billion non-redundant sequences predicted from metagenomic assemblies. This collection is now organised into a relational database making it possible to understand the genomic context of the protein through navigation back to the source assembly and sample metadata, marking a major improvement. To extend beyond the functional annotations already provided in MGnify, we have applied deep learning-based annotation methods. The technology underlying MGnify's Application Programming Interface (API) and website has been upgraded, and we have enabled the ability to perform downstream analysis of the MGnify data through the introduction of a coupled Jupyter Lab environment.


Assuntos
Microbiota , Análise de Sequência , Genômica/métodos , Metagenoma , Metagenômica/métodos , Microbiota/genética , Software , Análise de Sequência/métodos
4.
Gigascience ; 122022 12 28.
Artigo em Inglês | MEDLINE | ID: mdl-37850871

RESUMO

BACKGROUND: Genomic Observatories (GOs) are sites of long-term scientific study that undertake regular assessments of the genomic biodiversity. The European Marine Omics Biodiversity Observation Network (EMO BON) is a network of GOs that conduct regular biological community samplings to generate environmental and metagenomic data of microbial communities from designated marine stations around Europe. The development of an effective workflow is essential for the analysis of the EMO BON metagenomic data in a timely and reproducible manner. FINDINGS: Based on the established MGnify resource, we developed metaGOflow. metaGOflow supports the fast inference of taxonomic profiles from GO-derived data based on ribosomal RNA genes and their functional annotation using the raw reads. Thanks to the Research Object Crate packaging, relevant metadata about the sample under study, and the details of the bioinformatics analysis it has been subjected to, are inherited to the data product while its modular implementation allows running the workflow partially. The analysis of 2 EMO BON samples and 1 Tara Oceans sample was performed as a use case. CONCLUSIONS: metaGOflow is an efficient and robust workflow that scales to the needs of projects producing big metagenomic data such as EMO BON. It highlights how containerization technologies along with modern workflow languages and metadata package approaches can support the needs of researchers when dealing with ever-increasing volumes of biological data. Despite being initially oriented to address the needs of EMO BON, metaGOflow is a flexible and easy-to-use workflow that can be broadly used for one-sample-at-a-time analysis of shotgun metagenomics data.


Assuntos
Genômica , Software , Fluxo de Trabalho , Metagenômica , Biologia Computacional , Metagenoma
5.
Nat Biotechnol ; 39(1): 105-114, 2021 01.
Artigo em Inglês | MEDLINE | ID: mdl-32690973

RESUMO

Comprehensive, high-quality reference genomes are required for functional characterization and taxonomic assignment of the human gut microbiota. We present the Unified Human Gastrointestinal Genome (UHGG) collection, comprising 204,938 nonredundant genomes from 4,644 gut prokaryotes. These genomes encode >170 million protein sequences, which we collated in the Unified Human Gastrointestinal Protein (UHGP) catalog. The UHGP more than doubles the number of gut proteins in comparison to those present in the Integrated Gene Catalog. More than 70% of the UHGG species lack cultured representatives, and 40% of the UHGP lack functional annotations. Intraspecies genomic variation analyses revealed a large reservoir of accessory genes and single-nucleotide variants, many of which are specific to individual human populations. The UHGG and UHGP collections will enable studies linking genotypes to phenotypes in the human gut microbiome.


Assuntos
Bases de Dados Genéticas , Microbioma Gastrointestinal/genética , Genoma Bacteriano/genética , Metagenoma/genética , Bactérias/classificação , Bactérias/genética , Humanos , Metagenômica , Fenótipo , Filogenia
6.
Nucleic Acids Res ; 48(D1): D570-D578, 2020 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-31696235

RESUMO

MGnify (http://www.ebi.ac.uk/metagenomics) provides a free to use platform for the assembly, analysis and archiving of microbiome data derived from sequencing microbial populations that are present in particular environments. Over the past 2 years, MGnify (formerly EBI Metagenomics) has more than doubled the number of publicly available analysed datasets held within the resource. Recently, an updated approach to data analysis has been unveiled (version 5.0), replacing the previous single pipeline with multiple analysis pipelines that are tailored according to the input data, and that are formally described using the Common Workflow Language, enabling greater provenance, reusability, and reproducibility. MGnify's new analysis pipelines offer additional approaches for taxonomic assertions based on ribosomal internal transcribed spacer regions (ITS1/2) and expanded protein functional annotations. Biochemical pathways and systems predictions have also been added for assembled contigs. MGnify's growing focus on the assembly of metagenomic data has also seen the number of datasets it has assembled and analysed increase six-fold. The non-redundant protein database constructed from the proteins encoded by these assemblies now exceeds 1 billion sequences. Meanwhile, a newly developed contig viewer provides fine-grained visualisation of the assembled contigs and their enriched annotations.


Assuntos
Metagenoma , Microbiota , Filogenia , Software , Archaea/classificação , Archaea/genética , Bactérias/classificação , Bactérias/genética , DNA Espaçador Ribossômico/genética , Bases de Dados Genéticas , Metagenômica/métodos
7.
Microbiol Resour Announc ; 8(43)2019 Oct 24.
Artigo em Inglês | MEDLINE | ID: mdl-31649079

RESUMO

Kosakonia radicincitas UYSO10 is an endophytic bacterium that was isolated from stem tissues of Saccharum officinarum plants cultivated in Uruguay. UYSO10 is a diazotrophic indoleacetic acid-producing bacterium with growth-promoting effects on sugarcane. Here, we report the draft genome sequence, in which genes that are probably involved in the plant-bacterium interaction were identified.

8.
Microbiol Resour Announc ; 8(38)2019 Sep 19.
Artigo em Inglês | MEDLINE | ID: mdl-31537663

RESUMO

Streptomyces spp. produce many and diverse bioactive metabolites. Plant growth-promoting (PGP) activity by Streptomyces spp. has been reported repeatedly; however, the mechanisms are largely unknown. We report the sequencing of the genome of a PGP endophytic Streptomyces sp. strain, which will contribute to the understanding of the underlying mechanisms for growth promotion.

9.
Microbiol Resour Announc ; 8(16)2019 Apr 18.
Artigo em Inglês | MEDLINE | ID: mdl-31000547

RESUMO

Here, we present the draft genome sequence of strain UYCP14C, a rhizobium isolated from Calliandra parvifolia nodules. The assembled genome size was around 9.8 million bp, containing 9,031 predicted protein-coding sequences, including several symbiotic and nitrogen fixation genes. UYCP14C appears to be a novel species of the plant growth-promoting Paraburkholderia genus.

10.
Int J Genomics ; 2018: 7403670, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-30363992

RESUMO

Sweet sorghum (Sorghum bicolor) is a multipurpose crop used as a feedstock to produce bioethanol, sugar, energy, and animal feed. However, it requires high levels of N fertilizer application to achieve the optimal growth, which causes environmental degradation. Bacterial endophytes, which live inside plant tissues, play a key role in the health and productivity of their host. This particular community may be influenced by different agronomical practices. The aim of the work was to evaluate the effects of N fertilization on the structure, diversity, abundance, and composition of endophytic and diazotrophic bacterial community associated with field-grown sweet sorghum. PCR-DGGE, quantitative PCR, and high-throughput sequencing were performed based on the amplification of rrs and nifH genes. The level of N fertilization affected the structure and abundance but not the diversity of the endophytic bacterial communities associated with sweet sorghum plants. This effect was pronounced in the roots of both bacterial communities analyzed and may depend on the physiological state of the plants. Specific bacterial classes and genera increased or decreased when the fertilizer was applied. The data obtained here contribute to a better understanding on the effects of agronomical practices on the microbiota associated with this important crop, with the aim to improve its sustainability.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...