RESUMO
Evaluating metagenomic software is key for optimizing metagenome interpretation and focus of the Initiative for the Critical Assessment of Metagenome Interpretation (CAMI). The CAMI II challenge engaged the community to assess methods on realistic and complex datasets with long- and short-read sequences, created computationally from around 1,700 new and known genomes, as well as 600 new plasmids and viruses. Here we analyze 5,002 results by 76 program versions. Substantial improvements were seen in assembly, some due to long-read data. Related strains still were challenging for assembly and genome recovery through binning, as was assembly quality for the latter. Profilers markedly matured, with taxon profilers and binners excelling at higher bacterial ranks, but underperforming for viruses and Archaea. Clinical pathogen detection results revealed a need to improve reproducibility. Runtime and memory usage analyses identified efficient programs, including top performers with other metrics. The results identify challenges and guide researchers in selecting methods for analyses.
Assuntos
Metagenoma , Metagenômica , Archaea/genética , Metagenômica/métodos , Reprodutibilidade dos Testes , Análise de Sequência de DNA , SoftwareRESUMO
We introduce the marine databases; MarRef, MarDB and MarCat (https://mmp.sfb.uit.no/databases/), which are publicly available resources that promote marine research and innovation. These data resources, which have been implemented in the Marine Metagenomics Portal (MMP) (https://mmp.sfb.uit.no/), are collections of richly annotated and manually curated contextual (metadata) and sequence databases representing three tiers of accuracy. While MarRef is a database for completely sequenced marine prokaryotic genomes, which represent a marine prokaryote reference genome database, MarDB includes all incomplete sequenced prokaryotic genomes regardless level of completeness. The last database, MarCat, represents a gene (protein) catalog of uncultivable (and cultivable) marine genes and proteins derived from marine metagenomics samples. The first versions of MarRef and MarDB contain 612 and 3726 records, respectively. Each record is built up of 106 metadata fields including attributes for sampling, sequencing, assembly and annotation in addition to the organism and taxonomic information. Currently, MarCat contains 1227 records with 55 metadata fields. Ontologies and controlled vocabularies are used in the contextual databases to enhance consistency. The user-friendly web interface lets the visitors browse, filter and search in the contextual databases and perform BLAST searches against the corresponding sequence databases. All contextual and sequence databases are freely accessible and downloadable from https://s1.sfb.uit.no/public/mar/.
Assuntos
Organismos Aquáticos/genética , Bases de Dados Genéticas , Bases de Dados de Ácidos Nucleicos , Bases de Dados de Proteínas , Genes Arqueais , Genes Bacterianos , Metagenômica , Animais , Organismos Aquáticos/microbiologia , Proteínas Arqueais/genética , Proteínas de Bactérias/genética , Sequência de Bases , DNA Arqueal/genética , DNA Bacteriano/genética , Água Doce/microbiologia , Genoma , Internet , Anotação de Sequência Molecular , Água do Mar/microbiologia , Microbiologia do Solo , Interface Usuário-Computador , Microbiologia da Água , NavegadorRESUMO
BACKGROUND: Winter-ulcer Moritella viscosa infections continue to be a significant burden in Atlantic salmon (Salmo salar L.) farming. M. viscosa comprises two main clusters that differ in genetic variation and phenotypes including virulence. Horizontal gene transfer through acquisition and loss of mobile genetic elements (MGEs) is a major driving force of bacterial diversification. To gain insight into genomic traits that could affect sublineage evolution within this bacterium we examined the genome sequences of twelve M. viscosa strains. Matches between M. viscosa clustered, regularly interspaced, short palindromic, repeats and associated cas genes (CRISPR-Cas) were analysed to correlate CRISPR-Cas with adaptive immunity against MGEs. RESULTS: The comparative genomic analysis of M. viscosa isolates from across the North Atlantic region and from different fish species support delineation of M. viscosa into four phylogenetic lineages. The results showed that M. viscosa carries two distinct variants of the CRISPR-Cas subtype I-F systems and that CRISPR features follow the phylogenetic lineages. A subset of the spacer content match prophage and plasmid genes dispersed among the M. viscosa strains. Further analysis revealed that prophage and plasmid-like element distribution were reflected in the content of the CRISPR-spacer profiles. CONCLUSIONS: Our data suggests that CRISPR-Cas mediated interactions with MGEs impact genome properties among M. viscosa, and that patterns in spacer and MGE distributions are linked to strain relationships.
Assuntos
Sistemas CRISPR-Cas/genética , Peixes/microbiologia , Genômica , Moritella/genética , Animais , Evolução Molecular , Moritella/fisiologia , Moritella/virologia , Plasmídeos/genética , Prófagos/fisiologiaRESUMO
The COVID-19 pandemic has exemplified the importance of interoperable and equitable data sharing for global surveillance and to support research. While many challenges could be overcome, at least in some countries, many hurdles within the organizational, scientific, technical and cultural realms still remain to be tackled to be prepared for future threats. We propose to (i) continue supporting global efforts that have proven to be efficient and trustworthy toward addressing challenges in pathogen molecular data sharing; (ii) establish a distributed network of Pathogen Data Platforms to (a) ensure high quality data, metadata standardization and data analysis, (b) perform data brokering on behalf of data providers both for research and surveillance, (c) foster capacity building and continuous improvements, also for pandemic preparedness; (iii) establish an International One Health Pathogens Portal, connecting pathogen data isolated from various sources (human, animal, food, environment), in a truly One Health approach and following FAIR principles. To address these challenging endeavors, we have started an ELIXIR Focus Group where we invite all interested experts to join in a concerted, expert-driven effort toward sustaining and ensuring high-quality data for global surveillance and research.
Assuntos
COVID-19 , Animais , Humanos , COVID-19/epidemiologia , Pandemias , Fortalecimento Institucional , Disseminação de InformaçãoRESUMO
Genus Aliivibrio is known to harbor species exhibiting bioluminescence as well as pathogenic behavior affecting the fish farming industry. Current phylogenetic understanding of Aliivibrio has largely remained dormant after reclassification disentangled it from the Vibrio genus in 2007. There is growing evidence of wider diversity, but until now the lack of genomes and selective use of type strains have limited the ability to compare and classify strains firmly. In this study, a total of 143 bacterial strains, including 51 novel sequenced strains, were used to strengthen phylogenetic relationships in Aliivibrio by exploring intra-species and inter-species relations. Multilocus sequence analysis (MLSA), applying the six housekeeping genes 16S ribosomal RNA (rRNA), gapA, gyrB, pyrH, recA, and rpoA, inferred 12 clades and a singular branch in Aliivibrio. Along with four new phylogenetic clades, the MLSA resolved prior inconsistencies circumscribing Aliivibrio wodanis and formed a unique clade we propose as the novel species Aliivibrio sp. "friggae." Furthermore, phylogenetic assessment of individual marker genes showed gyrB, pyrH, and recA superior to the 16S rRNA gene, resolving accurately for most species clades in Aliivibrio. In this study, we provide a robust phylogenetic groundwork for Aliivibrio as a reference point to classification of species.
RESUMO
Understanding fish-microbial relationships may be of great value for fish producers as fish growth, development and welfare are influenced by the microbial community associated with the rearing systems and fish surfaces. Accurate methods to generate and analyze these microbial communities would be an important tool to help improve understanding of microbial effects in the industry. In this study, we performed taxonomic classification and determination of operational taxonomic units on Atlantic salmon microbiota by taking advantage of full-length 16S rRNA gene sequences. Skin mucus was dominated by the genera Flavobacterium and Psychrobacter. Intestinal samples were dominated by the genera Carnobacterium, Aeromonas, Mycoplasma and by sequences assigned to the order Clostridiales. Applying Sanger sequencing on the full-length bacterial 16S rRNA gene from the pool of 46 isolates obtained in this study showed a clear assignment of the PacBio full-length bacterial 16S rRNA gene sequences down to the genus level. One of the bottlenecks in comparing microbial profiles is that different studies use different 16S rRNA gene regions. Comparisons of sequence assignments between full-length and in silico derived variable 16S rRNA gene regions showed different microbial profiles with variable effects between phylogenetic groups and taxonomic ranks.
Assuntos
Bactérias/classificação , Bactérias/genética , Biota , Metagenômica/métodos , RNA Ribossômico 16S/genética , Salmo salar/microbiologia , Animais , Análise por Conglomerados , DNA Bacteriano/química , DNA Bacteriano/genética , DNA Ribossômico/química , DNA Ribossômico/genética , Genes de RNAr , Intestinos/microbiologia , Filogenia , Análise de Sequência de DNA , Pele/microbiologiaRESUMO
BACKGROUND: The population of Atlantic cod (Gadus morhua), also known as Northeast Arctic cod, migrating Atlantic cod, or simply "skrei," lives mainly in the Barents Sea and Svalbard waters and migrates in annual cycles to the Norwegian coast in order to spawn eggs during late winter. It is the world's largest population of Atlantic cod, and the population is distinct from the Norwegian coastal cod (or "fjord" cod). Despite the biological, economic, and cultural importance of migrating Atlantic cod, current knowledge on the associated microbiota is very limited. Using shotgun metagenomics and metaproteomics approaches, we present here the gut microbiota, metagenome-assembled genomes (MAGs) of the most abundant bacterial species, DNA-based functional profile, and the metaproteome of Atlantic cod specimens caught at a spawning area in an open ocean outside of Tromsø, Norway. RESULTS: Our analyses identified 268 bacterial families in DNA isolated from feces of 6 individual migrating Atlantic cod. The most abundant family was Vibrionaceae (52%; 83% if unclassified reads are excluded), with Photobacterium (genus) representing the vast majority. The recovery of metagenome-assembled genomes provided further details and suggests that several closely related Photobacterium strains from the Photobacterium phosphoreum clade are the most abundant. A genomic-based functional profiling showed that the most abundant functional subsystems are "Carbohydrates"; "Amino Acids and Derivatives"; "Protein Metabolism"; "Cofactors, Vitamins, Prosthetic, Groups, and Pigments"; and "DNA Metabolism," which is in agreement with other studies of gut microbiomes of marine organisms. Finally, the MS-based metaproteomic dataset revealed that the functional category "Protein Metabolism" is highly overrepresented (3×) when compared to the genome-based functional profile, which shows that ribosomal proteins are rich in the bacterial cytosol. CONCLUSION: We present here the first study of bacterial diversity of the gut of migrating Atlantic cod using shotgun sequencing and metagenome-assembled genomes (MAGs). The most abundant bacteria belong to the Photobacterium genus (Vibrionaceae family). We also constructed functional profiles of the gut microbiome. These may be used in future studies as a platform for mining of commercially interesting cold-active enzymes.