Your browser doesn't support javascript.
loading
MetaCon: unsupervised clustering of metagenomic contigs with probabilistic k-mers statistics and coverage.
Qian, Jia; Comin, Matteo.
Afiliação
  • Qian J; Department of Information Engineering, University of Padova, Via Giovanni Gradenigo 6, Padova, Italy.
  • Comin M; Department of Information Engineering, University of Padova, Via Giovanni Gradenigo 6, Padova, Italy. comin@dei.unipd.it.
BMC Bioinformatics ; 20(Suppl 9): 367, 2019 Nov 22.
Article em En | MEDLINE | ID: mdl-31757198
ABSTRACT
MOTIVATION Sequencing technologies allow the sequencing of microbial communities directly from the environment without prior culturing. Because assembly typically produces only genome fragments, also known as contigs, it is crucial to group them into putative species for further taxonomic profiling and down-streaming functional analysis. Taxonomic analysis of microbial communities requires contig clustering, a process referred to as binning, that is still one of the most challenging tasks when analyzing metagenomic data. The major problems are the lack of taxonomically related genomes in existing reference databases, the uneven abundance ratio of species, sequencing errors, and the limitations due to binning contig of different lengths.

RESULTS:

In this context we present MetaCon a novel tool for unsupervised metagenomic contig binning based on probabilistic k-mers statistics and coverage. MetaCon uses a signature based on k-mers statistics that accounts for the different probability of appearance of a k-mer in different species, also contigs of different length are clustered in two separate phases. The effectiveness of MetaCon is demonstrated in both simulated and real datasets in comparison with state-of-art binning approaches such as CONCOCT, MaxBin and MetaBAT.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Algoritmos / Probabilidade / Estatística como Assunto / Mapeamento de Sequências Contíguas / Metagenoma / Metagenômica Idioma: En Revista: BMC Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2019 Tipo de documento: Article País de afiliação: Itália

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Algoritmos / Probabilidade / Estatística como Assunto / Mapeamento de Sequências Contíguas / Metagenoma / Metagenômica Idioma: En Revista: BMC Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2019 Tipo de documento: Article País de afiliação: Itália