Your browser doesn't support javascript.
loading
Benchmarking short-, long- and hybrid-read assemblers for metagenome sequencing of complex microbial communities.
Goussarov, Gleb; Mysara, Mohamed; Cleenwerck, Ilse; Claesen, Jürgen; Leys, Natalie; Vandamme, Peter; Van Houdt, Rob.
Afiliação
  • Goussarov G; Microbiology Unit, Belgian Nuclear Research Centre (SCK CEN), Mol, Belgium.
  • Mysara M; Laboratory of Microbiology and BCCM/LMG Bacteria Collection, Faculty of Sciences, Ghent University, Ghent, Belgium.
  • Cleenwerck I; Microbiology Unit, Belgian Nuclear Research Centre (SCK CEN), Mol, Belgium.
  • Claesen J; Bioinformatics group, Information Technology & Computer Science, Nile University, Giza, Egypt.
  • Leys N; Laboratory of Microbiology and BCCM/LMG Bacteria Collection, Faculty of Sciences, Ghent University, Ghent, Belgium.
  • Vandamme P; Microbiology Unit, Belgian Nuclear Research Centre (SCK CEN), Mol, Belgium.
  • Van Houdt R; Present address: Department of Epidemiology & Biostatistics, Amsterdam UMC, VU University, Amsterdam, Netherlands.
Microbiology (Reading) ; 170(6)2024 Jun.
Article em En | MEDLINE | ID: mdl-38916949
ABSTRACT
Metagenome community analyses, driven by the continued development in sequencing technology, is rapidly providing insights in many aspects of microbiology and becoming a cornerstone tool. Illumina, Oxford Nanopore Technologies (ONT) and Pacific Biosciences (PacBio) are the leading technologies, each with their own advantages and drawbacks. Illumina provides accurate reads at a low cost, but their length is too short to close bacterial genomes. Long reads overcome this limitation, but these technologies produce reads with lower accuracy (ONT) or with lower throughput (PacBio high-fidelity reads). In a critical first analysis step, reads are assembled to reconstruct genomes or individual genes within the community. However, to date, the performance of existing assemblers has never been challenged with a complex mock metagenome. Here, we evaluate the performance of current assemblers that use short, long or both read types on a complex mock metagenome consisting of 227 bacterial strains with varying degrees of relatedness. We show that many of the current assemblers are not suited to handle such a complex metagenome. In addition, hybrid assemblies do not fulfil their potential. We conclude that ONT reads assembled with CANU and Illumina reads assembled with SPAdes offer the best value for reconstructing genomes and individual genes of complex metagenomes, respectively.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Bactérias / Análise de Sequência de DNA / Benchmarking / Metagenoma / Metagenômica / Sequenciamento de Nucleotídeos em Larga Escala Idioma: En Revista: Microbiology (Reading) Assunto da revista: MICROBIOLOGIA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Bélgica

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Bactérias / Análise de Sequência de DNA / Benchmarking / Metagenoma / Metagenômica / Sequenciamento de Nucleotídeos em Larga Escala Idioma: En Revista: Microbiology (Reading) Assunto da revista: MICROBIOLOGIA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Bélgica