Your browser doesn't support javascript.
loading
AssemblyQC: a Nextflow pipeline for reproducible reporting of assembly quality.
Rashid, Usman; Wu, Chen; Shiller, Jason; Smith, Ken; Crowhurst, Ross; Davy, Marcus; Chen, Ting-Hsuan; Carvajal, Ignacio; Bailey, Sarah; Thomson, Susan; Deng, Cecilia H.
Afiliação
  • Rashid U; Molecular & Digital Breeding, The New Zealand Institute for Plant and Food Research Limited, 1025 Auckland, New Zealand.
  • Wu C; Molecular & Digital Breeding, The New Zealand Institute for Plant and Food Research Limited, 1025 Auckland, New Zealand.
  • Shiller J; Molecular & Digital Breeding, The New Zealand Institute for Plant and Food Research Limited, 3182 Te Puke, New Zealand.
  • Smith K; Molecular & Digital Breeding, The New Zealand Institute for Plant and Food Research Limited, 1025 Auckland, New Zealand.
  • Crowhurst R; Molecular & Digital Breeding, The New Zealand Institute for Plant and Food Research Limited, 1025 Auckland, New Zealand.
  • Davy M; Molecular & Digital Breeding, The New Zealand Institute for Plant and Food Research Limited, 3182 Te Puke, New Zealand.
  • Chen TH; Molecular & Digital Breeding, The New Zealand Institute for Plant and Food Research Limited, 7608 Lincoln, New Zealand.
  • Carvajal I; Molecular & Digital Breeding, The New Zealand Institute for Plant and Food Research Limited, 1025 Auckland, New Zealand.
  • Bailey S; Molecular & Digital Breeding, The New Zealand Institute for Plant and Food Research Limited, 1025 Auckland, New Zealand.
  • Thomson S; Molecular & Digital Breeding, The New Zealand Institute for Plant and Food Research Limited, 7608 Lincoln, New Zealand.
  • Deng CH; Molecular & Digital Breeding, The New Zealand Institute for Plant and Food Research Limited, 1025 Auckland, New Zealand.
Bioinformatics ; 40(8)2024 Aug 02.
Article em En | MEDLINE | ID: mdl-39078114
ABSTRACT

SUMMARY:

Genome assembly projects have grown exponentially due to breakthroughs in sequencing technologies and assembly algorithms. Evaluating the quality of genome assemblies is critical to ensure the reliability of downstream analysis and interpretation. To fulfil this task, we have developed the AssemblyQC pipeline that performs file-format validation, contaminant checking, contiguity measurement, gene- and repeat-space completeness quantification, telomere inspection, taxonomic assignment, synteny alignment, scaffold examination through Hi-C contact-map visualization, and assessments of completeness, consensus quality and phasing through k-mer analysis. It produces a comprehensive HTML report with method descriptions, tables, and visualizations. AVAILABILITY AND IMPLEMENTATION The pipeline uses Nextflow for workflow orchestration and adheres to the best-practice established by the nf-core community. This pipeline offers a reproducible, scalable, and portable method to assess the quality of genome assemblies-the code is available online at GitHub https//github.com/Plant-Food-Research-Open/assemblyqc.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Software Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Nova Zelândia

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Software Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Nova Zelândia