Your browser doesn't support javascript.
loading
BlobToolKit - Interactive Quality Assessment of Genome Assemblies.
Challis, Richard; Richards, Edward; Rajan, Jeena; Cochrane, Guy; Blaxter, Mark.
Afiliação
  • Challis R; Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3JT, UK rc28@sanger.ac.uk.
  • Richards E; Wellcome Sanger Institute, Cambridge CB10 1SA, UK.
  • Rajan J; European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Cambridge CB10 1SD, UK.
  • Cochrane G; European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Cambridge CB10 1SD, UK.
  • Blaxter M; European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Cambridge CB10 1SD, UK.
G3 (Bethesda) ; 10(4): 1361-1374, 2020 04 09.
Article em En | MEDLINE | ID: mdl-32071071
ABSTRACT
Reconstruction of target genomes from sequence data produced by instruments that are agnostic as to the species-of-origin may be confounded by contaminant DNA. Whether introduced during sample processing or through co-extraction alongside the target DNA, if insufficient care is taken during the assembly process, the final assembled genome may be a mixture of data from several species. Such assemblies can confound sequence-based biological inference and, when deposited in public databases, may be included in downstream analyses by users unaware of underlying problems. We present BlobToolKit, a software suite to aid researchers in identifying and isolating non-target data in draft and publicly available genome assemblies. BlobToolKit can be used to process assembly, read and analysis files for fully reproducible interactive exploration in the browser-based Viewer. BlobToolKit can be used during assembly to filter non-target DNA, helping researchers produce assemblies with high biological credibility. We have been running an automated BlobToolKit pipeline on eukaryotic assemblies publicly available in the International Nucleotide Sequence Data Collaboration and are making the results available through a public instance of the Viewer at https//blobtoolkit.genomehubs.org/view We aim to complete analysis of all publicly available genomes and then maintain currency with the flow of new genomes. We have worked to embed these views into the presentation of genome assemblies at the European Nucleotide Archive, providing an indication of assembly quality alongside the public record with links out to allow full exploration in the Viewer.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software / Genoma Idioma: En Ano de publicação: 2020 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software / Genoma Idioma: En Ano de publicação: 2020 Tipo de documento: Article