Your browser doesn't support javascript.
loading
Castanet: a pipeline for rapid analysis of targeted multi-pathogen genomic data.
Mayne, Richard; Secret, Shannah; Geoghegan, Cyndi; Trebes, Amy; Kean, Kai; Reid, Kaitlin; Lin, Gu-Lung; Ansari, M Azim; de Cesare, Mariateresa; Bonsall, David; Elliott, Ivo; Piazza, Paolo; Brown, Anthony; Bray, James; Knight, Julian C; Harvala, Heli; Breuer, Judith; Simmonds, Peter; Bowden, Rory J; Golubchik, Tanya.
Afiliação
  • Mayne R; Nuffield Department of Medicine, Peter Medawar Building for Pathogen Research, University of Oxford, Oxfordshire OX1 3SY, United Kingdom.
  • Secret S; Radcliffe Department of Medicine, University of Oxford, West Wing John Radcliffe Hospital, Oxfordshire OX3 9DU, United Kingdom.
  • Geoghegan C; Microbiology Services, NHS Blood and Transplant, London NW9 5BG, United Kingdom.
  • Trebes A; Centre for Human Genetics, University of Oxford, Oxfordshire OX3 7BN, United Kingdom.
  • Kean K; Genewiz UK Ltd, Azenta Life Sciences, Oxfordshire OX14 1SG, United Kingdom.
  • Reid K; Oxford Genomics Centre, University of Oxford, Oxfordshire OX3 7BN, United Kingdom.
  • Lin GL; Nuffield Department of Medicine, Peter Medawar Building for Pathogen Research, University of Oxford, Oxfordshire OX1 3SY, United Kingdom.
  • Ansari MA; Nuffield Department of Medicine, Peter Medawar Building for Pathogen Research, University of Oxford, Oxfordshire OX1 3SY, United Kingdom.
  • de Cesare M; Oxford Vaccine Group, University of Oxford, Oxfordshire OX3 7LE, United Kingdom.
  • Bonsall D; Nuffield Department of Medicine, Peter Medawar Building for Pathogen Research, University of Oxford, Oxfordshire OX1 3SY, United Kingdom.
  • Elliott I; National Facility for Genomics, Human Technopole, Viale Rita Levi-Montalcini, Milan 20157, Italy.
  • Piazza P; Nuffield Department of Medicine, Peter Medawar Building for Pathogen Research, University of Oxford, Oxfordshire OX1 3SY, United Kingdom.
  • Brown A; Centre for Tropical Medicine and Global Health, University of Oxford, Oxfordshire OX3 7LE, United Kingdom.
  • Bray J; Centre for Human Genetics, University of Oxford, Oxfordshire OX3 7BN, United Kingdom.
  • Knight JC; Nuffield Department of Medicine, Peter Medawar Building for Pathogen Research, University of Oxford, Oxfordshire OX1 3SY, United Kingdom.
  • Harvala H; Department of Biology, University of Oxford, Oxfordshire OX1 3SY, United Kingdom.
  • Breuer J; Oxford Genomics Centre, University of Oxford, Oxfordshire OX3 7BN, United Kingdom.
  • Simmonds P; Chinese Academy of Medical Science Oxford Institute, University of Oxford, Oxfordshire OX3 7BN, United Kingdom.
  • Bowden RJ; NIHR Oxford Biomedical Research Centre, University of Oxford, John Radcliffe Hospital, Oxfordshire OX3 9DU, United Kingdom.
  • Golubchik T; Radcliffe Department of Medicine, University of Oxford, West Wing John Radcliffe Hospital, Oxfordshire OX3 9DU, United Kingdom.
Bioinformatics ; 40(10)2024 Oct 01.
Article em En | MEDLINE | ID: mdl-39360992
ABSTRACT
MOTIVATION Target enrichment strategies generate genomic data from multiple pathogens in a single process, greatly improving sensitivity over metagenomic sequencing and enabling cost-effective, high-throughput surveillance and clinical applications. However, uptake by research and clinical laboratories is constrained by an absence of computational tools that are specifically designed for the analysis of multi-pathogen enrichment sequence data. Here we present an analysis pipeline, Castanet, for use with multi-pathogen enrichment sequencing data. Castanet is designed to work with short-read data produced by existing targeted enrichment strategies, but can be readily deployed on any BAM file generated by another methodology. Also included are an optional graphical interface and installer script.

RESULTS:

In addition to genome reconstruction, Castanet reports method-specific metrics that enable quantification of capture efficiency, estimation of pathogen load, differentiation of low-level positives from contamination, and assessment of sequencing quality. Castanet can be used as a traditional end-to-end pipeline for consensus generation, but its strength lies in the ability to process a flexible, pre-defined set of pathogens of interest directly from multi-pathogen enrichment experiments. In our tests, Castanet consensus sequences were accurate reconstructions of reference sequences, including in instances where multiple strains of the same pathogen were present. Castanet performs effectively on standard computers and can process the entire output of a 96-sample enrichment sequencing run (50M reads) using a single batch process command, in $<$2 h. AVAILABILITY AND IMPLEMENTATION Source code freely available under GPL-3 license at https//github.com/MultipathogenGenomics/castanet, implemented in Python 3.10 and supported in Ubuntu Linux 22.04. The data underlying this article are available in Europe Nucleotide Archives, at https//www.ebi.ac.uk/ena/browser/view/PRJEB77004.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article