Your browser doesn't support javascript.
loading
An assessment of bioinformatics tools for the detection of human endogenous retroviral insertions in short-read genome sequencing data.
Bowles, Harry; Kabiljo, Renata; Al Khleifat, Ahmad; Jones, Ashley; Quinn, John P; Dobson, Richard J B; Swanson, Chad M; Al-Chalabi, Ammar; Iacoangeli, Alfredo.
Afiliação
  • Bowles H; Department of Basic and Clinical Neuroscience, King's College London, Maurice Wohl Clinical Neuroscience Institute, Institute of Psychiatry, Psychology and Neuroscience, London, United Kingdom.
  • Kabiljo R; Department of Basic and Clinical Neuroscience, King's College London, Maurice Wohl Clinical Neuroscience Institute, Institute of Psychiatry, Psychology and Neuroscience, London, United Kingdom.
  • Al Khleifat A; Department of Biostatistics and Health Informatics, King's College London, Institute of Psychiatry, Psychology and Neuroscience, London, United Kingdom.
  • Jones A; Department of Basic and Clinical Neuroscience, King's College London, Maurice Wohl Clinical Neuroscience Institute, Institute of Psychiatry, Psychology and Neuroscience, London, United Kingdom.
  • Quinn JP; Department of Basic and Clinical Neuroscience, King's College London, Maurice Wohl Clinical Neuroscience Institute, Institute of Psychiatry, Psychology and Neuroscience, London, United Kingdom.
  • Dobson RJB; Department of Pharmacology and Therapeutics, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, United Kingdom.
  • Swanson CM; Department of Biostatistics and Health Informatics, King's College London, Institute of Psychiatry, Psychology and Neuroscience, London, United Kingdom.
  • Al-Chalabi A; NIHR Biomedical Research Centre at South London and Maudsley NHS Foundation Trust and King's College London, London, United Kingdom.
  • Iacoangeli A; Institute of Health Informatics, University College London, London, United Kingdom.
Front Bioinform ; 2: 1062328, 2022.
Article em En | MEDLINE | ID: mdl-36845320
There is a growing interest in the study of human endogenous retroviruses (HERVs) given the substantial body of evidence that implicates them in many human diseases. Although their genomic characterization presents numerous technical challenges, next-generation sequencing (NGS) has shown potential to detect HERV insertions and their polymorphisms in humans. Currently, a number of computational tools to detect them in short-read NGS data exist. In order to design optimal analysis pipelines, an independent evaluation of the available tools is required. We evaluated the performance of a set of such tools using a variety of experimental designs and datasets. These included 50 human short-read whole-genome sequencing samples, matching long and short-read sequencing data, and simulated short-read NGS data. Our results highlight a great performance variability of the tools across the datasets and suggest that different tools might be suitable for different study designs. However, specialized tools designed to detect exclusively human endogenous retroviruses consistently outperformed generalist tools that detect a wider range of transposable elements. We suggest that, if sufficient computing resources are available, using multiple HERV detection tools to obtain a consensus set of insertion loci may be ideal. Furthermore, given that the false positive discovery rate of the tools varied between 8% and 55% across tools and datasets, we recommend the wet lab validation of predicted insertions if DNA samples are available.
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Tipo de estudo: Diagnostic_studies Idioma: En Revista: Front Bioinform Ano de publicação: 2022 Tipo de documento: Article País de afiliação: Reino Unido

Texto completo: 1 Base de dados: MEDLINE Tipo de estudo: Diagnostic_studies Idioma: En Revista: Front Bioinform Ano de publicação: 2022 Tipo de documento: Article País de afiliação: Reino Unido