Your browser doesn't support javascript.
loading
INSurVeyor: improving insertion calling from short read sequencing data.
Rajaby, Ramesh; Liu, Dong-Xu; Au, Chun Hang; Cheung, Yuen-Ting; Lau, Amy Yuet Ting; Yang, Qing-Yong; Sung, Wing-Kin.
Afiliação
  • Rajaby R; Hong Kong Genome Institute, Hong Kong Science Park, Shatin, Hong Kong, China.
  • Liu DX; A*STAR Genome Institute of Singapore, 60 Biopolis Street, Singapore, 138672, Singapore.
  • Au CH; National Key Laboratory of Crop Genetic Improvement, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China.
  • Cheung YT; Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China.
  • Lau AYT; Hong Kong Genome Institute, Hong Kong Science Park, Shatin, Hong Kong, China.
  • Yang QY; Hong Kong Genome Institute, Hong Kong Science Park, Shatin, Hong Kong, China.
  • Sung WK; Hong Kong Genome Institute, Hong Kong Science Park, Shatin, Hong Kong, China.
Nat Commun ; 14(1): 3243, 2023 06 05.
Article em En | MEDLINE | ID: mdl-37277343
ABSTRACT
Insertions are one of the major types of structural variations and are defined as the addition of 50 nucleotides or more into a DNA sequence. Several methods exist to detect insertions from next-generation sequencing short read data, but they generally have low sensitivity. Our contribution is two-fold. First, we introduce INSurVeyor, a fast, sensitive and precise method that detects insertions from next-generation sequencing paired-end data. Using publicly available benchmark datasets (both human and non-human), we show that INSurVeyor is not only more sensitive than any individual caller we tested, but also more sensitive than all of them combined. Furthermore, for most types of insertions, INSurVeyor is almost as sensitive as long reads callers. Second, we provide state-of-the-art catalogues of insertions for 1047 Arabidopsis Thaliana genomes from the 1001 Genomes Project and 3202 human genomes from the 1000 Genomes Project, both generated with INSurVeyor. We show that they are more complete and precise than existing resources, and important insertions are missed by existing methods.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Ano de publicação: 2023 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Ano de publicação: 2023 Tipo de documento: Article