Your browser doesn't support javascript.
loading
Comprehensive analysis to improve the validation rate for single nucleotide variants detected by next-generation sequencing.
Park, Mi-Hyun; Rhee, Hwanseok; Park, Jung Hoon; Woo, Hae-Mi; Choi, Byung-Ok; Kim, Bo-Young; Chung, Ki Wha; Cho, Yoo-Bok; Kim, Hyung Jin; Jung, Ji-Won; Koo, Soo Kyung.
Afiliação
  • Park MH; Division of Intractable Diseases, Center for Biomedical Sciences, National Institute of Health, Chungcheongbuk-do, South Korea.
  • Rhee H; Macrogen Inc., Gasan-dong, Seoul, South Korea.
  • Park JH; Macrogen Inc., Gasan-dong, Seoul, South Korea.
  • Woo HM; Division of Intractable Diseases, Center for Biomedical Sciences, National Institute of Health, Chungcheongbuk-do, South Korea.
  • Choi BO; Department of Neurology, Samsung Medical Center, Sungkyunkwan University, School of Medicine, Seoul, South Korea.
  • Kim BY; Division of Intractable Diseases, Center for Biomedical Sciences, National Institute of Health, Chungcheongbuk-do, South Korea.
  • Chung KW; Department of Biological Science, Kongju National University, Gongju, South Korea.
  • Cho YB; Department of Biological Sciences, KAIST, Yuseong-gu, Daejeon, South Korea.
  • Kim HJ; Division of Intractable Diseases, Center for Biomedical Sciences, National Institute of Health, Chungcheongbuk-do, South Korea.
  • Jung JW; Division of Intractable Diseases, Center for Biomedical Sciences, National Institute of Health, Chungcheongbuk-do, South Korea.
  • Koo SK; Division of Intractable Diseases, Center for Biomedical Sciences, National Institute of Health, Chungcheongbuk-do, South Korea.
PLoS One ; 9(1): e86664, 2014.
Article em En | MEDLINE | ID: mdl-24489763
ABSTRACT
Next-generation sequencing (NGS) has enabled the high-throughput discovery of germline and somatic mutations. However, NGS-based variant detection is still prone to errors, resulting in inaccurate variant calls. Here, we categorized the variants detected by NGS according to total read depth (TD) and SNP quality (SNPQ), and performed Sanger sequencing with 348 selected non-synonymous single nucleotide variants (SNVs) for validation. Using the SAMtools and GATK algorithms, the validation rate was positively correlated with SNPQ but showed no correlation with TD. In addition, common variants called by both programs had a higher validation rate than caller-specific variants. We further examined several parameters to improve the validation rate, and found that strand bias (SB) was a key parameter. SB in NGS data showed a strong difference between the variants passing validation and those that failed validation, showing a validation rate of more than 92% (filtering cutoff value alternate allele forward [AF] ≥ 20 and AF<80 in SAMtools, SB<-10 in GATK). Moreover, the validation rate increased significantly (up to 97-99%) when the variant was filtered together with the suggested values of mapping quality (MQ), SNPQ and SB. This detailed and systematic study provides comprehensive recommendations for improving validation rates, saving time and lowering cost in NGS analyses.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Algoritmos / Doença de Charcot-Marie-Tooth / Polimorfismo de Nucleotídeo Único Limite: Humans Idioma: En Revista: PLoS One Assunto da revista: CIENCIA / MEDICINA Ano de publicação: 2014 Tipo de documento: Article País de afiliação: Coréia do Sul

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Algoritmos / Doença de Charcot-Marie-Tooth / Polimorfismo de Nucleotídeo Único Limite: Humans Idioma: En Revista: PLoS One Assunto da revista: CIENCIA / MEDICINA Ano de publicação: 2014 Tipo de documento: Article País de afiliação: Coréia do Sul