Your browser doesn't support javascript.
loading
Best practices for analyzing imputed genotypes from low-pass sequencing in dogs.
Buckley, Reuben M; Harris, Alex C; Wang, Guo-Dong; Whitaker, D Thad; Zhang, Ya-Ping; Ostrander, Elaine A.
Afiliação
  • Buckley RM; Cancer Genetics and Comparative Genomics Branch, National Human Genome Research Institute, National Institutes of Health, 50 South Drive, Building 50, Room 5351, Bethesda, MD, 20892 , USA.
  • Harris AC; Cancer Genetics and Comparative Genomics Branch, National Human Genome Research Institute, National Institutes of Health, 50 South Drive, Building 50, Room 5351, Bethesda, MD, 20892 , USA.
  • Wang GD; State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China.
  • Whitaker DT; Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, 650223, China.
  • Zhang YP; Cancer Genetics and Comparative Genomics Branch, National Human Genome Research Institute, National Institutes of Health, 50 South Drive, Building 50, Room 5351, Bethesda, MD, 20892 , USA.
  • Ostrander EA; State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China.
Mamm Genome ; 33(1): 213-229, 2022 03.
Article em En | MEDLINE | ID: mdl-34498136
ABSTRACT
Although DNA array-based approaches for genome-wide association studies (GWAS) permit the collection of thousands of low-cost genotypes, it is often at the expense of resolution and completeness, as SNP chip technologies are ultimately limited by SNPs chosen during array development. An alternative low-cost approach is low-pass whole genome sequencing (WGS) followed by imputation. Rather than relying on high levels of genotype confidence at a set of select loci, low-pass WGS and imputation rely on the combined information from millions of randomly sampled low-confidence genotypes. To investigate low-pass WGS and imputation in the dog, we assessed accuracy and performance by downsampling 97 high-coverage (> 15×) WGS datasets from 51 different breeds to approximately 1× coverage, simulating low-pass WGS. Using a reference panel of 676 dogs from 91 breeds, genotypes were imputed from the downsampled data and compared to a truth set of genotypes generated from high-coverage WGS. Using our truth set, we optimized a variant quality filtering strategy that retained approximately 80% of 14 M imputed sites and lowered the imputation error rate from 3.0% to 1.5%. Seven million sites remained with a MAF > 5% and an average imputation quality score of 0.95. Finally, we simulated the impact of imputation errors on outcomes for case-control GWAS, where small effect sizes were most impacted and medium-to-large effect sizes were minorly impacted. These analyses provide best practice guidelines for study design and data post-processing of low-pass WGS-imputed genotypes in dogs.
Assuntos

Texto completo: 1 Bases de dados: MEDLINE Assunto principal: Polimorfismo de Nucleotídeo Único / Estudo de Associação Genômica Ampla Tipo de estudo: Guideline / Observational_studies / Risk_factors_studies Limite: Animals Idioma: En Revista: Mamm Genome Assunto da revista: GENETICA Ano de publicação: 2022 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Bases de dados: MEDLINE Assunto principal: Polimorfismo de Nucleotídeo Único / Estudo de Associação Genômica Ampla Tipo de estudo: Guideline / Observational_studies / Risk_factors_studies Limite: Animals Idioma: En Revista: Mamm Genome Assunto da revista: GENETICA Ano de publicação: 2022 Tipo de documento: Article País de afiliação: Estados Unidos