Your browser doesn't support javascript.
loading
Extent to which array genotyping and imputation with large reference panels approximate deep whole-genome sequencing.
Hanks, Sarah C; Forer, Lukas; Schönherr, Sebastian; LeFaive, Jonathon; Martins, Taylor; Welch, Ryan; Gagliano Taliun, Sarah A; Braff, David; Johnsen, Jill M; Kenny, Eimear E; Konkle, Barbara A; Laakso, Markku; Loos, Ruth F J; McCarroll, Steven; Pato, Carlos; Pato, Michele T; Smith, Albert V; Boehnke, Michael; Scott, Laura J; Fuchsberger, Christian.
Afiliação
  • Hanks SC; Department of Biostatistics and Center for Statistical Genetics, School of Public Health, University of Michigan, Ann Arbor, MI, USA.
  • Forer L; Institute of Genetic Epidemiology, Medical University of Innsbruck, Innsbruck, Austria.
  • Schönherr S; Institute of Genetic Epidemiology, Medical University of Innsbruck, Innsbruck, Austria.
  • LeFaive J; Department of Biostatistics and Center for Statistical Genetics, School of Public Health, University of Michigan, Ann Arbor, MI, USA.
  • Martins T; Department of Biostatistics and Center for Statistical Genetics, School of Public Health, University of Michigan, Ann Arbor, MI, USA.
  • Welch R; Department of Biostatistics and Center for Statistical Genetics, School of Public Health, University of Michigan, Ann Arbor, MI, USA.
  • Gagliano Taliun SA; Department of Medicine and Department of Neurosciences, Université de Montréal, Montreal, QC, Canada; Research Centre, Montreal Heart Institute, Montreal, QC, Canada.
  • Braff D; Department of Psychiatry, University of California San Diego, La Jolla, CA, USA.
  • Johnsen JM; Research Institute, Bloodworks, Seattle, WA, USA; Department of Medicine, University of Washington, Seattle, WA, USA.
  • Kenny EE; Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA; Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA; Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
  • Konkle BA; Department of Medicine, University of Washington, Seattle, WA, USA.
  • Laakso M; Institute of Clinical Medicine, Internal Medicine, University of Eastern Finland, Kuopio, Finland.
  • Loos RFJ; Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
  • McCarroll S; Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA; Department of Genetics, Harvard Medical School, Boston, MA, USA.
  • Pato C; Departments of Psychiatry, Rutgers University, Robert Wood Johnson Medical School and New Jersey Medical School, New Brunswick, NJ, USA.
  • Pato MT; Departments of Psychiatry, Rutgers University, Robert Wood Johnson Medical School and New Jersey Medical School, New Brunswick, NJ, USA.
  • Smith AV; Department of Biostatistics and Center for Statistical Genetics, School of Public Health, University of Michigan, Ann Arbor, MI, USA.
  • Boehnke M; Department of Biostatistics and Center for Statistical Genetics, School of Public Health, University of Michigan, Ann Arbor, MI, USA.
  • Scott LJ; Department of Biostatistics and Center for Statistical Genetics, School of Public Health, University of Michigan, Ann Arbor, MI, USA.
  • Fuchsberger C; Institute for Biomedicine (Affiliated with the University of Lübeck), Eurac Research, Bolzano, Italy. Electronic address: cfuchsberger@eurac.edu.
Am J Hum Genet ; 109(9): 1653-1666, 2022 09 01.
Article em En | MEDLINE | ID: mdl-35981533
ABSTRACT
Understanding the genetic basis of human diseases and traits is dependent on the identification and accurate genotyping of genetic variants. Deep whole-genome sequencing (WGS), the gold standard technology for SNP and indel identification and genotyping, remains very expensive for most large studies. Here, we quantify the extent to which array genotyping followed by genotype imputation can approximate WGS in studies of individuals of African, Hispanic/Latino, and European ancestry in the US and of Finnish ancestry in Finland (a population isolate). For each study, we performed genotype imputation by using the genetic variants present on the Illumina Core, OmniExpress, MEGA, and Omni 2.5M arrays with the 1000G, HRC, and TOPMed imputation reference panels. Using the Omni 2.5M array and the TOPMed panel, ≥90% of bi-allelic single-nucleotide variants (SNVs) are well imputed (r2 > 0.8) down to minor-allele frequencies (MAFs) of 0.14% in African, 0.11% in Hispanic/Latino, 0.35% in European, and 0.85% in Finnish ancestries. There was little difference in TOPMed-based imputation quality among the arrays with >700k variants. Individual-level imputation quality varied widely between and within the three US studies. Imputation quality also varied across genomic regions, producing regions where even common (MAF > 5%) variants were consistently not well imputed across ancestries. The extent to which array genotyping and imputation can approximate WGS therefore depends on reference panel, genotype array, sample ancestry, and genomic location. Imputation quality by variant or genomic region can be queried with our new tool, RsqBrowser, now deployed on the Michigan Imputation Server.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Polimorfismo de Nucleotídeo Único / Sequenciamento de Nucleotídeos em Larga Escala Limite: Humans Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Polimorfismo de Nucleotídeo Único / Sequenciamento de Nucleotídeos em Larga Escala Limite: Humans Idioma: En Ano de publicação: 2022 Tipo de documento: Article