Efficient identification of trait-associated loss-of-function variants in the UK Biobank cohort by exome-sequencing based genotype imputation.
Genet Epidemiol
; 47(2): 121-134, 2023 03.
Article
en En
| MEDLINE
| ID: mdl-36490288
The large-scale open access whole-exome sequencing (WES) data of the UK Biobank ~200,000 participants is accelerating a new wave of genetic association studies aiming to identify rare and functional loss-of-function (LoF) variants associated with complex traits and diseases. We proposed to merge the WES genotypes and the genome-wide genotyping (GWAS) genotypes of 167,000 UKB homogeneous European participants into a combined reference panel, and then to impute 241,911 UKB homogeneous European participants who had the GWAS genotypes only. We then used the imputed data to replicate association identified in the discovery WES sample. The average imputation accuracy measure r2 is modest to high for LoF variants at all minor allele frequency intervals: 0.942 at MAF interval (0.01, 0.5), 0.807 at (1.0 × 10-3 , 0.01), 0.805 at (1.0 × 10-4 , 1.0 × 10-3 ), 0.664 at (1.0 × 10-5 , 1.0 × 10-4 ) and 0.410 at (0, 1.0 × 10-5 ). As applications, we studied associations of LoF variants with estimated heel BMD and four lipid traits. In addition to replicating dozens of previously reported genes, we also identified three novel associations, two genes PLIN1 and ANGPTL3 for high-density-lipoprotein cholesterol and one gene PDE3B for triglycerides. Our results highlighted the strength of WES based genotype imputation as well as provided useful imputed data within the UKB cohort.
Palabras clave
Texto completo:
1
Colección:
01-internacional
Banco de datos:
MEDLINE
Asunto principal:
Bancos de Muestras Biológicas
/
Exoma
Tipo de estudio:
Diagnostic_studies
/
Risk_factors_studies
Límite:
Humans
País/Región como asunto:
Europa
Idioma:
En
Revista:
Genet Epidemiol
Asunto de la revista:
EPIDEMIOLOGIA
/
GENETICA MEDICA
Año:
2023
Tipo del documento:
Article