RESUMEN
BACKGROUND: In the past decade many Genome-wide Association Studies (GWAS) were performed that discovered new associations between single-nucleotide polymorphisms (SNPs) and various phenotypes. Imputation methods are widely used in GWAS. They facilitate the phenotype association with variants that are not directly genotyped. Imputation methods can also be used to combine and analyse data genotyped on different genotyping arrays. In this study we investigated the imputation quality and efficiency of two different approaches of combining GWAS data from different genotyping platforms. We investigated whether combining data from different platforms before the actual imputation performs better than combining the data from different platforms after imputation. METHODS: In total 979 unique individuals from the AMC-PAS cohort were genotyped on 3 different platforms. A total of 706 individuals were genotyped on the MetaboChip, a total of 757 individuals were genotyped on the 50K gene-centric Human CVD BeadChip, and a total of 955 individuals were genotyped on the HumanExome chip. A total of 397 individuals were genotyped on all 3 individual platforms. After pre-imputation quality control (QC), Minimac in combination with MaCH was used for the imputation of all samples with the 1,000 genomes reference panel. All imputed markers with an r2 value of <0.3 were excluded in our post-imputation QC. RESULTS: A total of 397 individuals were genotyped on all three platforms. All three datasets were carefully matched on strand, SNP ID and genomic coordinates. This resulted in a dataset of 979 unique individuals and a total of 258,925 unique markers. A total of 4,117,036 SNPs were available when imputation was performed before merging the three datasets. A total of 3,933,494 SNPs were available when imputation was done on the combined set. Our results suggest that imputation of individual datasets before merging performs slightly better than after combining the different datasets. CONCLUSIONS: Imputation of datasets genotyped by different platforms before merging generates more SNPs than imputation after putting the datasets together.
Asunto(s)
Estudio de Asociación del Genoma Completo , Genotipo , Polimorfismo de Nucleótido Simple , Estudios de Cohortes , Simulación por Computador , Exoma , Frecuencia de los Genes , Genoma Humano , Humanos , Análisis de Secuencia por Matrices de Oligonucleótidos , Fenotipo , Control de CalidadRESUMEN
It was investigated whether pharmacogenetic factors, both as single polymorphism and as gene-gene interactions, have an added value over non-genetic factors in predicting statin response. Five common polymorphisms were selected in apolipoprotein E, angiotensin-converting enzyme, hepatic lipase and toll-like receptor 4. Linear regression models were built and compared on R(2) to estimate the added value of single polymorphisms and gene-gene interactions. The selected polymorphisms and the gene-gene interactions had a small added value in predicting change in low-density lipoprotein cholesterol levels (LDL-c) as response to statins over the non-genetic predictors (P=0.104), and also in predicting LDL-c in non-treated patients (P=0.016). Moreover, four gene-gene interactions with statin therapy were identified. The added value of genetic factors over non-genetic variables is for the greater part produced by gene-gene interactions. This underlines the importance to examine gene-gene interactions in future (pharmaco)genetic research.