RESUMEN
The human leukocyte antigen (HLA) locus is associated with more complex diseases than any other locus in the human genome. In many diseases, HLA explains more heritability than all other known loci combined. In silico HLA imputation methods enable rapid and accurate estimation of HLA alleles in the millions of individuals that are already genotyped on microarrays. HLA imputation has been used to define causal variation in autoimmune diseases, such as type I diabetes, and in human immunodeficiency virus infection control. However, there are few guidelines on performing HLA imputation, association testing, and fine mapping. Here, we present a comprehensive tutorial to impute HLA alleles from genotype data. We provide detailed guidance on performing standard quality control measures for input genotyping data and describe options to impute HLA alleles and amino acids either locally or using the web-based Michigan Imputation Server, which hosts a multi-ancestry HLA imputation reference panel. We also offer best practice recommendations to conduct association tests to define the alleles, amino acids, and haplotypes that affect human traits. Along with the pipeline, we provide a step-by-step online guide with scripts and available software ( https://github.com/immunogenomics/HLA_analyses_tutorial ). This tutorial will be broadly applicable to large-scale genotyping data and will contribute to defining the role of HLA in human diseases across global populations.
Asunto(s)
Antígenos HLA , Antígenos de Histocompatibilidad Clase I , Humanos , Alelos , Antígenos HLA/genética , Genotipo , Haplotipos , Aminoácidos/genética , Polimorfismo de Nucleótido Simple , Estudio de Asociación del Genoma CompletoRESUMEN
OBJECTIVE: To investigate to what extent low-frequency genetic variants (with minor allele frequencies <5%) affect the risk of intracranial aneurysms (IAs). METHODS: One thousand fifty-six patients with IA and 2,097 population-based controls from the Netherlands were genotyped with the Illumina HumanExome BeadChip. After quality control (QC) of samples and single nucleotide variants (SNVs), we conducted a single variant analysis using the Fisher exact test. We also performed the variable threshold (VT) test and the sequence kernel association test (SKAT) at different minor allele count (MAC) thresholds of >5 and >0 to test the hypothesis that multiple variants within the same gene are associated with IA risk. Significant results were tested in a replication cohort of 425 patients with IA and 311 controls, and results of the 2 cohorts were combined in a meta-analysis. RESULTS: After QC, 995 patients with IA and 2,080 controls remained for further analysis. The single variant analysis comprising 46,534 SNVs did not identify significant loci at the genome-wide level. The gene-based tests showed a statistically significant association for fibulin 2 (FBLN2) (best p = 1 × 10-6 for the VT test, MAC >5). Associations were not statistically significant in the independent but smaller replication cohort (p > 0.57) but became slightly stronger in a meta-analysis of the 2 cohorts (best p = 4.8 × 10-7 for the SKAT, MAC ≥1). CONCLUSION: Gene-based tests indicated an association for FBLN2, a gene encoding an extracellular matrix protein implicated in vascular wall remodeling, but independent validation in larger cohorts is warranted. We did not identify any significant associations for single low-frequency genetic variants.
Asunto(s)
Proteínas de Unión al Calcio/genética , Exoma/genética , Proteínas de la Matriz Extracelular/genética , Aneurisma Intracraneal/genética , Adolescente , Adulto , Anciano , Anciano de 80 o más Años , Estudios de Casos y Controles , Femenino , Frecuencia de los Genes , Predisposición Genética a la Enfermedad , Genotipo , Humanos , Masculino , Persona de Mediana Edad , Países Bajos , Análisis de Secuencia por Matrices de Oligonucleótidos , Polimorfismo de Nucleótido Simple , Adulto JovenRESUMEN
An amendment to this paper has been published and can be accessed via a link at the top of the paper.
RESUMEN
OBJECTIVE: We sought to assess whether genetic risk factors for atrial fibrillation (AF) can explain cardioembolic stroke risk. METHODS: We evaluated genetic correlations between a previous genetic study of AF and AF in the presence of cardioembolic stroke using genome-wide genotypes from the Stroke Genetics Network (N = 3,190 AF cases, 3,000 cardioembolic stroke cases, and 28,026 referents). We tested whether a previously validated AF polygenic risk score (PRS) associated with cardioembolic and other stroke subtypes after accounting for AF clinical risk factors. RESULTS: We observed a strong correlation between previously reported genetic risk for AF, AF in the presence of stroke, and cardioembolic stroke (Pearson r = 0.77 and 0.76, respectively, across SNPs with p < 4.4 × 10-4 in the previous AF meta-analysis). An AF PRS, adjusted for clinical AF risk factors, was associated with cardioembolic stroke (odds ratio [OR] per SD = 1.40, p = 1.45 × 10-48), explaining â¼20% of the heritable component of cardioembolic stroke risk. The AF PRS was also associated with stroke of undetermined cause (OR per SD = 1.07, p = 0.004), but no other primary stroke subtypes (all p > 0.1). CONCLUSIONS: Genetic risk of AF is associated with cardioembolic stroke, independent of clinical risk factors. Studies are warranted to determine whether AF genetic risk can serve as a biomarker for strokes caused by AF.
RESUMEN
BACKGROUND: Atherosclerosis is a chronic inflammatory disease in part caused by lipid uptake in the vascular wall, but the exact underlying mechanisms leading to acute myocardial infarction and stroke remain poorly understood. Large consortia identified genetic susceptibility loci that associate with large artery ischemic stroke and coronary artery disease. However, deciphering their underlying mechanisms are challenging. Histological studies identified destabilizing characteristics in human atherosclerotic plaques that associate with clinical outcome. To what extent established susceptibility loci for large artery ischemic stroke and coronary artery disease relate to plaque characteristics is thus far unknown but may point to novel mechanisms. METHODS: We studied the associations of 61 established cardiovascular risk loci with 7 histological plaque characteristics assessed in 1443 carotid plaque specimens from the Athero-Express Biobank Study. We also assessed if the genotyped cardiovascular risk loci impact the tissue-specific gene expression in 2 independent biobanks, Biobank of Karolinska Endarterectomy and Stockholm Atherosclerosis Gene Expression. RESULTS: A total of 21 established risk variants (out of 61) nominally associated to a plaque characteristic. One variant (rs12539895, risk allele A) at 7q22 associated to a reduction of intraplaque fat, P=5.09×10-6 after correction for multiple testing. We further characterized this 7q22 Locus and show tissue-specific effects of rs12539895 on HBP1 expression in plaques and COG5 expression in whole blood and provide data from public resources showing an association with decreased LDL (low-density lipoprotein) and increase HDL (high-density lipoprotein) in the blood. CONCLUSIONS: Our study supports the view that cardiovascular susceptibility loci may exert their effect by influencing the atherosclerotic plaque characteristics.
Asunto(s)
Aterosclerosis/genética , Enfermedades Cardiovasculares/genética , Sitios Genéticos/genética , Predisposición Genética a la Enfermedad/genética , Polimorfismo de Nucleótido Simple , Proteínas Adaptadoras del Transporte Vesicular/genética , Anciano , Alelos , Enfermedades de las Arterias Carótidas/genética , Femenino , Regulación de la Expresión Génica , Genotipo , Proteínas del Grupo de Alta Movilidad/genética , Humanos , Masculino , Persona de Mediana Edad , Placa Aterosclerótica/genética , Proteínas Represoras/genéticaRESUMEN
Aims: Sudden cardiac arrest (SCA) accounts for 10% of adult mortality in Western populations. We aim to identify potential loci associated with SCA and to identify risk factors causally associated with SCA. Methods and results: We carried out a large genome-wide association study (GWAS) for SCA (n = 3939 cases, 25 989 non-cases) to examine common variation genome-wide and in candidate arrhythmia genes. We also exploited Mendelian randomization (MR) methods using cross-trait multi-variant genetic risk score associations (GRSA) to assess causal relationships of 18 risk factors with SCA. No variants were associated with SCA at genome-wide significance, nor were common variants in candidate arrhythmia genes associated with SCA at nominal significance. Using cross-trait GRSA, we established genetic correlation between SCA and (i) coronary artery disease (CAD) and traditional CAD risk factors (blood pressure, lipids, and diabetes), (ii) height and BMI, and (iii) electrical instability traits (QT and atrial fibrillation), suggesting aetiologic roles for these traits in SCA risk. Conclusions: Our findings show that a comprehensive approach to the genetic architecture of SCA can shed light on the determinants of a complex life-threatening condition with multiple influencing factors in the general population. The results of this genetic analysis, both positive and negative findings, have implications for evaluating the genetic architecture of patients with a family history of SCA, and for efforts to prevent SCA in high-risk populations and the general community.
Asunto(s)
Arritmias Cardíacas/genética , Muerte Súbita Cardíaca/etiología , Arritmias Cardíacas/fisiopatología , Índice de Masa Corporal , Enfermedad de la Arteria Coronaria/genética , Femenino , Estudio de Asociación del Genoma Completo , Sistema de Conducción Cardíaco/fisiopatología , Humanos , Masculino , Análisis de la Aleatorización Mendeliana , Polimorfismo de Nucleótido Simple , Medición de Riesgo/métodos , Factores de Riesgo , Factores SexualesRESUMEN
A correction to this article has been published and is linked from the HTML and PDF versions of this paper. The error has not been fixed in the paper.
RESUMEN
Stroke has multiple etiologies, but the underlying genes and pathways are largely unknown. We conducted a multiancestry genome-wide-association meta-analysis in 521,612 individuals (67,162 cases and 454,450 controls) and discovered 22 new stroke risk loci, bringing the total to 32. We further found shared genetic variation with related vascular traits, including blood pressure, cardiac traits, and venous thromboembolism, at individual loci (n = 18), and using genetic risk scores and linkage-disequilibrium-score regression. Several loci exhibited distinct association and pleiotropy patterns for etiological stroke subtypes. Eleven new susceptibility loci indicate mechanisms not previously implicated in stroke pathophysiology, with prioritization of risk variants and genes accomplished through bioinformatics analyses using extensive functional datasets. Stroke risk loci were significantly enriched in drug targets for antithrombotic therapy.
Asunto(s)
Accidente Cerebrovascular/genética , Biología Computacional , Bases de Datos Genéticas , Epigénesis Genética , Femenino , Redes Reguladoras de Genes , Sitios Genéticos , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Mutación INDEL , Desequilibrio de Ligamiento , Masculino , Modelos Genéticos , Polimorfismo de Nucleótido Simple , Factores de Riesgo , Accidente Cerebrovascular/clasificación , Accidente Cerebrovascular/fisiopatologíaRESUMEN
Importance: Risk factors for abdominal aortic aneurysm (AAA) are largely unknown, which has hampered the development of nonsurgical treatments to alter the natural history of disease. Objective: To investigate the association between lipid-associated single-nucleotide polymorphisms (SNPs) and AAA risk. Design, Setting, and Participants: Genetic risk scores, composed of lipid trait-associated SNPs, were constructed and tested for their association with AAA using conventional (inverse-variance weighted) mendelian randomization (MR) and data from international AAA genome-wide association studies. Sensitivity analyses to account for potential genetic pleiotropy included MR-Egger and weighted median MR, and multivariable MR method was used to test the independent association of lipids with AAA risk. The association between AAA and SNPs in loci that can act as proxies for drug targets was also assessed. Data collection took place between January 9, 2015, and January 4, 2016. Data analysis was conducted between January 4, 2015, and December 31, 2016. Exposures: Genetic elevation of low-density lipoprotein cholesterol (LDL-C), high-density lipoprotein cholesterol (HDL-C), and triglycerides (TG). Main Outcomes and Measures: The association between genetic risk scores of lipid-associated SNPs and AAA risk, as well as the association between SNPs in lipid drug targets (HMGCR, CETP, and PCSK9) and AAA risk. Results: Up to 4914 cases and 48â¯002 controls were included in our analysis. A 1-SD genetic elevation of LDL-C was associated with increased AAA risk (odds ratio [OR], 1.66; 95% CI, 1.41-1.96; P = 1.1 × 10-9). For HDL-C, a 1-SD increase was associated with reduced AAA risk (OR, 0.67; 95% CI, 0.55-0.82; P = 8.3 × 10-5), whereas a 1-SD increase in triglycerides was associated with increased AAA risk (OR, 1.69; 95% CI, 1.38-2.07; P = 5.2 × 10-7). In multivariable MR analysis and both MR-Egger and weighted median MR methods, the association of each lipid fraction with AAA risk remained largely unchanged. The LDL-C-reducing allele of rs12916 in HMGCR was associated with AAA risk (OR, 0.93; 95% CI, 0.89-0.98; P = .009). The HDL-C-raising allele of rs3764261 in CETP was associated with lower AAA risk (OR, 0.89; 95% CI, 0.85-0.94; P = 3.7 × 10-7). Finally, the LDL-C-lowering allele of rs11206510 in PCSK9 was weakly associated with a lower AAA risk (OR, 0.94; 95% CI, 0.88-1.00; P = .04), but a second independent LDL-C-lowering variant in PCSK9 (rs2479409) was not associated with AAA risk (OR, 0.97; 95% CI, 0.92-1.02; P = .28). Conclusions and Relevance: The MR analyses in this study lend support to the hypothesis that lipids play an important role in the etiology of AAA. Analyses of individual genetic variants used as proxies for drug targets support LDL-C lowering as a potential effective treatment strategy for preventing and managing AAA.
Asunto(s)
Aneurisma de la Aorta Abdominal/genética , Hipolipemiantes/uso terapéutico , Metabolismo de los Lípidos/genética , Polimorfismo de Nucleótido Simple/genética , Triglicéridos/metabolismo , HDL-Colesterol/genética , HDL-Colesterol/metabolismo , LDL-Colesterol/genética , LDL-Colesterol/metabolismo , Estudio de Asociación del Genoma Completo , Humanos , Análisis de la Aleatorización Mendeliana , Factores de Riesgo , Triglicéridos/genéticaRESUMEN
Background: Previous genetic association studies of human immunodeficiency virus-1 (HIV-1) progression have focused on common human genetic variation ascertained through genome-wide genotyping. Methods: We sought to systematically assess the full spectrum of functional variation in protein coding gene regions on HIV-1 progression through exome sequencing of 1327 individuals. Genetic variants were tested individually and in aggregate across genes and gene sets for an influence on HIV-1 viral load. Results: Multiple single variants within the major histocompatibility complex (MHC) region were observed to be strongly associated with HIV-1 outcome, consistent with the known impact of classical HLA alleles. However, no single variant or gene located outside of the MHC region was significantly associated with HIV progression. Set-based association testing focusing on genes identified as being essential for HIV replication in genome-wide small interfering RNA (siRNA) and clustered regularly interspaced short palindromic repeats (CRISPR) studies did not reveal any novel associations. Conclusions: These results suggest that exonic variants with large effect sizes are unlikely to have a major contribution to host control of HIV infection.
Asunto(s)
Secuenciación del Exoma , Infecciones por VIH/genética , Infecciones por VIH/virología , VIH-1/genética , Interacciones Huésped-Patógeno/genética , Carga Viral/genética , Adulto , Femenino , Predisposición Genética a la Enfermedad , Variación Genética , Genotipo , Humanos , Masculino , Persona de Mediana Edad , Polimorfismo de Nucleótido SimpleRESUMEN
We aimed to replicate reported associations of 10 SNPs at eight distinct loci with overall ischemic stroke (IS) and its subtypes in an independent cohort of Dutch IS patients. We included 1,375 IS patients enrolled in a prospective multicenter hospital-based cohort in the Netherlands, and 1,533 population-level controls of Dutch descent. We tested these SNPs for association with overall IS and its subtypes (large artery atherosclerosis, small vessel disease and cardioembolic stroke (CE), as classified by TOAST) using an additive multivariable logistic regression model, adjusting for age and sex. We obtained odds ratios (OR) with 95% confidence intervals (95% CI) for the risk allele of each SNP analyzed and exact p-values by permutation. We confirmed the association at 4q25 (PITX2) (OR 1.43; 95% CI, 1.13-1.81, p = 0.029) and 16q22 (ZFHX3) (OR 1.62; 95% CI, 1.26-2.07, p = 0.001) as risk loci for CE. Locus 16q22 was also associated with overall IS (OR 1.24; 95% CI, 1.08-1.42, p = 0.016). Other loci previously associated with IS and/or its subtypes were not confirmed. In conclusion, we validated two loci (4q25, 16q22) associated with CE. In addition, our study may suggest that the association of locus 16q22 may not be limited to CE, but also includes overall IS.
Asunto(s)
Isquemia Encefálica/genética , Sitios Genéticos , Polimorfismo de Nucleótido Simple , Accidente Cerebrovascular/genética , Factores de Edad , Anciano , Isquemia Encefálica/epidemiología , Estudios de Casos y Controles , Predisposición Genética a la Enfermedad , Humanos , Persona de Mediana Edad , Países Bajos/epidemiología , Estudios Prospectivos , Factores Sexuales , Accidente Cerebrovascular/epidemiologíaRESUMEN
This corrects the article DOI: 10.1038/ncomms15805.
RESUMEN
Reduced cardiac vagal control reflected in low heart rate variability (HRV) is associated with greater risks for cardiac morbidity and mortality. In two-stage meta-analyses of genome-wide association studies for three HRV traits in up to 53,174 individuals of European ancestry, we detect 17 genome-wide significant SNPs in eight loci. HRV SNPs tag non-synonymous SNPs (in NDUFA11 and KIAA1755), expression quantitative trait loci (eQTLs) (influencing GNG11, RGS6 and NEO1), or are located in genes preferentially expressed in the sinoatrial node (GNG11, RGS6 and HCN4). Genetic risk scores account for 0.9 to 2.6% of the HRV variance. Significant genetic correlation is found for HRV with heart rate (-0.74Asunto(s)
Cardiopatías/genética
, Frecuencia Cardíaca
, Presión Sanguínea
, Estudios de Cohortes
, Predisposición Genética a la Enfermedad
, Estudio de Asociación del Genoma Completo
, Cardiopatías/fisiopatología
, Humanos
, Canales Regulados por Nucleótidos Cíclicos Activados por Hiperpolarización/genética
, Proteínas Musculares/genética
, Polimorfismo de Nucleótido Simple
, Canales de Potasio/genética
, Sitios de Carácter Cuantitativo
, Proteínas RGS/genética
, Factores de Riesgo
, Población Blanca/genética
RESUMEN
Negative selection against deleterious alleles produced by mutation influences within-population variation as the most pervasive form of natural selection. However, it is not known whether deleterious alleles affect fitness independently, so that cumulative fitness loss depends exponentially on the number of deleterious alleles, or synergistically, so that each additional deleterious allele results in a larger decrease in relative fitness. Negative selection with synergistic epistasis should produce negative linkage disequilibrium between deleterious alleles and, therefore, an underdispersed distribution of the number of deleterious alleles in the genome. Indeed, we detected underdispersion of the number of rare loss-of-function alleles in eight independent data sets from human and fly populations. Thus, selection against rare protein-disrupting alleles is characterized by synergistic epistasis, which may explain how human and fly populations persist despite high genomic mutation rates.
Asunto(s)
Drosophila melanogaster/genética , Epistasis Genética , Genoma Humano , Genoma de los Insectos , Tasa de Mutación , Selección Genética , Alelos , Animales , Aptitud Genética , Humanos , Desequilibrio de Ligamiento , Mutación MissenseRESUMEN
Genome-wide association studies (GWAS) of common disease have been hugely successful in implicating loci that modify disease risk. The bulk of these associations have proven robust and reproducible, in part due to community adoption of statistical criteria for claiming significant genotype-phenotype associations. As the cost of sequencing continues to drop, assembling large samples in global populations is becoming increasingly feasible. Sequencing studies interrogate not only common variants, as was true for genotyping-based GWAS, but variation across the full allele frequency spectrum, yielding many more (independent) statistical tests. We sought to empirically determine genome-wide significance thresholds for various analysis scenarios. Using whole-genome sequence data, we simulated sequencing-based disease studies of varying sample size and ancestry. We determined that future sequencing efforts in >2,000 samples of European, Asian, or admixed ancestry should set genome-wide significance at approximately P = 5 × 10-9 , and studies of African samples should apply a more stringent genome-wide significance threshold of P = 1 × 10-9 . Adoption of a revised multiple test correction will be crucial in avoiding irreproducible claims of association.
Asunto(s)
Etnicidad/genética , Genoma Humano , Estudio de Asociación del Genoma Completo/métodos , Estudio de Asociación del Genoma Completo/estadística & datos numéricos , Secuenciación de Nucleótidos de Alto Rendimiento/estadística & datos numéricos , Metagenómica , Polimorfismo de Nucleótido Simple/genética , Genotipo , Salud Global , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , HumanosRESUMEN
OBJECTIVE: Genome-wide association studies (GWAS) have been successful at identifying associations with stroke and stroke subtypes, but have not yet identified any associations solely with small vessel stroke (SVS). SVS comprises one quarter of all ischemic stroke and is a major manifestation of cerebral small vessel disease, the primary cause of vascular cognitive impairment. Studies across neurological traits have shown that younger-onset cases have an increased genetic burden. We leveraged this increased genetic burden by performing an age-at-onset informed GWAS meta-analysis, including a large younger-onset SVS population, to identify novel associations with stroke. METHODS: We used a three-stage age-at-onset informed GWAS to identify novel genetic variants associated with stroke. On identifying a novel locus associated with SVS, we assessed its influence on other small vessel disease phenotypes, as well as on messenger RNA (mRNA) expression of nearby genes, and on DNA methylation of nearby CpG sites in whole blood and in the fetal brain. RESULTS: We identified an association with SVS in 4,203 cases and 50,728 controls on chromosome 16q24.2 (odds ratio [OR; 95% confidence interval {CI}] = 1.16 [1.10-1.22]; p = 3.2 × 10-9 ). The lead single-nucleotide polymorphism (rs12445022) was also associated with cerebral white matter hyperintensities (OR [95% CI] = 1.10 [1.05-1.16]; p = 5.3 × 10-5 ; N = 3,670), but not intracerebral hemorrhage (OR [95% CI] = 0.97 [0.84-1.12]; p = 0.71; 1,545 cases, 1,481 controls). rs12445022 is associated with mRNA expression of ZCCHC14 in arterial tissues (p = 9.4 × 10-7 ) and DNA methylation at probe cg16596957 in whole blood (p = 5.3 × 10-6 ). INTERPRETATION: 16q24.2 is associated with SVS. Associations of the locus with expression of ZCCHC14 and DNA methylation suggest the locus acts through changes to regulatory elements. Ann Neurol 2017;81:383-394.
Asunto(s)
Enfermedades de los Pequeños Vasos Cerebrales/genética , Cromosomas Humanos Par 16/genética , Estudio de Asociación del Genoma Completo , Accidente Cerebrovascular/genética , Dedos de Zinc/genética , Adulto , Anciano , Anciano de 80 o más Años , Femenino , Sitios Genéticos , Variación Genética , Humanos , Masculino , Persona de Mediana Edad , Accidente Vascular Cerebral Lacunar/genéticaRESUMEN
Overlap between non-coding DNA regulatory sequences and common variant associations can help to identify specific cell and tissue types that are relevant for particular diseases. In a systematic manner, we analyzed variants from 94 genome-wide association studies (reporting at least 12 loci at p<5x10-8) by projecting them onto 466 epigenetic datasets (characterizing DNase I hypersensitive sites; DHSs) derived from various adult and fetal tissue samples and cell lines including many biological replicates. We were able to confirm many expected associations, such as the involvement of specific immune cell types in immune-related diseases and tissue types in diseases that affect specific organs, for example, inflammatory bowel disease and coronary artery disease. Other notable associations include adrenal glands in coronary artery disease, the immune system in Alzheimer's disease, and the kidney for bone marrow density. The association signals for some GWAS (for example, myopia or age at menarche) did not show a clear pattern with any of the cell or tissue types studied. In general, the identified variants from GWAS tend to be located outside coding regions. Altogether, we have performed an extensive characterization of GWAS signals in relation to cell and tissue-specific DHSs, demonstrating a key role for regulatory mechanisms in common diseases and complex traits.
Asunto(s)
Enfermedad de Alzheimer/genética , Enfermedad de la Arteria Coronaria/genética , Bases de Datos de Ácidos Nucleicos , Enfermedades del Sistema Inmune/genética , Enfermedades Inflamatorias del Intestino/genética , Secuencias Reguladoras de Ácidos Nucleicos , Adulto , Femenino , Estudio de Asociación del Genoma Completo , Humanos , MasculinoRESUMEN
Structural variation (SV) represents a major source of differences between individual human genomes and has been linked to disease phenotypes. However, the majority of studies provide neither a global view of the full spectrum of these variants nor integrate them into reference panels of genetic variation. Here, we analyse whole genome sequencing data of 769 individuals from 250 Dutch families, and provide a haplotype-resolved map of 1.9 million genome variants across 9 different variant classes, including novel forms of complex indels, and retrotransposition-mediated insertions of mobile elements and processed RNAs. A large proportion are previously under reported variants sized between 21 and 100 bp. We detect 4 megabases of novel sequence, encoding 11 new transcripts. Finally, we show 191 known, trait-associated SNPs to be in strong linkage disequilibrium with SVs and demonstrate that our panel facilitates accurate imputation of SVs in unrelated individuals.
Asunto(s)
Genoma Humano , Variación Estructural del Genoma , Genómica , Algoritmos , Cromosomas/ultraestructura , Biología Computacional , Eliminación de Gen , Genotipo , Haplotipos , Humanos , Mutación INDEL , Desequilibrio de Ligamiento , Países Bajos , Reacción en Cadena de la Polimerasa , Polimorfismo de Nucleótido Simple , ARN/metabolismo , Análisis de Secuencia de ADN , Análisis de Secuencia de ARN , Programas InformáticosRESUMEN
The ST-segment and adjacent T-wave (ST-T wave) amplitudes of the electrocardiogram are quantitative characteristics of cardiac repolarization. Repolarization abnormalities have been linked to ventricular arrhythmias and sudden cardiac death. We performed the first genome-wide association meta-analysis of ST-T-wave amplitudes in up to 37 977 individuals identifying 71 robust genotype-phenotype associations clustered within 28 independent loci. Fifty-four genes were prioritized as candidates underlying the phenotypes, including genes with established roles in the cardiac repolarization phase (SCN5A/SCN10A, KCND3, KCNB1, NOS1AP and HEY2) and others with as yet undefined cardiac function. These associations may provide insights in the spatiotemporal contribution of genetic variation influencing cardiac repolarization and provide novel leads for future functional follow-up.