Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 53
Filtrar
Más filtros

Banco de datos
Tipo del documento
Intervalo de año de publicación
1.
Cell ; 187(2): 464-480.e10, 2024 01 18.
Artículo en Inglés | MEDLINE | ID: mdl-38242088

RESUMEN

Primary open-angle glaucoma (POAG), the leading cause of irreversible blindness worldwide, disproportionately affects individuals of African ancestry. We conducted a genome-wide association study (GWAS) for POAG in 11,275 individuals of African ancestry (6,003 cases; 5,272 controls). We detected 46 risk loci associated with POAG at genome-wide significance. Replication and post-GWAS analyses, including functionally informed fine-mapping, multiple trait co-localization, and in silico validation, implicated two previously undescribed variants (rs1666698 mapping to DBF4P2; rs34957764 mapping to ROCK1P1) and one previously associated variant (rs11824032 mapping to ARHGEF12) as likely causal. For individuals of African ancestry, a polygenic risk score (PRS) for POAG from our mega-analysis (African ancestry individuals) outperformed a PRS from summary statistics of a much larger GWAS derived from European ancestry individuals. This study quantifies the genetic architecture similarities and differences between African and non-African ancestry populations for this blinding disease.


Asunto(s)
Estudio de Asociación del Genoma Completo , Glaucoma de Ángulo Abierto , Humanos , Predisposición Genética a la Enfermedad , Glaucoma de Ángulo Abierto/genética , Población Negra/genética , Polimorfismo de Nucleótido Simple/genética
2.
Am J Hum Genet ; 110(4): 575-591, 2023 04 06.
Artículo en Inglés | MEDLINE | ID: mdl-37028392

RESUMEN

Leveraging linkage disequilibrium (LD) patterns as representative of population substructure enables the discovery of additive association signals in genome-wide association studies (GWASs). Standard GWASs are well-powered to interrogate additive models; however, new approaches are required for invesigating other modes of inheritance such as dominance and epistasis. Epistasis, or non-additive interaction between genes, exists across the genome but often goes undetected because of a lack of statistical power. Furthermore, the adoption of LD pruning as customary in standard GWASs excludes detection of sites that are in LD but might underlie the genetic architecture of complex traits. We hypothesize that uncovering long-range interactions between loci with strong LD due to epistatic selection can elucidate genetic mechanisms underlying common diseases. To investigate this hypothesis, we tested for associations between 23 common diseases and 5,625,845 epistatic SNP-SNP pairs (determined by Ohta's D statistics) in long-range LD (>0.25 cM). Across five disease phenotypes, we identified one significant and four near-significant associations that replicated in two large genotype-phenotype datasets (UK Biobank and eMERGE). The genes that were most likely involved in the replicated associations were (1) members of highly conserved gene families with complex roles in multiple pathways, (2) essential genes, and/or (3) genes that were associated in the literature with complex traits that display variable expressivity. These results support the highly pleiotropic and conserved nature of variants in long-range LD under epistatic selection. Our work supports the hypothesis that epistatic interactions regulate diverse clinical mechanisms and might especially be driving factors in conditions with a wide range of phenotypic outcomes.


Asunto(s)
Epistasis Genética , Estudio de Asociación del Genoma Completo , Desequilibrio de Ligamiento/genética , Genotipo , Bancos de Muestras Biológicas , Reino Unido , Polimorfismo de Nucleótido Simple/genética
3.
PLoS Genet ; 17(6): e1009534, 2021 06.
Artículo en Inglés | MEDLINE | ID: mdl-34086673

RESUMEN

Assumptions are made about the genetic model of single nucleotide polymorphisms (SNPs) when choosing a traditional genetic encoding: additive, dominant, and recessive. Furthermore, SNPs across the genome are unlikely to demonstrate identical genetic models. However, running SNP-SNP interaction analyses with every combination of encodings raises the multiple testing burden. Here, we present a novel and flexible encoding for genetic interactions, the elastic data-driven genetic encoding (EDGE), in which SNPs are assigned a heterozygous value based on the genetic model they demonstrate in a dataset prior to interaction testing. We assessed the power of EDGE to detect genetic interactions using 29 combinations of simulated genetic models and found it outperformed the traditional encoding methods across 10%, 30%, and 50% minor allele frequencies (MAFs). Further, EDGE maintained a low false-positive rate, while additive and dominant encodings demonstrated inflation. We evaluated EDGE and the traditional encodings with genetic data from the Electronic Medical Records and Genomics (eMERGE) Network for five phenotypes: age-related macular degeneration (AMD), age-related cataract, glaucoma, type 2 diabetes (T2D), and resistant hypertension. A multi-encoding genome-wide association study (GWAS) for each phenotype was performed using the traditional encodings, and the top results of the multi-encoding GWAS were considered for SNP-SNP interaction using the traditional encodings and EDGE. EDGE identified a novel SNP-SNP interaction for age-related cataract that no other method identified: rs7787286 (MAF: 0.041; intergenic region of chromosome 7)-rs4695885 (MAF: 0.34; intergenic region of chromosome 4) with a Bonferroni LRT p of 0.018. A SNP-SNP interaction was found in data from the UK Biobank within 25 kb of these SNPs using the recessive encoding: rs60374751 (MAF: 0.030) and rs6843594 (MAF: 0.34) (Bonferroni LRT p: 0.026). We recommend using EDGE to flexibly detect interactions between SNPs exhibiting diverse action.


Asunto(s)
Modelos Genéticos , Catarata/genética , Conjuntos de Datos como Asunto , Diabetes Mellitus Tipo 2/genética , Frecuencia de los Genes , Estudio de Asociación del Genoma Completo , Glaucoma/genética , Humanos , Hipertensión/genética , Degeneración Macular/genética , Fenotipo , Polimorfismo de Nucleótido Simple
4.
J Transl Med ; 20(1): 550, 2022 11 28.
Artículo en Inglés | MEDLINE | ID: mdl-36443877

RESUMEN

BACKGROUND: Pharmacogenomics (PGx) aims to utilize a patient's genetic data to enable safer and more effective prescribing of medications. The Clinical Pharmacogenetics Implementation Consortium (CPIC) provides guidelines with strong evidence for 24 genes that affect 72 medications. Despite strong evidence linking PGx alleles to drug response, there is a large gap in the implementation and return of actionable pharmacogenetic findings to patients in standard clinical practice. In this study, we evaluated opportunities for genetically guided medication prescribing in a diverse health system and determined the frequencies of actionable PGx alleles in an ancestrally diverse biobank population. METHODS: A retrospective analysis of the Penn Medicine electronic health records (EHRs), which includes ~ 3.3 million patients between 2012 and 2020, provides a snapshot of the trends in prescriptions for drugs with genotype-based prescribing guidelines ('CPIC level A or B') in the Penn Medicine health system. The Penn Medicine BioBank (PMBB) consists of a diverse group of 43,359 participants whose EHRs are linked to genome-wide SNP array and whole exome sequencing (WES) data. We used the Pharmacogenomics Clinical Annotation Tool (PharmCAT), to annotate PGx alleles from PMBB variant call format (VCF) files and identify samples with actionable PGx alleles. RESULTS: We identified ~ 316.000 unique patients that were prescribed at least 2 drugs with CPIC Level A or B guidelines. Genetic analysis in PMBB identified that 98.9% of participants carry one or more PGx actionable alleles where treatment modification would be recommended. After linking the genetic data with prescription data from the EHR, 14.2% of participants (n = 6157) were prescribed medications that could be impacted by their genotype (as indicated by their PharmCAT report). For example, 856 participants received clopidogrel who carried CYP2C19 reduced function alleles, placing them at increased risk for major adverse cardiovascular events. When we stratified by genetic ancestry, we found disparities in PGx allele frequencies and clinical burden. Clopidogrel users of Asian ancestry in PMBB had significantly higher rates of CYP2C19 actionable alleles than European ancestry users of clopidrogrel (p < 0.0001, OR = 3.68). CONCLUSIONS: Clinically actionable PGx alleles are highly prevalent in our health system and many patients were prescribed medications that could be affected by PGx alleles. These results illustrate the potential utility of preemptive genotyping for tailoring of medications and implementation of PGx into routine clinical care.


Asunto(s)
Bancos de Muestras Biológicas , Farmacogenética , Humanos , Alelos , Citocromo P-450 CYP2C19 , Clopidogrel , Estudios Retrospectivos
5.
Genet Med ; 24(3): 601-609, 2022 03.
Artículo en Inglés | MEDLINE | ID: mdl-34906489

RESUMEN

PURPOSE: Genome-wide association studies have identified hundreds of single nucleotide variations (formerly single nucleotide polymorphisms) associated with several cancers, but the predictive ability of polygenic risk scores (PRSs) is unclear, especially among non-Whites. METHODS: PRSs were derived from genome-wide significant single-nucleotide variations for 15 cancers in 20,079 individuals in an academic biobank. We evaluated the improvement in discriminatory accuracy by including cancer-specific PRS in patients of genetically-determined African and European ancestry. RESULTS: Among the individuals of European genetic ancestry, PRSs for breast, colon, melanoma, and prostate were significantly associated with their respective cancers. Among the individuals of African genetic ancestry, PRSs for breast, colon, prostate, and thyroid were significantly associated with their respective cancers. The area under the curve of the model consisting of age, sex, and principal components was 0.621 to 0.710, and it increased by 1% to 4% with the inclusion of PRS in individuals of European genetic ancestry. In individuals of African genetic ancestry, area under the curve was overall higher in the model without the PRS (0.723-0.810) but increased by <1% with the inclusion of PRS for most cancers. CONCLUSION: PRS moderately increased the ability to discriminate the cancer status in individuals of European but not African ancestry. Further large-scale studies are needed to identify ancestry-specific genetic factors in non-White populations to incorporate PRS into cancer risk assessment.


Asunto(s)
Estudio de Asociación del Genoma Completo , Herencia Multifactorial , Neoplasias , Bancos de Muestras Biológicas , Población Negra/genética , Femenino , Predisposición Genética a la Enfermedad , Humanos , Masculino , Neoplasias/etnología , Neoplasias/genética , Factores de Riesgo , Población Blanca/genética
6.
Addict Biol ; 27(1): e13099, 2022 01.
Artículo en Inglés | MEDLINE | ID: mdl-34611967

RESUMEN

Polygenic risk scores (PRS) represent an individual's summed genetic risk for a trait and can serve as biomarkers for disease. Less is known about the utility of PRS as a means to quantify genetic risk for substance use disorders (SUDs) than for many other traits. Nonetheless, the growth of large, electronic health record-based biobanks makes it possible to evaluate the association of SUD PRS with other traits. We calculated PRS for smoking initiation, alcohol use disorder (AUD), and opioid use disorder (OUD) using summary statistics from the Million Veteran Program sample. We then tested the association of each PRS with its primary phenotype in the Penn Medicine BioBank (PMBB) using all available genotyped participants of African or European ancestry (AFR and EUR, respectively) (N = 18,612). Finally, we conducted phenome-wide association analyses (PheWAS) separately by ancestry and sex to test for associations across disease categories. Tobacco use disorder was the most common SUD in the PMBB, followed by AUD and OUD, consistent with the population prevalence of these disorders. All PRS were associated with their primary phenotype in both ancestry groups. PheWAS results yielded cross-trait associations across multiple domains, including psychiatric disorders and medical conditions. SUD PRS were associated with their primary phenotypes; however, they are not yet predictive enough to be useful diagnostically. The cross-trait associations of the SUD PRS are indicative of a broader genetic liability. Future work should extend findings to additional population groups and for other substances of abuse.


Asunto(s)
Comorbilidad , Registros Electrónicos de Salud/estadística & datos numéricos , Predisposición Genética a la Enfermedad/genética , Trastornos Relacionados con Sustancias/etnología , Trastornos Relacionados con Sustancias/genética , Adulto , Anciano , Anciano de 80 o más Años , Alcoholismo/etnología , Alcoholismo/genética , Población Negra/genética , Femenino , Estudio de Asociación del Genoma Completo , Genotipo , Humanos , Masculino , Persona de Mediana Edad , Herencia Multifactorial , Trastornos Relacionados con Opioides/etnología , Trastornos Relacionados con Opioides/genética , Fenotipo , Factores de Riesgo , Factores Sexuales , Tabaquismo/etnología , Tabaquismo/genética , Población Blanca/genética
7.
Am J Hum Genet ; 102(4): 592-608, 2018 04 05.
Artículo en Inglés | MEDLINE | ID: mdl-29606303

RESUMEN

Most phenome-wide association studies (PheWASs) to date have used a small to moderate number of SNPs for association with phenotypic data. We performed a large-scale single-cohort PheWAS, using electronic health record (EHR)-derived case-control status for 541 diagnoses using International Classification of Disease version 9 (ICD-9) codes and 25 median clinical laboratory measures. We calculated associations between these diagnoses and traits with ∼630,000 common frequency SNPs with minor allele frequency > 0.01 for 38,662 individuals. In this landscape PheWAS, we explored results within diseases and traits, comparing results to those previously reported in genome-wide association studies (GWASs), as well as previously published PheWASs. We further leveraged the context of functional impact from protein-coding to regulatory regions, providing a deeper interpretation of these associations. The comprehensive nature of this PheWAS allows for novel hypothesis generation, the identification of phenotypes for further study for future phenotypic algorithm development, and identification of cross-phenotype associations.


Asunto(s)
Técnicas de Laboratorio Clínico , Registros Electrónicos de Salud , Estudio de Asociación del Genoma Completo , Clasificación Internacional de Enfermedades , Cromatina/genética , ADN Intergénico/genética , Regulación de la Expresión Génica , Genoma Humano , Haplotipos/genética , Humanos , Anotación de Secuencia Molecular , Sistemas de Lectura Abierta/genética , Fenotipo , Reproducibilidad de los Resultados , Análisis de Secuencia de ARN
8.
PLoS Med ; 17(10): e1003288, 2020 10.
Artículo en Inglés | MEDLINE | ID: mdl-33031386

RESUMEN

BACKGROUND: Observational studies have identified height as a strong risk factor for atrial fibrillation, but this finding may be limited by residual confounding. We aimed to examine genetic variation in height within the Mendelian randomization (MR) framework to determine whether height has a causal effect on risk of atrial fibrillation. METHODS AND FINDINGS: In summary-level analyses, MR was performed using summary statistics from genome-wide association studies of height (GIANT/UK Biobank; 693,529 individuals) and atrial fibrillation (AFGen; 65,446 cases and 522,744 controls), finding that each 1-SD increase in genetically predicted height increased the odds of atrial fibrillation (odds ratio [OR] 1.34; 95% CI 1.29 to 1.40; p = 5 × 10-42). This result remained consistent in sensitivity analyses with MR methods that make different assumptions about the presence of pleiotropy, and when accounting for the effects of traditional cardiovascular risk factors on atrial fibrillation. Individual-level phenome-wide association studies of height and a height genetic risk score were performed among 6,567 European-ancestry participants of the Penn Medicine Biobank (median age at enrollment 63 years, interquartile range 55-72; 38% female; recruitment 2008-2015), confirming prior observational associations between height and atrial fibrillation. Individual-level MR confirmed that each 1-SD increase in height increased the odds of atrial fibrillation, including adjustment for clinical and echocardiographic confounders (OR 1.89; 95% CI 1.50 to 2.40; p = 0.007). The main limitations of this study include potential bias from pleiotropic effects of genetic variants, and lack of generalizability of individual-level findings to non-European populations. CONCLUSIONS: In this study, we observed evidence that height is likely a positive causal risk factor for atrial fibrillation. Further study is needed to determine whether risk prediction tools including height or anthropometric risk factors can be used to improve screening and primary prevention of atrial fibrillation, and whether biological pathways involved in height may offer new targets for treatment of atrial fibrillation.


Asunto(s)
Fibrilación Atrial/genética , Estatura/genética , Adulto , Anciano , Causalidad , Femenino , Predicción , Predisposición Genética a la Enfermedad/genética , Estudio de Asociación del Genoma Completo/métodos , Humanos , Masculino , Análisis de la Aleatorización Mendeliana/métodos , Persona de Mediana Edad , Oportunidad Relativa , Fenotipo , Polimorfismo de Nucleótido Simple/genética , Factores de Riesgo , Población Blanca/genética
9.
Circulation ; 138(22): 2469-2481, 2018 11 27.
Artículo en Inglés | MEDLINE | ID: mdl-30571344

RESUMEN

BACKGROUND: Proteomic approaches allow measurement of thousands of proteins in a single specimen, which can accelerate biomarker discovery. However, applying these technologies to massive biobanks is not currently feasible because of the practical barriers and costs of implementing such assays at scale. To overcome these challenges, we used a "virtual proteomic" approach, linking genetically predicted protein levels to clinical diagnoses in >40 000 individuals. METHODS: We used genome-wide association data from the Framingham Heart Study (n=759) to construct genetic predictors for 1129 plasma protein levels. We validated the genetic predictors for 268 proteins and used them to compute predicted protein levels in 41 288 genotyped individuals in the Electronic Medical Records and Genomics (eMERGE) cohort. We tested associations for each predicted protein with 1128 clinical phenotypes. Lead associations were validated with directly measured protein levels and either low-density lipoprotein cholesterol or subclinical atherosclerosis in the MDCS (Malmö Diet and Cancer Study; n=651). RESULTS: In the virtual proteomic analysis in eMERGE, 55 proteins were associated with 89 distinct diagnoses at a false discovery rate q<0.1. Among these, 13 associations involved lipid (n=7) or atherosclerosis (n=6) phenotypes. We tested each association for validation in MDCS using directly measured protein levels. At Bonferroni-adjusted significance thresholds, levels of apolipoprotein E isoforms were associated with hyperlipidemia, and circulating C-type lectin domain family 1 member B and platelet-derived growth factor receptor-ß predicted subclinical atherosclerosis. Odds ratios for carotid atherosclerosis were 1.31 (95% CI, 1.08-1.58; P=0.006) per 1-SD increment in C-type lectin domain family 1 member B and 0.79 (0.66-0.94; P=0.008) per 1-SD increment in platelet-derived growth factor receptor-ß. CONCLUSIONS: We demonstrate a biomarker discovery paradigm to identify candidate biomarkers of cardiovascular and other diseases.


Asunto(s)
Biomarcadores/sangre , Enfermedades de las Arterias Carótidas/diagnóstico , Estudio de Asociación del Genoma Completo , Proteoma/análisis , Adulto , Anciano , Anciano de 80 o más Años , Enfermedades de las Arterias Carótidas/genética , Femenino , Genotipo , Humanos , Lectinas Tipo C/análisis , Masculino , Persona de Mediana Edad , Oportunidad Relativa , Fenotipo , Polimorfismo de Nucleótido Simple , Proteómica , Receptor beta de Factor de Crecimiento Derivado de Plaquetas/sangre
10.
Circ Res ; 120(2): 341-353, 2017 Jan 20.
Artículo en Inglés | MEDLINE | ID: mdl-27899403

RESUMEN

RATIONALE: Abdominal aortic aneurysm (AAA) is a complex disease with both genetic and environmental risk factors. Together, 6 previously identified risk loci only explain a small proportion of the heritability of AAA. OBJECTIVE: To identify additional AAA risk loci using data from all available genome-wide association studies. METHODS AND RESULTS: Through a meta-analysis of 6 genome-wide association study data sets and a validation study totaling 10 204 cases and 107 766 controls, we identified 4 new AAA risk loci: 1q32.3 (SMYD2), 13q12.11 (LINC00540), 20q13.12 (near PCIF1/MMP9/ZNF335), and 21q22.2 (ERG). In various database searches, we observed no new associations between the lead AAA single nucleotide polymorphisms and coronary artery disease, blood pressure, lipids, or diabetes mellitus. Network analyses identified ERG, IL6R, and LDLR as modifiers of MMP9, with a direct interaction between ERG and MMP9. CONCLUSIONS: The 4 new risk loci for AAA seem to be specific for AAA compared with other cardiovascular diseases and related traits suggesting that traditional cardiovascular risk factor management may only have limited value in preventing the progression of aneurysmal disease.


Asunto(s)
Aneurisma de la Aorta Abdominal/diagnóstico , Aneurisma de la Aorta Abdominal/genética , Sitios Genéticos/genética , Predisposición Genética a la Enfermedad/genética , Estudio de Asociación del Genoma Completo/métodos , Aneurisma de la Aorta Abdominal/epidemiología , Predisposición Genética a la Enfermedad/epidemiología , Variación Genética/genética , Estudio de Asociación del Genoma Completo/tendencias , Humanos
11.
JAMA ; 322(22): 2191-2202, 2019 12 10.
Artículo en Inglés | MEDLINE | ID: mdl-31821430

RESUMEN

Importance: Hereditary transthyretin (TTR) amyloid cardiomyopathy (hATTR-CM) due to the TTR V122I variant is an autosomal-dominant disorder that causes heart failure in elderly individuals of African ancestry. The clinical associations of carrying the variant, its effect in other African ancestry populations including Hispanic/Latino individuals, and the rates of achieving a clinical diagnosis in carriers are unknown. Objective: To assess the association between the TTR V122I variant and heart failure and identify rates of hATTR-CM diagnosis among carriers with heart failure. Design, Setting, and Participants: Cross-sectional analysis of carriers and noncarriers of TTR V122I of African ancestry aged 50 years or older enrolled in the Penn Medicine Biobank between 2008 and 2017 using electronic health record data from 1996 to 2017. Case-control study in participants of African and Hispanic/Latino ancestry with and without heart failure in the Mount Sinai BioMe Biobank enrolled between 2007 and 2015 using electronic health record data from 2007 to 2018. Exposures: TTR V122I carrier status. Main Outcomes and Measures: The primary outcome was prevalent heart failure. The rate of diagnosis with hATTR-CM among TTR V122I carriers with heart failure was measured. Results: The cross-sectional cohort included 3724 individuals of African ancestry with a median age of 64 years (interquartile range, 57-71); 1755 (47%) were male, 2896 (78%) had a diagnosis of hypertension, and 753 (20%) had a history of myocardial infarction or coronary revascularization. There were 116 TTR V122I carriers (3.1%); 1121 participants (30%) had heart failure. The case-control study consisted of 2307 individuals of African ancestry and 3663 Hispanic/Latino individuals; the median age was 73 years (interquartile range, 68-80), 2271 (38%) were male, 4709 (79%) had a diagnosis of hypertension, and 1008 (17%) had a history of myocardial infarction or coronary revascularization. There were 1376 cases of heart failure. TTR V122I was associated with higher rates of heart failure (cross-sectional cohort: n = 51/116 TTR V122I carriers [44%], n = 1070/3608 noncarriers [30%], adjusted odds ratio, 1.7 [95% CI, 1.2-2.4], P = .006; case-control study: n = 36/1376 heart failure cases [2.6%], n = 82/4594 controls [1.8%], adjusted odds ratio, 1.8 [95% CI, 1.2-2.7], P = .008). Ten of 92 TTR V122I carriers with heart failure (11%) were diagnosed as having hATTR-CM; the median time from onset of symptoms to clinical diagnosis was 3 years. Conclusions and Relevance: Among individuals of African or Hispanic/Latino ancestry enrolled in 2 academic medical center-based biobanks, the TTR V122I genetic variant was significantly associated with heart failure.


Asunto(s)
Neuropatías Amiloides Familiares/genética , Negro o Afroamericano/genética , Insuficiencia Cardíaca/genética , Hispánicos o Latinos/genética , Prealbúmina/genética , Centros Médicos Académicos , Anciano , Neuropatías Amiloides Familiares/complicaciones , Neuropatías Amiloides Familiares/etnología , Bancos de Muestras Biológicas , Estudios de Casos y Controles , Estudios Transversales , Femenino , Variación Genética , Insuficiencia Cardíaca/etnología , Humanos , Masculino , Persona de Mediana Edad
12.
BMC Bioinformatics ; 19(1): 120, 2018 04 04.
Artículo en Inglés | MEDLINE | ID: mdl-29618318

RESUMEN

BACKGROUND: Phenome-wide association studies (PheWAS) are a high-throughput approach to evaluate comprehensive associations between genetic variants and a wide range of phenotypic measures. PheWAS has varying sample sizes for quantitative traits, and variable numbers of cases and controls for binary traits across the many phenotypes of interest, which can affect the statistical power to detect associations. The motivation of this study is to investigate the various parameters which affect the estimation of statistical power in PheWAS, including sample size, case-control ratio, minor allele frequency, and disease penetrance. RESULTS: We performed a PheWAS simulation study, where we investigated variations in statistical power based on different parameters, such as overall sample size, number of cases, case-control ratio, minor allele frequency, and disease penetrance. The simulation was performed on both binary and quantitative phenotypic measures. Our simulation on binary traits suggests that the number of cases has more impact on statistical power than the case to control ratio; also, we found that a sample size of 200 cases or more maintains the statistical power to identify associations for common variants. For quantitative traits, a sample size of 1000 or more individuals performed best in the power calculations. We focused on common genetic variants (MAF > 0.01) in this study; however, in future studies, we will be extending this effort to perform similar simulations on rare variants. CONCLUSIONS: This study provides a series of PheWAS simulation analyses that can be used to estimate statistical power for some potential scenarios. These results can be used to provide guidelines for appropriate study design for future PheWAS analyses.


Asunto(s)
Simulación por Computador , Enfermedad/genética , Estudios de Asociación Genética , Estudio de Asociación del Genoma Completo , Fenotipo , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Algoritmos , Humanos
13.
Genet Epidemiol ; 41(4): 282-296, 2017 05.
Artículo en Inglés | MEDLINE | ID: mdl-28198095

RESUMEN

Disease risk estimation plays an important role in disease prevention. Many studies have found that the ability to predict risk improves as the number of risk single-nucleotide polymorphisms (SNPs) in the risk model increases. However, the width of the confidence interval of the risk estimate is often not considered in the evaluation of the risk model. Here, we explore how the risk and the confidence interval width change as more SNPs are added to the model in the order of decreasing effect size, using both simulated data and real data from studies of abdominal aortic aneurysms and age-related macular degeneration. Our results show that confidence interval width is positively correlated with model size and the majority of the bigger models have wider confidence interval widths than smaller models. Once the model size is bigger than a certain level, the risk does not shift markedly, as 100% of the risk estimates of the one-SNP-bigger models lie inside the confidence interval of the one-SNP-smaller models. We also created a confidence interval-augmented reclassification table. It shows that both more effective SNPs with larger odds ratios and less effective SNPs with smaller odds ratios contribute to the correct decision of whom to screen. The best screening strategy is selected and evaluated by the net benefit quantity and the reclassification rate. We suggest that individuals whose upper bound of their risk confidence interval is above the screening threshold, which corresponds to the population prevalence of the disease, should be screened.


Asunto(s)
Predisposición Genética a la Enfermedad , Modelos Genéticos , Anciano , Aneurisma de la Aorta Abdominal/genética , Simulación por Computador , Intervalos de Confianza , Bases de Datos Genéticas , Femenino , Humanos , Degeneración Macular/genética , Masculino , Persona de Mediana Edad , Oportunidad Relativa , Polimorfismo de Nucleótido Simple/genética , Factores de Riesgo , Tamaño de la Muestra
14.
Pharmacogenet Genomics ; 28(7): 179-187, 2018 07.
Artículo en Inglés | MEDLINE | ID: mdl-29847509

RESUMEN

OBJECTIVE: We characterized associations between central nervous system (CNS) adverse events and brain neurotransmitter transporter/receptor genomics among participants randomized to efavirenz-containing regimens in AIDS Clinical Trials Group studies in the USA. PARTICIPANTS AND METHODS: Four clinical trials randomly assigned treatment-naive participants to efavirenz-containing regimens. Genome-wide genotype and PrediXcan were used to infer gene expression levels in tissues including 10 brain regions. Multivariable regression models stratified by race/ethnicity were adjusted for CYP2B6/CYP2A6 genotypes that predict plasma efavirenz exposure, age, and sex. Combined analyses also adjusted for genetic ancestry. RESULTS: Analyses included 167 cases with grade 2 or greater efavirenz-consistent CNS adverse events within 48 weeks of study entry, and 653 efavirenz-tolerant controls. CYP2B6/CYP2A6 genotype level was independently associated with CNS adverse events (odds ratio: 1.07; P=0.044). Predicted expression of six genes postulated to mediate efavirenz CNS side effects (SLC6A2, SLC6A3, PGR, HTR2A, HTR2B, HTR6) were not associated with CNS adverse events after correcting for multiple testing, the lowest P value being for PGR in hippocampus (P=0.012), nor were polymorphisms in these genes or AR and HTR2C, the lowest P value being for rs12393326 in HTR2C (P=6.7×10(-4)). As a positive control, baseline plasma bilirubin concentration was associated with predicted liver UGT1A1 expression level (P=1.9×10(-27)). CONCLUSION: Efavirenz-related CNS adverse events were not associated with predicted neurotransmitter transporter/receptor gene expression levels in brain or with polymorphisms in these genes. Variable susceptibility to efavirenz-related CNS adverse events may not be explained by brain neurotransmitter transporter/receptor genomics.


Asunto(s)
Benzoxazinas/efectos adversos , Enfermedades del Sistema Nervioso Central/inducido químicamente , Enfermedades del Sistema Nervioso Central/genética , Infecciones por VIH/tratamiento farmacológico , Proteínas de Transporte de Neurotransmisores/genética , Polimorfismo de Nucleótido Simple , Receptores de Neurotransmisores/genética , Adulto , Alquinos , Ciclopropanos , Femenino , Genómica , Genotipo , VIH/efectos de los fármacos , Humanos , Masculino , Persona de Mediana Edad , Pruebas de Farmacogenómica , Inhibidores de la Transcriptasa Inversa/efectos adversos
16.
Pharmacogenet Genomics ; 27(3): 101-111, 2017 03.
Artículo en Inglés | MEDLINE | ID: mdl-28099408

RESUMEN

BACKGROUND: High-throughput approaches are increasingly being used to identify genetic associations across multiple phenotypes simultaneously. Here, we describe a pilot analysis that considered multiple on-treatment laboratory phenotypes from antiretroviral therapy-naive patients who were randomized to initiate antiretroviral regimens in a prospective clinical trial, AIDS Clinical Trials Group protocol A5202. PARTICIPANTS AND METHODS: From among 5 9545 294 polymorphisms imputed genome-wide, we analyzed 2544, including 2124 annotated in the PharmGKB, and 420 previously associated with traits in the GWAS Catalog. We derived 774 phenotypes on the basis of context from six variables: plasma atazanavir (ATV) pharmacokinetics, plasma efavirenz (EFV) pharmacokinetics, change in the CD4+ T-cell count, HIV-1 RNA suppression, fasting low-density lipoprotein-cholesterol, and fasting triglycerides. Permutation testing assessed the likelihood of associations being by chance alone. Pleiotropy was assessed for polymorphisms with the lowest P-values. RESULTS: This analysis included 1181 patients. At P less than 1.5×10, most associations were not by chance alone. Polymorphisms with the lowest P-values for EFV pharmacokinetics (CYPB26 rs3745274), low-density lipoprotein -cholesterol (APOE rs7412), and triglyceride (APOA5 rs651821) phenotypes had been associated previously with those traits in previous studies. The association between triglycerides and rs651821 was present with ATV-containing regimens, but not with EFV-containing regimens. Polymorphisms with the lowest P-values for ATV pharmacokinetics, CD4 T-cell count, and HIV-1 RNA phenotypes had not been reported previously to be associated with that trait. CONCLUSION: Using data from a prospective HIV clinical trial, we identified expected genetic associations, potentially novel associations, and at least one context-dependent association. This study supports high-throughput strategies that simultaneously explore multiple phenotypes from clinical trials' datasets for genetic associations.


Asunto(s)
Síndrome de Inmunodeficiencia Adquirida/tratamiento farmacológico , Antirretrovirales/administración & dosificación , Apolipoproteína A-V/genética , Apolipoproteínas E/genética , Citocromo P-450 CYP2B6/genética , Polimorfismo de Nucleótido Simple , Síndrome de Inmunodeficiencia Adquirida/genética , Adulto , Antirretrovirales/farmacocinética , Linfocitos T CD4-Positivos/citología , Femenino , Humanos , Recuento de Linfocitos , Masculino , Persona de Mediana Edad , Variantes Farmacogenómicas , Fenotipo , Proyectos Piloto , Estudios Prospectivos
17.
Hum Genet ; 136(2): 165-178, 2017 02.
Artículo en Inglés | MEDLINE | ID: mdl-27848076

RESUMEN

Genetic loci explain only 25-30 % of the heritability observed in plasma lipid traits. Epistasis, or gene-gene interactions may contribute to a portion of this missing heritability. Using the genetic data from five NHLBI cohorts of 24,837 individuals, we combined the use of the quantitative multifactor dimensionality reduction (QMDR) algorithm with two SNP-filtering methods to exhaustively search for SNP-SNP interactions that are associated with HDL cholesterol (HDL-C), LDL cholesterol (LDL-C), total cholesterol (TC) and triglycerides (TG). SNPs were filtered either on the strength of their independent effects (main effect filter) or the prior knowledge supporting a given interaction (Biofilter). After the main effect filter, QMDR identified 20 SNP-SNP models associated with HDL-C, 6 associated with LDL-C, 3 associated with TC, and 10 associated with TG (permutation P value <0.05). With the use of Biofilter, we identified 2 SNP-SNP models associated with HDL-C, 3 associated with LDL-C, 1 associated with TC and 8 associated with TG (permutation P value <0.05). In an independent dataset of 7502 individuals from the eMERGE network, we replicated 14 of the interactions identified after main effect filtering: 11 for HDL-C, 1 for LDL-C and 2 for TG. We also replicated 23 of the interactions found to be associated with TG after applying Biofilter. Prior knowledge supports the possible role of these interactions in the genetic etiology of lipid traits. This study also presents a computationally efficient pipeline for analyzing data from large genotyping arrays and detecting SNP-SNP interactions that are not primarily driven by strong main effects.


Asunto(s)
Enfermedades Cardiovasculares/genética , HDL-Colesterol/sangre , LDL-Colesterol/sangre , Epistasis Genética , Fenotipo , Triglicéridos/sangre , Índice de Masa Corporal , Enfermedades Cardiovasculares/sangre , Estudios de Cohortes , Femenino , Sitios Genéticos , Marcadores Genéticos , Genoma Humano , Técnicas de Genotipaje , Humanos , Modelos Lineales , Desequilibrio de Ligamiento , Masculino , Reducción de Dimensionalidad Multifactorial , Polimorfismo de Nucleótido Simple
18.
Genet Epidemiol ; 39(5): 376-84, 2015 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-25982363

RESUMEN

Bioinformatics approaches to examine gene-gene models provide a means to discover interactions between multiple genes that underlie complex disease. Extensive computational demands and adjusting for multiple testing make uncovering genetic interactions a challenge. Here, we address these issues using our knowledge-driven filtering method, Biofilter, to identify putative single nucleotide polymorphism (SNP) interaction models for cataract susceptibility, thereby reducing the number of models for analysis. Models were evaluated in 3,377 European Americans (1,185 controls, 2,192 cases) from the Marshfield Clinic, a study site of the Electronic Medical Records and Genomics (eMERGE) Network, using logistic regression. All statistically significant models from the Marshfield Clinic were then evaluated in an independent dataset of 4,311 individuals (742 controls, 3,569 cases), using independent samples from additional study sites in the eMERGE Network: Mayo Clinic, Group Health/University of Washington, Vanderbilt University Medical Center, and Geisinger Health System. Eighty-three SNP-SNP models replicated in the independent dataset at likelihood ratio test P < 0.05. Among the most significant replicating models was rs12597188 (intron of CDH1)-rs11564445 (intron of CTNNB1). These genes are known to be involved in processes that include: cell-to-cell adhesion signaling, cell-cell junction organization, and cell-cell communication. Further Biofilter analysis of all replicating models revealed a number of common functions among the genes harboring the 83 replicating SNP-SNP models, which included signal transduction and PI3K-Akt signaling pathway. These findings demonstrate the utility of Biofilter as a biology-driven method, applicable for any genome-wide association study dataset.


Asunto(s)
Catarata/genética , Biología Computacional/métodos , Interpretación Estadística de Datos , Registros Electrónicos de Salud , Interacción Gen-Ambiente , Modelos Genéticos , Factores de Edad , Estudios de Casos y Controles , Adhesión Celular , Femenino , Estudio de Asociación del Genoma Completo , Genómica/métodos , Humanos , Masculino , Persona de Mediana Edad , Polimorfismo de Nucleótido Simple/genética , Grupos de Población/genética , Transducción de Señal , Programas Informáticos
19.
Mol Vis ; 20: 1281-95, 2014.
Artículo en Inglés | MEDLINE | ID: mdl-25352737

RESUMEN

PURPOSE: Cataract is the leading cause of blindness in the world, and in the United States accounts for approximately 60% of Medicare costs related to vision. The purpose of this study was to identify genetic markers for age-related cataract through a genome-wide association study (GWAS). METHODS: In the electronic medical records and genomics (eMERGE) network, we ran an electronic phenotyping algorithm on individuals in each of five sites with electronic medical records linked to DNA biobanks. We performed a GWAS using 530,101 SNPs from the Illumina 660W-Quad in a total of 7,397 individuals (5,503 cases and 1,894 controls). We also performed an age-at-diagnosis case-only analysis. RESULTS: We identified several statistically significant associations with age-related cataract (45 SNPs) as well as age at diagnosis (44 SNPs). The 45 SNPs associated with cataract at p<1×10(-5) are in several interesting genes, including ALDOB, MAP3K1, and MEF2C. All have potential biologic relationships with cataracts. CONCLUSIONS: This is the first genome-wide association study of age-related cataract, and several regions of interest have been identified. The eMERGE network has pioneered the exploration of genomic associations in biobanks linked to electronic health records, and this study is another example of the utility of such resources. Explorations of age-related cataract including validation and replication of the association results identified herein are needed in future studies.


Asunto(s)
Catarata/genética , Registros Electrónicos de Salud/estadística & datos numéricos , Fructosa-Bifosfato Aldolasa/genética , Predisposición Genética a la Enfermedad , Quinasa 1 de Quinasa de Quinasa MAP/genética , Polimorfismo de Nucleótido Simple , Factores de Edad , Anciano , Anciano de 80 o más Años , Algoritmos , Catarata/patología , Bases de Datos de Ácidos Nucleicos , Femenino , Marcadores Genéticos , Genoma Humano , Estudio de Asociación del Genoma Completo , Costos de la Atención en Salud , Humanos , Factores de Transcripción MEF2/genética , Masculino , Persona de Mediana Edad , Sitios de Carácter Cuantitativo , Estados Unidos
20.
medRxiv ; 2024 Jan 17.
Artículo en Inglés | MEDLINE | ID: mdl-38293092

RESUMEN

Importance: The effect of high percentage spliced in (hiPSI) TTN truncating variants (TTNtvs) on risk of dilated cardiomyopathy (DCM) has historically been studied among population subgroups defined by genetic similarity to European reference populations. This has raised questions about the effect of TTNtvs in diverse populations, especially among individuals genetically similar to African reference populations. Objective: To determine the effect of TTNtvs on risk of DCM in diverse population as measured by genetic distance (GD) in principal component (PC) space. Design: Cohort study. Setting: Penn Medicine Biobank (PMBB) is a large, diverse biobank. Participants: Participants were recruited from across the Penn Medicine healthcare system and volunteered to have their electronic health records linked to biospecimen data including DNA which has undergone whole exome sequencing. Main Outcomes and Measures: Risk of DCM among individuals carrying a hiPSI TTNtv. Results: Carrying a hiPSI TTNtv was associated with DCM among PMBB participants across a range of GD deciles from the 1000G European centroid; the effect estimates ranged from odds ratio (OR) = 3.29 (95% confidence interval [CI] 1.26 to 8.56) to OR = 9.39 (95% CI 3.82 to 23.13). When individuals were assigned to population subgroups based on genetic similarity to the 1000G reference populations, hiPSI TTNtvs conferred significant risk of DCM among those genetically similar to the 1000G European reference population (OR = 7.55, 95% CI 4.99 to 11.42, P<0.001) and individuals genetically similar to the 1000G African reference population (OR 3.50, 95% CI 1.48 to 8.24, P=0.004). Conclusions and Relevance: TTNtvs are associated with increased risk of DCM among a diverse cohort. There is no significant difference in effect of TTNtvs on DCM risk across deciles of GD from the 1000G European centroid, suggesting genetic background should not be considered when screening individuals for titin-related DCM.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA