Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 58
Filter
Add more filters

Country/Region as subject
Publication year range
1.
Circulation ; 145(12): 877-891, 2022 03 22.
Article in English | MEDLINE | ID: mdl-34930020

ABSTRACT

BACKGROUND: Sequencing Mendelian arrhythmia genes in individuals without an indication for arrhythmia genetic testing can identify carriers of pathogenic or likely pathogenic (P/LP) variants. However, the extent to which these variants are associated with clinically meaningful phenotypes before or after return of variant results is unclear. In addition, the majority of discovered variants are currently classified as variants of uncertain significance, limiting clinical actionability. METHODS: The eMERGE-III study (Electronic Medical Records and Genomics Phase III) is a multicenter prospective cohort that included 21 846 participants without previous indication for cardiac genetic testing. Participants were sequenced for 109 Mendelian disease genes, including 10 linked to arrhythmia syndromes. Variant carriers were assessed with electronic health record-derived phenotypes and follow-up clinical examination. Selected variants of uncertain significance (n=50) were characterized in vitro with automated electrophysiology experiments in HEK293 cells. RESULTS: As previously reported, 3.0% of participants had P/LP variants in the 109 genes. Herein, we report 120 participants (0.6%) with P/LP arrhythmia variants. Compared with noncarriers, arrhythmia P/LP carriers had a significantly higher burden of arrhythmia phenotypes in their electronic health records. Fifty-four participants had variant results returned. Nineteen of these 54 participants had inherited arrhythmia syndrome diagnoses (primarily long-QT syndrome), and 12 of these 19 diagnoses were made only after variant results were returned (0.05%). After in vitro functional evaluation of 50 variants of uncertain significance, we reclassified 11 variants: 3 to likely benign and 8 to P/LP. CONCLUSIONS: Genome sequencing in a large population without indication for arrhythmia genetic testing identified phenotype-positive carriers of variants in congenital arrhythmia syndrome disease genes. As the genomes of large numbers of people are sequenced, the disease risk from rare variants in arrhythmia genes can be assessed by integrating genomic screening, electronic health record phenotypes, and in vitro functional studies. REGISTRATION: URL: https://www. CLINICALTRIALS: gov; Unique identifier; NCT03394859.


Subject(s)
Arrhythmias, Cardiac , Genetic Testing , Arrhythmias, Cardiac/diagnosis , Arrhythmias, Cardiac/genetics , Genetic Predisposition to Disease , Genetic Testing/methods , Genomics , HEK293 Cells , Humans , Phenotype , Prospective Studies
2.
Am J Hum Genet ; 107(3): 432-444, 2020 09 03.
Article in English | MEDLINE | ID: mdl-32758450

ABSTRACT

Accurate colorectal cancer (CRC) risk prediction models are critical for identifying individuals at low and high risk of developing CRC, as they can then be offered targeted screening and interventions to address their risks of developing disease (if they are in a high-risk group) and avoid unnecessary screening and interventions (if they are in a low-risk group). As it is likely that thousands of genetic variants contribute to CRC risk, it is clinically important to investigate whether these genetic variants can be used jointly for CRC risk prediction. In this paper, we derived and compared different approaches to generating predictive polygenic risk scores (PRS) from genome-wide association studies (GWASs) including 55,105 CRC-affected case subjects and 65,079 control subjects of European ancestry. We built the PRS in three ways, using (1) 140 previously identified and validated CRC loci; (2) SNP selection based on linkage disequilibrium (LD) clumping followed by machine-learning approaches; and (3) LDpred, a Bayesian approach for genome-wide risk prediction. We tested the PRS in an independent cohort of 101,987 individuals with 1,699 CRC-affected case subjects. The discriminatory accuracy, calculated by the age- and sex-adjusted area under the receiver operating characteristics curve (AUC), was highest for the LDpred-derived PRS (AUC = 0.654) including nearly 1.2 M genetic variants (the proportion of causal genetic variants for CRC assumed to be 0.003), whereas the PRS of the 140 known variants identified from GWASs had the lowest AUC (AUC = 0.629). Based on the LDpred-derived PRS, we are able to identify 30% of individuals without a family history as having risk for CRC similar to those with a family history of CRC, whereas the PRS based on known GWAS variants identified only top 10% as having a similar relative risk. About 90% of these individuals have no family history and would have been considered average risk under current screening guidelines, but might benefit from earlier screening. The developed PRS offers a way for risk-stratified CRC screening and other targeted interventions.


Subject(s)
Colorectal Neoplasms/epidemiology , Genetic Predisposition to Disease , Genome, Human/genetics , Risk Assessment , Aged , Asian People/genetics , Bayes Theorem , Colorectal Neoplasms/genetics , Colorectal Neoplasms/pathology , Female , Genome-Wide Association Study , Humans , Male , Middle Aged , Multifactorial Inheritance/genetics , Polymorphism, Single Nucleotide/genetics , Risk Factors
3.
J Allergy Clin Immunol ; 150(5): 1086-1096, 2022 Nov.
Article in English | MEDLINE | ID: mdl-35595084

ABSTRACT

BACKGROUND: Asthma is the most common chronic condition in children and the third leading cause of hospitalization in pediatrics. The genome-wide association study catalog reports 140 studies with genome-wide significance. A polygenic risk score (PRS) with predictive value across ancestries has not been evaluated for this important trait. OBJECTIVES: This study aimed to train and validate a PRS relying on genetic determinants for asthma to provide predictions for disease occurrence in pediatric cohorts of diverse ancestries. METHODS: This study applied a Bayesian regression framework method using the Trans-National Asthma Genetic Consortium genome-wide association study summary statistics to derive a multiancestral PRS score, used one Electronic Medical Records and Genomics (eMERGE) cohort as a training set, used a second independent eMERGE cohort to validate the score, and used the UK Biobank data to replicate the findings. A phenome-wide association study was performed using the PRS to identify shared genetic etiology with other phenotypes. RESULTS: The multiancestral asthma PRS was associated with asthma in the 2 pediatric validation datasets. Overall, the multiancestral asthma PRS has an area under the curve (AUC) of 0.70 (95% CI, 0.69-0.72) in the pediatric validation 1 and AUC of 0.66 (0.65-0.66) in the pediatric validation 2 datasets. We found significant discrimination across pediatric subcohorts of European (AUC, 95% CI, 0.60 and 0.66), African (AUC, 95% CI, 0.61 and 0.66), admixed American (AUC, 0.64 and 0.70), Southeast Asian (AUC, 0.65), and East Asian (AUC, 0.73) ancestry. Pediatric participants with the top 5% PRS had 2.80 to 5.82 increased odds of asthma compared to the bottom 5% across the training, validation 1, and validation 2 cohorts when adjusted for ancestry. Phenome-wide association study analysis confirmed the strong association of the identified PRS with asthma (odds ratio, 2.71, PFDR = 3.71 × 10-65) and related phenotypes. CONCLUSIONS: A multiancestral PRS for asthma based on Bayesian posterior genomic effect sizes identifies increased odds of pediatric asthma.


Subject(s)
Asthma , Genome-Wide Association Study , Humans , Child , Genome-Wide Association Study/methods , Multifactorial Inheritance , Genetic Predisposition to Disease , Polymorphism, Single Nucleotide , Bayes Theorem , Risk Factors , Asthma/genetics
4.
Genet Med ; 24(10): 2123-2133, 2022 10.
Article in English | MEDLINE | ID: mdl-35943490

ABSTRACT

PURPOSE: We estimated the penetrance of pathogenic/likely pathogenic (P/LP) variants in arteriopathy-related genes and assessed near-term outcomes following return of results. METHODS: Participants (N = 24,520) in phase III of the Electronic Medical Records and Genomics network underwent targeted sequencing of 68 actionable genes, including 9 genes associated with arterial aneurysmal diseases. Penetrance was estimated on the basis of the presence of relevant clinical traits. Outcomes occurring within 1 year of return of results included new diagnoses, referral to a specialist, new tests ordered, surveillance initiated, and new medications started. RESULTS: P/LP variants were present in 34 participants. The average penetrance across genes was 59%, ranging from 86% for FBN1 variants to 25% for SMAD3. Of 16 participants in whom results were returned, 1-year outcomes occurred in 63%. A new diagnosis was made in 44% of the participants, 56% were referred to a specialist, a new test was ordered in 44%, surveillance was initiated in 31%, and a new medication was started in 31%. CONCLUSION: Penetrance of P/LP variants in arteriopathy-related genes, identified in a large, targeted sequencing study, was variable and overall lower than that reported in clinical cohorts. Meaningful outcomes within the first year were noted in 63% of participants who received results.


Subject(s)
Genomics , Humans , Penetrance , Phenotype
5.
Hum Mutat ; 42(12): 1602-1614, 2021 12.
Article in English | MEDLINE | ID: mdl-34467602

ABSTRACT

Preterm birth (PTB), or birth that occurs earlier than 37 weeks of gestational age, is a major contributor to infant mortality and neonatal hospitalization. Mutations in the mitochondrial genome (mtDNA) have been linked to various rare mitochondrial disorders and may be a contributing factor in PTB given that maternal genetic factors have been strongly linked to PTB. However, to date, no study has found a conclusive connection between a particular mtDNA variant and PTB. Given the high mtDNA copy number per cell, an automated pipeline was developed for detecting mtDNA variants using low-coverage whole-genome sequencing (lcWGS) data. The pipeline was first validated against samples of known heteroplasmy, and then applied to 929 samples from a PTB cohort from diverse ethnic backgrounds with an average gestational age of 27.18 weeks (range: 21-30). Our new pipeline successfully identified haplogroups and a large number of mtDNA variants in this large PTB cohort, including 8 samples carrying known pathogenic variants and 47 samples carrying rare mtDNA variants. These results confirm that lcWGS can be utilized to reliably identify mtDNA variants. These mtDNA variants may make a contribution toward preterm birth in a small proportion of live births.


Subject(s)
Genome, Mitochondrial , Premature Birth , DNA, Mitochondrial/genetics , Humans , Infant , Infant, Newborn , Mitochondria/genetics , Premature Birth/genetics , Whole Genome Sequencing
6.
Int J Obes (Lond) ; 45(1): 155-169, 2021 01.
Article in English | MEDLINE | ID: mdl-32952152

ABSTRACT

BACKGROUND/OBJECTIVES: Melanocortin-4 receptor (MC4R) plays an essential role in food intake and energy homeostasis. More than 170 MC4R variants have been described over the past two decades, with conflicting reports regarding the prevalence and phenotypic effects of these variants in diverse cohorts. To determine the frequency of MC4R variants in large cohort of different ancestries, we evaluated the MC4R coding region for 20,537 eMERGE participants with sequencing data plus additional 77,454 independent individuals with genome-wide genotyping data at this locus. SUBJECTS/METHODS: The sequencing data were obtained from the eMERGE phase III study, in which multisample variant call format calls have been generated, curated, and annotated. In addition to penetrance estimation using body mass index (BMI) as a binary outcome, GWAS and PheWAS were performed using median BMI in linear regression analyses. All results were adjusted for principal components, age, sex, and sites of genotyping. RESULTS: Targeted sequencing data of MC4R revealed 125 coding variants in 1839 eMERGE participants including 30 unreported coding variants that were predicted to be functionally damaging. Highly penetrant unreported variants included (L325I, E308K, D298N, S270F, F261L, T248A, D111V, and Y80F) in which seven participants had obesity class III defined as BMI ≥ 40 kg/m2. In GWAS analysis, in addition to known risk haplotype upstream of MC4R (best variant rs6567160 (P = 5.36 × 10-25, Beta = 0.37), a novel rare haplotype was detected which was protective against obesity and encompassed the V103I variant with known gain-of-function properties (P = 6.23 × 10-08, Beta = -0.62). PheWAS analyses extended this protective effect of V103I to type 2 diabetes, diabetic nephropathy, and chronic renal failure independent of BMI. CONCLUSIONS: MC4R screening in a large eMERGE cohort confirmed many previous findings, extend the MC4R pleotropic effects, and discovered additional MC4R rare alleles that probably contribute to obesity.


Subject(s)
Genetic Variation/genetics , Genome-Wide Association Study , Obesity , Receptor, Melanocortin, Type 4/genetics , Adult , Aged , Body Mass Index , Cohort Studies , Female , Humans , Male , Middle Aged , Obesity/epidemiology , Obesity/genetics
7.
Hum Mol Genet ; 27(13): 2392-2404, 2018 07 01.
Article in English | MEDLINE | ID: mdl-29912393

ABSTRACT

Systemic lupus erythematosus (SLE or lupus) (OMIM: 152700) is a chronic autoimmune disease with debilitating inflammation that affects multiple organ systems. The STAT1-STAT4 locus is one of the first and most highly replicated genetic loci associated with lupus risk. We performed a fine-mapping study to identify plausible causal variants within the STAT1-STAT4 locus associated with increased lupus disease risk. Using complementary frequentist and Bayesian approaches in trans-ancestral Discovery and Replication cohorts, we found one variant whose association with lupus risk is supported across ancestries in both the Discovery and Replication cohorts: rs11889341. In B cell lines from patients with lupus and healthy controls, the lupus risk allele of rs11889341 was associated with increased STAT1 expression. We demonstrated that the transcription factor HMGA1, a member of the HMG transcription factor family with an AT-hook DNA-binding domain, has enriched binding to the risk allele compared with the non-risk allele of rs11889341. We identified a genotype-dependent repressive element in the DNA within the intron of STAT4 surrounding rs11889341. Consistent with expression quantitative trait locus (eQTL) analysis, the lupus risk allele of rs11889341 decreased the activity of this putative repressor. Altogether, we present a plausible molecular mechanism for increased lupus risk at the STAT1-STAT4 locus in which the risk allele of rs11889341, the most probable causal variant, leads to elevated STAT1 expression in B cells due to decreased repressor activity mediated by increased binding of HMGA1.


Subject(s)
Alleles , Lupus Erythematosus, Systemic/genetics , Polymorphism, Genetic , Quantitative Trait Loci , STAT1 Transcription Factor/genetics , STAT4 Transcription Factor/genetics , Female , Humans , Lupus Erythematosus, Systemic/epidemiology , Male , Risk Factors
8.
World J Surg ; 44(1): 84-94, 2020 01.
Article in English | MEDLINE | ID: mdl-31605180

ABSTRACT

BACKGROUND: The extent to which obesity and genetics determine postoperative complications is incompletely understood. METHODS: We performed a retrospective study using two population cohorts with electronic health record (EHR) data. The first included 736,726 adults with body mass index (BMI) recorded between 1990 and 2017 at Vanderbilt University Medical Center. The second cohort consisted of 65,174 individuals from 12 institutions contributing EHR and genome-wide genotyping data to the Electronic Medical Records and Genomics (eMERGE) Network. Pairwise logistic regression analyses were used to measure the association of BMI categories with postoperative complications derived from International Classification of Disease-9 codes, including postoperative infection, incisional hernia, and intestinal obstruction. A genetic risk score was constructed from 97 obesity-risk single-nucleotide polymorphisms for a Mendelian randomization study to determine the association of genetic risk of obesity on postoperative complications. Logistic regression analyses were adjusted for sex, age, site, and race/principal components. RESULTS: Individuals with overweight or obese BMI (≥25 kg/m2) had increased risk of incisional hernia (odds ratio [OR] 1.7-5.5, p < 3.1 × 10-20), and people with obesity (BMI ≥ 30 kg/m2) had increased risk of postoperative infection (OR 1.2-2.3, p < 2.5 × 10-5). In the eMERGE cohort, genetically predicted BMI was associated with incisional hernia (OR 2.1 [95% CI 1.8-2.5], p = 1.4 × 10-6) and postoperative infection (OR 1.6 [95% CI 1.4-1.9], p = 3.1 × 10-6). Association findings were similar after limitation of the cohorts to those who underwent abdominal procedures. CONCLUSIONS: Clinical and Mendelian randomization studies suggest that obesity, as measured by BMI, is associated with the development of postoperative incisional hernia and infection.


Subject(s)
Mendelian Randomization Analysis/methods , Obesity/complications , Postoperative Complications/genetics , Adult , Body Mass Index , Female , Humans , Logistic Models , Male , Middle Aged , Polymorphism, Single Nucleotide , Postoperative Complications/etiology , Retrospective Studies , Risk Factors
9.
Circulation ; 138(22): 2469-2481, 2018 11 27.
Article in English | MEDLINE | ID: mdl-30571344

ABSTRACT

BACKGROUND: Proteomic approaches allow measurement of thousands of proteins in a single specimen, which can accelerate biomarker discovery. However, applying these technologies to massive biobanks is not currently feasible because of the practical barriers and costs of implementing such assays at scale. To overcome these challenges, we used a "virtual proteomic" approach, linking genetically predicted protein levels to clinical diagnoses in >40 000 individuals. METHODS: We used genome-wide association data from the Framingham Heart Study (n=759) to construct genetic predictors for 1129 plasma protein levels. We validated the genetic predictors for 268 proteins and used them to compute predicted protein levels in 41 288 genotyped individuals in the Electronic Medical Records and Genomics (eMERGE) cohort. We tested associations for each predicted protein with 1128 clinical phenotypes. Lead associations were validated with directly measured protein levels and either low-density lipoprotein cholesterol or subclinical atherosclerosis in the MDCS (Malmö Diet and Cancer Study; n=651). RESULTS: In the virtual proteomic analysis in eMERGE, 55 proteins were associated with 89 distinct diagnoses at a false discovery rate q<0.1. Among these, 13 associations involved lipid (n=7) or atherosclerosis (n=6) phenotypes. We tested each association for validation in MDCS using directly measured protein levels. At Bonferroni-adjusted significance thresholds, levels of apolipoprotein E isoforms were associated with hyperlipidemia, and circulating C-type lectin domain family 1 member B and platelet-derived growth factor receptor-ß predicted subclinical atherosclerosis. Odds ratios for carotid atherosclerosis were 1.31 (95% CI, 1.08-1.58; P=0.006) per 1-SD increment in C-type lectin domain family 1 member B and 0.79 (0.66-0.94; P=0.008) per 1-SD increment in platelet-derived growth factor receptor-ß. CONCLUSIONS: We demonstrate a biomarker discovery paradigm to identify candidate biomarkers of cardiovascular and other diseases.


Subject(s)
Biomarkers/blood , Carotid Artery Diseases/diagnosis , Genome-Wide Association Study , Proteome/analysis , Adult , Aged , Aged, 80 and over , Carotid Artery Diseases/genetics , Female , Genotype , Humans , Lectins, C-Type/analysis , Male , Middle Aged , Odds Ratio , Phenotype , Polymorphism, Single Nucleotide , Proteomics , Receptor, Platelet-Derived Growth Factor beta/blood
11.
BMC Med ; 17(1): 135, 2019 07 17.
Article in English | MEDLINE | ID: mdl-31311600

ABSTRACT

BACKGROUND: Non-alcoholic fatty liver disease (NAFLD) is a common chronic liver illness with a genetically heterogeneous background that can be accompanied by considerable morbidity and attendant health care costs. The pathogenesis and progression of NAFLD is complex with many unanswered questions. We conducted genome-wide association studies (GWASs) using both adult and pediatric participants from the Electronic Medical Records and Genomics (eMERGE) Network to identify novel genetic contributors to this condition. METHODS: First, a natural language processing (NLP) algorithm was developed, tested, and deployed at each site to identify 1106 NAFLD cases and 8571 controls and histological data from liver tissue in 235 available participants. These include 1242 pediatric participants (396 cases, 846 controls). The algorithm included billing codes, text queries, laboratory values, and medication records. Next, GWASs were performed on NAFLD cases and controls and case-only analyses using histologic scores and liver function tests adjusting for age, sex, site, ancestry, PC, and body mass index (BMI). RESULTS: Consistent with previous results, a robust association was detected for the PNPLA3 gene cluster in participants with European ancestry. At the PNPLA3-SAMM50 region, three SNPs, rs738409, rs738408, and rs3747207, showed strongest association (best SNP rs738409 p = 1.70 × 10- 20). This effect was consistent in both pediatric (p = 9.92 × 10- 6) and adult (p = 9.73 × 10- 15) cohorts. Additionally, this variant was also associated with disease severity and NAFLD Activity Score (NAS) (p = 3.94 × 10- 8, beta = 0.85). PheWAS analysis link this locus to a spectrum of liver diseases beyond NAFLD with a novel negative correlation with gout (p = 1.09 × 10- 4). We also identified novel loci for NAFLD disease severity, including one novel locus for NAS score near IL17RA (rs5748926, p = 3.80 × 10- 8), and another near ZFP90-CDH1 for fibrosis (rs698718, p = 2.74 × 10- 11). Post-GWAS and gene-based analyses identified more than 300 genes that were used for functional and pathway enrichment analyses. CONCLUSIONS: In summary, this study demonstrates clear confirmation of a previously described NAFLD risk locus and several novel associations. Further collaborative studies including an ethnically diverse population with well-characterized liver histologic features of NAFLD are needed to further validate the novel findings.


Subject(s)
Non-alcoholic Fatty Liver Disease/genetics , Adult , Aged , Body Mass Index , Case-Control Studies , Community Networks/organization & administration , Community Networks/statistics & numerical data , Disease Progression , Electronic Health Records/organization & administration , Electronic Health Records/statistics & numerical data , Female , Genetic Predisposition to Disease , Genome-Wide Association Study , Genomics/organization & administration , Genomics/statistics & numerical data , Humans , Lipase/genetics , Male , Membrane Proteins/genetics , Middle Aged , Morbidity , Non-alcoholic Fatty Liver Disease/epidemiology , Phenotype , Polymorphism, Single Nucleotide , Signal Transduction/genetics
12.
Am J Hum Genet ; 96(5): 731-9, 2015 May 07.
Article in English | MEDLINE | ID: mdl-25865496

ABSTRACT

Genetic variants at chromosomal region 11q23.3, near the gene ETS1, have been associated with systemic lupus erythematosus (SLE), or lupus, in independent cohorts of Asian ancestry. Several recent studies have implicated ETS1 as a critical driver of immune cell function and differentiation, and mice deficient in ETS1 develop an SLE-like autoimmunity. We performed a fine-mapping study of 14,551 subjects from multi-ancestral cohorts by starting with genotyped variants and imputing to all common variants spanning ETS1. By constructing genetic models via frequentist and Bayesian association methods, we identified 16 variants that are statistically likely to be causal. We functionally assessed each of these variants on the basis of their likelihood of affecting transcription factor binding, miRNA binding, or chromatin state. Of the four variants that we experimentally examined, only rs6590330 differentially binds lysate from B cells. Using mass spectrometry, we found more binding of the transcription factor signal transducer and activator of transcription 1 (STAT1) to DNA near the risk allele of rs6590330 than near the non-risk allele. Immunoblot analysis and chromatin immunoprecipitation of pSTAT1 in B cells heterozygous for rs6590330 confirmed that the risk allele increased binding to the active form of STAT1. Analysis with expression quantitative trait loci indicated that the risk allele of rs6590330 is associated with decreased ETS1 expression in Han Chinese, but not other ancestral cohorts. We propose a model in which the risk allele of rs6590330 is associated with decreased ETS1 expression and increases SLE risk by enhancing the binding of pSTAT1.


Subject(s)
Genetic Predisposition to Disease , Lupus Erythematosus, Systemic/genetics , Proto-Oncogene Protein c-ets-1/genetics , STAT1 Transcription Factor/genetics , Alleles , Animals , Asian People , Bayes Theorem , Genotype , Haplotypes , Humans , Mice , Protein Binding , Proto-Oncogene Protein c-ets-1/metabolism , STAT1 Transcription Factor/metabolism
13.
Hum Mol Genet ; 24(2): 582-96, 2015 Jan 15.
Article in English | MEDLINE | ID: mdl-25205108

ABSTRACT

Exploiting genotyping, DNA sequencing, imputation and trans-ancestral mapping, we used Bayesian and frequentist approaches to model the IRF5-TNPO3 locus association, now implicated in two immunotherapies and seven autoimmune diseases. Specifically, in systemic lupus erythematosus (SLE), we resolved separate associations in the IRF5 promoter (all ancestries) and with an extended European haplotype. We captured 3230 IRF5-TNPO3 high-quality, common variants across 5 ethnicities in 8395 SLE cases and 7367 controls. The genetic effect from the IRF5 promoter can be explained by any one of four variants in 5.7 kb (P-valuemeta = 6 × 10(-49); OR = 1.38-1.97). The second genetic effect spanned an 85.5-kb, 24-variant haplotype that included the genes IRF5 and TNPO3 (P-valuesEU = 10(-27)-10(-32), OR = 1.7-1.81). Many variants at the IRF5 locus with previously assigned biological function are not members of either final credible set of potential causal variants identified herein. In addition to the known biologically functional variants, we demonstrated that the risk allele of rs4728142, a variant in the promoter among the lowest frequentist probability and highest Bayesian posterior probability, was correlated with IRF5 expression and differentially binds the transcription factor ZBTB3. Our analytical strategy provides a novel framework for future studies aimed at dissecting etiological genetic effects. Finally, both SLE elements of the statistical model appear to operate in Sjögren's syndrome and systemic sclerosis whereas only the IRF5-TNPO3 gene-spanning haplotype is associated with primary biliary cirrhosis, demonstrating the nuance of similarity and difference in autoimmune disease risk mechanisms at IRF5-TNPO3.


Subject(s)
Interferon Regulatory Factors/genetics , Lupus Erythematosus, Systemic/genetics , beta Karyopherins/genetics , Autoimmune Diseases/genetics , Bayes Theorem , Case-Control Studies , Cohort Studies , DNA-Binding Proteins/genetics , Haplotypes , Humans , Male , Polymorphism, Single Nucleotide , Promoter Regions, Genetic
14.
Sci Rep ; 14(1): 14009, 2024 06 18.
Article in English | MEDLINE | ID: mdl-38890458

ABSTRACT

Type 2 diabetes (T2D) is caused by both genetic and environmental factors and is associated with an increased risk of cardiorenal complications and mortality. Though disproportionately affected by the condition, African Americans (AA) are largely underrepresented in genetic studies of T2D, and few estimates of heritability have been calculated in this race group. Using genome-wide association study (GWAS) data paired with phenotypic data from ~ 19,300 AA participants of the Reasons for Geographic and Racial Differences in Stroke (REGARDS) study, Genetics of Hypertension Associated Treatments (GenHAT) study, and the Electronic Medical Records and Genomics (eMERGE) network, we estimated narrow-sense heritability using two methods: Linkage-Disequilibrium Adjusted Kinships (LDAK) and Genome-Wide Complex Trait Analysis (GCTA). Study-level heritability estimates adjusting for age, sex, and genetic ancestry ranged from 18% to 34% across both methods. Overall, the current study narrows the expected range for T2D heritability in this race group compared to prior estimates, while providing new insight into the genetic basis of T2D in AAs for ongoing genetic discovery efforts.


Subject(s)
Black or African American , Diabetes Mellitus, Type 2 , Genetic Predisposition to Disease , Genome-Wide Association Study , Humans , Diabetes Mellitus, Type 2/genetics , Black or African American/genetics , Male , Female , Middle Aged , Aged , Polymorphism, Single Nucleotide , Linkage Disequilibrium , Phenotype , Multifactorial Inheritance/genetics
15.
Nat Med ; 30(2): 480-487, 2024 Feb.
Article in English | MEDLINE | ID: mdl-38374346

ABSTRACT

Polygenic risk scores (PRSs) have improved in predictive performance, but several challenges remain to be addressed before PRSs can be implemented in the clinic, including reduced predictive performance of PRSs in diverse populations, and the interpretation and communication of genetic results to both providers and patients. To address these challenges, the National Human Genome Research Institute-funded Electronic Medical Records and Genomics (eMERGE) Network has developed a framework and pipeline for return of a PRS-based genome-informed risk assessment to 25,000 diverse adults and children as part of a clinical study. From an initial list of 23 conditions, ten were selected for implementation based on PRS performance, medical actionability and potential clinical utility, including cardiometabolic diseases and cancer. Standardized metrics were considered in the selection process, with additional consideration given to strength of evidence in African and Hispanic populations. We then developed a pipeline for clinical PRS implementation (score transfer to a clinical laboratory, validation and verification of score performance), and used genetic ancestry to calibrate PRS mean and variance, utilizing genetically diverse data from 13,475 participants of the All of Us Research Program cohort to train and test model parameters. Finally, we created a framework for regulatory compliance and developed a PRS clinical report for return to providers and for inclusion in an additional genome-informed risk assessment. The initial experience from eMERGE can inform the approach needed to implement PRS-based testing in diverse clinical settings.


Subject(s)
Chronic Disease , Genetic Risk Score , Population Health , Adult , Child , Humans , Communication , Genetic Predisposition to Disease , Genome-Wide Association Study , Risk Factors , United States
16.
Arthritis Rheum ; 64(6): 1960-9, 2012 Jun.
Article in English | MEDLINE | ID: mdl-22231568

ABSTRACT

OBJECTIVE: Systemic lupus erythematosus (SLE) is a heterogeneous autoimmune disease with significant immune system aberrations resulting from complex heritable genetics as well as environmental factors. We undertook to study the role of TRAF6 as a candidate gene for SLE, since it plays a major role in several signaling pathways that are important for immunity and organ development. METHODS: Fifteen single-nucleotide polymorphisms (SNPs) across TRAF6 were evaluated in 7,490 SLE patients and 6,780 control subjects from different ancestries. Population-based case-control association analyses and meta-analyses were performed. P values, false discovery rate q values, and odds ratios (ORs) with 95% confidence intervals (95% CIs) were calculated. RESULTS: Evidence of associations was detected in multiple SNPs. The best overall P values were obtained for SNPs rs5030437 and rs4755453 (P = 7.85 × 10(-5) and P = 4.73 × 10(-5) , respectively) without significant heterogeneity among populations (P = 0.67 and P = 0.50, respectively, in Q statistic). In addition, SNP rs540386, which was previously reported to be associated with rheumatoid arthritis (RA), was found to be in linkage disequilibrium with these 2 SNPs (r(2) = 0.95) and demonstrated evidence of association with SLE in the same direction (meta-analysis P = 9.15 × 10(-4) , OR 0.89 [95% CI 0.83-0.95]). The presence of thrombocytopenia improved the overall results in different populations (meta-analysis P = 1.99 × 10(-6) , OR 0.57 [95% CI 0.45-0.72], for rs5030470). Finally, evidence of family-based association in 34 African American pedigrees with the presence of thrombocytopenia was detected in 1 available SNP (rs5030437) with a Z score magnitude of 2.28 (P = 0.02) under a dominant model. CONCLUSION: Our data indicate the presence of association of TRAF6 with SLE, consistent with the previous report of association with RA. These data provide further support for the involvement of TRAF6 in the pathogenesis of autoimmunity.


Subject(s)
Lupus Erythematosus, Systemic/genetics , Polymorphism, Single Nucleotide , TNF Receptor-Associated Factor 6/genetics , Alleles , Case-Control Studies , Cohort Studies , Female , Gene Frequency , Genetic Association Studies , Genetic Predisposition to Disease , Genotype , Haplotypes , Humans , Male
17.
Pac Symp Biocomput ; 28: 437-448, 2023.
Article in English | MEDLINE | ID: mdl-36540998

ABSTRACT

Polygenic risk scores (PRS) have led to enthusiasm for precision medicine. However, it is well documented that PRS do not generalize across groups differing in ancestry or sample characteristics e.g., age. Quantifying performance of PRS across different groups of study participants, using genome-wide association study (GWAS) summary statistics from multiple ancestry groups and sample sizes, and using different linkage disequilibrium (LD) reference panels may clarify which factors are limiting PRS transferability. To evaluate these factors in the PRS generation process, we generated body mass index (BMI) PRS (PRSBMI) in the Electronic Medical Records and Genomics (eMERGE) network (N=75,661). Analyses were conducted in two ancestry groups (European and African) and three age ranges (adult, teenagers, and children). For PRSBMI calculations, we evaluated five LD reference panels and three sets of GWAS summary statistics of varying sample size and ancestry. PRSBMI performance increased for both African and European ancestry individuals using cross-ancestry GWAS summary statistics compared to European-only summary statistics (6.3% and 3.7% relative R2 increase, respectively, pAfrican=0.038, pEuropean=6.26x10-4). The effects of LD reference panels were more pronounced in African ancestry study datasets. PRSBMI performance degraded in children; R2 was less than half of teenagers or adults. The effect of GWAS summary statistics sample size was small when modeled with the other factors. Additionally, the potential of using a PRS generated for one trait to predict risk for comorbid diseases is not well understood especially in the context of cross-ancestry analyses - we explored clinical comorbidities from the electronic health record associated with PRSBMI and identified significant associations with type 2 diabetes and coronary atherosclerosis. In summary, this study quantifies the effects that ancestry, GWAS summary statistic sample size, and LD reference panel have on PRS performance, especially in cross-ancestry and age-specific analyses.


Subject(s)
Diabetes Mellitus, Type 2 , Adult , Adolescent , Child , Humans , Diabetes Mellitus, Type 2/genetics , Body Mass Index , Genome-Wide Association Study , Genetic Predisposition to Disease , Computational Biology , Risk Factors , Multifactorial Inheritance
18.
Nat Commun ; 14(1): 5562, 2023 09 09.
Article in English | MEDLINE | ID: mdl-37689782

ABSTRACT

Genes act in concert with each other in specific contexts to perform their functions. Determining how these genes influence complex traits requires a mechanistic understanding of expression regulation across different conditions. It has been shown that this insight is critical for developing new therapies. Transcriptome-wide association studies have helped uncover the role of individual genes in disease-relevant mechanisms. However, modern models of the architecture of complex traits predict that gene-gene interactions play a crucial role in disease origin and progression. Here we introduce PhenoPLIER, a computational approach that maps gene-trait associations and pharmacological perturbation data into a common latent representation for a joint analysis. This representation is based on modules of genes with similar expression patterns across the same conditions. We observe that diseases are significantly associated with gene modules expressed in relevant cell types, and our approach is accurate in predicting known drug-disease pairs and inferring mechanisms of action. Furthermore, using a CRISPR screen to analyze lipid regulation, we find that functionally important players lack associations but are prioritized in trait-associated modules by PhenoPLIER. By incorporating groups of co-expressed genes, PhenoPLIER can contextualize genetic associations and reveal potential targets missed by single-gene strategies.


Subject(s)
Clustered Regularly Interspaced Short Palindromic Repeats , Epistasis, Genetic , Causality , Gene Regulatory Networks , Transcriptome
19.
JAMIA Open ; 6(2): ooad032, 2023 Jul.
Article in English | MEDLINE | ID: mdl-37181728

ABSTRACT

With the burgeoning development of computational phenotypes, it is increasingly difficult to identify the right phenotype for the right tasks. This study uses a mixed-methods approach to develop and evaluate a novel metadata framework for retrieval of and reusing computational phenotypes. Twenty active phenotyping researchers from 2 large research networks, Electronic Medical Records and Genomics and Observational Health Data Sciences and Informatics, were recruited to suggest metadata elements. Once consensus was reached on 39 metadata elements, 47 new researchers were surveyed to evaluate the utility of the metadata framework. The survey consisted of 5-Likert multiple-choice questions and open-ended questions. Two more researchers were asked to use the metadata framework to annotate 8 type-2 diabetes mellitus phenotypes. More than 90% of the survey respondents rated metadata elements regarding phenotype definition and validation methods and metrics positively with a score of 4 or 5. Both researchers completed annotation of each phenotype within 60 min. Our thematic analysis of the narrative feedback indicates that the metadata framework was effective in capturing rich and explicit descriptions and enabling the search for phenotypes, compliance with data standards, and comprehensive validation metrics. Current limitations were its complexity for data collection and the entailed human costs.

20.
medRxiv ; 2023 Jun 05.
Article in English | MEDLINE | ID: mdl-37333246

ABSTRACT

Polygenic risk scores (PRS) have improved in predictive performance supporting their use in clinical practice. Reduced predictive performance of PRS in diverse populations can exacerbate existing health disparities. The NHGRI-funded eMERGE Network is returning a PRS-based genome-informed risk assessment to 25,000 diverse adults and children. We assessed PRS performance, medical actionability, and potential clinical utility for 23 conditions. Standardized metrics were considered in the selection process with additional consideration given to strength of evidence in African and Hispanic populations. Ten conditions were selected with a range of high-risk thresholds: atrial fibrillation, breast cancer, chronic kidney disease, coronary heart disease, hypercholesterolemia, prostate cancer, asthma, type 1 diabetes, obesity, and type 2 diabetes. We developed a pipeline for clinical PRS implementation, used genetic ancestry to calibrate PRS mean and variance, created a framework for regulatory compliance, and developed a PRS clinical report. eMERGE's experience informs the infrastructure needed to implement PRS-based implementation in diverse clinical settings.

SELECTION OF CITATIONS
SEARCH DETAIL