Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 54
Filter
Add more filters

Country/Region as subject
Publication year range
1.
Hum Mol Genet ; 33(4): 374-385, 2024 Feb 01.
Article in English | MEDLINE | ID: mdl-37934784

ABSTRACT

Genome-wide association studies have contributed extensively to the discovery of disease-associated common variants. However, the genetic contribution to complex traits is still largely difficult to interpret. We report a genome-wide association study of 2394 cases and 2393 controls for age-related macular degeneration (AMD) via whole-genome sequencing, with 46.9 million genetic variants. Our study reveals significant single-variant association signals at four loci and independent gene-based signals in CFH, C2, C3, and NRTN. Using data from the Exome Aggregation Consortium (ExAC) for a gene-based test, we demonstrate an enrichment of predicted rare loss-of-function variants in CFH, CFI, and an as-yet unreported gene in AMD, ORMDL2. Our method of using a large variant list without individual-level genotypes as an external reference provides a flexible and convenient approach to leverage the publicly available variant datasets to augment the search for rare variant associations, which can explain additional disease risk in AMD.


Subject(s)
Genome-Wide Association Study , Macular Degeneration , Humans , Genome-Wide Association Study/methods , Macular Degeneration/genetics , Genotype , Genetic Testing , Whole Genome Sequencing , Polymorphism, Single Nucleotide/genetics , Genetic Predisposition to Disease , Complement Factor H/genetics
2.
Am J Hum Genet ; 110(9): 1522-1533, 2023 09 07.
Article in English | MEDLINE | ID: mdl-37607538

ABSTRACT

Population-scale biobanks linked to electronic health record data provide vast opportunities to extend our knowledge of human genetics and discover new phenotype-genotype associations. Given their dense phenotype data, biobanks can also facilitate replication studies on a phenome-wide scale. Here, we introduce the phenotype-genotype reference map (PGRM), a set of 5,879 genetic associations from 523 GWAS publications that can be used for high-throughput replication experiments. PGRM phenotypes are standardized as phecodes, ensuring interoperability between biobanks. We applied the PGRM to five ancestry-specific cohorts from four independent biobanks and found evidence of robust replications across a wide array of phenotypes. We show how the PGRM can be used to detect data corruption and to empirically assess parameters for phenome-wide studies. Finally, we use the PGRM to explore factors associated with replicability of GWAS results.


Subject(s)
Biological Specimen Banks , Data Science , Humans , Phenomics , Phenotype , Genotype
3.
Br J Anaesth ; 131(1): 37-46, 2023 07.
Article in English | MEDLINE | ID: mdl-37188560

ABSTRACT

BACKGROUND: Acute kidney injury (AKI) is a frequent yet understudied postoperative total joint arthroplasty complication. This study aimed to describe cardiometabolic disease co-occurrence using latent class analysis, and associated postoperative AKI risk. METHODS: This retrospective analysis examined patients ≥18 years old undergoing primary total knee or hip arthroplasties within the US Multicenter Perioperative Outcomes Group of hospitals from 2008 to 2019. AKI was defined using modified Kidney Disease: Improving Global Outcomes (KDIGO) criteria. Latent classes were constructed from eight cardiometabolic diseases including hypertension, diabetes, and coronary artery disease, excluding obesity. A mixed-effects logistic regression model was constructed for the outcome of any AKI and the exposure of interaction between latent class and obesity status adjusting for preoperative and intraoperative covariates. RESULTS: Of 81 639 cases, 4007 (4.9%) developed AKI. Patients with AKI were more commonly older and non-Hispanic Black, with more significant comorbidity. A latent class model selected three groups of cardiometabolic patterning, labelled 'hypertension only' (n=37 223), 'metabolic syndrome (MetS)' (n=36 503), and 'MetS+cardiovascular disease (CVD)' (n=7913). After adjustment, latent class/obesity interaction groups had differential risk of AKI compared with those in 'hypertension only'/non-obese. Those 'hypertension only'/obese had 1.7-fold increased odds of AKI (95% confidence interval [CI]: 1.5-2.0). Compared with 'hypertension only'/non-obese, those 'MetS+CVD'/obese had the highest odds of AKI (odds ratio 3.1, 95% CI: 2.6-3.7), whereas 'MetS+CVD'/non-obese had 2.2 times the odds of AKI (95% CI: 1.8-2.7; model area under the curve 0.76). CONCLUSIONS: The risk of postoperative AKI varies widely between patients. The current study suggests that the co-occurrence of metabolic conditions (diabetes mellitus, hypertension), with or without obesity, is a more important risk factor for acute kidney injury than individual comorbid diseases.


Subject(s)
Acute Kidney Injury , Arthroplasty, Replacement , Cardiovascular Diseases , Hypertension , Metabolic Syndrome , Humans , Adolescent , Retrospective Studies , Obesity/complications , Obesity/epidemiology , Risk Factors , Arthroplasty, Replacement/adverse effects , Metabolic Syndrome/complications , Postoperative Complications/epidemiology , Postoperative Complications/etiology , Acute Kidney Injury/epidemiology , Acute Kidney Injury/etiology , Hypertension/complications , Hypertension/epidemiology , Cardiovascular Diseases/epidemiology , Cardiovascular Diseases/etiology
4.
PLoS Genet ; 16(11): e1009077, 2020 11.
Article in English | MEDLINE | ID: mdl-33175840

ABSTRACT

Phenotypes extracted from Electronic Health Records (EHRs) are increasingly prevalent in genetic studies. EHRs contain hundreds of distinct clinical laboratory test results, providing a trove of health data beyond diagnoses. Such lab data is complex and lacks a ubiquitous coding scheme, making it more challenging than diagnosis data. Here we describe the first large-scale cross-health system genome-wide association study (GWAS) of EHR-based quantitative laboratory-derived phenotypes. We meta-analyzed 70 lab traits matched between the BioVU cohort from the Vanderbilt University Health System and the Michigan Genomics Initiative (MGI) cohort from Michigan Medicine. We show high replication of known association for these traits, validating EHR-based measurements as high-quality phenotypes for genetic analysis. Notably, our analysis provides the first replication for 699 previous GWAS associations across 46 different traits. We discovered 31 novel associations at genome-wide significance for 22 distinct traits, including the first reported associations for two lab-based traits. We replicated 22 of these novel associations in an independent tranche of BioVU samples. The summary statistics for all association tests are freely available to benefit other researchers. Finally, we performed mirrored analyses in BioVU and MGI to assess competing analytic practices for EHR lab traits. We find that using the mean of all available lab measurements provides a robust summary value, but alternate summarizations can improve power in certain circumstances. This study provides a proof-of-principle for cross health system GWAS and is a framework for future studies of quantitative EHR lab traits.


Subject(s)
Electronic Health Records/statistics & numerical data , Genetic Association Studies/methods , Genome-Wide Association Study/methods , Biological Specimen Banks , Cohort Studies , Electronic Health Records/trends , Genomics , Humans , Michigan , Phenotype , Polymorphism, Single Nucleotide/genetics , Quantitative Trait, Heritable
5.
Mol Psychiatry ; 26(9): 5239-5250, 2021 09.
Article in English | MEDLINE | ID: mdl-33483695

ABSTRACT

Bipolar disorder (BD) is a serious mental illness with substantial common variant heritability. However, the role of rare coding variation in BD is not well established. We examined the protein-coding (exonic) sequences of 3,987 unrelated individuals with BD and 5,322 controls of predominantly European ancestry across four cohorts from the Bipolar Sequencing Consortium (BSC). We assessed the burden of rare, protein-altering, single nucleotide variants classified as pathogenic or likely pathogenic (P-LP) both exome-wide and within several groups of genes with phenotypic or biologic plausibility in BD. While we observed an increased burden of rare coding P-LP variants within 165 genes identified as BD GWAS regions in 3,987 BD cases (meta-analysis OR = 1.9, 95% CI = 1.3-2.8, one-sided p = 6.0 × 10-4), this enrichment did not replicate in an additional 9,929 BD cases and 14,018 controls (OR = 0.9, one-side p = 0.70). Although BD shares common variant heritability with schizophrenia, in the BSC sample we did not observe a significant enrichment of P-LP variants in SCZ GWAS genes, in two classes of neuronal synaptic genes (RBFOX2 and FMRP) associated with SCZ or in loss-of-function intolerant genes. In this study, the largest analysis of exonic variation in BD, individuals with BD do not carry a replicable enrichment of rare P-LP variants across the exome or in any of several groups of genes with biologic plausibility. Moreover, despite a strong shared susceptibility between BD and SCZ through common genetic variation, we do not observe an association between BD risk and rare P-LP coding variants in genes known to modulate risk for SCZ.


Subject(s)
Bipolar Disorder , Schizophrenia , Bipolar Disorder/genetics , Exome/genetics , Genetic Predisposition to Disease/genetics , Genetic Variation/genetics , Genome-Wide Association Study , Humans , Polymorphism, Single Nucleotide/genetics , Schizophrenia/genetics
6.
Support Care Cancer ; 30(9): 7355-7363, 2022 Sep.
Article in English | MEDLINE | ID: mdl-35606478

ABSTRACT

PURPOSE: Cyclophosphamide is a commonly used cancer agent that is metabolically activated by polymorphic enzymes. This study aims to investigate the association between predicted activity of candidate pharmacogenes with severe toxicity during cyclophosphamide treatment. METHODS: Genome-wide genetic data was collected from an institutional genetic data repository for CYP2B6, CYP3A4, CYP2C9, CYP2C19, GSTA1, GSTP1, ALDH1A1, ALDH3A1, ABCC1, ABCB1, and ERCC1. Treatment and toxicity data were retrospectively collected from the patient's medical record. The a priori selected primary hypothesis was that patients who have CYP2B6 reduced metabolizer activity (poor or intermediate (PM/IM) vs. normal (NM) metabolizer) have lower risk of severe toxicity or cyclophosphamide treatment modification due to toxicity. RESULTS: In the primary analysis of 510 cyclophosphamide-treated patients with available genetic data, there was no difference in the odds of severe toxicity or treatment modification due to toxicity in CYP2B6 PM/IM vs. NM (odds ratio = 0.97, 95% Confidence Interval: 0.62-1.50, p = 0.88). In an exploratory, statistically uncorrected secondary analysis, carriers of the ALDH1A1 rs8187996 variant had a lower risk of the primary toxicity endpoint compared with wild-type homozygous patients (odds ratio = 0.31, 95% Confidence Interval: 0.09-0.78, p = 0.028). None of the other tested phenotypes or genotypes was associated with the primary or secondary endpoints in unadjusted analysis (all p > 0.05). CONCLUSION: The finding that patients who carry ALDH1A1 rs8187996 may have a lower risk of cyclophosphamide toxicity than wild-type patients contradicts a prior finding for this variant and should be viewed with skepticism. We found weak evidence that any of these candidate pharmacogenetic predictors of cyclophosphamide toxicity may be useful to personalize cyclophosphamide dosing to optimize therapeutic outcomes in patients with cancer.


Subject(s)
Aldehyde Dehydrogenase 1 Family , Cytochrome P-450 CYP2B6 , Neoplasms , Pharmacogenetics , Retinal Dehydrogenase , Aldehyde Dehydrogenase 1 Family/genetics , Cyclophosphamide , Cytochrome P-450 CYP2B6/genetics , Genotype , Humans , Neoplasms/drug therapy , Neoplasms/genetics , Retinal Dehydrogenase/genetics , Retrospective Studies
7.
BMC Nephrol ; 23(1): 339, 2022 10 21.
Article in English | MEDLINE | ID: mdl-36271344

ABSTRACT

BACKGROUND: Prior studies support a genetic basis for postoperative acute kidney injury (AKI). We conducted a genome-wide association study (GWAS), assessed the clinical utility of a polygenic risk score (PRS), and estimated the heritable component of AKI in patients who underwent noncardiac surgery. METHODS: We performed a retrospective large-scale genome-wide association study followed by a meta-analysis of patients who underwent noncardiac surgery at the Vanderbilt University Medical Center ("Vanderbilt" cohort) or Michigan Medicine, the academic medical center of the University of Michigan ("Michigan" cohort). In the Vanderbilt cohort, the relationship between polygenic risk score for estimated glomerular filtration rate and postoperative AKI was also tested to explore the predictive power of aggregating multiple common genetic variants associated with AKI risk. Similarly, in the Vanderbilt cohort genome-wide complex trait analysis was used to estimate the heritable component of AKI due to common genetic variants. RESULTS: The study population included 8248 adults in the Vanderbilt cohort (mean [SD] 58.05 [15.23] years, 50.2% men) and 5998 adults in Michigan cohort (56.24 [14.76] years, 49% men). Incident postoperative AKI events occurred in 959 patients (11.6%) and in 277 patients (4.6%), respectively. No loci met genome-wide significance in the GWAS and meta-analysis. PRS for estimated glomerular filtration rate explained a very small percentage of variance in rates of postoperative AKI and was not significantly associated with AKI (odds ratio 1.050 per 1 SD increase in polygenic risk score [95% CI, 0.971-1.134]). The estimated heritability among common variants for AKI was 4.5% (SE = 4.5%) suggesting low heritability. CONCLUSION: The findings of this study indicate that common genetic variation minimally contributes to postoperative AKI after noncardiac surgery, and likely has little clinical utility for identifying high-risk patients.


Subject(s)
Acute Kidney Injury , Genome-Wide Association Study , Male , Adult , Humans , Female , Retrospective Studies , Acute Kidney Injury/diagnosis , Acute Kidney Injury/epidemiology , Acute Kidney Injury/genetics , Glomerular Filtration Rate , Risk Factors , Postoperative Complications/genetics , Postoperative Complications/epidemiology
8.
PLoS Genet ; 15(6): e1008202, 2019 06.
Article in English | MEDLINE | ID: mdl-31194742

ABSTRACT

Polygenic risk scores (PRS) are designed to serve as single summary measures that are easy to construct, condensing information from a large number of genetic variants associated with a disease. They have been used for stratification and prediction of disease risk. The primary focus of this paper is to demonstrate how we can combine PRS and electronic health records data to better understand the shared and unique genetic architecture and etiology of disease subtypes that may be both related and heterogeneous. PRS construction strategies often depend on the purpose of the study, the available data/summary estimates, and the underlying genetic architecture of a disease. We consider several choices for constructing a PRS using data obtained from various publicly-available sources including the UK Biobank and evaluate their abilities to predict not just the primary phenotype but also secondary phenotypes derived from electronic health records (EHR). This study was conducted using data from 30,702 unrelated, genotyped patients of recent European descent from the Michigan Genomics Initiative (MGI), a longitudinal biorepository effort within Michigan Medicine. We examine the three most common skin cancer subtypes in the USA: basal cell carcinoma, cutaneous squamous cell carcinoma, and melanoma. Using these PRS for various skin cancer subtypes, we conduct a phenome-wide association study (PheWAS) within the MGI data to evaluate PRS associations with secondary traits. PheWAS results are then replicated using population-based UK Biobank data and compared across various PRS construction methods. We develop an accompanying visual catalog called PRSweb that provides detailed PheWAS results and allows users to directly compare different PRS construction methods.


Subject(s)
Genetic Predisposition to Disease , Genomics , Multifactorial Inheritance/genetics , Skin Neoplasms/genetics , Biological Specimen Banks , Electronic Health Records , Genome-Wide Association Study , Genotype , Humans , Michigan/epidemiology , Phenotype , Polymorphism, Single Nucleotide/genetics , Risk Factors , Skin Neoplasms/pathology , United Kingdom/epidemiology
9.
Am J Hum Genet ; 102(6): 1048-1061, 2018 06 07.
Article in English | MEDLINE | ID: mdl-29779563

ABSTRACT

Health systems are stewards of patient electronic health record (EHR) data with extraordinarily rich depth and breadth, reflecting thousands of diagnoses and exposures. Measures of genomic variation integrated with EHRs offer a potential strategy to accurately stratify patients for risk profiling and discover new relationships between diagnoses and genomes. The objective of this study was to evaluate whether polygenic risk scores (PRS) for common cancers are associated with multiple phenotypes in a phenome-wide association study (PheWAS) conducted in 28,260 unrelated, genotyped patients of recent European ancestry who consented to participate in the Michigan Genomics Initiative, a longitudinal biorepository effort within Michigan Medicine. PRS for 12 cancer traits were calculated using summary statistics from the NHGRI-EBI catalog. A total of 1,711 synthetic case-control studies was used for PheWAS analyses. There were 13,490 (47.7%) patients with at least one cancer diagnosis in this study sample. PRS exhibited strong association for several cancer traits they were designed for, including female breast cancer, prostate cancer, melanoma, basal cell carcinoma, squamous cell carcinoma, and thyroid cancer. Phenome-wide significant associations were observed between PRS and many non-cancer diagnoses. To differentiate PRS associations driven by the primary trait from associations arising through shared genetic risk profiles, the idea of "exclusion PRS PheWAS" was introduced. Further analysis of temporal order of the diagnoses improved our understanding of these secondary associations. This comprehensive PheWAS used PRS instead of a single variant.


Subject(s)
Genetic Association Studies , Genomics , Multifactorial Inheritance/genetics , Neoplasms/genetics , Neoplasms/pathology , Calibration , Female , Genome-Wide Association Study , Humans , Male , Middle Aged , Neoplasms/diagnosis , Phenotype , Polymorphism, Single Nucleotide/genetics , Quantitative Trait, Heritable , Reproducibility of Results , Risk Factors , Time Factors
10.
Arterioscler Thromb Vasc Biol ; 40(11): 2686-2699, 2020 11.
Article in English | MEDLINE | ID: mdl-32938213

ABSTRACT

OBJECTIVE: While rare variants in the COL5A1 gene have been associated with classical Ehlers-Danlos syndrome and rarely with arterial dissections, recurrent variants in COL5A1 underlying a systemic arteriopathy have not been described. Monogenic forms of multifocal fibromuscular dysplasia (mFMD) have not been previously defined. Approach and Results: We studied 4 independent probands with the COL5A1 pathogenic variant c.1540G>A, p.(Gly514Ser) who presented with arterial aneurysms, dissections, tortuosity, and mFMD affecting multiple arteries. Arterial medial fibroplasia and smooth muscle cell disorganization were confirmed histologically. The COL5A1 c.1540G>A variant is predicted to be pathogenic in silico and absent in gnomAD. The c.1540G>A variant is on a shared 160.1 kb haplotype with 0.4% frequency in Europeans. Furthermore, exome sequencing data from a cohort of 264 individuals with mFMD were examined for COL5A1 variants. In this mFMD cohort, COL5A1 c.1540G>A and 6 additional relatively rare COL5A1 variants predicted to be deleterious in silico were identified and were associated with arterial dissections (P=0.005). CONCLUSIONS: COL5A1 c.1540G>A is the first recurring variant recognized to be associated with arterial dissections and mFMD. This variant presents with a phenotype reminiscent of vascular Ehlers-Danlos syndrome. A shared haplotype among probands supports the existence of a common founder. Relatively rare COL5A1 genetic variants predicted to be deleterious by in silico analysis were identified in ≈2.7% of mFMD cases, and as they were enriched in patients with arterial dissections, may act as disease modifiers. Molecular testing for COL5A1 should be considered in patients with a phenotype overlapping with vascular Ehlers-Danlos syndrome and mFMD.


Subject(s)
Aortic Dissection/genetics , Arteries/pathology , Collagen Type V/genetics , Ehlers-Danlos Syndrome/genetics , Fibromuscular Dysplasia/genetics , Polymorphism, Single Nucleotide , Adult , Aortic Dissection/diagnostic imaging , Aortic Dissection/pathology , Arteries/diagnostic imaging , Ehlers-Danlos Syndrome/diagnostic imaging , Ehlers-Danlos Syndrome/pathology , Female , Fibromuscular Dysplasia/diagnostic imaging , Fibromuscular Dysplasia/pathology , Genetic Predisposition to Disease , Haplotypes , Humans , Male , Middle Aged , Phenotype , Young Adult
11.
Proc Natl Acad Sci U S A ; 115(2): 379-384, 2018 01 09.
Article in English | MEDLINE | ID: mdl-29279374

ABSTRACT

A major challenge in evaluating the contribution of rare variants to complex disease is identifying enough copies of the rare alleles to permit informative statistical analysis. To investigate the contribution of rare variants to the risk of type 2 diabetes (T2D) and related traits, we performed deep whole-genome analysis of 1,034 members of 20 large Mexican-American families with high prevalence of T2D. If rare variants of large effect accounted for much of the diabetes risk in these families, our experiment was powered to detect association. Using gene expression data on 21,677 transcripts for 643 pedigree members, we identified evidence for large-effect rare-variant cis-expression quantitative trait loci that could not be detected in population studies, validating our approach. However, we did not identify any rare variants of large effect associated with T2D, or the related traits of fasting glucose and insulin, suggesting that large-effect rare variants account for only a modest fraction of the genetic risk of these traits in this sample of families. Reliable identification of large-effect rare variants will require larger samples of extended pedigrees or different study designs that further enrich for such variants.


Subject(s)
Diabetes Mellitus, Type 2/genetics , Genetic Predisposition to Disease/genetics , Genetic Variation , Mexican Americans/genetics , Diabetes Mellitus, Type 2/ethnology , Diabetes Mellitus, Type 2/pathology , Family Health , Female , Gene Frequency , Genetic Predisposition to Disease/ethnology , Genome-Wide Association Study/methods , Genotype , Humans , Male , Pedigree , Phenotype , Quantitative Trait Loci/genetics , Whole Genome Sequencing/methods
12.
Genet Epidemiol ; 43(7): 800-814, 2019 10.
Article in English | MEDLINE | ID: mdl-31433078

ABSTRACT

The power of genetic association analyses can be increased by jointly meta-analyzing multiple correlated phenotypes. Here, we develop a meta-analysis framework, Meta-MultiSKAT, that uses summary statistics to test for association between multiple continuous phenotypes and variants in a region of interest. Our approach models the heterogeneity of effects between studies through a kernel matrix and performs a variance component test for association. Using a genotype kernel, our approach can test for rare-variants and the combined effects of both common and rare-variants. To achieve robust power, within Meta-MultiSKAT, we developed fast and accurate omnibus tests combining different models of genetic effects, functional genomic annotations, multiple correlated phenotypes, and heterogeneity across studies. In addition, Meta-MultiSKAT accommodates situations where studies do not share exactly the same set of phenotypes or have differing correlation patterns among the phenotypes. Simulation studies confirm that Meta-MultiSKAT can maintain the type-I error rate at the exome-wide level of 2.5 × 10-6 . Further simulations under different models of association show that Meta-MultiSKAT can improve the power of detection from 23% to 38% on average over single phenotype-based meta-analysis approaches. We demonstrate the utility and improved power of Meta-MultiSKAT in the meta-analyses of four white blood cell subtype traits from the Michigan Genomics Initiative (MGI) and SardiNIA studies.


Subject(s)
Genetic Association Studies , Meta-Analysis as Topic , Gene Frequency/genetics , Genotype , Humans , Italy , Leukocytes/metabolism , Models, Genetic , Mutation/genetics , Phenotype
13.
Hum Mol Genet ; 26(21): 4301-4313, 2017 11 01.
Article in English | MEDLINE | ID: mdl-28973304

ABSTRACT

Psoriasis is a common inflammatory skin disorder for which multiple genetic susceptibility loci have been identified, but few resolved to specific functional variants. In this study, we sought to identify common and rare psoriasis-associated gene-centric variation. Using exome arrays we genotyped four independent cohorts, totalling 11 861 psoriasis cases and 28 610 controls, aggregating the dataset through statistical meta-analysis. Single variant analysis detected a previously unreported risk locus at TNFSF15 (rs6478108; P = 1.50 × 10-8, OR = 1.10), and association of common protein-altering variants at 11 loci previously implicated in psoriasis susceptibility. We validate previous reports of protective low-frequency protein-altering variants within IFIH1 (encoding an innate antiviral receptor) and TYK2 (encoding a Janus kinase), in each case establishing a further series of protective rare variants (minor allele frequency < 0.01) via gene-wide aggregation testing (IFIH1: pburden = 2.53 × 10-7, OR = 0.707; TYK2: pburden = 6.17 × 10-4, OR = 0.744). Both genes play significant roles in type I interferon (IFN) production and signalling. Several of the protective rare and low-frequency variants in IFIH1 and TYK2 disrupt conserved protein domains, highlighting potential mechanisms through which their effect may be exerted.


Subject(s)
Psoriasis/genetics , Tumor Necrosis Factor Ligand Superfamily Member 15/genetics , Alleles , Case-Control Studies , Cohort Studies , Exome , Female , Gene Frequency/genetics , Genetic Predisposition to Disease/genetics , Genetic Variation/genetics , Genome-Wide Association Study , Genotype , Humans , Interferon-Induced Helicase, IFIH1/genetics , Interferon-Induced Helicase, IFIH1/metabolism , Male , Polymorphism, Single Nucleotide/genetics , Psoriasis/physiopathology , Risk Factors , TYK2 Kinase/genetics , TYK2 Kinase/metabolism , Tumor Necrosis Factor Ligand Superfamily Member 15/metabolism , Exome Sequencing
14.
Pharmacogenet Genomics ; 27(3): 89-100, 2017 03.
Article in English | MEDLINE | ID: mdl-27984508

ABSTRACT

OBJECTIVE: Proteins involving absorption, distribution, metabolism, and excretion (ADME) play a critical role in drug pharmacokinetics. The type and frequency of genetic variation in the ADME genes differ among populations. The aim of this study was to systematically investigate common and rare ADME coding variation in diverse ethnic populations by exome sequencing. MATERIALS AND METHODS: Data derived from commercial exome capture arrays and next-generation sequencing were used to characterize coding variation in 298 ADME genes in 251 Northeast Asians and 1181 individuals from the 1000 Genomes Project. RESULTS: Approximately 75% of the ADME coding sequence was captured at high quality across the joint samples harboring more than 8000 variants, with 49% of individuals carrying at least one 'knockout' allele. ADME genes carried 50% more nonsynonymous variation than non-ADME genes (P=8.2×10) and showed significantly greater levels of population differentiation (P=7.6×10). Out of the 2135 variants identified that were predicted to be deleterious, 633 were not on commercially available ADME or general-purpose genotyping arrays. Forty deleterious variants within important ADME genes, with frequencies of at least 2% in at least one population, were identified as candidates for future pharmacogenetic studies. CONCLUSION: Exome sequencing was effective in accurately genotyping most ADME variants important for pharmacogenetic research, in addition to identifying rare or potentially de novo coding variants that may be clinically meaningful. Furthermore, as a class, ADME genes are more variable and less sensitive to purifying selection than non-ADME genes.


Subject(s)
High-Throughput Nucleotide Sequencing/methods , Oligonucleotide Array Sequence Analysis/methods , Population Groups/genetics , Sequence Analysis, DNA/methods , Exome , Genetic Variation , Genetics, Population , Humans , Male , Polymorphism, Single Nucleotide , Population Groups/ethnology , Principal Component Analysis
15.
Ann Rheum Dis ; 76(7): 1321-1324, 2017 Jul.
Article in English | MEDLINE | ID: mdl-28501801

ABSTRACT

OBJECTIVES: Psoriatic arthritis (PsA) is an inflammatory arthritis associated with psoriasis. While many common risk alleles have been reported for association with PsA as well as psoriasis, few rare coding alleles have yet been identified. METHODS: To identify rare coding variation associated with PsA risk or protection, we genotyped 41 267 variants with the exome chip and investigated association within an initial cohort of 1980 PsA cases and 5913 controls. Genotype data for an independent cohort of 2234 PsA cases and 5708 controls was also made available, allowing for a meta-analysis to be performed with the discovery dataset. RESULTS: We identified an association with the rare variant rs35667974 (p=2.39x10-6, OR=0.47), encoding an Ile923Val amino acid change in the IFIH1 gene protein product. The association was reproduced in our independent cohort, which reached a high level of significance on meta-analysis with the discovery and replication datasets (p=4.67x10-10). We identified a strong association with IFIH1 when performing multiple-variant analysis (p=6.77x10-6), and found evidence of independent effects between the rare allele and the common PsA variant at the same locus. CONCLUSION: For the first time, we report a rare coding allele in IFIH1 to be protective for PsA. This rare allele has also been identified to have the same direction of effect on type I diabetes and psoriasis. While this association further supports existing evidence for IFIH1 as a causal gene for PsA, mechanistic studies will need to be pursued to confirm that IFIH1 is indeed causal.


Subject(s)
Arthritis, Psoriatic/genetics , Interferon-Induced Helicase, IFIH1/genetics , Alleles , Case-Control Studies , Genetic Predisposition to Disease , Genotype , Humans , Logistic Models , Polymorphism, Single Nucleotide , Principal Component Analysis , Protective Factors
16.
Med Care ; 55(9): 864-870, 2017 09.
Article in English | MEDLINE | ID: mdl-28763374

ABSTRACT

BACKGROUND: Accurately estimating cardiovascular risk is fundamental to good decision-making in cardiovascular disease (CVD) prevention, but risk scores developed in one population often perform poorly in dissimilar populations. We sought to examine whether a large integrated health system can use their electronic health data to better predict individual patients' risk of developing CVD. METHODS: We created a cohort using all patients ages 45-80 who used Department of Veterans Affairs (VA) ambulatory care services in 2006 with no history of CVD, heart failure, or loop diuretics. Our outcome variable was new-onset CVD in 2007-2011. We then developed a series of recalibrated scores, including a fully refit "VA Risk Score-CVD (VARS-CVD)." We tested the different scores using standard measures of prediction quality. RESULTS: For the 1,512,092 patients in the study, the Atherosclerotic cardiovascular disease risk score had similar discrimination as the VARS-CVD (c-statistic of 0.66 in men and 0.73 in women), but the Atherosclerotic cardiovascular disease model had poor calibration, predicting 63% more events than observed. Calibration was excellent in the fully recalibrated VARS-CVD tool, but simpler techniques tested proved less reliable. CONCLUSIONS: We found that local electronic health record data can be used to estimate CVD better than an established risk score based on research populations. Recalibration improved estimates dramatically, and the type of recalibration was important. Such tools can also easily be integrated into health system's electronic health record and can be more readily updated.


Subject(s)
Cardiovascular Diseases/epidemiology , Electronic Health Records/statistics & numerical data , Health Status Indicators , Age Distribution , Aged , Atherosclerosis/epidemiology , Female , Humans , Male , Middle Aged , Risk Assessment , Risk Factors , Sex Distribution , Socioeconomic Factors , United States , United States Department of Veterans Affairs
18.
Stat Med ; 36(13): 2148-2160, 2017 06 15.
Article in English | MEDLINE | ID: mdl-28245528

ABSTRACT

Creating accurate risk prediction models from Big Data resources such as Electronic Health Records (EHRs) is a critical step toward achieving precision medicine. A major challenge in developing these tools is accounting for imperfect aspects of EHR data, particularly the potential for misclassified outcomes. Misclassification, the swapping of case and control outcome labels, is well known to bias effect size estimates for regression prediction models. In this paper, we study the effect of misclassification on accuracy assessment for risk prediction models and find that it leads to bias in the area under the curve (AUC) metric from standard ROC analysis. The extent of the bias is determined by the false positive and false negative misclassification rates as well as disease prevalence. Notably, we show that simply correcting for misclassification while building the prediction model is not sufficient to remove the bias in AUC. We therefore introduce an intuitive misclassification-adjusted ROC procedure that accounts for uncertainty in observed outcomes and produces bias-corrected estimates of the true AUC. The method requires that misclassification rates are either known or can be estimated, quantities typically required for the modeling step. The computational simplicity of our method is a key advantage, making it ideal for efficiently comparing multiple prediction models on very large datasets. Finally, we apply the correction method to a hospitalization prediction model from a cohort of over 1 million patients from the Veterans Health Administrations EHR. Implementations of the ROC correction are provided for Stata and R. Published 2017. This article is a U.S. Government work and is in the public domain in the USA.


Subject(s)
Models, Statistical , ROC Curve , Area Under Curve , Bias , Electronic Health Records , Hospitalization/statistics & numerical data , Humans , Risk Assessment/methods , United States , United States Department of Veterans Affairs/statistics & numerical data
19.
Genet Epidemiol ; 39(4): 227-38, 2015 May.
Article in English | MEDLINE | ID: mdl-25740221

ABSTRACT

Advances in exome sequencing and the development of exome genotyping arrays are enabling explorations of association between rare coding variants and complex traits. To ensure power for these rare variant analyses, a variety of association tests that group variants by gene or functional unit have been proposed. Here, we extend these tests to family-based studies. We develop family-based burden tests, variable frequency threshold tests and sequence kernel association tests. Through simulations, we compare the performance of different tests. We describe situations where family-based studies provide greater power than studies of unrelated individuals to detect rare variants associated with moderate to large changes in trait values. Broadly speaking, we find that when sample sizes are limited and only a modest fraction of all trait-associated variants can be identified, family samples are more powerful. Finally, we illustrate our approach by analyzing the relationship between coding variants and levels of high-density lipoprotein (HDL) cholesterol in 11,556 individuals from the HUNT and SardiNIA studies, demonstrating association for coding variants in the APOC3, CETP, LIPC, LIPG, and LPL genes and illustrating the value of family samples, meta-analysis, and gene-level tests. Our methods are implemented in freely available C++ code.


Subject(s)
Genetic Association Studies/methods , Genetic Variation/genetics , Models, Genetic , Software , Apolipoprotein C-III/genetics , Cholesterol Ester Transfer Proteins/genetics , Cholesterol, HDL/genetics , Computer Simulation , Exome/genetics , Family , Genotype , Humans , Lipase/genetics , Lipoprotein Lipase/genetics , Phenotype
20.
Genome Res ; 23(12): 1974-84, 2013 Dec.
Article in English | MEDLINE | ID: mdl-23990608

ABSTRACT

Understanding patterns of spontaneous mutations is of fundamental interest in studies of human genome evolution and genetic disease. Here, we used extremely rare variants in humans to model the molecular spectrum of single-nucleotide mutations. Compared to common variants in humans and human-chimpanzee fixed differences (substitutions), rare variants, on average, arose more recently in the human lineage and are less affected by the potentially confounding effects of natural selection, population demographic history, and biased gene conversion. We analyzed variants obtained from a population-based sequencing study of 202 genes in >14,000 individuals. We observed considerable variability in the per-gene mutation rate, which was correlated with local GC content, but not recombination rate. Using >20,000 variants with a derived allele frequency ≤ 10(-4), we examined the effect of local GC content and recombination rate on individual variant subtypes and performed comparisons with common variants and substitutions. The influence of local GC content on rare variants differed from that on common variants or substitutions, and the differences varied by variant subtype. Furthermore, recombination rate and recombination hotspots have little effect on rare variants of any subtype, yet both have a relatively strong impact on multiple variant subtypes in common variants and substitutions. This observation is consistent with the effect of biased gene conversion or selection-dependent processes. Our results highlight the distinct biases inherent in the initial mutation patterns and subsequent evolutionary processes that affect segregating variants.


Subject(s)
Genetic Variation , Genome, Human , Point Mutation , Animals , Base Composition , Evolution, Molecular , Gene Conversion , Gene Frequency , Genomics , Humans , Logistic Models , Models, Genetic , Mutation Rate , Pan troglodytes/genetics , Phylogeny , Recombination, Genetic , Selection, Genetic
SELECTION OF CITATIONS
SEARCH DETAIL