ABSTRACT
Genetic studies have identified ≥240 loci associated with the risk of type 2 diabetes (T2D), yet most of these loci lie in non-coding regions, masking the underlying molecular mechanisms. Recent studies investigating mRNA expression in human pancreatic islets have yielded important insights into the molecular drivers of normal islet function and T2D pathophysiology. However, similar studies investigating microRNA (miRNA) expression remain limited. Here, we present data from 63 individuals, the largest sequencing-based analysis of miRNA expression in human islets to date. We characterized the genetic regulation of miRNA expression by decomposing the expression of highly heritable miRNAs into cis- and trans-acting genetic components and mapping cis-acting loci associated with miRNA expression [miRNA-expression quantitative trait loci (eQTLs)]. We found i) 84 heritable miRNAs, primarily regulated by trans-acting genetic effects, and ii) 5 miRNA-eQTLs. We also used several different strategies to identify T2D-associated miRNAs. First, we colocalized miRNA-eQTLs with genetic loci associated with T2D and multiple glycemic traits, identifying one miRNA, miR-1908, that shares genetic signals for blood glucose and glycated hemoglobin (HbA1c). Next, we intersected miRNA seed regions and predicted target sites with credible set SNPs associated with T2D and glycemic traits and found 32 miRNAs that may have altered binding and function due to disrupted seed regions. Finally, we performed differential expression analysis and identified 14 miRNAs associated with T2D status-including miR-187-3p, miR-21-5p, miR-668, and miR-199b-5p-and 4 miRNAs associated with a polygenic score for HbA1c levels-miR-216a, miR-25, miR-30a-3p, and miR-30a-5p.
Subject(s)
Diabetes Mellitus, Type 2 , Islets of Langerhans , MicroRNAs , Humans , MicroRNAs/metabolism , Diabetes Mellitus, Type 2/genetics , Diabetes Mellitus, Type 2/metabolism , Glycated Hemoglobin , Islets of Langerhans/metabolism , Quantitative Trait Loci/geneticsABSTRACT
Genetic association studies have identified hundreds of independent signals associated with type 2 diabetes (T2D) and related traits. Despite these successes, the identification of specific causal variants underlying a genetic association signal remains challenging. In this study, we describe a deep learning (DL) method to analyze the impact of sequence variants on enhancers. Focusing on pancreatic islets, a T2D relevant tissue, we show that our model learns islet-specific transcription factor (TF) regulatory patterns and can be used to prioritize candidate causal variants. At 101 genetic signals associated with T2D and related glycemic traits where multiple variants occur in linkage disequilibrium, our method nominates a single causal variant for each association signal, including three variants previously shown to alter reporter activity in islet-relevant cell types. For another signal associated with blood glucose levels, we biochemically test all candidate causal variants from statistical fine-mapping using a pancreatic islet beta cell line and show biochemical evidence of allelic effects on TF binding for the model-prioritized variant. To aid in future research, we publicly distribute our model and islet enhancer perturbation scores across ~67 million genetic variants. We anticipate that DL methods like the one presented in this study will enhance the prioritization of candidate causal variants for functional studies.
Subject(s)
Deep Learning , Diabetes Mellitus, Type 2 , Enhancer Elements, Genetic , Islets of Langerhans , Diabetes Mellitus, Type 2/genetics , Diabetes Mellitus, Type 2/metabolism , Diabetes Mellitus, Type 2/pathology , Islets of Langerhans/metabolism , Islets of Langerhans/pathology , Genetic Variation , Humans , Computer SimulationABSTRACT
AIMS/HYPOTHESIS: Disruption of pancreatic islet function and glucose homeostasis can lead to the development of sustained hyperglycaemia, beta cell glucotoxicity and subsequently type 2 diabetes. In this study, we explored the effects of in vitro hyperglycaemic conditions on human pancreatic islet gene expression across 24 h in six pancreatic cell types: alpha; beta; gamma; delta; ductal; and acinar. We hypothesised that genes associated with hyperglycaemic conditions may be relevant to the onset and progression of diabetes. METHODS: We exposed human pancreatic islets from two donors to low (2.8 mmol/l) and high (15.0 mmol/l) glucose concentrations over 24 h in vitro. To assess the transcriptome, we performed single-cell RNA-seq (scRNA-seq) at seven time points. We modelled time as both a discrete and continuous variable to determine momentary and longitudinal changes in transcription associated with islet time in culture or glucose exposure. Additionally, we integrated genomic features and genetic summary statistics to nominate candidate effector genes. For three of these genes, we functionally characterised the effect on insulin production and secretion using CRISPR interference to knock down gene expression in EndoC-ßH1 cells, followed by a glucose-stimulated insulin secretion assay. RESULTS: In the discrete time models, we identified 1344 genes associated with time and 668 genes associated with glucose exposure across all cell types and time points. In the continuous time models, we identified 1311 genes associated with time, 345 genes associated with glucose exposure and 418 genes associated with interaction effects between time and glucose across all cell types. By integrating these expression profiles with summary statistics from genetic association studies, we identified 2449 candidate effector genes for type 2 diabetes, HbA1c, random blood glucose and fasting blood glucose. Of these candidate effector genes, we showed that three (ERO1B, HNRNPA2B1 and RHOBTB3) exhibited an effect on glucose-stimulated insulin production and secretion in EndoC-ßH1 cells. CONCLUSIONS/INTERPRETATION: The findings of our study provide an in-depth characterisation of the 24 h transcriptomic response of human pancreatic islets to glucose exposure at a single-cell resolution. By integrating differentially expressed genes with genetic signals for type 2 diabetes and glucose-related traits, we provide insights into the molecular mechanisms underlying glucose homeostasis. Finally, we provide functional evidence to support the role of three candidate effector genes in insulin secretion and production. DATA AVAILABILITY: The scRNA-seq data from the 24 h glucose exposure experiment performed in this study are available in the database of Genotypes and Phenotypes (dbGap; https://www.ncbi.nlm.nih.gov/gap/ ) with accession no. phs001188.v3.p1. Study metadata and summary statistics for the differential expression, gene set enrichment and candidate effector gene prediction analyses are available in the Zenodo data repository ( https://zenodo.org/ ) under accession number 11123248. The code used in this study is publicly available at https://github.com/CollinsLabBioComp/publication-islet_glucose_timecourse .
Subject(s)
Gene Expression Profiling , Glucose , Islets of Langerhans , Single-Cell Analysis , Humans , Islets of Langerhans/metabolism , Islets of Langerhans/drug effects , Glucose/pharmacology , Glucose/metabolism , Transcriptome , Diabetes Mellitus, Type 2/genetics , Diabetes Mellitus, Type 2/metabolism , Insulin/metabolism , Insulin-Secreting Cells/metabolism , Insulin-Secreting Cells/drug effects , Hyperglycemia/genetics , Hyperglycemia/metabolismABSTRACT
Patients with classic hydroa vacciniforme-like lymphoproliferative disorder (HVLPD) typically have high levels of Epstein-Barr virus (EBV) DNA in T cells and/or natural killer (NK) cells in blood and skin lesions induced by sun exposure that are infiltrated with EBV-infected lymphocytes. HVLPD is very rare in the United States and Europe but more common in Asia and South America. The disease can progress to a systemic form that may result in fatal lymphoma. We report our 11-year experience with 16 HVLPD patients from the United States and England and found that whites were less likely to develop systemic EBV disease (1/10) than nonwhites (5/6). All (10/10) of the white patients were generally in good health at last follow-up, while two-thirds (4/6) of the nonwhite patients required hematopoietic stem cell transplantation. Nonwhite patients had later age of onset of HVLPD than white patients (median age, 8 vs 5 years) and higher levels of EBV DNA (median, 1 515 000 vs 250 000 copies/ml) and more often had low numbers of NK cells (83% vs 50% of patients) and T-cell clones in the blood (83% vs 30% of patients). RNA-sequencing analysis of an HVLPD skin lesion in a white patient compared with his normal skin showed increased expression of interferon-γ and chemokines that attract T cells and NK cells. Thus, white patients with HVLPD were less likely to have systemic disease with EBV and had a much better prognosis than nonwhite patients. This trial was registered at www.clinicaltrials.gov as #NCT00369421 and #NCT00032513.
Subject(s)
Epstein-Barr Virus Infections/pathology , Hydroa Vacciniforme/virology , Lymphoproliferative Disorders/pathology , Lymphoproliferative Disorders/virology , Child , Child, Preschool , Epstein-Barr Virus Infections/ethnology , Epstein-Barr Virus Infections/immunology , Female , Humans , Lymphoproliferative Disorders/ethnology , Male , White PeopleABSTRACT
Genome-wide association studies (GWAS) have identified ~100 loci associated with blood lipid levels, but much of the trait heritability remains unexplained, and at most loci the identities of the trait-influencing variants remain unknown. We conducted a trans-ethnic fine-mapping study at 18, 22, and 18 GWAS loci on the Metabochip for their association with triglycerides (TG), high-density lipoprotein cholesterol (HDL-C), and low-density lipoprotein cholesterol (LDL-C), respectively, in individuals of African American (n = 6,832), East Asian (n = 9,449), and European (n = 10,829) ancestry. We aimed to identify the variants with strongest association at each locus, identify additional and population-specific signals, refine association signals, and assess the relative significance of previously described functional variants. Among the 58 loci, 33 exhibited evidence of association at P<1 × 10(-4) in at least one ancestry group. Sequential conditional analyses revealed that ten, nine, and four loci in African Americans, Europeans, and East Asians, respectively, exhibited two or more signals. At these loci, accounting for all signals led to a 1.3- to 1.8-fold increase in the explained phenotypic variance compared to the strongest signals. Distinct signals across ancestry groups were identified at PCSK9 and APOA5. Trans-ethnic analyses narrowed the signals to smaller sets of variants at GCKR, PPP1R3B, ABO, LCAT, and ABCA1. Of 27 variants reported previously to have functional effects, 74% exhibited the strongest association at the respective signal. In conclusion, trans-ethnic high-density genotyping and analysis confirm the presence of allelic heterogeneity, allow the identification of population-specific variants, and limit the number of candidate SNPs for functional studies.
Subject(s)
Apolipoproteins A/genetics , Genome-Wide Association Study , Proprotein Convertases/genetics , Serine Endopeptidases/genetics , Black or African American/genetics , Apolipoprotein A-V , Cholesterol, HDL/blood , Cholesterol, HDL/genetics , Cholesterol, LDL/blood , Cholesterol, LDL/genetics , Humans , Lipoproteins, HDL/blood , Lipoproteins, HDL/genetics , Lipoproteins, LDL/blood , Lipoproteins, LDL/genetics , Proprotein Convertase 9 , Triglycerides/blood , Triglycerides/genetics , White People/geneticsABSTRACT
Transgenic animals are extensively used to model human disease. Typically, the transgene copy number is estimated, but the exact integration site and configuration of the foreign DNA remains uncharacterized. When transgenes have been closely examined, some unexpected configurations have been found. Here, we describe a method to recover transgene insertion sites and assess structural rearrangements of host and transgene DNA using microarray hybridization and targeted sequence capture. We used information about the transgene insertion site to develop a polymerase chain reaction genotyping assay to distinguish heterozygous from homozygous transgenic animals. Although we worked with a bacterial artificial chromosome transgenic mouse line, this method can be used to analyse the integration site and configuration of any foreign DNA in a sequenced genome.
Subject(s)
Genotyping Techniques , High-Throughput Nucleotide Sequencing , Oligonucleotide Array Sequence Analysis/methods , Sequence Analysis, DNA , Transgenes , Animals , Chromosomes, Artificial, Bacterial , Mice , Mice, Transgenic , Polymerase Chain ReactionABSTRACT
We developed an efficient CRISPR prime editing protocol and generated isogenic-induced pluripotent stem cell (iPSC) lines carrying heterozygous or homozygous alleles for putatively causal single nucleotide variants at six type 2 diabetes loci (ABCC8, MTNR1B, TCF7L2, HNF4A, CAMK1D, and GCK). Our two-step sequence-based approach to first identify transfected cell pools with the highest fraction of edited cells significantly reduced the downstream efforts to isolate single clones of edited cells. We found that prime editing can make targeted genetic changes in iPSC and optimization of system components and guide RNA designs that were critical to achieve acceptable efficiency. Systems utilizing PEmax, epegRNA modifications, and MLH1dn provided significant benefit, producing editing efficiencies of 36-73%. Editing success and pegRNA design optimization required for each variant differed depending on the sequence at the target site. With attention to design, prime editing is a promising approach to generate isogenic iPSC lines, enabling the study of specific genetic changes in a common genetic background.
Subject(s)
Diabetes Mellitus, Type 2 , Induced Pluripotent Stem Cells , Humans , Clustered Regularly Interspaced Short Palindromic Repeats/genetics , CRISPR-Cas Systems/genetics , Gene Editing , RNA, Guide, CRISPR-Cas SystemsABSTRACT
Massively parallel DNA sequencing technologies have greatly increased our ability to generate large amounts of sequencing data at a rapid pace. Several methods have been developed to enrich for genomic regions of interest for targeted sequencing. We have compared three of these methods: Molecular Inversion Probes (MIP), Solution Hybrid Selection (SHS), and Microarray-based Genomic Selection (MGS). Using HapMap DNA samples, we compared each of these methods with respect to their ability to capture an identical set of exons and evolutionarily conserved regions associated with 528 genes (2.61 Mb). For sequence analysis, we developed and used a novel Bayesian genotype-assigning algorithm, Most Probable Genotype (MPG). All three capture methods were effective, but sensitivities (percentage of targeted bases associated with high-quality genotypes) varied for an equivalent amount of pass-filtered sequence: for example, 70% (MIP), 84% (SHS), and 91% (MGS) for 400 Mb. In contrast, all methods yielded similar accuracies of >99.84% when compared to Infinium 1M SNP BeadChip-derived genotypes and >99.998% when compared to 30-fold coverage whole-genome shotgun sequencing data. We also observed a low false-positive rate with all three methods; of the heterozygous positions identified by each of the capture methods, >99.57% agreed with 1M SNP BeadChip, and >98.840% agreed with the whole-genome shotgun data. In addition, we successfully piloted the genomic enrichment of a set of 12 pooled samples via the MGS method using molecular bar codes. We find that these three genomic enrichment methods are highly accurate and practical, with sensitivities comparable to that of 30-fold coverage whole-genome shotgun data.
Subject(s)
Diabetes Mellitus, Type 2/genetics , Genome, Human , Oligonucleotide Array Sequence Analysis/methods , Sequence Analysis, DNA/methods , Algorithms , Bayes Theorem , DNA/genetics , DNA Probes/genetics , Exons , Genotype , Humans , Reproducibility of Results , Sensitivity and SpecificityABSTRACT
Disruption of pancreatic islet function and glucose homeostasis can lead to the development of sustained hyperglycemia, beta cell glucotoxicity, and ultimately type 2 diabetes (T2D). In this study, we sought to explore the effects of hyperglycemia on human pancreatic islet (HPI) gene expression by exposing HPIs from two donors to low (2.8mM) and high (15.0mM) glucose concentrations over 24 hours, assaying the transcriptome at seven time points using single-cell RNA sequencing (scRNA-seq). We modeled time as both a discrete and continuous variable to determine momentary and longitudinal changes in transcription associated with islet time in culture or glucose exposure. Across all cell types, we identified 1,528 genes associated with time, 1,185 genes associated with glucose exposure, and 845 genes associated with interaction effects between time and glucose. We clustered differentially expressed genes across cell types and found 347 modules of genes with similar expression patterns across time and glucose conditions, including two beta cell modules enriched in genes associated with T2D. Finally, by integrating genomic features from this study and genetic summary statistics for T2D and related traits, we nominate 363 candidate effector genes that may underlie genetic associations for T2D and related traits.
ABSTRACT
Genetic studies have identified numerous loci associated with type 2 diabetes (T2D), but the functional roles of many loci remain unexplored. Here, we engineered isogenic knockout human embryonic stem cell lines for 20 genes associated with T2D risk. We examined the impacts of each knockout on ß cell differentiation, functions, and survival. We generated gene expression and chromatin accessibility profiles on ß cells derived from each knockout line. Analyses of T2D-association signals overlapping HNF4A-dependent ATAC peaks identified a likely causal variant at the FAIM2 T2D-association signal. Additionally, the integrative association analyses identified four genes (CP, RNASE1, PCSK1N, and GSTA2) associated with insulin production, and two genes (TAGLN3 and DHRS2) associated with ß cell sensitivity to lipotoxicity. Finally, we leveraged deep ATAC-seq read coverage to assess allele-specific imbalance at variants heterozygous in the parental line and identified a single likely functional variant at each of 23 T2D-association signals.
Subject(s)
Diabetes Mellitus, Type 2 , Human Embryonic Stem Cells , Insulin-Secreting Cells , Humans , Diabetes Mellitus, Type 2/genetics , Diabetes Mellitus, Type 2/metabolism , Human Embryonic Stem Cells/metabolism , Genetic Predisposition to Disease , Genome-Wide Association Study , Insulin-Secreting Cells/metabolism , Polymorphism, Single Nucleotide , Carbonyl Reductase (NADPH)/genetics , Carbonyl Reductase (NADPH)/metabolismABSTRACT
Genetic studies have identified numerous loci associated with type 2 diabetes (T2D), but the functional role of many loci has remained unexplored. In this study, we engineered isogenic knockout human embryonic stem cell (hESC) lines for 20 genes associated with T2D risk. We systematically examined ß-cell differentiation, insulin production and secretion, and survival. We performed RNA-seq and ATAC-seq on hESC-ß cells from each knockout line. Analyses of T2D GWAS signals overlapping with HNF4A-dependent ATAC peaks identified a specific SNP as a likely causal variant. In addition, we performed integrative association analyses and identified four genes ( CP, RNASE1, PCSK1N and GSTA2 ) associated with insulin production, and two genes ( TAGLN3 and DHRS2 ) associated with sensitivity to lipotoxicity. Finally, we leveraged deep ATAC-seq read coverage to assess allele-specific imbalance at variants heterozygous in the parental hESC line, to identify a single likely functional variant at each of 23 T2D GWAS signals.
ABSTRACT
Identifying the genetic variants that regulate fasting glucose concentrations may further our understanding of the pathogenesis of diabetes. We therefore investigated the association of fasting glucose levels with SNPs in 2 genome-wide scans including a total of 5,088 nondiabetic individuals from Finland and Sardinia. We found a significant association between the SNP rs563694 and fasting glucose concentrations (P = 3.5 x 10(-7)). This association was further investigated in an additional 18,436 nondiabetic individuals of mixed European descent from 7 different studies. The combined P value for association in these follow-up samples was 6.9 x 10(-26), and combining results from all studies resulted in an overall P value for association of 6.4 x 10(-33). Across these studies, fasting glucose concentrations increased 0.01-0.16 mM with each copy of the major allele, accounting for approximately 1% of the total variation in fasting glucose. The rs563694 SNP is located between the genes glucose-6-phosphatase catalytic subunit 2 (G6PC2) and ATP-binding cassette, subfamily B (MDR/TAP), member 11 (ABCB11). Our results in combination with data reported in the literature suggest that G6PC2, a glucose-6-phosphatase almost exclusively expressed in pancreatic islet cells, may underlie variation in fasting glucose, though it is possible that ABCB11, which is expressed primarily in liver, may also contribute to such variation.
Subject(s)
ATP-Binding Cassette Transporters/genetics , Blood Glucose/analysis , Glucose-6-Phosphatase/genetics , Polymorphism, Single Nucleotide , ATP Binding Cassette Transporter, Subfamily B, Member 11 , Adult , Aged , Analysis of Variance , Fasting/blood , Finland , Follow-Up Studies , Gene Frequency , Genotype , Humans , Italy , Linkage Disequilibrium , Middle Aged , White People/geneticsABSTRACT
Single-cell RNA sequencing (scRNA-seq) of human primary tissues is a rapidly emerging tool for investigating human health and disease at the molecular level. However, optimal processing of solid tissues presents a number of technical and logistical challenges, especially for tissues that are only available at autopsy, which includes pancreatic islets, a tissue that is highly relevant to diabetes. To assess the possible effects of different sample preparation protocols on fresh islet samples, we performed a detailed comparison of scRNA-seq data generated with islets isolated from a human donor but processed according to four treatment strategies, including fixation and cryopreservation. We found significant and reproducible differences in the proportion of cell types identified, and more minor effects on cell-specific patterns of gene expression. Fresh islets from a second donor confirmed gene expression signatures of alpha and beta subclusters. These findings may well apply to other tissues, emphasizing the need for careful consideration when choosing processing methods, comparing results between different studies, and/or interpreting data in the context of multiple cell types from preserved tissue.
ABSTRACT
More than 120 published reports have described associations between single nucleotide polymorphisms (SNPs) and type 2 diabetes. However, multiple studies of the same variant have often been discordant. From a literature search, we identified previously reported type 2 diabetes-associated SNPs. We initially genotyped 134 SNPs on 786 index case subjects from type 2 diabetes families and 617 control subjects with normal glucose tolerance from Finland and excluded from analysis 20 SNPs in strong linkage disequilibrium (r(2) > 0.8) with another typed SNP. Of the 114 SNPs examined, we followed up the 20 most significant SNPs (P < 0.10) on an additional 384 case subjects and 366 control subjects from a population-based study in Finland. In the combined data, we replicated association (P < 0.05) for 12 SNPs: PPARG Pro12Ala and His447, KCNJ11 Glu23Lys and rs5210, TNF -857, SLC2A2 Ile110Thr, HNF1A/TCF1 rs2701175 and GE117881_360, PCK1 -232, NEUROD1 Thr45Ala, IL6 -598, and ENPP1 Lys121Gln. The replication of 12 SNPs of 114 tested was significantly greater than expected by chance under the null hypothesis of no association (P = 0.012). We observed that SNPs from genes that had three or more previous reports of association were significantly more likely to be replicated in our sample (P = 0.03), although we also replicated 4 of 58 SNPs from genes that had only one previous report of association.
Subject(s)
Chromosome Mapping , Diabetes Mellitus, Type 2/genetics , Genetic Testing , Polymorphism, Single Nucleotide , Aged , Blood Glucose/analysis , Body Mass Index , Fasting , Female , Humans , Infant , Insulin/blood , Male , Middle AgedABSTRACT
Prior reports have suggested that variants in the genes for maturity-onset diabetes of the young (MODY) may confer susceptibility to type 2 diabetes, but results have been conflicting and coverage of the MODY genes has been incomplete. To complement our previous studies of HNF4A, we examined the other five known MODY genes for association with type 2 diabetes in Finnish individuals. For each of the five genes, we selected 1) nonredundant single nucleotide polymorphisms (SNPs) (r(2)< 0.8 with other SNPs) from the HapMap database or another linkage disequilibrium map, 2) SNPs with previously reported type 2 diabetes association, and 3) nonsynonymous coding SNPs. We tested 128 SNPs for association with type 2 diabetes in 786 index cases from type 2 diabetic families and 619 normal glucose-tolerant control subjects. We followed up 35 of the most significant SNPs by genotyping them on another 384 case subjects and 366 control subjects from Finland. We also supplemented our previous HNF4A results by genotyping 12 SNPs on additional Finnish samples. After correcting for testing multiple correlated SNPs within a gene, we find evidence of type 2 diabetes association with SNPs in five of the six known MODY genes: GCK, HNF1A, HNF1B, NEUROD1, and HNF4A. Our data suggest that common variants in several MODY genes play a modest role in type 2 diabetes susceptibility.
Subject(s)
Basic Helix-Loop-Helix Transcription Factors/genetics , Diabetes Mellitus, Type 2/genetics , Hepatocyte Nuclear Factor 1-alpha/genetics , Hepatocyte Nuclear Factor 1-beta/genetics , Hepatocyte Nuclear Factor 4/genetics , Diabetes Mellitus, Type 2/etiology , Female , Finland , Glucokinase/genetics , Homeodomain Proteins/genetics , Humans , Male , Middle Aged , Polymorphism, Single Nucleotide , Trans-Activators/geneticsABSTRACT
High blood pressure is a major risk factor for cardiovascular disease and premature death. However, there is limited knowledge on specific causal genes and pathways. To better understand the genetics of blood pressure, we genotyped 242,296 rare, low-frequency and common genetic variants in up to 192,763 individuals and used â¼155,063 samples for independent replication. We identified 30 new blood pressure- or hypertension-associated genetic regions in the general population, including 3 rare missense variants in RBM47, COL21A1 and RRAS with larger effects (>1.5 mm Hg/allele) than common variants. Multiple rare nonsense and missense variant associations were found in A2ML1, and a low-frequency nonsense variant in ENPEP was identified. Our data extend the spectrum of allelic variation underlying blood pressure traits and hypertension, provide new insights into the pathophysiology of hypertension and indicate new targets for clinical intervention.
Subject(s)
Blood Pressure/genetics , Genetic Variation , Hypertension/genetics , Genetic Predisposition to Disease , Genome-Wide Association Study , Genotype , HumansABSTRACT
Patients with established type 2 diabetes display both ß-cell dysfunction and insulin resistance. To define fundamental processes leading to the diabetic state, we examined the relationship between type 2 diabetes risk variants at 37 established susceptibility loci, and indices of proinsulin processing, insulin secretion, and insulin sensitivity. We included data from up to 58,614 nondiabetic subjects with basal measures and 17,327 with dynamic measures. We used additive genetic models with adjustment for sex, age, and BMI, followed by fixed-effects, inverse-variance meta-analyses. Cluster analyses grouped risk loci into five major categories based on their relationship to these continuous glycemic phenotypes. The first cluster (PPARG, KLF14, IRS1, GCKR) was characterized by primary effects on insulin sensitivity. The second cluster (MTNR1B, GCK) featured risk alleles associated with reduced insulin secretion and fasting hyperglycemia. ARAP1 constituted a third cluster characterized by defects in insulin processing. A fourth cluster (TCF7L2, SLC30A8, HHEX/IDE, CDKAL1, CDKN2A/2B) was defined by loci influencing insulin processing and secretion without a detectable change in fasting glucose levels. The final group contained 20 risk loci with no clear-cut associations to continuous glycemic traits. By assembling extensive data on continuous glycemic traits, we have exposed the diverse mechanisms whereby type 2 diabetes risk variants impact disease predisposition.
Subject(s)
Diabetes Mellitus, Type 2/genetics , Genetic Predisposition to Disease , Insulin Resistance/genetics , Insulin-Secreting Cells/metabolism , Insulin/metabolism , Quantitative Trait Loci/genetics , Alleles , Cluster Analysis , Female , Gene Frequency , Genetic Variation , Genome-Wide Association Study , Humans , Insulin Secretion , Male , Polymorphism, Single Nucleotide , Risk Factors , Transcription Factors/metabolismABSTRACT
We used an unbiased genome-wide approach to identify exonic variants segregating with diabetes in a multigenerational Finnish family. At least eight members of this family presented with diabetes with age of diagnosis ranging from 18 to 51 years and a pattern suggesting autosomal dominant inheritance. We sequenced the exomes of four affected members of this family and performed follow-up genotyping of additional affected and unaffected family members. We uncovered a novel nonsynonymous variant (p.Trp314Arg) in the Wolfram syndrome 1 (WFS1) gene that segregates completely with the diabetic phenotype. Multipoint parametric linkage analysis with 13 members of this family identified a single linkage signal with maximum logarithm of odds score 3.01 at 4p16.2-p16.1, corresponding to a region harboring the WFS1 locus. Functional studies demonstrate a role for this variant in endoplasmic reticulum stress, which is consistent with the ß-cell failure phenotype seen in mutation carriers. This represents the first compelling report of a mutation in WFS1 associated with dominantly inherited nonsyndromic adult-onset diabetes.
Subject(s)
Diabetes Mellitus/genetics , Membrane Proteins/genetics , Wolfram Syndrome/genetics , Adolescent , Adult , Endoplasmic Reticulum Stress , Exome/genetics , Female , Genes, Dominant , Genetic Linkage , Humans , Male , Middle Aged , PedigreeABSTRACT
Defining the genetic contribution of rare variants to common diseases is a major basic and clinical science challenge that could offer new insights into disease etiology and provide potential for directed gene- and pathway-based prevention and treatment. Common and rare nonsynonymous variants in the GCKR gene are associated with alterations in metabolic traits, most notably serum triglyceride levels. GCKR encodes glucokinase regulatory protein (GKRP), a predominantly nuclear protein that inhibits hepatic glucokinase (GCK) and plays a critical role in glucose homeostasis. The mode of action of rare GCKR variants remains unexplored. We identified 19 nonsynonymous GCKR variants among 800 individuals from the ClinSeq medical sequencing project. Excluding the previously described common missense variant p.Pro446Leu, all variants were rare in the cohort. Accordingly, we functionally characterized all variants to evaluate their potential phenotypic effects. Defects were observed for the majority of the rare variants after assessment of cellular localization, ability to interact with GCK, and kinetic activity of the encoded proteins. Comparing the individuals with functional rare variants to those without such variants showed associations with lipid phenotypes. Our findings suggest that, while nonsynonymous GCKR variants, excluding p.Pro446Leu, are rare in individuals of mixed European descent, the majority do affect protein function. In sum, this study utilizes computational, cell biological, and biochemical methods to present a model for interpreting the clinical significance of rare genetic variants in common disease.
Subject(s)
Adaptor Proteins, Signal Transducing/genetics , Adaptor Proteins, Signal Transducing/metabolism , Genetic Variation , Active Transport, Cell Nucleus , Aged , Amino Acid Substitution , Codon, Nonsense , Cohort Studies , Evolution, Molecular , Exons , Female , Frameshift Mutation , Glucokinase/metabolism , HeLa Cells , Humans , Kinetics , Male , Middle Aged , Mutation, Missense , Phenotype , Polymorphism, Single Nucleotide , Recombinant Proteins/genetics , Recombinant Proteins/metabolism , Triglycerides/bloodABSTRACT
A mutation in the LMNA gene is responsible for the most dramatic form of premature aging, Hutchinson-Gilford progeria syndrome (HGPS). Several recent studies have suggested that protein products of this gene might have a role in normal physiological cellular senescence. To explore further LMNA's possible role in normal aging, we genotyped 16 SNPs over a span of 75.4 kb of the LMNA gene on a sample of long-lived individuals (LLI) (US Caucasians with age ≥ 95 years, N=873) and genetically matched younger controls (N=443). We tested all common nonredundant haplotypes (frequency ≥ 0.05) based on subgroups of these 16 SNPs for association with longevity. The most significant haplotype, based on four SNPs, remained significant after adjustment for multiple testing (OR=1.56, P=2.5 × 10(-5) , multiple-testing-adjusted P=0.0045). To attempt to replicate these results, we genotyped 3619 subjects from four independent samples of LLI and control subjects from (i) the New England Centenarian Study (NECS) (N=738), (ii) the Southern Italian Centenarian Study (SICS) (N=905), (iii) France (N=1103), and (iv) the Einstein Ashkenazi Longevity Study (N= 702). We replicated the association with the most significant haplotype from our initial analysis in the NECS sample (OR=1.60, P=0.0023), but not in the other three samples (P > 0.15). In a meta-analysis combining all five samples, the best haplotype remained significantly associated with longevity after adjustment for multiple testing in the initial and follow-up samples (OR=1.18, P=7.5 × 10(-4) , multiple-testing-adjusted P=0.037). These results suggest that LMNA variants may play a role in human lifespan.