ABSTRACT
Type 2 diabetes (T2D) affects Latinos at twice the rate seen in populations of European descent. We recently identified a risk haplotype spanning SLC16A11 that explains â¼20% of the increased T2D prevalence in Mexico. Here, through genetic fine-mapping, we define a set of tightly linked variants likely to contain the causal allele(s). We show that variants on the T2D-associated haplotype have two distinct effects: (1) decreasing SLC16A11 expression in liver and (2) disrupting a key interaction with basigin, thereby reducing cell-surface localization. Both independent mechanisms reduce SLC16A11 function and suggest SLC16A11 is the causal gene at this locus. To gain insight into how SLC16A11 disruption impacts T2D risk, we demonstrate that SLC16A11 is a proton-coupled monocarboxylate transporter and that genetic perturbation of SLC16A11 induces changes in fatty acid and lipid metabolism that are associated with increased T2D risk. Our findings suggest that increasing SLC16A11 function could be therapeutically beneficial for T2D. VIDEO ABSTRACT.
Subject(s)
Diabetes Mellitus, Type 2/metabolism , Monocarboxylic Acid Transporters/genetics , Monocarboxylic Acid Transporters/metabolism , Basigin/metabolism , Cell Membrane/metabolism , Chromosomes, Human, Pair 17/metabolism , Gene Knockdown Techniques , Haplotypes , Hepatocytes/metabolism , Heterozygote , Histone Code , Humans , Liver/metabolism , Models, Molecular , Monocarboxylic Acid Transporters/chemistryABSTRACT
Protein-coding genetic variants that strongly affect disease risk can yield relevant clues to disease pathogenesis. Here we report exome-sequencing analyses of 20,791 individuals with type 2 diabetes (T2D) and 24,440 non-diabetic control participants from 5 ancestries. We identify gene-level associations of rare variants (with minor allele frequencies of less than 0.5%) in 4 genes at exome-wide significance, including a series of more than 30 SLC30A8 alleles that conveys protection against T2D, and in 12 gene sets, including those corresponding to T2D drug targets (P = 6.1 × 10-3) and candidate genes from knockout mice (P = 5.2 × 10-3). Within our study, the strongest T2D gene-level signals for rare variants explain at most 25% of the heritability of the strongest common single-variant signals, and the gene-level effect sizes of the rare variants that we observed in established T2D drug targets will require 75,000-185,000 sequenced cases to achieve exome-wide significance. We propose a method to interpret these modest rare-variant associations and to incorporate these associations into future target or gene prioritization efforts.
Subject(s)
Diabetes Mellitus, Type 2/genetics , Exome Sequencing , Exome/genetics , Animals , Case-Control Studies , Decision Support Techniques , Female , Gene Frequency , Genome-Wide Association Study , Humans , Male , Mice , Mice, KnockoutABSTRACT
AIMS/HYPOTHESIS: Characterisation of genetic variation that influences the response to glucose-lowering medications is instrumental to precision medicine for treatment of type 2 diabetes. The Study to Understand the Genetics of the Acute Response to Metformin and Glipizide in Humans (SUGAR-MGH) examined the acute response to metformin and glipizide in order to identify new pharmacogenetic associations for the response to common glucose-lowering medications in individuals at risk of type 2 diabetes. METHODS: One thousand participants at risk for type 2 diabetes from diverse ancestries underwent sequential glipizide and metformin challenges. A genome-wide association study was performed using the Illumina Multi-Ethnic Genotyping Array. Imputation was performed with the TOPMed reference panel. Multiple linear regression using an additive model tested for association between genetic variants and primary endpoints of drug response. In a more focused analysis, we evaluated the influence of 804 unique type 2 diabetes- and glycaemic trait-associated variants on SUGAR-MGH outcomes and performed colocalisation analyses to identify shared genetic signals. RESULTS: Five genome-wide significant variants were associated with metformin or glipizide response. The strongest association was between an African ancestry-specific variant (minor allele frequency [MAFAfr]=0.0283) at rs149403252 and lower fasting glucose at Visit 2 following metformin (p=1.9×10-9); carriers were found to have a 0.94 mmol/l larger decrease in fasting glucose. rs111770298, another African ancestry-specific variant (MAFAfr=0.0536), was associated with a reduced response to metformin (p=2.4×10-8), where carriers had a 0.29 mmol/l increase in fasting glucose compared with non-carriers, who experienced a 0.15 mmol/l decrease. This finding was validated in the Diabetes Prevention Program, where rs111770298 was associated with a worse glycaemic response to metformin: heterozygous carriers had an increase in HbA1c of 0.08% and non-carriers had an HbA1c increase of 0.01% after 1 year of treatment (p=3.3×10-3). We also identified associations between type 2 diabetes-associated variants and glycaemic response, including the type 2 diabetes-protective C allele of rs703972 near ZMIZ1 and increased levels of active glucagon-like peptide 1 (GLP-1) (p=1.6×10-5), supporting the role of alterations in incretin levels in type 2 diabetes pathophysiology. CONCLUSIONS/INTERPRETATION: We present a well-phenotyped, densely genotyped, multi-ancestry resource to study gene-drug interactions, uncover novel variation associated with response to common glucose-lowering medications and provide insight into mechanisms of action of type 2 diabetes-related variation. DATA AVAILABILITY: The complete summary statistics from this study are available at the Common Metabolic Diseases Knowledge Portal ( https://hugeamp.org ) and the GWAS Catalog ( www.ebi.ac.uk/gwas/ , accession IDs: GCST90269867 to GCST90269899).
Subject(s)
Diabetes Mellitus, Type 2 , Metformin , Humans , Metformin/therapeutic use , Glipizide/therapeutic use , Diabetes Mellitus, Type 2/drug therapy , Diabetes Mellitus, Type 2/genetics , Diabetes Mellitus, Type 2/metabolism , Genome-Wide Association Study , Blood Glucose/metabolism , Glucose , Genetic Variation/genetics , Hypoglycemic Agents/therapeutic useABSTRACT
AIMS/HYPOTHESIS: Type 2 diabetes is highly polygenic and influenced by multiple biological pathways. Rapid expansion in the number of type 2 diabetes loci can be leveraged to identify such pathways. METHODS: We developed a high-throughput pipeline to enable clustering of type 2 diabetes loci based on variant-trait associations. Our pipeline extracted summary statistics from genome-wide association studies (GWAS) for type 2 diabetes and related traits to generate a matrix of 323 variants × 64 trait associations and applied Bayesian non-negative matrix factorisation (bNMF) to identify genetic components of type 2 diabetes. Epigenomic enrichment analysis was performed in 28 cell types and single pancreatic cells. We generated cluster-specific polygenic scores and performed regression analysis in an independent cohort (N=25,419) to assess for clinical relevance. RESULTS: We identified ten clusters of genetic loci, recapturing the five from our prior analysis as well as novel clusters related to beta cell dysfunction, pronounced insulin secretion, and levels of alkaline phosphatase, lipoprotein A and sex hormone-binding globulin. Four clusters related to mechanisms of insulin deficiency, five to insulin resistance and one had an unclear mechanism. The clusters displayed tissue-specific epigenomic enrichment, notably with the two beta cell clusters differentially enriched in functional and stressed pancreatic beta cell states. Additionally, cluster-specific polygenic scores were differentially associated with patient clinical characteristics and outcomes. The pipeline was applied to coronary artery disease and chronic kidney disease, identifying multiple overlapping clusters with type 2 diabetes. CONCLUSIONS/INTERPRETATION: Our approach stratifies type 2 diabetes loci into physiologically interpretable genetic clusters associated with distinct tissues and clinical outcomes. The pipeline allows for efficient updating as additional GWAS become available and can be readily applied to other conditions, facilitating clinical translation of GWAS findings. Software to perform this clustering pipeline is freely available.
Subject(s)
Diabetes Mellitus, Type 2 , Humans , Diabetes Mellitus, Type 2/genetics , Genome-Wide Association Study , Genetic Predisposition to Disease/genetics , Bayes Theorem , Cluster Analysis , Polymorphism, Single NucleotideABSTRACT
The burden of several common diseases including obesity, diabetes, hypertension, asthma, and depression is increasing in most world populations. However, the mechanisms underlying the numerous epidemiological and genetic correlations among these disorders remain largely unknown. We investigated whether common polymorphic inversions underlie the shared genetic influence of these disorders. We performed an inversion association analysis including 21 inversions and 25 obesity-related traits on a total of 408,898 Europeans and validated the results in 67,299 independent individuals. Seven inversions were associated with multiple diseases while inversions at 8p23.1, 16p11.2, and 11q13.2 were strongly associated with the co-occurrence of obesity with other common diseases. Transcriptome analysis across numerous tissues revealed strong candidate genes for obesity-related traits. Analyses in human pancreatic islets indicated the potential mechanism of inversions in the susceptibility of diabetes by disrupting the cis-regulatory effect of SNPs from their target genes. Our data underscore the role of inversions as major genetic contributors to the joint susceptibility to common complex diseases.
Subject(s)
Chromosome Inversion/genetics , Diabetes Mellitus/genetics , Genetic Predisposition to Disease , Hypertension/genetics , Obesity/complications , Obesity/genetics , Polymorphism, Genetic , Adolescent , Adult , Aged , Aged, 80 and over , Alleles , Chromosomes, Human, Pair 16/genetics , Chromosomes, Human, Pair 8/genetics , Datasets as Topic/standards , Diabetes Mellitus/pathology , Europe/ethnology , Female , Gene Expression Profiling , Haplotypes , Humans , Hypertension/complications , Islets of Langerhans/metabolism , Islets of Langerhans/pathology , Male , Middle Aged , Polymorphism, Single Nucleotide/genetics , Reproducibility of Results , Young AdultABSTRACT
Metformin is the first-line treatment for type 2 diabetes (T2D) in youth but with limited sustained glycemic response. To identify common variants associated with metformin response, we used a genome-wide approach in 506 youth from the Treatment Options for Type 2 Diabetes in Adolescents and Youth (TODAY) study and examined the relationship between T2D partitioned polygenic scores (pPS), glycemic traits, and metformin response in these youth. Several variants met a suggestive threshold (P < 1 × 10-6), though none including published adult variants reached genome-wide significance. We pursued replication of top nine variants in three cohorts, and rs76195229 in ATRNL1 was associated with worse metformin response in the Metformin Genetics Consortium (n = 7,812), though statistically not being significant after Bonferroni correction (P = 0.06). A higher ß-cell pPS was associated with a lower insulinogenic index (P = 0.02) and C-peptide (P = 0.047) at baseline and higher pPS related to two insulin resistance processes were associated with increased C-peptide at baseline (P = 0.04,0.02). Although pPS were not associated with changes in glycemic traits or metformin response, our results indicate a trend in the association of the ß-cell pPS with reduced ß-cell function over time. Our data show initial evidence for genetic variation associated with metformin response in youth with T2D.
Subject(s)
Diabetes Mellitus, Type 2 , Metformin , Adult , Humans , Adolescent , Metformin/therapeutic use , Diabetes Mellitus, Type 2/drug therapy , Diabetes Mellitus, Type 2/genetics , Diabetes Mellitus, Type 2/complications , C-Peptide , Treatment Failure , Genetic Variation , Blood Glucose , Hypoglycemic Agents/therapeutic useABSTRACT
BACKGROUND: The high heterogeneity in the symptoms and severity of COVID-19 makes it challenging to identify high-risk patients early in the disease. Cardiometabolic comorbidities have shown strong associations with COVID-19 severity in epidemiologic studies. Cardiometabolic protein biomarkers, therefore, may provide predictive insight regarding which patients are most susceptible to severe illness from COVID-19. METHODS: In plasma samples collected from 343 patients hospitalized with COVID-19 during the first wave of the pandemic, we measured 92 circulating protein biomarkers previously implicated in cardiometabolic disease. We performed proteomic analysis and developed predictive models for severe outcomes. We then used these models to predict the outcomes of out-of-sample patients hospitalized with COVID-19 later in the surge (N = 194). RESULTS: We identified a set of seven protein biomarkers predictive of admission to the intensive care unit and/or death (ICU/death) within 28 days of presentation to care. Two of the biomarkers, ADAMTS13 and VEGFD, were associated with a lower risk of ICU/death. The remaining biomarkers, ACE2, IL-1RA, IL6, KIM1, and CTSL1, were associated with higher risk. When used to predict the outcomes of the future, out-of-sample patients, the predictive models built with these protein biomarkers outperformed all models built from standard clinical data, including known COVID-19 risk factors. CONCLUSIONS: These findings suggest that proteomic profiling can inform the early clinical impression of a patient's likelihood of developing severe COVID-19 outcomes and, ultimately, accelerate the recognition and treatment of high-risk patients.
Subject(s)
COVID-19 , Cardiovascular Diseases , Biomarkers , Cardiovascular Diseases/diagnosis , Humans , Proteomics , SARS-CoV-2ABSTRACT
Birth weight (BW) has been shown to be influenced by both fetal and maternal factors and in observational studies is reproducibly associated with future risk of adult metabolic diseases including type 2 diabetes (T2D) and cardiovascular disease. These life-course associations have often been attributed to the impact of an adverse early life environment. Here, we performed a multi-ancestry genome-wide association study (GWAS) meta-analysis of BW in 153,781 individuals, identifying 60 loci where fetal genotype was associated with BW (P < 5 × 10-8). Overall, approximately 15% of variance in BW was captured by assays of fetal genetic variation. Using genetic association alone, we found strong inverse genetic correlations between BW and systolic blood pressure (Rg = -0.22, P = 5.5 × 10-13), T2D (Rg = -0.27, P = 1.1 × 10-6) and coronary artery disease (Rg = -0.30, P = 6.5 × 10-9). In addition, using large -cohort datasets, we demonstrated that genetic factors were the major contributor to the negative covariance between BW and future cardiometabolic risk. Pathway analyses indicated that the protein products of genes within BW-associated regions were enriched for diverse processes including insulin signalling, glucose homeostasis, glycogen biosynthesis and chromatin remodelling. There was also enrichment of associations with BW in known imprinted regions (P = 1.9 × 10-4). We demonstrate that life-course associations between early growth phenotypes and adult cardiometabolic disease are in part the result of shared genetic effects and identify some of the pathways through which these causal genetic effects are mediated.
Subject(s)
Aging/genetics , Birth Weight/genetics , Coronary Artery Disease/genetics , Diabetes Mellitus, Type 2/genetics , Fetus/metabolism , Genetic Predisposition to Disease , Genome-Wide Association Study , Adult , Anthropometry , Blood Pressure/genetics , Chromatin Assembly and Disassembly , Cohort Studies , Datasets as Topic , Female , Genetic Loci/genetics , Genetic Variation/genetics , Genomic Imprinting/genetics , Genotype , Glucose/metabolism , Glycogen/biosynthesis , Humans , Insulin/metabolism , Male , Phenotype , Signal TransductionABSTRACT
BACKGROUND: Epidemiological studies report associations of diverse cardiometabolic conditions including obesity with COVID-19 illness, but causality has not been established. We sought to evaluate the associations of 17 cardiometabolic traits with COVID-19 susceptibility and severity using 2-sample Mendelian randomization (MR) analyses. METHODS AND FINDINGS: We selected genetic variants associated with each exposure, including body mass index (BMI), at p < 5 × 10-8 from genome-wide association studies (GWASs). We then calculated inverse-variance-weighted averages of variant-specific estimates using summary statistics for susceptibility and severity from the COVID-19 Host Genetics Initiative GWAS meta-analyses of population-based cohorts and hospital registries comprising individuals with self-reported or genetically inferred European ancestry. Susceptibility was defined as testing positive for COVID-19 and severity was defined as hospitalization with COVID-19 versus population controls (anyone not a case in contributing cohorts). We repeated the analysis for BMI with effect estimates from the UK Biobank and performed pairwise multivariable MR to estimate the direct effects and indirect effects of BMI through obesity-related cardiometabolic diseases. Using p < 0.05/34 tests = 0.0015 to declare statistical significance, we found a nonsignificant association of genetically higher BMI with testing positive for COVID-19 (14,134 COVID-19 cases/1,284,876 controls, p = 0.002; UK Biobank: odds ratio 1.06 [95% CI 1.02, 1.10] per kg/m2; p = 0.004]) and a statistically significant association with higher risk of COVID-19 hospitalization (6,406 hospitalized COVID-19 cases/902,088 controls, p = 4.3 × 10-5; UK Biobank: odds ratio 1.14 [95% CI 1.07, 1.21] per kg/m2, p = 2.1 × 10-5). The implied direct effect of BMI was abolished upon conditioning on the effect on type 2 diabetes, coronary artery disease, stroke, and chronic kidney disease. No other cardiometabolic exposures tested were associated with a higher risk of poorer COVID-19 outcomes. Small study samples and weak genetic instruments could have limited the detection of modest associations, and pleiotropy may have biased effect estimates away from the null. CONCLUSIONS: In this study, we found genetic evidence to support higher BMI as a causal risk factor for COVID-19 susceptibility and severity. These results raise the possibility that obesity could amplify COVID-19 disease burden independently or through its cardiometabolic consequences and suggest that targeting obesity may be a strategy to reduce the risk of severe COVID-19 outcomes.
Subject(s)
Body Mass Index , COVID-19 , Coronary Artery Disease , Diabetes Mellitus, Type 2 , Disease Susceptibility , Obesity , Renal Insufficiency, Chronic , Stroke , COVID-19/diagnosis , COVID-19/epidemiology , COVID-19/genetics , Cardiometabolic Risk Factors , Causality , Coronary Artery Disease/epidemiology , Coronary Artery Disease/genetics , Diabetes Mellitus, Type 2/epidemiology , Diabetes Mellitus, Type 2/genetics , Genetic Variation , Genome-Wide Association Study/statistics & numerical data , Humans , Mendelian Randomization Analysis , Meta-Analysis as Topic , Obesity/diagnosis , Obesity/epidemiology , Obesity/metabolism , Renal Insufficiency, Chronic/epidemiology , Renal Insufficiency, Chronic/genetics , SARS-CoV-2 , Severity of Illness Index , Stroke/epidemiology , Stroke/geneticsABSTRACT
There is a limited understanding about the impact of rare protein-truncating variants across multiple phenotypes. We explore the impact of this class of variants on 13 quantitative traits and 10 diseases using whole-exome sequencing data from 100,296 individuals. Protein-truncating variants in genes intolerant to this class of mutations increased risk of autism, schizophrenia, bipolar disorder, intellectual disability, and ADHD. In individuals without these disorders, there was an association with shorter height, lower education, increased hospitalization, and reduced age at enrollment. Gene sets implicated from GWASs did not show a significant protein-truncating variants burden beyond what was captured by established Mendelian genes. In conclusion, we provide a thorough investigation of the impact of rare deleterious coding variants on complex traits, suggesting widespread pleiotropic risk.
Subject(s)
Mutation/genetics , Open Reading Frames/genetics , Databases, Genetic , Ethnicity/genetics , Genetic Predisposition to Disease , Genome-Wide Association Study , Humans , Phenotype , Proteins/geneticsABSTRACT
Cytokinetic abscission facilitates the irreversible separation of daughter cells. This process requires the endosomal-sorting complexes required for transport (ESCRT) machinery and is tightly regulated by charged multivesicular body protein 4C (CHMP4C), an ESCRT-III subunit that engages the abscission checkpoint (NoCut) in response to mitotic problems such as persisting chromatin bridges within the midbody. Importantly, a human polymorphism in CHMP4C (rs35094336, CHMP4CT232) increases cancer susceptibility. Here, we explain the structural and functional basis for this cancer association: The CHMP4CT232 allele unwinds the C-terminal helix of CHMP4C, impairs binding to the early-acting ESCRT factor ALIX, and disrupts the abscission checkpoint. Cells expressing CHMP4CT232 exhibit increased levels of DNA damage and are sensitized to several conditions that increase chromosome missegregation, including DNA replication stress, inhibition of the mitotic checkpoint, and loss of p53. Our data demonstrate the biological importance of the abscission checkpoint and suggest that dysregulation of abscission by CHMP4CT232 may synergize with oncogene-induced mitotic stress to promote genomic instability and tumorigenesis.
Subject(s)
Cell Cycle Checkpoints/genetics , Endosomal Sorting Complexes Required for Transport/genetics , Genetic Predisposition to Disease/genetics , Genomic Instability/genetics , Neoplasms/genetics , Calcium-Binding Proteins/metabolism , Carcinogenesis/genetics , Cell Cycle Proteins/metabolism , Cell Line, Tumor , Chromatin/metabolism , Crystallography, X-Ray , DNA Damage/genetics , DNA Replication/genetics , Endosomal Sorting Complexes Required for Transport/metabolism , Humans , Mitosis/genetics , Phosphorylation , Polymorphism, Genetic , RNA, Small Interfering/metabolism , Tumor Suppressor Protein p53/genetics , Tumor Suppressor Protein p53/metabolismABSTRACT
BACKGROUND: Obesity and its associated diseases are major health problems characterized by extensive metabolic disturbances. Understanding the causal connections between these phenotypes and variation in metabolite levels can uncover relevant biology and inform novel intervention strategies. Recent studies have combined metabolite profiling with genetic instrumental variable (IV) analysis (Mendelian randomization) to infer the direction of causality between metabolites and obesity, but often omitted a large portion of untargeted profiling data consisting of unknown, unidentified metabolite signals. METHODS: We expanded upon previous research by identifying body mass index (BMI)-associated metabolites in multiple untargeted metabolomics datasets, and then performing bidirectional IV analysis to classify metabolites based on their inferred causal relationships with BMI. Meta-analysis and pathway analysis of both known and unknown metabolites across datasets were enabled by our recently developed bioinformatics suite, PAIRUP-MS. RESULTS: We identified ten known metabolites that are more likely to be causes (e.g., alpha-hydroxybutyrate) or effects (e.g., valine) of BMI, or may have more complex bidirectional cause-effect relationships with BMI (e.g., glycine). Importantly, we also identified about five times more unknown than known metabolites in each of these three categories. Pathway analysis incorporating both known and unknown metabolites prioritized 40 enriched (p < 0.05) metabolite sets for the cause versus effect groups, providing further support that these two metabolite groups are linked to obesity via distinct biological mechanisms. CONCLUSIONS: These findings demonstrate the potential utility of our approach to uncover causal connections with obesity from untargeted metabolomics datasets. Combining genetically informed causal inference with the ability to map unknown metabolites across datasets provides a path to jointly analyze many untargeted datasets with obesity or other phenotypes. This approach, applied to larger datasets with genotype and untargeted metabolite data, should generate sufficient power for robust discovery and replication of causal biological connections between metabolites and various human diseases.
Subject(s)
Metabolome , Obesity/metabolism , Body Mass Index , Causality , Computational Biology , Humans , Metabolomics , Obesity/geneticsABSTRACT
During adipogenesis, preadipocytes' cytoskeleton reorganizes in parallel with lipid accumulation. Failure to do so may impact the ability of adipose tissue (AT) to shift between lipid storage and mobilization. Here, we identify cytoskeletal transgelin 2 (TAGLN2) as a protein expressed in AT and associated with obesity and inflammation, being normalized upon weight loss. TAGLN2 was primarily found in the adipose stromovascular cell fraction, but inflammation, TGF-ß, and estradiol also prompted increased expression in human adipocytes. Tagln2 knockdown revealed a key functional role, being required for proliferation and differentiation of fat cells, whereas transgenic mice overexpressing Tagln2 using the adipocyte protein 2 promoter disclosed remarkable sex-dependent variations, in which females displayed "healthy" obesity and hypertrophied adipocytes but preserved insulin sensitivity, and males exhibited physiologic changes suggestive of defective AT expandability, including increased number of small adipocytes, activation of immune cells, mitochondrial dysfunction, and impaired metabolism together with decreased insulin sensitivity. The metabolic relevance and sexual dimorphism of TAGLN2 was also outlined by genetic variants that may modulate its expression and are associated with obesity and the risk of ischemic heart disease in men. Collectively, current findings highlight the contribution of cytoskeletal TAGLN2 to the obese phenotype in a gender-dependent manner.-Ortega, F. J., Moreno-Navarrete, J. M., Mercader, J. M., Gómez-Serrano, M., García-Santos, E., Latorre, J., Lluch, A., Sabater, M., Caballano-Infantes, E., Guzmán, R., Macías-González, M., Buxo, M., Gironés, J., Vilallonga, R., Naon, D., Botas, P., Delgado, E., Corella, D., Burcelin, R., Frühbeck, G., Ricart, W., Simó, R., Castrillon-Rodríguez, I., Tinahones, F. J., Bosch, F., Vidal-Puig, A., Malagón, M. M., Peral, B., Zorzano, A., Fernández-Real, J. M. Cytoskeletal transgelin 2 contributes to gender-dependent adipose tissue expandability and immune function.
Subject(s)
Adipose Tissue/immunology , Adipose Tissue/metabolism , Diet, High-Fat/adverse effects , Microfilament Proteins/metabolism , Muscle Proteins/metabolism , Obesity/immunology , Obesity/metabolism , Animals , Blotting, Western , Cytoskeleton/metabolism , Female , Humans , Immunohistochemistry , Male , Mice , Mice, Inbred C57BL , Mice, Transgenic , Microfilament Proteins/genetics , Muscle Proteins/genetics , Obesity/etiology , Sex Factors , THP-1 CellsABSTRACT
Metabolomics is a powerful approach for discovering biomarkers and for characterizing the biochemical consequences of genetic variation. While untargeted metabolite profiling can measure thousands of signals in a single experiment, many biologically meaningful signals cannot be readily identified as known metabolites nor compared across datasets, making it difficult to infer biology and to conduct well-powered meta-analyses across studies. To overcome these challenges, we developed a suite of computational methods, PAIRUP-MS, to match metabolite signals across mass spectrometry-based profiling datasets and to generate metabolic pathway annotations for these signals. To pair up signals measured in different datasets, where retention times (RT) are often not comparable or even available, we implemented an imputation-based approach that only requires mass-to-charge ratios (m/z). As validation, we treated each shared known metabolite as an unmatched signal and showed that PAIRUP-MS correctly matched 70-88% of these metabolites from among thousands of signals, equaling or outperforming a standard m/z- and RT-based approach. We performed further validation using genetic data: the most stringent set of matched signals and shared knowns showed comparable consistency of genetic associations across datasets. Next, we developed a pathway reconstitution method to annotate unknown signals using curated metabolic pathways containing known metabolites. We performed genetic validation for the generated annotations, showing that annotated signals associated with gene variants were more likely to be enriched for pathways functionally related to the genes compared to random expectation. Finally, we applied PAIRUP-MS to study associations between metabolites and genetic variants or body mass index (BMI) across multiple datasets, identifying up to ~6 times more significant signals and many more BMI-associated pathways compared to the standard practice of only analyzing known metabolites. These results demonstrate that PAIRUP-MS enables analysis of unknown signals in a robust, biologically meaningful manner and provides a path to more comprehensive, well-powered studies of untargeted metabolomics data.
Subject(s)
Computational Biology/methods , Mass Spectrometry/methods , Metabolome , Metabolomics/methods , Aged , Aged, 80 and over , Biomarkers/analysis , Biomarkers/metabolism , Databases, Factual , Humans , Metabolic Networks and Pathways/physiology , Metabolome/genetics , Metabolome/physiologyABSTRACT
BACKGROUND: Type 2 diabetes (T2D) is a heterogeneous disease for which (1) disease-causing pathways are incompletely understood and (2) subclassification may improve patient management. Unlike other biomarkers, germline genetic markers do not change with disease progression or treatment. In this paper, we test whether a germline genetic approach informed by physiology can be used to deconstruct T2D heterogeneity. First, we aimed to categorize genetic loci into groups representing likely disease mechanistic pathways. Second, we asked whether the novel clusters of genetic loci we identified have any broad clinical consequence, as assessed in four separate subsets of individuals with T2D. METHODS AND FINDINGS: In an effort to identify mechanistic pathways driven by established T2D genetic loci, we applied Bayesian nonnegative matrix factorization (bNMF) clustering to genome-wide association study (GWAS) results for 94 independent T2D genetic variants and 47 diabetes-related traits. We identified five robust clusters of T2D loci and traits, each with distinct tissue-specific enhancer enrichment based on analysis of epigenomic data from 28 cell types. Two clusters contained variant-trait associations indicative of reduced beta cell function, differing from each other by high versus low proinsulin levels. The three other clusters displayed features of insulin resistance: obesity mediated (high body mass index [BMI] and waist circumference [WC]), "lipodystrophy-like" fat distribution (low BMI, adiponectin, and high-density lipoprotein [HDL] cholesterol, and high triglycerides), and disrupted liver lipid metabolism (low triglycerides). Increased cluster genetic risk scores were associated with distinct clinical outcomes, including increased blood pressure, coronary artery disease (CAD), and stroke. We evaluated the potential for clinical impact of these clusters in four studies containing individuals with T2D (Metabolic Syndrome in Men Study [METSIM], N = 487; Ashkenazi, N = 509; Partners Biobank, N = 2,065; UK Biobank [UKBB], N = 14,813). Individuals with T2D in the top genetic risk score decile for each cluster reproducibly exhibited the predicted cluster-associated phenotypes, with approximately 30% of all individuals assigned to just one cluster top decile. Limitations of this study include that the genetic variants used in the cluster analysis were restricted to those associated with T2D in populations of European ancestry. CONCLUSION: Our approach identifies salient T2D genetically anchored and physiologically informed pathways, and supports the use of genetics to deconstruct T2D heterogeneity. Classification of patients by these genetic pathways may offer a step toward genetically informed T2D patient management.
Subject(s)
Diabetes Mellitus, Type 2/classification , Diabetes Mellitus, Type 2/genetics , Genetic Loci , Multigene Family , Algorithms , Bayes Theorem , Cluster Analysis , Cohort Studies , Cross-Sectional Studies , Databases, Genetic , Female , Founder Effect , Genetic Markers , Genetic Predisposition to Disease , Genome-Wide Association Study , Humans , Insulin/deficiency , Insulin/genetics , Insulin Resistance/genetics , Male , Phenotype , Prospective Studies , Risk FactorsABSTRACT
Type 2 Diabetes (T2D) is a highly prevalent chronic metabolic disease with strong co-morbidity with obesity and cardiovascular diseases. There is growing evidence supporting the notion that a crosstalk between mitochondria and the insulin signaling cascade could be involved in the etiology of T2D and insulin resistance. In this study we investigated the molecular basis of this crosstalk by using systems biology approaches. We combined, filtered, and interrogated different types of functional interaction data, such as direct protein-protein interactions, co-expression analyses, and metabolic and signaling dependencies. As a result, we constructed the mitochondria-insulin (MITIN) network, which highlights 286 genes as candidate functional linkers between these two systems. The results of internal gene expression analysis of three independent experimental models of mitochondria and insulin signaling perturbations further support the connecting roles of these genes. In addition, we further assessed whether these genes are involved in the etiology of T2D using the genome-wide association study meta-analysis from the DIAGRAM consortium, involving 8,130 T2D cases and 38,987 controls. We found modest enrichment of genes associated with T2D amongst our linker genes (pâ=â0.0549), including three already validated T2D SNPs and 15 additional SNPs, which, when combined, were collectively associated to increased fasting glucose levels according to MAGIC genome wide meta-analysis (pâ=â8.12×10(-5)). This study highlights the potential of combining systems biology, experimental, and genome-wide association data mining for identifying novel genes and related variants that increase vulnerability to complex diseases.
Subject(s)
Diabetes Mellitus, Type 2 , Genome-Wide Association Study , Insulin Resistance/genetics , Mitochondria , Diabetes Mellitus, Type 2/genetics , Diabetes Mellitus, Type 2/metabolism , Gene Expression Regulation , Genetic Predisposition to Disease , Glucose/metabolism , Humans , Insulin/genetics , Insulin/metabolism , Metabolic Networks and Pathways , Mitochondria/genetics , Mitochondria/metabolism , Obesity/genetics , Polymorphism, Single Nucleotide , Systems BiologyABSTRACT
BACKGROUND: It has been shown in a number of metagenomic studies that the addition and removal of specific genes have allowed microbiomes to adapt to specific environmental conditions by losing and gaining specific functions. But it is not known whether and how the regulation of gene expression also contributes to adaptation. RESULTS: We have here characterized and analyzed the metaregulome of three different environments, as well as their impact in the adaptation to particular variable physico-chemical conditions. For this, we have developed a computational protocol to extract regulatory regions and their corresponding transcription factors binding sites directly from metagenomic reads and applied it to three well known environments: Acid Mine, Whale Fall, and Waseca Farm. Taking the density of regulatory sites in promoters as a measure of the potential and complexity of gene regulation, we found it to be quantitatively the same in all three environments, despite their different physico-chemical conditions and species composition. However, we found that each environment distributes their regulatory potential differently across their functional space. Among the functions with highest regulatory potential in each niche, we found significant enrichment of processes related to sensing and buffering external variable factors specific to each environment, like for example, the availability of co-factors in deep sea, of oligosaccharides in soil and the regulation of pH in the acid mine. CONCLUSIONS: These results highlight the potential impact of gene regulation in the adaptation of bacteria to the different habitats through the distribution of their regulatory potential among specific functions, and point to critical environmental factors that challenge the growth of any microbial community.
Subject(s)
Bacteria/genetics , Metagenome , Adaptation, Physiological/genetics , Bacterial Proteins/chemistry , Bacterial Proteins/genetics , Bacterial Proteins/metabolism , Binding Sites , DNA, Bacterial/chemistry , DNA, Bacterial/metabolism , Ecosystem , Promoter Regions, Genetic , Protein Binding , Regulatory Sequences, Nucleic Acid/genetics , Soil Microbiology , Transcription Factors/chemistry , Transcription Factors/genetics , Transcription Factors/metabolism , Wastewater/microbiology , Water MicrobiologyABSTRACT
IMPORTANCE: Latino populations have one of the highest prevalences of type 2 diabetes worldwide. OBJECTIVES: To investigate the association between rare protein-coding genetic variants and prevalence of type 2 diabetes in a large Latino population and to explore potential molecular and physiological mechanisms for the observed relationships. DESIGN, SETTING, AND PARTICIPANTS: Whole-exome sequencing was performed on DNA samples from 3756 Mexican and US Latino individuals (1794 with type 2 diabetes and 1962 without diabetes) recruited from 1993 to 2013. One variant was further tested for allele frequency and association with type 2 diabetes in large multiethnic data sets of 14,276 participants and characterized in experimental assays. MAIN OUTCOME AND MEASURES: Prevalence of type 2 diabetes. Secondary outcomes included age of onset, body mass index, and effect on protein function. RESULTS: A single rare missense variant (c.1522G>A [p.E508K]) was associated with type 2 diabetes prevalence (odds ratio [OR], 5.48; 95% CI, 2.83-10.61; P = 4.4 × 10(-7)) in hepatocyte nuclear factor 1-α (HNF1A), the gene responsible for maturity onset diabetes of the young type 3 (MODY3). This variant was observed in 0.36% of participants without type 2 diabetes and 2.1% of participants with it. In multiethnic replication data sets, the p.E508K variant was seen only in Latino patients (n = 1443 with type 2 diabetes and 1673 without it) and was associated with type 2 diabetes (OR, 4.16; 95% CI, 1.75-9.92; P = .0013). In experimental assays, HNF-1A protein encoding the p.E508K mutant demonstrated reduced transactivation activity of its target promoter compared with a wild-type protein. In our data, carriers and noncarriers of the p.E508K mutation with type 2 diabetes had no significant differences in compared clinical characteristics, including age at onset. The mean (SD) age for carriers was 45.3 years (11.2) vs 47.5 years (11.5) for noncarriers (P = .49) and the mean (SD) BMI for carriers was 28.2 (5.5) vs 29.3 (5.3) for noncarriers (P = .19). CONCLUSIONS AND RELEVANCE: Using whole-exome sequencing, we identified a single low-frequency variant in the MODY3-causing gene HNF1A that is associated with type 2 diabetes in Latino populations and may affect protein function. This finding may have implications for screening and therapeutic modification in this population, but additional studies are required.
Subject(s)
Diabetes Mellitus, Type 2/genetics , Hepatocyte Nuclear Factor 1-alpha/genetics , Adult , Age of Onset , Aged , Female , Genotype , Hispanic or Latino/genetics , Humans , Male , Mexico , Middle Aged , Mutation, Missense , Sequence Analysis, DNA , United StatesABSTRACT
OBJECTIVE: Individuals with diabetes who carry genetic variants that lower hemoglobin A1c (HbA1c) independently of glycemia may have higher real, but undetected, hyperglycemia compared with those without these variants despite achieving similar HbA1c targets, potentially placing them at greater risk for diabetes-related complications. We sought to determine whether these genetic variants, aggregated in a polygenic score, and the large-effect African ancestry-specific missense variant in G6PD (rs1050828) that lower HbA1c were associated with higher retinopathy risk. RESEARCH DESIGN AND METHODS: Using data from 29,828 type 2 diabetes cases of genetically inferred African American/African British and European ancestries, we calculated ancestry-specific nonglycemic HbA1c polygenic scores (ngA1cPS) composed of 122 variants associated with HbA1c at genome-wide significance, but not with glucose. We tested the association of the ngA1cPS and the G6PD variant with retinopathy, adjusting for measured HbA1c and retinopathy risk factors. RESULTS: Participants in the bottom quintile of the ngA1cPS showed between 20% and 50% higher retinopathy prevalence, compared with those above this quintile, despite similar levels of measured HbA1c. The adjusted meta-analytic odds ratio for the bottom quintile was 1.31 (95% CI 1.0, 1.73; P = 0.05) in African ancestry and 1.31 (95% CI 1.15, 1.50; P = 6.5 × 10-5) in European ancestry. Among individuals of African ancestry with HbA1c below 7%, retinopathy prevalence was higher in individuals below, compared with above, the 50th percentile of the ngA1cPS regardless of sex or G6PD carrier status. CONCLUSIONS: Genetic effects need to be considered to personalize HbA1c targets and improve outcomes of people with diabetes from diverse ancestries.
Subject(s)
Diabetic Retinopathy , Glycated Hemoglobin , Adult , Aged , Female , Humans , Male , Middle Aged , Black or African American/genetics , Black People/genetics , Diabetes Mellitus, Type 2/genetics , Diabetes Mellitus, Type 2/epidemiology , Diabetic Retinopathy/genetics , Diabetic Retinopathy/epidemiology , Diabetic Retinopathy/ethnology , Glycated Hemoglobin/metabolism , Glycated Hemoglobin/analysis , Risk Factors , White/geneticsABSTRACT
BACKGROUND: The clinical utility of genetic information for type 2 diabetes (T2D) prediction with polygenic scores (PGS) in ancestrally diverse, real-world US healthcare systems is unclear, especially for those at low clinical phenotypic risk for T2D. METHODS: We tested the association of PGS with T2D incidence in patients followed within a primary care practice network over 16 years in four hypothetical scenarios that varied by clinical data availability (N = 14,712): (1) age and sex; (2) age, sex, body mass index (BMI), systolic blood pressure, and family history of T2D; (3) all variables in (2) and random glucose; and (4) all variables in (3), HDL, total cholesterol, and triglycerides, combined in a clinical risk score (CRS). To determine whether genetic effects differed by baseline clinical risk, we tested for interaction with the CRS. RESULTS: PGS was associated with incident T2D in all models. Adjusting for age and sex only, the Hazard Ratio (HR) per PGS standard deviation (SD) was 1.76 (95% CI 1.68, 1.84) and the HR of top 5% of PGS vs interquartile range (IQR) was 2.80 (2.39, 3.28). Adjusting for the CRS, the HR per SD was 1.48 (1.40, 1.57) and HR of the top 5% of PGS vs IQR was 2.09 (1.72, 2.55). Genetic effects differed by baseline clinical risk ((PGS-CRS interaction p = 0.05; CRS below the median: HR 1.60 (1.43, 1.79); CRS above the median: HR 1.45 (1.35, 1.55)). CONCLUSIONS: Genetic information can help identify high-risk patients even among those perceived to be low risk in a clinical evaluation.