RESUMO
Mosaic loss of chromosome Y (LOY) in circulating white blood cells is the most common form of clonal mosaicism1-5, yet our knowledge of the causes and consequences of this is limited. Here, using a computational approach, we estimate that 20% of the male population represented in the UK Biobank study (n = 205,011) has detectable LOY. We identify 156 autosomal genetic determinants of LOY, which we replicate in 757,114 men of European and Japanese ancestry. These loci highlight genes that are involved in cell-cycle regulation and cancer susceptibility, as well as somatic drivers of tumour growth and targets of cancer therapy. We demonstrate that genetic susceptibility to LOY is associated with non-haematological effects on health in both men and women, which supports the hypothesis that clonal haematopoiesis is a biomarker of genomic instability in other tissues. Single-cell RNA sequencing identifies dysregulated expression of autosomal genes in leukocytes with LOY and provides insights into why clonal expansion of these cells may occur. Collectively, these data highlight the value of studying clonal mosaicism to uncover fundamental mechanisms that underlie cancer and other ageing-related diseases.
Assuntos
Deleção Cromossômica , Cromossomos Humanos Y/genética , Predisposição Genética para Doença/genética , Instabilidade Genômica/genética , Leucócitos/patologia , Mosaicismo , Adulto , Idoso , Biologia Computacional , Bases de Dados Genéticas , Feminino , Marcadores Genéticos/genética , Humanos , Masculino , Pessoa de Meia-Idade , Neoplasias/genética , Reino UnidoRESUMO
AIMS/HYPOTHESIS: The identification of people who are at high risk of developing type 2 diabetes is a key part of population-level prevention strategies. Previous studies have evaluated the predictive utility of omics measurements, such as metabolites, proteins or polygenic scores, but have considered these separately. The improvement that combined omics biomarkers can provide over and above current clinical standard models is unclear. The aim of this study was to test the predictive performance of genome, proteome, metabolome and clinical biomarkers when added to established clinical prediction models for type 2 diabetes. METHODS: We developed sparse interpretable prediction models in a prospective, nested type 2 diabetes case-cohort study (N=1105, incident type 2 diabetes cases=375) with 10,792 person-years of follow-up, selecting from 5759 features across the genome, proteome, metabolome and clinical biomarkers using least absolute shrinkage and selection operator (LASSO) regression. We compared the predictive performance of omics-derived predictors with a clinical model including the variables from the Cambridge Diabetes Risk Score and HbA1c. RESULTS: Among single omics prediction models that did not include clinical risk factors, the top ten proteins alone achieved the highest performance (concordance index [C index]=0.82 [95% CI 0.75, 0.88]), suggesting the proteome as the most informative single omic layer in the absence of clinical information. However, the largest improvement in prediction of type 2 diabetes incidence over and above the clinical model was achieved by the top ten features across several omic layers (C index=0.87 [95% CI 0.82, 0.92], Δ C index=0.05, p=0.045). This improvement by the top ten omic features was also evident in individuals with HbA1c <42 mmol/mol (6.0%), the threshold for prediabetes (C index=0.84 [95% CI 0.77, 0.90], Δ C index=0.07, p=0.03), the group in whom prediction would be most useful since they are not targeted for preventative interventions by current clinical guidelines. In this subgroup, the type 2 diabetes polygenic risk score was the major contributor to the improvement in prediction, and achieved a comparable improvement in performance when added onto the clinical model alone (C index=0.83 [95% CI 0.75, 0.90], Δ C index=0.06, p=0.002). However, compared with those with prediabetes, individuals at high polygenic risk in this group had only around half the absolute risk for type 2 diabetes over a 20 year period. CONCLUSIONS/INTERPRETATION: Omic approaches provided marginal improvements in prediction of incident type 2 diabetes. However, while a polygenic risk score does improve prediction in people with an HbA1c in the normoglycaemic range, the group in whom prediction would be most useful, even individuals with a high polygenic burden in that subgroup had a low absolute type 2 diabetes risk. This suggests a limited feasibility of implementing targeted population-based genetic screening for preventative interventions.
Assuntos
Diabetes Mellitus Tipo 2 , Estado Pré-Diabético , Humanos , Diabetes Mellitus Tipo 2/diagnóstico , Diabetes Mellitus Tipo 2/epidemiologia , Diabetes Mellitus Tipo 2/genética , Estado Pré-Diabético/complicações , Estudos Prospectivos , Estudos de Coortes , Proteoma , Multiômica , Fatores de Risco , BiomarcadoresRESUMO
Several genetic discoveries robustly implicate five single-nucleotide variants in the progression of non-alcoholic fatty liver disease to non-alcoholic steatohepatitis and fibrosis (NASH-fibrosis), including a recently identified variant in MTARC1. To better understand these variants as potential therapeutic targets, we aimed to characterize their impact on metabolism using comprehensive metabolomics data from two population-based studies. A total of 9135 participants from the Fenland study and 9902 participants from the EPIC-Norfolk cohort were included in the study. We identified individuals with risk alleles associated with NASH-fibrosis: rs738409C>G in PNPLA3, rs58542926C>T in TM6SF2, rs641738C>T near MBOAT7, rs72613567TA>T in HSD17B13 and rs2642438A>G in MTARC1. Circulating levels of 1449 metabolites were measured using targeted and untargeted metabolomics. Associations between NASH-fibrosis variants and metabolites were assessed using linear regression. The specificity of variant-metabolite associations were compared to metabolite associations with ultrasound-defined steatosis, gene variants linked to liver fat (in GCKR, PPP1R3B and LYPLAL1) and gene variants linked to cirrhosis (in HFE and SERPINA1). Each NASH-fibrosis variant demonstrated a specific metabolite profile with little overlap (8/97 metabolites) comprising diverse aspects of lipid metabolism. Risk alleles in PNPLA3 and HSD17B13 were both associated with higher 3-methylglutarylcarnitine and three variants were associated with lower lysophosphatidylcholine C14:0. The risk allele in MTARC1 was associated with higher levels of sphingomyelins. There was no overlap with metabolites that associated with HFE or SERPINA1 variants. Our results suggest a link between the NASH-protective variant in MTARC1 to the metabolism of sphingomyelins and identify distinct molecular patterns associated with each of the NASH-fibrosis variants under investigation.
Assuntos
Predisposição Genética para Doença , Cirrose Hepática/patologia , Metaboloma , Hepatopatia Gordurosa não Alcoólica/patologia , Polimorfismo de Nucleotídeo Único , Adulto , Idoso , Feminino , Estudos de Associação Genética , Humanos , Cirrose Hepática/complicações , Cirrose Hepática/genética , Cirrose Hepática/metabolismo , Masculino , Pessoa de Meia-Idade , Hepatopatia Gordurosa não Alcoólica/complicações , Hepatopatia Gordurosa não Alcoólica/genética , Hepatopatia Gordurosa não Alcoólica/metabolismo , Prognóstico , Estudos ProspectivosRESUMO
BACKGROUND: Population-specificity of exploratory dietary patterns limits their generalizability in investigations with type 2 diabetes incidence. OBJECTIVE: The aim of this study was to derive country-specific exploratory dietary patterns, investigate their association with type 2 diabetes incidence, and replicate diabetes-associated dietary patterns in other countries. METHODS: Dietary intake data were used, assessed by country-specific questionnaires at baseline of 11,183 incident diabetes cases and 14,694 subcohort members (mean age 52.9 y) from 8 countries, nested within the European Prospective Investigation into Cancer and Nutrition study (mean follow-up time 6.9 y). Exploratory dietary patterns were derived by principal component analysis. HRs for incident type 2 diabetes were calculated by Prentice-weighted Cox proportional hazard regression models. Diabetes-associated dietary patterns were simplified or replicated to be applicable in other countries. A meta-analysis across all countries evaluated the generalizability of the diabetes-association. RESULTS: Two dietary patterns per country/UK-center, of which overall 3 dietary patterns were diabetes-associated, were identified. A risk-lowering French dietary pattern was not confirmed across other countries: pooled HRFrance per 1 SD: 1.00; 95% CI: 0.90, 1.10. Risk-increasing dietary patterns, derived in Spain and UK-Norfolk, were confirmed, but only the latter statistically significantly: HRSpain: 1.09; 95% CI: 0.97, 1.22 and HRUK-Norfolk: 1.12; 95% CI: 1.04, 1.20. Respectively, this dietary pattern was characterized by relatively high intakes of potatoes, processed meat, vegetable oils, sugar, cake and cookies, and tea. CONCLUSIONS: Only few country/center-specific dietary patterns (3 of 18) were statistically significantly associated with diabetes incidence in this multicountry European study population. One pattern, whose association with diabetes was confirmed across other countries, showed overlaps in the food groups potatoes and processed meat with identified diabetes-associated dietary patterns from other studies. The study demonstrates that replication of associations of exploratory patterns with health outcomes is feasible and a necessary step to overcome population-specificity in associations from such analyses.
Assuntos
Diabetes Mellitus Tipo 2/etiologia , Dieta/efeitos adversos , Adulto , Idoso , Estudos de Casos e Controles , Estudos de Coortes , Diabetes Mellitus Tipo 2/epidemiologia , Suscetibilidade a Doenças , Europa (Continente)/epidemiologia , Estudos de Viabilidade , Feminino , Humanos , Incidência , Masculino , Pessoa de Meia-Idade , Análise de Componente Principal , Fatores de RiscoRESUMO
BACKGROUND: Understanding of the genetic basis of type 2 diabetes (T2D) has progressed rapidly, but the interactions between common genetic variants and lifestyle risk factors have not been systematically investigated in studies with adequate statistical power. Therefore, we aimed to quantify the combined effects of genetic and lifestyle factors on risk of T2D in order to inform strategies for prevention. METHODS AND FINDINGS: The InterAct study includes 12,403 incident T2D cases and a representative sub-cohort of 16,154 individuals from a cohort of 340,234 European participants with 3.99 million person-years of follow-up. We studied the combined effects of an additive genetic T2D risk score and modifiable and non-modifiable risk factors using Prentice-weighted Cox regression and random effects meta-analysis methods. The effect of the genetic score was significantly greater in younger individuals (p for interaction â=â1.20×10-4). Relative genetic risk (per standard deviation [4.4 risk alleles]) was also larger in participants who were leaner, both in terms of body mass index (p for interaction â=â1.50×10-3) and waist circumference (p for interaction â=â7.49×10-9). Examination of absolute risks by strata showed the importance of obesity for T2D risk. The 10-y cumulative incidence of T2D rose from 0.25% to 0.89% across extreme quartiles of the genetic score in normal weight individuals, compared to 4.22% to 7.99% in obese individuals. We detected no significant interactions between the genetic score and sex, diabetes family history, physical activity, or dietary habits assessed by a Mediterranean diet score. CONCLUSIONS: The relative effect of a T2D genetic risk score is greater in younger and leaner participants. However, this sub-group is at low absolute risk and would not be a logical target for preventive interventions. The high absolute risk associated with obesity at any level of genetic risk highlights the importance of universal rather than targeted approaches to lifestyle intervention.
Assuntos
Diabetes Mellitus Tipo 2/genética , Predisposição Genética para Doença , Estilo de Vida , Alelos , Índice de Massa Corporal , Estudos de Coortes , Diabetes Mellitus Tipo 2/dietoterapia , Dieta Mediterrânea , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Atividade Motora , Polimorfismo de Nucleotídeo Único/genética , Modelos de Riscos Proporcionais , Fatores de Risco , Circunferência da Cintura/genéticaRESUMO
BACKGROUND: Broad-capture proteomic technologies have the potential to improve disease prediction, enabling targeted prevention and management, but studies have so far been limited to very few selected diseases and have not evaluated predictive performance across multiple conditions. We aimed to evaluate the potential of serum proteins to improve risk prediction over and above health-derived information and polygenic risk scores across a diverse set of 24 outcomes. METHODS: We designed multiple case-cohorts nested in the EPIC-Norfolk prospective study, from participants with available serum samples and genome-wide genotype data, with more than 32â974 person-years of follow-up. Participants were middle-aged individuals (aged 40-79 years at baseline) of European ancestry who were recruited from the general population of Norfolk, England, between March, 1993 and December, 1997. We selected participants who developed one of ten less common diseases within 10 years of follow-up; we also subsampled a randomly drawn control subcohort, which also served to investigate 14 more common outcomes (n>70), including all-cause premature mortality (death before the age of 75 years; case numbers 71-437; controls 608-1556). Individuals were excluded from the current study owing to failed genotyping or proteomic quality control, relatedness, or missing information on age, sex, BMI, or smoking status. We used a machine learning framework to derive sparse predictive protein models for the onset of the the 23 individual diseases and all-cause premature mortality, and to derive a single common sparse multimorbidity signature that was predictive across multiple diseases from 2923 serum proteins. FINDINGS: Participants who developed one of ten less common diseases within 10 years of follow-up included 482 women and 507 men, with a mean age at baseline of 64·56 years (8·08). The random subcohort included 990 women and 769 men, with a mean age of 58·79 years (9·31). As few as five proteins alone outperformed polygenic risk scores for 17 of 23 outcomes (median dfference in concordance index [C-index] 0·13 [0·10-0·17]) and improved predictive performance when added over basic patient-derived information models for seven outcomes, achieving a median C-index of 0·82 (IQR 0·77-0·82). This included diseases with poor prognosis such as lung cancer (C-index 0·85 [+/- cross-validation error 0·83-0·87]), for which we identified unreported biomarkers such as C-X-C motif chemokine ligand 17. A sparse multimorbidity signature of ten proteins improved prediction across seven outcomes over patient-derived information models, achieving performances (median C-index 0·81 [IQR 0·80-0·82]) similar to those of disease-specific signatures. INTERPRETATION: We show the value of broad-capture proteomic biomarker discovery studies across multiple diseases of diverse causes, pointing to those that might benefit the most from proteomic approaches, and the potential to derive common sparse biomarker panels for prediction of multiple diseases at once. This framework could enable follow-up studies to explore the generalisability of proteomic models and to benchmark these against clinical assays, which are required to understand the translational potential of these findings. FUNDING: Medical Research Council, Health Data Research UK, UK Research and Innovation-National Institute for Health and Care Research, Cancer Research UK, and Wellcome Trust.
Assuntos
Biomarcadores , Aprendizado de Máquina , Proteômica , Humanos , Pessoa de Meia-Idade , Masculino , Feminino , Estudos Prospectivos , Biomarcadores/sangue , Proteômica/métodos , Idoso , Adulto , Inglaterra , Medição de Risco/métodos , Fatores de RiscoRESUMO
Higher cardiorespiratory fitness is associated with lower risk of type 2 diabetes. However, the causality of this relationship and the biological mechanisms that underlie it are unclear. Here, we examine genetic determinants of cardiorespiratory fitness in 450k European-ancestry individuals in UK Biobank, by leveraging the genetic overlap between fitness measured by an exercise test and resting heart rate. We identified 160 fitness-associated loci which we validated in an independent cohort, the Fenland study. Gene-based analyses prioritised candidate genes, such as CACNA1C, SCN10A, MYH11 and MYH6, that are enriched in biological processes related to cardiac muscle development and muscle contractility. In a Mendelian Randomisation framework, we demonstrate that higher genetically predicted fitness is causally associated with lower risk of type 2 diabetes independent of adiposity. Integration with proteomic data identified N-terminal pro B-type natriuretic peptide, hepatocyte growth factor-like protein and sex hormone-binding globulin as potential mediators of this relationship. Collectively, our findings provide insights into the biological mechanisms underpinning cardiorespiratory fitness and highlight the importance of improving fitness for diabetes prevention.
Assuntos
Aptidão Cardiorrespiratória , Diabetes Mellitus Tipo 2 , Humanos , Diabetes Mellitus Tipo 2/genética , Aptidão Cardiorrespiratória/fisiologia , Proteômica , Obesidade , Fatores de RiscoRESUMO
Studying the plasma proteome as the intermediate layer between the genome and the phenome has the potential to identify new disease processes. Here, we conducted a cis-focused proteogenomic analysis of 2,923 plasma proteins measured in 1,180 individuals using antibody-based assays. We (1) identify 256 unreported protein quantitative trait loci (pQTL); (2) demonstrate shared genetic regulation of 224 cis-pQTLs with 575 specific health outcomes, revealing examples for notable metabolic diseases (such as gastrin-releasing peptide as a potential therapeutic target for type 2 diabetes); (3) improve causal gene assignment at 40% (n = 192) of overlapping risk loci; and (4) observe convergence of phenotypic consequences of cis-pQTLs and rare loss-of-function gene burden for 12 proteins, such as TIMD4 for lipoprotein metabolism. Our findings demonstrate the value of integrating complementary proteomic technologies with genomics even at moderate scale to identify new mediators of metabolic diseases with the potential for therapeutic interventions.
Assuntos
Diabetes Mellitus Tipo 2 , Proteogenômica , Humanos , Proteômica , Diabetes Mellitus Tipo 2/genética , Locos de Características Quantitativas , Proteínas Sanguíneas/genéticaRESUMO
BACKGROUND: Waist circumference (WC) is a simple and reliable measure of fat distribution that may add to the prediction of type 2 diabetes (T2D), but previous studies have been too small to reliably quantify the relative and absolute risk of future diabetes by WC at different levels of body mass index (BMI). METHODS AND FINDINGS: The prospective InterAct case-cohort study was conducted in 26 centres in eight European countries and consists of 12,403 incident T2D cases and a stratified subcohort of 16,154 individuals from a total cohort of 340,234 participants with 3.99 million person-years of follow-up. We used Prentice-weighted Cox regression and random effects meta-analysis methods to estimate hazard ratios for T2D. Kaplan-Meier estimates of the cumulative incidence of T2D were calculated. BMI and WC were each independently associated with T2D, with WC being a stronger risk factor in women than in men. Risk increased across groups defined by BMI and WC; compared to low normal weight individuals (BMI 18.5-22.4 kg/m(2)) with a low WC (<94/80 cm in men/women), the hazard ratio of T2D was 22.0 (95% confidence interval 14.3; 33.8) in men and 31.8 (25.2; 40.2) in women with grade 2 obesity (BMI≥35 kg/m(2)) and a high WC (>102/88 cm). Among the large group of overweight individuals, WC measurement was highly informative and facilitated the identification of a subgroup of overweight people with high WC whose 10-y T2D cumulative incidence (men, 70 per 1,000 person-years; women, 44 per 1,000 person-years) was comparable to that of the obese group (50-103 per 1,000 person-years in men and 28-74 per 1,000 person-years in women). CONCLUSIONS: WC is independently and strongly associated with T2D, particularly in women, and should be more widely measured for risk stratification. If targeted measurement is necessary for reasons of resource scarcity, measuring WC in overweight individuals may be an effective strategy, since it identifies a high-risk subgroup of individuals who could benefit from individualised preventive action.
Assuntos
Diabetes Mellitus Tipo 2/complicações , Diabetes Mellitus Tipo 2/epidemiologia , Obesidade/complicações , Obesidade/fisiopatologia , Antropometria , Índice de Massa Corporal , Estudos de Coortes , Europa (Continente)/epidemiologia , Feminino , Humanos , Incidência , Masculino , Pessoa de Meia-Idade , Modelos de Riscos Proporcionais , Fatores de Risco , Fatores de Tempo , Circunferência da Cintura/fisiologiaRESUMO
Type 2 diabetes (T2D) is a heritable metabolic disorder. While population studies have identified hundreds of common genetic variants associated with T2D, the role of rare (frequency < 0.1%) protein-coding variation is less clear. We performed exome sequence analysis in 418,436 (n = 32,374 T2D cases) individuals in the UK Biobank. We identified previously reported genes (GCK, GIGYF1, HNF1A) in addition to missense variants in ZEB2 (n = 31 carriers; odds ratio [OR] = 5.5 [95% confidence interval = 2.5-12.0]; p = 6.4 × 10-7), MLXIPL (n = 245; OR = 2.3 [1.6-3.2]; p = 3.2 × 10-7), and IGF1R (n = 394; OR = 2.4 [1.8-3.2]; p = 1.3 × 10-10). Carriers of damaging missense variants within IGF1R were also shorter (-2.2 cm [-1.8 to -2.7]; p = 1.2 × 10-19) and had higher circulating insulin-like growth factor-1 (IGF-1) protein levels (2.3 nmol/L [1.7-2.9]; p = 2.8 × 10-14), indicating relative IGF-1 resistance. A likely causal role of IGF-1 resistance was supported by Mendelian randomization analyses using common variants. These results increase understanding of the genetic architecture of T2D and highlight the growth hormone/IGF-1 axis as a potential therapeutic target.
RESUMO
CONTEXT: Biological and translational insights from large-scale, array-based genetic studies of fat distribution, a key determinant of metabolic health, have been limited by the difficulty in linking predominantly noncoding variants to specific gene targets. Rare coding variant analyses provide greater confidence that a specific gene is involved, but do not necessarily indicate whether gain or loss of function (LoF) would be of most therapeutic benefit. OBJECTIVE: This work aimed to identify genes/proteins involved in determining fat distribution. METHODS: We combined the power of genome-wide analysis of array-based rare, nonsynonymous variants in 450â 562 individuals in the UK Biobank with exome-sequence-based rare LoF gene burden testing in 184â 246 individuals. RESULTS: The data indicate that the LoF of 4 genes (PLIN1 [LoF variants, Pâ =â 5.86â ×â 10-7], INSR [LoF variants, Pâ =â 6.21â ×â 10-7], ACVR1C [LoFâ +â moderate impact variants, Pâ =â 1.68â ×â 10-7; moderate impact variants, Pâ =â 4.57â ×â 10-7], and PDE3B [LoF variants, Pâ =â 1.41â ×â 10-6]) is associated with a beneficial effect on body mass index-adjusted waist-to-hip ratio and increased gluteofemoral fat mass, whereas LoF of PLIN4 (LoF variants, Pâ =â 5.86â ×â 10-7 adversely affects these parameters. Phenotypic follow-up suggests that LoF of PLIN1, PDE3B, and ACVR1C favorably affects metabolic phenotypes (eg, triglycerides [TGs] and high-density lipoprotein [HDL] cholesterol concentrations) and reduces the risk of cardiovascular disease, whereas PLIN4 LoF has adverse health consequences. INSR LoF is associated with lower TG and HDL levels but may increase the risk of type 2 diabetes. CONCLUSION: This study robustly implicates these genes in the regulation of fat distribution, providing new and in some cases somewhat counterintuitive insight into the potential consequences of targeting these molecules therapeutically.
Assuntos
Diabetes Mellitus Tipo 2 , Receptores de Ativinas Tipo I/genética , Distribuição da Gordura Corporal , Diabetes Mellitus Tipo 2/genética , Exoma , Variação Genética , Estudo de Associação Genômica Ampla , HumanosRESUMO
Mosaic loss of chromosome Y (LOY) in leukocytes is the most common form of clonal mosaicism, caused by dysregulation in cell-cycle and DNA damage response pathways. Previous genetic studies have focussed on identifying common variants associated with LOY, which we now extend to rarer, protein-coding variation using exome sequences from 82,277 male UK Biobank participants. We find that loss of function of two genes-CHEK2 and GIGYF1-reach exome-wide significance. Rare alleles in GIGYF1 have not previously been implicated in any complex trait, but here loss-of-function carriers exhibit six-fold higher susceptibility to LOY (OR = 5.99 [3.04-11.81], p = 1.3 × 10-10). These same alleles are also associated with adverse metabolic health, including higher susceptibility to Type 2 Diabetes (OR = 6.10 [3.51-10.61], p = 1.8 × 10-12), 4 kg higher fat mass (p = 1.3 × 10-4), 2.32 nmol/L lower serum IGF1 levels (p = 1.5 × 10-4) and 4.5 kg lower handgrip strength (p = 4.7 × 10-7) consistent with proposed GIGYF1 enhancement of insulin and IGF-1 receptor signalling. These associations are mirrored by a common variant nearby associated with the expression of GIGYF1. Our observations highlight a potential direct connection between clonal mosaicism and metabolic health.
Assuntos
Proteínas de Transporte/genética , Cromossomos Humanos Y/genética , Diabetes Mellitus Tipo 2/genética , Predisposição Genética para Doença , Mosaicismo , Adulto , Idoso , Proteínas de Transporte/metabolismo , Estudos de Casos e Controles , Análise Mutacional de DNA , Diabetes Mellitus Tipo 2/metabolismo , Feminino , Estudo de Associação Genômica Ampla , Humanos , Insulina/metabolismo , Leucócitos , Mutação com Perda de Função , Masculino , Pessoa de Meia-Idade , Receptor IGF Tipo 1/metabolismo , Transdução de Sinais/genética , Sequenciamento do ExomaRESUMO
Affinity-based proteomics has enabled scalable quantification of thousands of protein targets in blood enhancing biomarker discovery, understanding of disease mechanisms, and genetic evaluation of drug targets in humans through protein quantitative trait loci (pQTLs). Here, we integrate two partly complementary techniques-the aptamer-based SomaScan® v4 assay and the antibody-based Olink assays-to systematically assess phenotypic consequences of hundreds of pQTLs discovered for 871 protein targets across both platforms. We create a genetically anchored cross-platform proteome-phenome network comprising 547 protein-phenotype connections, 36.3% of which were only seen with one of the two platforms suggesting that both techniques capture distinct aspects of protein biology. We further highlight discordance of genetically predicted effect directions between assays, such as for PILRA and Alzheimer's disease. Our results showcase the synergistic nature of these technologies to better understand and identify disease mechanisms and provide a benchmark for future cross-platform discoveries.
Assuntos
Proteoma/genética , Proteômica/métodos , Locos de Características Quantitativas , Adulto , Doença de Alzheimer/genética , Anticorpos/metabolismo , Aptâmeros de Peptídeos/metabolismo , Estudos de Coortes , Feminino , Humanos , Masculino , Glicoproteínas de Membrana/genética , Pessoa de Meia-Idade , Fenótipo , Mapeamento de Interação de Proteínas , Mapas de Interação de Proteínas/genética , Proteoma/metabolismo , Receptores Imunológicos/genéticaRESUMO
Characterization of the genetic regulation of proteins is essential for understanding disease etiology and developing therapies. We identified 10,674 genetic associations for 3892 plasma proteins to create a cis-anchored gene-protein-disease map of 1859 connections that highlights strong cross-disease biological convergence. This proteo-genomic map provides a framework to connect etiologically related diseases, to provide biological context for new or emerging disorders, and to integrate different biological domains to establish mechanisms for known gene-disease links. Our results identify proteo-genomic connections within and between diseases and establish the value of cis-protein variants for annotation of likely causal disease genes at loci identified in genome-wide association studies, thereby addressing a major barrier to experimental validation and clinical translation of genetic discoveries.
Assuntos
Proteínas Sanguíneas/genética , Doença/genética , Genoma Humano , Genômica , Proteínas/genética , Proteoma , Envelhecimento , Processamento Alternativo , Proteínas Sanguíneas/metabolismo , COVID-19/genética , Doenças do Tecido Conjuntivo/genética , Doença/etiologia , Desenvolvimento de Medicamentos , Feminino , Cálculos Biliares/genética , Estudos de Associação Genética , Variação Genética , Estudo de Associação Genômica Ampla , Humanos , Internet , Masculino , Fenótipo , Proteínas/metabolismo , Locos de Características Quantitativas , Caracteres SexuaisRESUMO
Understanding the genetic architecture of host proteins interacting with SARS-CoV-2 or mediating the maladaptive host response to COVID-19 can help to identify new or repurpose existing drugs targeting those proteins. We present a genetic discovery study of 179 such host proteins among 10,708 individuals using an aptamer-based technique. We identify 220 host DNA sequence variants acting in cis (MAF 0.01-49.9%) and explaining 0.3-70.9% of the variance of 97 of these proteins, including 45 with no previously known protein quantitative trait loci (pQTL) and 38 encoding current drug targets. Systematic characterization of pQTLs across the phenome identified protein-drug-disease links and evidence that putative viral interaction partners such as MARK3 affect immune response. Our results accelerate the evaluation and prioritization of new drug development programmes and repurposing of trials to prevent, treat or reduce adverse outcomes. Rapid sharing and detailed interrogation of results is facilitated through an interactive webserver ( https://omicscience.org/apps/covidpgwas/ ).
Assuntos
COVID-19/genética , COVID-19/virologia , Interações Hospedeiro-Patógeno/genética , Proteínas/genética , SARS-CoV-2/fisiologia , Sistema ABO de Grupos Sanguíneos/metabolismo , Aptâmeros de Peptídeos/sangue , Aptâmeros de Peptídeos/metabolismo , Coagulação Sanguínea , Sistemas de Liberação de Medicamentos , Feminino , Regulação da Expressão Gênica , Fatores Celulares Derivados do Hospedeiro/metabolismo , Humanos , Internet , Masculino , Pessoa de Meia-Idade , Locos de Características Quantitativas/genéticaRESUMO
Strategies to develop therapeutics for SARS-CoV-2 infection may be informed by experimental identification of viral-host protein interactions in cellular assays and measurement of host response proteins in COVID-19 patients. Identification of genetic variants that influence the level or activity of these proteins in the host could enable rapid 'in silico' assessment in human genetic studies of their causal relevance as molecular targets for new or repurposed drugs to treat COVID-19. We integrated large-scale genomic and aptamer-based plasma proteomic data from 10,708 individuals to characterize the genetic architecture of 179 host proteins reported to interact with SARS-CoV-2 proteins or to participate in the host response to COVID-19. We identified 220 host DNA sequence variants acting in cis (MAF 0.01-49.9%) and explaining 0.3-70.9% of the variance of 97 of these proteins, including 45 with no previously known protein quantitative trait loci (pQTL) and 38 encoding current drug targets. Systematic characterization of pQTLs across the phenome identified protein-drug-disease links, evidence that putative viral interaction partners such as MARK3 affect immune response, and establish the first link between a recently reported variant for respiratory failure of COVID-19 patients at the ABO locus and hypercoagulation, i.e. maladaptive host response. Our results accelerate the evaluation and prioritization of new drug development programmes and repurposing of trials to prevent, treat or reduce adverse outcomes. Rapid sharing and dynamic and detailed interrogation of results is facilitated through an interactive webserver ( https://omicscience.org/apps/covidpgwas/ ).
RESUMO
Type 2 diabetes (T2D) is a global public health challenge. Whilst the advent of genome-wide association studies has identified >400 genetic variants associated with T2D, our understanding of its biological mechanisms and translational insights is still limited. The EPIC-InterAct project, centred in 8 countries in the European Prospective Investigations into Cancer and Nutrition study, is one of the largest prospective studies of T2D. Established as a nested case-cohort study to investigate the interplay between genetic and lifestyle behavioural factors on the risk of T2D, a total of 12,403 individuals were identified as incident T2D cases, and a representative sub-cohort of 16,154 individuals was selected from a larger cohort of 340,234 participants with a follow-up time of 3.99 million person-years. We describe the results from a genome-wide association analysis between more than 8.9 million SNPs and T2D risk among 22,326 individuals (9,978 cases and 12,348 non-cases) from the EPIC-InterAct study. The summary statistics to be shared provide a valuable resource to facilitate further investigations into the genetics of T2D.
Assuntos
Diabetes Mellitus Tipo 2/genética , Estudo de Associação Genômica Ampla , Estilo de Vida , Europa (Continente) , Humanos , Estudos Prospectivos , Fatores de RiscoRESUMO
Epigenetic changes may contribute substantially to risks of diseases of aging. Previous studies reported seven methylation variable positions (MVPs) robustly associated with incident type 2 diabetes mellitus (T2DM). However, their causal roles in T2DM are unclear. In an incident T2DM case-cohort study nested within the population-based European Prospective Investigation into Cancer and Nutrition (EPIC)-Norfolk cohort, we used whole blood DNA collected at baseline, up to 11 years before T2DM onset, to investigate the role of methylation in the etiology of T2DM. We identified 15 novel MVPs with robust associations with incident T2DM and robustly confirmed three MVPs identified previously (near to TXNIP, ABCG1, and SREBF1). All 18 MVPs showed directionally consistent associations with incident and prevalent T2DM in independent studies. Further conditional analyses suggested that the identified epigenetic signals appear related to T2DM via glucose and obesity-related pathways acting before the collection of baseline samples. We integrated genome-wide genetic data to identify methylation-associated quantitative trait loci robustly associated with 16 of the 18 MVPs and found one MVP, cg00574958 at CPT1A, with a possible direct causal role in T2DM. None of the implicated genes were previously highlighted by genetic association studies, suggesting that DNA methylation studies may reveal novel biological mechanisms involved in tissue responses to glycemia.
Assuntos
Metilação de DNA , Diabetes Mellitus Tipo 2/genética , Epigenoma , Adulto , Idoso , Glicemia , Diabetes Mellitus Tipo 2/epidemiologia , Inglaterra/epidemiologia , Epigenômica , Feminino , Estudos de Associação Genética , Humanos , Incidência , Masculino , Pessoa de Meia-IdadeRESUMO
Importance: Pharmacological enhancers of lipoprotein lipase (LPL) are in preclinical or early clinical development for cardiovascular prevention. Studying whether these agents will reduce cardiovascular events or diabetes risk when added to existing lipid-lowering drugs would require large outcome trials. Human genetics studies can help prioritize or deprioritize these resource-demanding endeavors. Objective: To investigate the independent and combined associations of genetically determined differences in LPL-mediated lipolysis and low-density lipoprotein cholesterol (LDL-C) metabolism with risk of coronary disease and diabetes. Design, Setting, and Participants: In this genetic association study, individual-level genetic data from 392â¯220 participants from 2 population-based cohort studies and 1 case-cohort study conducted in Europe were included. Data were collected from January 1991 to July 2018, and data were analyzed from July 2014 to July 2018. Exposures: Six conditionally independent triglyceride-lowering alleles in LPL, the p.Glu40Lys variant in ANGPTL4, rare loss-of-function variants in ANGPTL3, and LDL-C-lowering polymorphisms at 58 independent genomic regions, including HMGCR, NPC1L1, and PCSK9. Main Outcomes and Measures: Odds ratio for coronary artery disease and type 2 diabetes. Results: Of the 392â¯220 participants included, 211â¯915 (54.0%) were female, and the mean (SD) age was 57 (8) years. Triglyceride-lowering alleles in LPL were associated with protection from coronary disease (approximately 40% lower odds per SD of genetically lower triglycerides) and type 2 diabetes (approximately 30% lower odds) in people above or below the median of the population distribution of LDL-C-lowering alleles at 58 independent genomic regions, HMGCR, NPC1L1, or PCSK9. Associations with lower risk were consistent in quintiles of the distribution of LDL-C-lowering alleles and 2 × 2 factorial genetic analyses. The 40Lys variant in ANGPTL4 was associated with protection from coronary disease and type 2 diabetes in groups with genetically higher or lower LDL-C. For a genetic difference of 0.23 SDs in LDL-C, ANGPTL3 loss-of-function variants, which also have beneficial associations with LPL lipolysis, were associated with greater protection against coronary disease than other LDL-C-lowering genetic mechanisms (ANGPTL3 loss-of-function variants: odds ratio, 0.66; 95% CI, 0.52-0.83; 58 LDL-C-lowering variants: odds ratio, 0.90; 95% CI, 0.89-0.91; P for heterogeneity = .009). Conclusions and Relevance: Triglyceride-lowering alleles in the LPL pathway are associated with lower risk of coronary disease and type 2 diabetes independently of LDL-C-lowering genetic mechanisms. These findings provide human genetics evidence to support the development of agents that enhance LPL-mediated lipolysis for further clinical benefit in addition to LDL-C-lowering therapy.