RESUMO
The Pharma Proteomics Project is a precompetitive biopharmaceutical consortium characterizing the plasma proteomic profiles of 54,219 UK Biobank participants. Here we provide a detailed summary of this initiative, including technical and biological validations, insights into proteomic disease signatures, and prediction modelling for various demographic and health indicators. We present comprehensive protein quantitative trait locus (pQTL) mapping of 2,923 proteins that identifies 14,287 primary genetic associations, of which 81% are previously undescribed, alongside ancestry-specific pQTL mapping in non-European individuals. The study provides an updated characterization of the genetic architecture of the plasma proteome, contextualized with projected pQTL discovery rates as sample sizes and proteomic assay coverages increase over time. We offer extensive insights into trans pQTLs across multiple biological domains, highlight genetic influences on ligand-receptor interactions and pathway perturbations across a diverse collection of cytokines and complement networks, and illustrate long-range epistatic effects of ABO blood group and FUT2 secretor status on proteins with gastrointestinal tissue-enriched expression. We demonstrate the utility of these data for drug discovery by extending the genetic proxied effects of protein targets, such as PCSK9, on additional endpoints, and disentangle specific genes and proteins perturbed at loci associated with COVID-19 susceptibility. This public-private partnership provides the scientific community with an open-access proteomics resource of considerable breadth and depth to help to elucidate the biological mechanisms underlying proteo-genomic discoveries and accelerate the development of biomarkers, predictive models and therapeutics1.
Assuntos
Bancos de Espécimes Biológicos , Proteínas Sanguíneas , Bases de Dados Factuais , Genômica , Saúde , Proteoma , Proteômica , Humanos , Sistema ABO de Grupos Sanguíneos/genética , Proteínas Sanguíneas/análise , Proteínas Sanguíneas/genética , COVID-19/genética , Descoberta de Drogas , Epistasia Genética , Fucosiltransferases/metabolismo , Predisposição Genética para Doença , Plasma/química , Pró-Proteína Convertase 9/metabolismo , Proteoma/análise , Proteoma/genética , Parcerias Público-Privadas , Locos de Características Quantitativas , Reino Unido , Galactosídeo 2-alfa-L-FucosiltransferaseRESUMO
Transcriptomics data have been integrated with genome-wide association studies (GWASs) to help understand disease/trait molecular mechanisms. The utility of metabolomics, integrated with transcriptomics and disease GWASs, to understand molecular mechanisms for metabolite levels or diseases has not been thoroughly evaluated. We performed probabilistic transcriptome-wide association and locus-level colocalization analyses to integrate transcriptomics results for 49 tissues in 706 individuals from the GTEx project, metabolomics results for 1,391 plasma metabolites in 6,136 Finnish men from the METSIM study, and GWAS results for 2,861 disease traits in 260,405 Finnish individuals from the FinnGen study. We found that genetic variants that regulate metabolite levels were more likely to influence gene expression and disease risk compared to the ones that do not. Integrating transcriptomics with metabolomics results prioritized 397 genes for 521 metabolites, including 496 previously identified gene-metabolite pairs with strong functional connections and suggested 33.3% of such gene-metabolite pairs shared the same causal variants with genetic associations of gene expression. Integrating transcriptomics and metabolomics individually with FinnGen GWAS results identified 1,597 genes for 790 disease traits. Integrating transcriptomics and metabolomics jointly with FinnGen GWAS results helped pinpoint metabolic pathways from genes to diseases. We identified putative causal effects of UGT1A1/UGT1A4 expression on gallbladder disorders through regulating plasma (E,E)-bilirubin levels, of SLC22A5 expression on nasal polyps and plasma carnitine levels through distinct pathways, and of LIPC expression on age-related macular degeneration through glycerophospholipid metabolic pathways. Our study highlights the power of integrating multiple sets of molecular traits and GWAS results to deepen understanding of disease pathophysiology.
Assuntos
Estudo de Associação Genômica Ampla , Transcriptoma , Bilirrubina , Carnitina , Glicerofosfolipídeos , Humanos , Masculino , Metabolômica , Locos de Características Quantitativas/genética , Membro 5 da Família 22 de Carreadores de Soluto/genética , Transcriptoma/genéticaRESUMO
By sampling 500 time points of an electrocardiogram (ECG) trace in a genome-wide association study (GWAS), Verweig et al. identified novel correlates for cardiac dysfunction. Clustering of SNPs based on their impact at all time points revealed natural sets of SNPs with similar effects and mechanisms. Similar approaches may be applicable to other quantitative, time-ordered traits.
Assuntos
Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único , Eletrocardiografia , Fenótipo , Polimorfismo de Nucleotídeo Único/genéticaRESUMO
Genome-wide association studies have discovered hundreds of genomic loci associated with psychiatric traits, but the causal genes underlying these associations are often unclear, a research gap that has hindered clinical translation. Here, we present a Psychiatric Omnilocus Prioritization Score (PsyOPS) derived from just three binary features encapsulating high-level assumptions about psychiatric disease etiology - namely, that causal psychiatric disease genes are likely to be mutationally constrained, be specifically expressed in the brain, and overlap with known neurodevelopmental disease genes. To our knowledge, PsyOPS is the first method specifically tailored to prioritizing causal genes at psychiatric GWAS loci. We show that, despite its extreme simplicity, PsyOPS achieves state-of-the-art performance at this task, comparable to a prior domain-agnostic approach relying on tens of thousands of features. Genes prioritized by PsyOPS are substantially more likely than other genes at the same loci to have convergent evidence of direct regulation by the GWAS variant according to both DNA looping assays and expression or splicing quantitative trait locus (QTL) maps. We provide examples of genes hundreds of kilobases away from the lead variant, like GABBR1 for schizophrenia, that are prioritized by all three of PsyOPS, DNA looping and QTLs. Our results underscore the power of incorporating high-level knowledge of trait etiology into causal gene prediction at GWAS loci, and comprise a resource for researchers interested in experimentally characterizing psychiatric gene candidates.
Assuntos
Estudo de Associação Genômica Ampla , Locos de Características Quantitativas , DNA , Predisposição Genética para Doença/genética , Estudo de Associação Genômica Ampla/métodos , Genômica , Humanos , Fenótipo , Polimorfismo de Nucleotídeo Único/genética , Locos de Características Quantitativas/genéticaRESUMO
BACKGROUND: A genome-wide association study (GWAS) correlates variation in the genotype with variation in the phenotype across a cohort, but the causal gene mediating that impact is often unclear. When the phenotype is protein abundance, a reasonable hypothesis is that the gene encoding that protein is the causal gene. However, as variants impacting protein levels can occur thousands or even millions of base pairs from the gene encoding the protein, it is unclear at what distance this simple hypothesis breaks down. RESULTS: By making the simple assumption that cis-pQTLs should be distance dependent while trans-pQTLs are distance independent, we arrive at a simple and empirical distance cutoff separating cis- and trans-pQTLs. Analyzing a recent large-scale pQTL study (Pietzner in Science 374:eabj1541, 2021) we arrive at an estimated distance cutoff of 944 kilobasepairs (95% confidence interval: 767-1,161) separating the cis and trans regimes. CONCLUSIONS: We demonstrate that this simple model can be applied to other molecular GWAS traits. Since much of biology is built on molecular traits like protein, transcript and metabolite abundance, we posit that the mathematical models for cis and trans distance distributions derived here will also apply to more complex phenotypes and traits.
Assuntos
Estudo de Associação Genômica Ampla , Locos de Características Quantitativas , Genótipo , Humanos , Fenótipo , Polimorfismo de Nucleotídeo Único , Proteínas/genética , Proteínas/metabolismoRESUMO
Several genetic discoveries robustly implicate five single-nucleotide variants in the progression of non-alcoholic fatty liver disease to non-alcoholic steatohepatitis and fibrosis (NASH-fibrosis), including a recently identified variant in MTARC1. To better understand these variants as potential therapeutic targets, we aimed to characterize their impact on metabolism using comprehensive metabolomics data from two population-based studies. A total of 9135 participants from the Fenland study and 9902 participants from the EPIC-Norfolk cohort were included in the study. We identified individuals with risk alleles associated with NASH-fibrosis: rs738409C>G in PNPLA3, rs58542926C>T in TM6SF2, rs641738C>T near MBOAT7, rs72613567TA>T in HSD17B13 and rs2642438A>G in MTARC1. Circulating levels of 1449 metabolites were measured using targeted and untargeted metabolomics. Associations between NASH-fibrosis variants and metabolites were assessed using linear regression. The specificity of variant-metabolite associations were compared to metabolite associations with ultrasound-defined steatosis, gene variants linked to liver fat (in GCKR, PPP1R3B and LYPLAL1) and gene variants linked to cirrhosis (in HFE and SERPINA1). Each NASH-fibrosis variant demonstrated a specific metabolite profile with little overlap (8/97 metabolites) comprising diverse aspects of lipid metabolism. Risk alleles in PNPLA3 and HSD17B13 were both associated with higher 3-methylglutarylcarnitine and three variants were associated with lower lysophosphatidylcholine C14:0. The risk allele in MTARC1 was associated with higher levels of sphingomyelins. There was no overlap with metabolites that associated with HFE or SERPINA1 variants. Our results suggest a link between the NASH-protective variant in MTARC1 to the metabolism of sphingomyelins and identify distinct molecular patterns associated with each of the NASH-fibrosis variants under investigation.
Assuntos
Predisposição Genética para Doença , Cirrose Hepática/patologia , Metaboloma , Hepatopatia Gordurosa não Alcoólica/patologia , Polimorfismo de Nucleotídeo Único , Adulto , Idoso , Feminino , Estudos de Associação Genética , Humanos , Cirrose Hepática/complicações , Cirrose Hepática/genética , Cirrose Hepática/metabolismo , Masculino , Pessoa de Meia-Idade , Hepatopatia Gordurosa não Alcoólica/complicações , Hepatopatia Gordurosa não Alcoólica/genética , Hepatopatia Gordurosa não Alcoólica/metabolismo , Prognóstico , Estudos ProspectivosRESUMO
Drug development and biological discovery require effective strategies to map existing genetic associations to causal genes. To approach this problem, we selected 12 common diseases and quantitative traits for which highly powered genome-wide association studies (GWAS) were available. For each disease or trait, we systematically curated positive control gene sets from Mendelian forms of the disease and from targets of medicines used for disease treatment. We found that these positive control genes were highly enriched in proximity of GWAS-associated single-nucleotide variants (SNVs). We then performed quantitative assessment of the contribution of commonly used genomic features, including open chromatin maps, expression quantitative trait loci (eQTL), and chromatin conformation data. Using these features, we trained and validated an Effector Index (Ei), to map target genes for these 12 common diseases and traits. Ei demonstrated high predictive performance, both with cross-validation on the training set, and an independently derived set for type 2 diabetes. Key predictive features included coding or transcript-altering SNVs, distance to gene, and open chromatin-based metrics. This work outlines a simple, understandable approach to prioritize genes at GWAS loci for functional follow-up and drug development, and provides a systematic strategy for prioritization of GWAS target genes.
Assuntos
Diabetes Mellitus Tipo 2 , Estudo de Associação Genômica Ampla , Cromatina/genética , Diabetes Mellitus Tipo 2/genética , Predisposição Genética para Doença , Humanos , Polimorfismo de Nucleotídeo Único , Locos de Características QuantitativasRESUMO
BACKGROUND: Genetic, lifestyle, and environmental factors can lead to perturbations in circulating lipid levels and increase the risk of cardiovascular and metabolic diseases. However, how changes in individual lipid species contribute to disease risk is often unclear. Moreover, little is known about the role of lipids on cardiovascular disease in Pakistan, a population historically underrepresented in cardiovascular studies. METHODS: We characterised the genetic architecture of the human blood lipidome in 5662 hospital controls from the Pakistan Risk of Myocardial Infarction Study (PROMIS) and 13,814 healthy British blood donors from the INTERVAL study. We applied a candidate causal gene prioritisation tool to link the genetic variants associated with each lipid to the most likely causal genes, and Gaussian Graphical Modelling network analysis to identify and illustrate relationships between lipids and genetic loci. RESULTS: We identified 253 genetic associations with 181 lipids measured using direct infusion high-resolution mass spectrometry in PROMIS, and 502 genetic associations with 244 lipids in INTERVAL. Our analyses revealed new biological insights at genetic loci associated with cardiometabolic diseases, including novel lipid associations at the LPL, MBOAT7, LIPC, APOE-C1-C2-C4, SGPP1, and SPTLC3 loci. CONCLUSIONS: Our findings, generated using a distinctive lipidomics platform in an understudied South Asian population, strengthen and expand the knowledge base of the genetic determinants of lipids and their association with cardiometabolic disease-related loci.
Assuntos
Estudo de Associação Genômica Ampla , Infarto do Miocárdio , Povo Asiático/genética , Predisposição Genética para Doença , Humanos , Lipídeos , Polimorfismo de Nucleotídeo Único , População BrancaRESUMO
Quantitative trait locus (QTL) mapping of molecular phenotypes such as metabolites, lipids and proteins through genome-wide association studies represents a powerful means of highlighting molecular mechanisms relevant to human diseases. However, a major challenge of this approach is to identify the causal gene(s) at the observed QTLs. Here, we present a framework for the 'Prioritization of candidate causal Genes at Molecular QTLs' (ProGeM), which incorporates biological domain-specific annotation data alongside genome annotation data from multiple repositories. We assessed the performance of ProGeM using a reference set of 227 previously reported and extensively curated metabolite QTLs. For 98% of these loci, the expert-curated gene was one of the candidate causal genes prioritized by ProGeM. Benchmarking analyses revealed that 69% of the causal candidates were nearest to the sentinel variant at the investigated molecular QTLs, indicating that genomic proximity is the most reliable indicator of 'true positive' causal genes. In contrast, cis-gene expression QTL data led to three false positive candidate causal gene assignments for every one true positive assignment. We provide evidence that these conclusions also apply to other molecular phenotypes, suggesting that ProGeM is a powerful and versatile tool for annotating molecular QTLs. ProGeM is freely available via GitHub.
Assuntos
Estudos de Associação Genética , Estudo de Associação Genômica Ampla/métodos , Anotação de Sequência Molecular/métodos , Locos de Características Quantitativas/genética , Mapeamento Cromossômico/métodos , Humanos , Lipídeos/genética , Fenótipo , Proteínas/genéticaRESUMO
Recent advances in highly multiplexed immunoassays have allowed systematic large-scale measurement of hundreds of plasma proteins in large cohort studies. In combination with genotyping, such studies offer the prospect to 1) identify mechanisms involved with regulation of protein expression in plasma, and 2) determine whether the plasma proteins are likely to be causally implicated in disease. We report here the results of genome-wide association (GWA) studies of 83 proteins considered relevant to cardiovascular disease (CVD), measured in 3,394 individuals with multiple CVD risk factors. We identified 79 genome-wide significant (p<5e-8) association signals, 55 of which replicated at P<0.0007 in separate validation studies (n = 2,639 individuals). Using automated text mining, manual curation, and network-based methods incorporating information on expression quantitative trait loci (eQTL), we propose plausible causal mechanisms for 25 trans-acting loci, including a potential post-translational regulation of stem cell factor by matrix metalloproteinase 9 and receptor-ligand pairs such as RANK-RANK ligand. Using public GWA study data, we further evaluate all 79 loci for their causal effect on coronary artery disease, and highlight several potentially causal associations. Overall, a majority of the plasma proteins studied showed evidence of regulation at the genetic level. Our results enable future studies of the causal architecture of human disease, which in turn should aid discovery of new drug targets.
Assuntos
Biomarcadores/sangue , Proteínas Sanguíneas/genética , Doenças Cardiovasculares/sangue , Doenças Cardiovasculares/genética , Locos de Características Quantitativas , Doença da Artéria Coronariana/sangue , Doença da Artéria Coronariana/genética , Feminino , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , MasculinoRESUMO
Direct infusion high-resolution mass spectrometry (DIHRMS) is a novel, high-throughput approach to rapidly and accurately profile hundreds of lipids in human serum without prior chromatography, facilitating in-depth lipid phenotyping for large epidemiological studies to reveal the detailed associations of individual lipids with coronary heart disease (CHD) risk factors. Intact lipid profiling by DIHRMS was performed on 5662 serum samples from healthy participants in the Pakistan Risk of Myocardial Infarction Study (PROMIS). We developed a novel semi-targeted peak-picking algorithm to detect mass-to-charge ratios in positive and negative ionization modes. We analyzed lipid partial correlations, assessed the association of lipid principal components with established CHD risk factors and genetic variants, and examined differences between lipids for a common genetic polymorphism. The DIHRMS method provided information on 360 lipids (including fatty acyls, glycerolipids, glycerophospholipids, sphingolipids, and sterol lipids), with a median coefficient of variation of 11.6% (range: 5.4-51.9). The lipids were highly correlated and exhibited a range of associations with clinical chemistry biomarkers and lifestyle factors. This platform can provide many novel insights into the effects of physiology and lifestyle on lipid metabolism, genetic determinants of lipids, and the relationship between individual lipids and CHD risk factors.
Assuntos
Biomarcadores/sangue , Doença das Coronárias/genética , Lipídeos/genética , Doença das Coronárias/sangue , Doença das Coronárias/patologia , Feminino , Variação Genética , Glicerofosfolipídeos/sangue , Humanos , Metabolismo dos Lipídeos/genética , Lipídeos/sangue , Masculino , Pessoa de Meia-Idade , Fatores de Risco , Esfingolipídeos/sangue , Esfingolipídeos/genética , Esteróis/sangueRESUMO
Understanding the interaction between humans and mosquitoes is a critical area of study due to the phenomenal burdens on public health from mosquito-transmitted diseases. In this study, we conducted the first genome-wide association studies (GWAS) of self-reported mosquito bite reaction size (n = 84,724), itchiness caused by bites (n = 69,057), and perceived attractiveness to mosquitoes (n = 16,576). In total, 15 independent significant (P < 5×10-8) associations were identified. These loci were enriched for immunity-related genes that are involved in multiple cytokine signalling pathways. We also detected suggestive enrichment of these loci in enhancer regions that are active in stimulated T-cells, as well as within loci previously identified as controlling central memory T-cell levels. Egger regression analysis between the traits suggests that perception of itchiness and attractiveness to mosquitoes is driven, at least in part, by the genetic determinants of bite reaction size.Our findings illustrate the complex genetic and immunological landscapes underpinning human interactions with mosquitoes.
Assuntos
Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Mordeduras e Picadas de Insetos/genética , Prurido/genética , Animais , Culicidae/genética , Culicidae/patogenicidade , Genótipo , Humanos , Mordeduras e Picadas de Insetos/patologia , Fenótipo , Polimorfismo de Nucleotídeo Único/genética , Prurido/patologia , Autorrelato , Linfócitos T/imunologia , Linfócitos T/metabolismoRESUMO
Diabetes is the leading cause of ESRD. Despite evidence for a substantial heritability of diabetic kidney disease, efforts to identify genetic susceptibility variants have had limited success. We extended previous efforts in three dimensions, examining a more comprehensive set of genetic variants in larger numbers of subjects with type 1 diabetes characterized for a wider range of cross-sectional diabetic kidney disease phenotypes. In 2843 subjects, we estimated that the heritability of diabetic kidney disease was 35% (P=6.4×10-3). Genome-wide association analysis and replication in 12,540 individuals identified no single variants reaching stringent levels of significance and, despite excellent power, provided little independent confirmation of previously published associated variants. Whole-exome sequencing in 997 subjects failed to identify any large-effect coding alleles of lower frequency influencing the risk of diabetic kidney disease. However, sets of alleles increasing body mass index (P=2.2×10-5) and the risk of type 2 diabetes (P=6.1×10-4) associated with the risk of diabetic kidney disease. We also found genome-wide genetic correlation between diabetic kidney disease and failure at smoking cessation (P=1.1×10-4). Pathway analysis implicated ascorbate and aldarate metabolism (P=9.0×10-6), and pentose and glucuronate interconversions (P=3.0×10-6) in pathogenesis of diabetic kidney disease. These data provide further evidence for the role of genetic factors influencing diabetic kidney disease in those with type 1 diabetes and highlight some key pathways that may be responsible. Altogether these results reveal important biology behind the major cause of kidney disease.
Assuntos
Diabetes Mellitus Tipo 1/complicações , Diabetes Mellitus Tipo 1/genética , Nefropatias Diabéticas/genética , Adolescente , Adulto , Feminino , Estudo de Associação Genômica Ampla , Humanos , Masculino , Pessoa de Meia-Idade , Adulto JovemRESUMO
Motivation: Biobank scale genetic associations results over thousands of traits can be difficult to visualize and navigate. Results: We have created LAVAA, a visualization web-application to generate genetic volcano plots for simultaneously considering the P-value, effect size, case counts, trait class and fine-mapping posterior probability at a single-nucleotide polymorphism (SNP) across a range of traits from a large set of genome-wide association study. We find that user interaction with association results in LAVAA can enrich and enhance the biological interpretation of individual loci. Availability and implementation: LAVAA is available as a stand-alone web service (https://geneviz.aalto.fi/LAVAA/) and will be available in future releases of the finngen.fi website starting with release 10 in late 2023.
RESUMO
Natural human knockouts of genes associated with desirable outcomes, such as PCSK9 with low levels of LDL-cholesterol, can lead to the discovery of new drug targets and treatments. Rare loss-of-function variants are more likely to be found in the homozygous state in consanguineous populations, and deep molecular phenotyping of blood samples from homozygous carriers can help to discriminate between silent and functional variants. Here, we combined whole-genome sequencing with proteomics and metabolomics for 2,935 individuals from the Qatar Biobank (QBB) to evaluate the power of this approach for finding genes of clinical and pharmaceutical interest. As proof-of-concept, we identified a homozygous carrier of a very rare PCSK9 variant with extremely low circulating PCSK9 levels and low LDL. Our study demonstrates that the chances of finding such variants are about 168 times higher in QBB compared with GnomAD and emphasizes the potential of consanguineous populations for drug discovery.
RESUMO
Studies of the genetic regulation of cerebrospinal fluid (CSF) proteins may reveal pathways for treatment of neurological diseases. 398 proteins in CSF were measured in 1,591 participants from the BioFINDER study. Protein quantitative trait loci (pQTL) were identified as associations between genetic variants and proteins, with 176 pQTLs for 145 CSF proteins (P < 1.25 × 10-10 , 117 cis-pQTLs and 59 trans-pQTLs). Ventricular volume (measured with brain magnetic resonance imaging) was a confounder for several pQTLs. pQTLs for CSF and plasma proteins were overall correlated, but CSF-specific pQTLs were also observed. Mendelian randomization analyses suggested causal roles for several proteins, for example, ApoE, CD33, and GRN in Alzheimer's disease, MMP-10 in preclinical Alzheimer's disease, SIGLEC9 in amyotrophic lateral sclerosis, and CD38, GPNMB, and ADAM15 in Parkinson's disease. CSF levels of GRN, MMP-10, and GPNMB were altered in Alzheimer's disease, preclinical Alzheimer's disease, and Parkinson's disease, respectively. These findings point to pathways to be explored for novel therapies. The novel finding that ventricular volume confounded pQTLs has implications for design of future studies of the genetic regulation of the CSF proteome.
Assuntos
Doença de Alzheimer , Doença de Parkinson , Humanos , Doença de Alzheimer/genética , Doença de Alzheimer/líquido cefalorraquidiano , Metaloproteinase 10 da Matriz/genética , Doença de Parkinson/genética , Proteômica , Locos de Características Quantitativas , Biomarcadores/líquido cefalorraquidiano , Antígenos CD , Lectinas Semelhantes a Imunoglobulina de Ligação ao Ácido Siálico/genética , Proteínas de Membrana/genética , Proteínas ADAM/genética , Glicoproteínas de Membrana/genéticaRESUMO
Metabolites are small molecules that are useful for estimating disease risk and elucidating disease biology. Nevertheless, their causal effects on human diseases have not been evaluated comprehensively. We performed two-sample Mendelian randomization to systematically infer the causal effects of 1,099 plasma metabolites measured in 6,136 Finnish men from the METSIM study on risk of 2,099 binary disease endpoints measured in 309,154 Finnish individuals from FinnGen. We identified evidence for 282 causal effects of 70 metabolites on 183 disease endpoints (FDR<1%). We found 25 metabolites with potential causal effects across multiple disease domains, including ascorbic acid 2-sulfate affecting 26 disease endpoints in 12 disease domains. Our study suggests that N-acetyl-2-aminooctanoate and glycocholenate sulfate affect risk of atrial fibrillation through two distinct metabolic pathways and that N-methylpipecolate may mediate the causal effect of N6, N6-dimethyllysine on anxious personality disorder. This study highlights the broad causal impact of plasma metabolites and widespread metabolic connections across diseases.
RESUMO
Associations between human genetic variation and clinical phenotypes have become a foundation of biomedical research. Most repositories of these data seek to be disease-agnostic and therefore lack disease-focused views. The Type 2 Diabetes Knowledge Portal (T2DKP) is a public resource of genetic datasets and genomic annotations dedicated to type 2 diabetes (T2D) and related traits. Here, we seek to make the T2DKP more accessible to prospective users and more useful to existing users. First, we evaluate the T2DKP's comprehensiveness by comparing its datasets with those of other repositories. Second, we describe how researchers unfamiliar with human genetic data can begin using and correctly interpreting them via the T2DKP. Third, we describe how existing users can extend their current workflows to use the full suite of tools offered by the T2DKP. We finally discuss the lessons offered by the T2DKP toward the goal of democratizing access to complex disease genetic results.
Assuntos
Diabetes Mellitus Tipo 2 , Humanos , Diabetes Mellitus Tipo 2/genética , Acesso à Informação , Estudos Prospectivos , Genômica/métodos , FenótipoRESUMO
Pleiotropy and genetic correlation are widespread features in genome-wide association studies (GWAS), but they are often difficult to interpret at the molecular level. Here, we perform GWAS of 16 metabolites clustered at the intersection of amino acid catabolism, glycolysis, and ketone body metabolism in a subset of UK Biobank. We utilize the well-documented biochemistry jointly impacting these metabolites to analyze pleiotropic effects in the context of their pathways. Among the 213 lead GWAS hits, we find a strong enrichment for genes encoding pathway-relevant enzymes and transporters. We demonstrate that the effect directions of variants acting on biology between metabolite pairs often contrast with those of upstream or downstream variants as well as the polygenic background. Thus, we find that these outlier variants often reflect biology local to the traits. Finally, we explore the implications for interpreting disease GWAS, underscoring the potential of unifying biochemistry with dense metabolomics data to understand the molecular basis of pleiotropy in complex traits and diseases.
Assuntos
Pleiotropia Genética , Estudo de Associação Genômica Ampla , Aminoácidos/genética , Cetonas , Fenótipo , Polimorfismo de Nucleotídeo ÚnicoRESUMO
The recognition that individual GPCRs can activate multiple signaling pathways has raised the possibility of developing drugs selectively targeting therapeutically relevant ones. This requires tools to determine which G proteins and ßarrestins are activated by a given receptor. Here, we present a set of BRET sensors monitoring the activation of the 12 G protein subtypes based on the translocation of their effectors to the plasma membrane (EMTA). Unlike most of the existing detection systems, EMTA does not require modification of receptors or G proteins (except for Gs). EMTA was found to be suitable for the detection of constitutive activity, inverse agonism, biased signaling and polypharmacology. Profiling of 100 therapeutically relevant human GPCRs resulted in 1500 pathway-specific concentration-response curves and revealed a great diversity of coupling profiles ranging from exquisite selectivity to broad promiscuity. Overall, this work describes unique resources for studying the complexities underlying GPCR signaling and pharmacology.