RESUMO
The Pharma Proteomics Project is a precompetitive biopharmaceutical consortium characterizing the plasma proteomic profiles of 54,219 UK Biobank participants. Here we provide a detailed summary of this initiative, including technical and biological validations, insights into proteomic disease signatures, and prediction modelling for various demographic and health indicators. We present comprehensive protein quantitative trait locus (pQTL) mapping of 2,923 proteins that identifies 14,287 primary genetic associations, of which 81% are previously undescribed, alongside ancestry-specific pQTL mapping in non-European individuals. The study provides an updated characterization of the genetic architecture of the plasma proteome, contextualized with projected pQTL discovery rates as sample sizes and proteomic assay coverages increase over time. We offer extensive insights into trans pQTLs across multiple biological domains, highlight genetic influences on ligand-receptor interactions and pathway perturbations across a diverse collection of cytokines and complement networks, and illustrate long-range epistatic effects of ABO blood group and FUT2 secretor status on proteins with gastrointestinal tissue-enriched expression. We demonstrate the utility of these data for drug discovery by extending the genetic proxied effects of protein targets, such as PCSK9, on additional endpoints, and disentangle specific genes and proteins perturbed at loci associated with COVID-19 susceptibility. This public-private partnership provides the scientific community with an open-access proteomics resource of considerable breadth and depth to help to elucidate the biological mechanisms underlying proteo-genomic discoveries and accelerate the development of biomarkers, predictive models and therapeutics1.
Assuntos
Bancos de Espécimes Biológicos , Proteínas Sanguíneas , Bases de Dados Factuais , Genômica , Saúde , Proteoma , Proteômica , Humanos , Sistema ABO de Grupos Sanguíneos/genética , Proteínas Sanguíneas/análise , Proteínas Sanguíneas/genética , COVID-19/genética , Descoberta de Drogas , Epistasia Genética , Fucosiltransferases/metabolismo , Predisposição Genética para Doença , Plasma/química , Pró-Proteína Convertase 9/metabolismo , Proteoma/análise , Proteoma/genética , Parcerias Público-Privadas , Locos de Características Quantitativas , Reino Unido , Galactosídeo 2-alfa-L-FucosiltransferaseRESUMO
The manganese (Mn) export protein SLC30A10 is essential for Mn excretion via the liver and intestines. Patients with SLC30A10 deficiency develop Mn excess, dystonia, liver disease, and polycythemia. Recent genome-wide association studies revealed a link between the SLC30A10 variant T95I and markers of liver disease. The in vivo relevance of this variant has yet to be investigated. Using in vitro and in vivo models, we explore the impact of the T95I variant on SLC30A10 function. While SLC30A10 I95 expressed at lower levels than T95 in transfected cell lines, both T95 and I95 variants protected cells similarly from Mn-induced toxicity. Adeno-associated virus 8-mediated expression of T95 or I95 SLC30A10 using the liver-specific thyroxine binding globulin promoter normalized liver Mn levels in mice with hepatocyte Slc30a10 deficiency. Furthermore, Adeno-associated virus-mediated expression of T95 or I95 SLC30A10 normalized red blood cell parameters and body weights and attenuated Mn levels and differential gene expression in livers and brains of mice with whole body Slc30a10 deficiency. While our in vivo data do not indicate that the T95I variant significantly compromises SLC30A10 function, it does reinforce the notion that the liver is a key site of SLC30A10 function. It also supports the idea that restoration of hepatic SLC30A10 expression is sufficient to attenuate phenotypes in SLC30A10 deficiency.
Assuntos
Substituição de Aminoácidos , Proteínas de Transporte de Cátions , Dependovirus , Fígado , Manganês , Mutação , Animais , Camundongos , Peso Corporal , Encéfalo/metabolismo , Proteínas de Transporte de Cátions/deficiência , Proteínas de Transporte de Cátions/genética , Proteínas de Transporte de Cátions/metabolismo , Linhagem Celular , Dependovirus/genética , Eritrócitos , Estudo de Associação Genômica Ampla , Hepatócitos/metabolismo , Fígado/citologia , Fígado/metabolismo , Hepatopatias/genética , Hepatopatias/metabolismo , Manganês/metabolismo , Intoxicação por Manganês/metabolismo , Fenótipo , Regiões Promotoras Genéticas , Globulina de Ligação a Tiroxina/genéticaRESUMO
Chromatin is important for the regulation of transcription and other functions, yet the diversity of chromatin composition and the distribution along chromosomes are still poorly characterized. By integrative analysis of genome-wide binding maps of 53 broadly selected chromatin components in Drosophila cells, we show that the genome is segmented into five principal chromatin types that are defined by unique yet overlapping combinations of proteins and form domains that can extend over > 100 kb. We identify a repressive chromatin type that covers about half of the genome and lacks classic heterochromatin markers. Furthermore, transcriptionally active euchromatin consists of two types that differ in molecular organization and H3K36 methylation and regulate distinct classes of genes. Finally, we provide evidence that the different chromatin types help to target DNA-binding factors to specific genomic regions. These results provide a global view of chromatin diversity and domain organization in a metazoan cell.
Assuntos
Cromatina/classificação , Proteínas de Ligação a DNA/análise , Proteínas de Drosophila/análise , Drosophila melanogaster/genética , Animais , Linhagem Celular , Cromatina/metabolismo , Proteínas de Ligação a DNA/metabolismo , Proteínas de Drosophila/metabolismo , Drosophila melanogaster/metabolismo , Eucromatina/metabolismo , Heterocromatina/metabolismo , Histonas/metabolismo , Análise de Componente PrincipalRESUMO
The characterization of mutational processes that generate sequence diversity in the human genome is of paramount importance both to medical genetics and to evolutionary studies. To understand how the age and sex of transmitting parents affect de novo mutations, here we sequence 1,548 Icelanders, their parents, and, for a subset of 225, at least one child, to 35× genome-wide coverage. We find 108,778 de novo mutations, both single nucleotide polymorphisms and indels, and determine the parent of origin of 42,961. The number of de novo mutations from mothers increases by 0.37 per year of age (95% CI 0.32-0.43), a quarter of the 1.51 per year from fathers (95% CI 1.45-1.57). The number of clustered mutations increases faster with the mother's age than with the father's, and the genomic span of maternal de novo mutation clusters is greater than that of paternal ones. The types of de novo mutation from mothers change substantially with age, with a 0.26% (95% CI 0.19-0.33%) decrease in cytosine-phosphate-guanine to thymine-phosphate-guanine (CpG>TpG) de novo mutations and a 0.33% (95% CI 0.28-0.38%) increase in C>G de novo mutations per year, respectively. Remarkably, these age-related changes are not distributed uniformly across the genome. A striking example is a 20 megabase region on chromosome 8p, with a maternal C>G mutation rate that is up to 50-fold greater than the rest of the genome. The age-related accumulation of maternal non-crossover gene conversions also mostly occurs within these regions. Increased sequence diversity and linkage disequilibrium of C>G variants within regions affected by excess maternal mutations indicate that the underlying mutational process has persisted in humans for thousands of years. Moreover, the regional excess of C>G variation in humans is largely shared by chimpanzees, less by gorillas, and is almost absent from orangutans. This demonstrates that sequence diversity in humans results from evolving interactions between age, sex, mutation type, and genomic location.
Assuntos
Envelhecimento/genética , Mutação em Linhagem Germinativa/genética , Idade Materna , Mutagênese , Pais , Idade Paterna , Adolescente , Adulto , Idoso , Animais , Criança , Cromossomos Humanos Par 8/genética , Evolução Molecular , Feminino , Sequência Rica em GC , Genoma Humano/genética , Gorilla gorilla/genética , Humanos , Mutação INDEL , Islândia , Desequilíbrio de Ligação/genética , Masculino , Pessoa de Meia-Idade , Taxa de Mutação , Pan troglodytes/genética , Polimorfismo de Nucleotídeo Único , Pongo/genética , Adulto JovemRESUMO
The reference human genome sequence set the stage for studies of genetic variation and its association with human disease, but epigenomic studies lack a similar reference. To address this need, the NIH Roadmap Epigenomics Consortium generated the largest collection so far of human epigenomes for primary cells and tissues. Here we describe the integrative analysis of 111 reference human epigenomes generated as part of the programme, profiled for histone modification patterns, DNA accessibility, DNA methylation and RNA expression. We establish global maps of regulatory elements, define regulatory modules of coordinated activity, and their likely activators and repressors. We show that disease- and trait-associated genetic variants are enriched in tissue-specific epigenomic marks, revealing biologically relevant cell types for diverse human traits, and providing a resource for interpreting the molecular basis of human disease. Our results demonstrate the central role of epigenomic information for understanding gene regulation, cellular differentiation and human disease.
Assuntos
Epigênese Genética/genética , Epigenômica , Genoma Humano/genética , Sequência de Bases , Linhagem da Célula/genética , Células Cultivadas , Cromatina/química , Cromatina/genética , Cromatina/metabolismo , Cromossomos Humanos/química , Cromossomos Humanos/genética , Cromossomos Humanos/metabolismo , DNA/química , DNA/genética , DNA/metabolismo , Metilação de DNA , Conjuntos de Dados como Assunto , Elementos Facilitadores Genéticos/genética , Variação Genética/genética , Estudo de Associação Genômica Ampla , Histonas/metabolismo , Humanos , Especificidade de Órgãos/genética , RNA/genética , Valores de ReferênciaRESUMO
Romosozumab (EVENITY™ [romosozumab-aqqg in the US]) is a humanized monoclonal antibody that inhibits sclerostin and has been approved in several countries for the treatment of osteoporosis in postmenopausal women at high risk of fracture. Sclerostin is expressed in bone and aortic vascular smooth muscle (AVSM). Its function in AVSM is unclear but it has been proposed to inhibit vascular calcification, atheroprogression, and inflammation. An increased incidence of positively adjudicated serious cardiovascular adverse events driven by an increase in myocardial infarction and stroke was observed in romosozumab-treated subjects in a clinical trial comparing alendronate with romosozumab (ARCH; NCT01631214) but not in a placebo-controlled trial (FRAME; NCT01575834). To investigate the effects of sclerostin inhibition with sclerostin antibody on the cardiovascular system, a comprehensive nonclinical toxicology package with additional cardiovascular studies was conducted. Although pharmacodynamic effects were observed in the bone, there were no functional, morphological, or transcriptional effects on the cardiovascular system in animal models in the presence or absence of atherosclerosis. These nonclinical studies did not identify evidence that proves the association between sclerostin inhibition and adverse cardiovascular function, increased cardiovascular calcification, and atheroprogression.
Assuntos
Proteínas Adaptadoras de Transdução de Sinal/antagonistas & inibidores , Anticorpos Monoclonais/farmacologia , Conservadores da Densidade Óssea/farmacologia , Sistema Cardiovascular/efeitos dos fármacos , Animais , Anticorpos Monoclonais/uso terapêutico , Conservadores da Densidade Óssea/uso terapêutico , Avaliação Pré-Clínica de Medicamentos , Feminino , Fraturas Ósseas/prevenção & controle , Humanos , Macaca fascicularis , Masculino , Camundongos Endogâmicos C57BL , Camundongos Knockout para ApoE , Osteoporose/tratamento farmacológico , Ratos Sprague-Dawley , RiscoRESUMO
More than 90% of common variants associated with complex traits do not affect proteins directly, but instead the circuits that control gene expression. This has increased the urgency of understanding the regulatory genome as a key component for translating genetic results into mechanistic insights and ultimately therapeutics. To address this challenge, we developed HaploReg (http://compbio.mit.edu/HaploReg) to aid the functional dissection of genome-wide association study (GWAS) results, the prediction of putative causal variants in haplotype blocks, the prediction of likely cell types of action, and the prediction of candidate target genes by systematic mining of comparative, epigenomic and regulatory annotations. Since first launching the website in 2011, we have greatly expanded HaploReg, increasing the number of chromatin state maps to 127 reference epigenomes from ENCODE 2012 and Roadmap Epigenomics, incorporating regulator binding data, expanding regulatory motif disruption annotations, and integrating expression quantitative trait locus (eQTL) variants and their tissue-specific target genes from GTEx, Geuvadis, and other recent studies. We present these updates as HaploReg v4, and illustrate a use case of HaploReg for attention deficit hyperactivity disorder (ADHD)-associated SNPs with putative brain regulatory mechanisms.
Assuntos
Bases de Dados Genéticas , Doença/genética , Estudo de Associação Genômica Ampla , Haplótipos , Polimorfismo de Nucleotídeo Único , Transtorno do Deficit de Atenção com Hiperatividade/genética , Sítios de Ligação , Mineração de Dados , Epigenômica , Expressão Gênica , Genes , Variação Genética , Humanos , Locos de Características Quantitativas , Sequências Reguladoras de Ácido Nucleico , Fatores de Transcrição/metabolismoRESUMO
Chromatin profiling has emerged as a powerful means of genome annotation and detection of regulatory activity. The approach is especially well suited to the characterization of non-coding portions of the genome, which critically contribute to cellular phenotypes yet remain largely uncharted. Here we map nine chromatin marks across nine cell types to systematically characterize regulatory elements, their cell-type specificities and their functional interactions. Focusing on cell-type-specific patterns of promoters and enhancers, we define multicell activity profiles for chromatin state, gene expression, regulatory motif enrichment and regulator expression. We use correlations between these profiles to link enhancers to putative target genes, and predict the cell-type-specific activators and repressors that modulate them. The resulting annotations and regulatory predictions have implications for the interpretation of genome-wide association studies. Top-scoring disease single nucleotide polymorphisms are frequently positioned within enhancer elements specifically active in relevant cell types, and in some cases affect a motif instance for a predicted regulator, thus suggesting a mechanism for the association. Our study presents a general framework for deciphering cis-regulatory connections and their roles in disease.
Assuntos
Fenômenos Fisiológicos Celulares , Cromatina/genética , Cromatina/metabolismo , Mapeamento Cromossômico , Sítios de Ligação , Linhagem Celular , Linhagem Celular Tumoral , Células Cultivadas , Regulação da Expressão Gênica , Genoma Humano/genética , Células Hep G2 , Humanos , Regiões Promotoras Genéticas/genética , Reprodutibilidade dos Testes , Fatores de Transcrição/genéticaRESUMO
The comparison of related genomes has emerged as a powerful lens for genome interpretation. Here we report the sequencing and comparative analysis of 29 eutherian genomes. We confirm that at least 5.5% of the human genome has undergone purifying selection, and locate constrained elements covering â¼4.2% of the genome. We use evolutionary signatures and comparisons with experimental data sets to suggest candidate functions for â¼60% of constrained bases. These elements reveal a small number of new coding exons, candidate stop codon readthrough events and over 10,000 regions of overlapping synonymous constraint within protein-coding exons. We find 220 candidate RNA structural families, and nearly a million elements overlapping potential promoter, enhancer and insulator regions. We report specific amino acid residues that have undergone positive selection, 280,000 non-coding elements exapted from mobile elements and more than 1,000 primate- and human-accelerated elements. Overlap with disease-associated variants indicates that our findings will be relevant for studies of human biology, health and disease.
Assuntos
Evolução Molecular , Genoma Humano/genética , Genoma/genética , Mamíferos/genética , Animais , Doença , Éxons/genética , Genômica , Saúde , Humanos , Anotação de Sequência Molecular , Filogenia , RNA/classificação , RNA/genética , Seleção Genética/genética , Alinhamento de Sequência , Análise de Sequência de DNARESUMO
With the completion of the human genome sequence, attention turned to identifying and annotating its functional DNA elements. As a complement to genetic and comparative genomics approaches, the Encyclopedia of DNA Elements Project was launched to contribute maps of RNA transcripts, transcriptional regulator binding sites, and chromatin states in many cell types. The resulting genome-wide data reveal sites of biochemical activity with high positional resolution and cell type specificity that facilitate studies of gene regulation and interpretation of noncoding variants associated with human disease. However, the biochemically active regions cover a much larger fraction of the genome than do evolutionarily conserved regions, raising the question of whether nonconserved but biochemically active regions are truly functional. Here, we review the strengths and limitations of biochemical, evolutionary, and genetic approaches for defining functional DNA segments, potential sources for the observed differences in estimated genomic coverage, and the biological implications of these discrepancies. We also analyze the relationship between signal intensity, genomic coverage, and evolutionary conservation. Our results reinforce the principle that each approach provides complementary information and that we need to use combinations of all three to elucidate genome function in human biology and disease.
Assuntos
DNA/genética , Genoma Humano/genética , Evolução Biológica , Doença/genética , Humanos , Sequências Reguladoras de Ácido Nucleico/genética , SoftwareRESUMO
BACKGROUND: Recent chromatin immunoprecipitation (ChIP) experiments in fly, mouse, and human have revealed the existence of high-occupancy target (HOT) regions or "hotspots" that show enrichment across many assayed DNA-binding proteins. Similar co-enrichment observed in yeast so far has been treated as artifactual, and has not been fully characterized. RESULTS: Here we reanalyze ChIP data from both array-based and sequencing-based experiments to show that in the yeast S. cerevisiae, the collective enrichment phenomenon is strongly associated with proximity to noncoding RNA genes and with nucleosome depletion. DNA sequence motifs that confer binding affinity for the proteins are largely absent from these hotspots, suggesting that protein-protein interactions play a prominent role. The hotspots are condition-specific, suggesting that they reflect a chromatin state or protein state, and are not a static feature of underlying sequence. Additionally, only a subset of all assayed factors is associated with these loci, suggesting that the co-enrichment cannot be simply explained by a chromatin state that is universally more prone to immunoprecipitation. CONCLUSIONS: Together our results suggest that the co-enrichment patterns observed in yeast represent transcription factor co-occupancy. More generally, they make clear that great caution must be used when interpreting ChIP enrichment profiles for individual factors in isolation, as they will include factor-specific as well as collective contributions.
Assuntos
Cromatina/metabolismo , Saccharomyces cerevisiae/genética , Imunoprecipitação da Cromatina , Nucleossomos/genética , Nucleossomos/metabolismo , RNA não Traduzido/genética , RNA não Traduzido/metabolismo , Proteínas de Saccharomyces cerevisiae/genética , Fatores de Transcrição/genéticaRESUMO
The resolution of genome-wide association studies (GWAS) is limited by the linkage disequilibrium (LD) structure of the population being studied. Selecting the most likely causal variants within an LD block is relatively straightforward within coding sequence, but is more difficult when all variants are intergenic. Predicting functional non-coding sequence has been recently facilitated by the availability of conservation and epigenomic information. We present HaploReg, a tool for exploring annotations of the non-coding genome among the results of published GWAS or novel sets of variants. Using LD information from the 1000 Genomes Project, linked SNPs and small indels can be visualized along with their predicted chromatin state in nine cell types, conservation across mammals and their effect on regulatory motifs. Sets of SNPs, such as those resulting from GWAS, are analyzed for an enrichment of cell type-specific enhancers. HaploReg will be useful to researchers developing mechanistic hypotheses of the impact of non-coding variants on clinical phenotypes and normal variation. The HaploReg database is available at http://compbio.mit.edu/HaploReg.
Assuntos
Cromatina/química , Bases de Dados de Ácidos Nucleicos , Estudo de Associação Genômica Ampla , Mutação INDEL , Polimorfismo de Nucleotídeo Único , Animais , Sequência de Bases , Sequência Conservada , Elementos Facilitadores Genéticos , Ligação Genética , Genômica , Desequilíbrio de Ligação , Anotação de Sequência Molecular , Motivos de Nucleotídeos , Sequências Reguladoras de Ácido NucleicoRESUMO
A single gene can produce multiple transcripts with distinct molecular functions. Rare-variant association tests often aggregate all coding variants across individual genes, without accounting for the variants' presence or consequence in resulting transcript isoforms. To evaluate the utility of transcript-aware variant sets, rare predicted loss-of-function (pLOF) variants were aggregated for 17,035 protein-coding genes using 55,558 distinct transcript-specific variant sets. These sets were tested for their association with 728 circulating proteins and 188 quantitative phenotypes across 406,921 individuals in the UK Biobank. The transcript-specific approach resulted in larger estimated effects of pLOF variants decreasing serum cis-protein levels compared to the gene-based approach (pbinom ≤ 2x10-16). Additionally, 251 quantitative trait associations were identified as being significant using the transcript-specific approach but not the gene-based approach, including PCSK5 transcript ENST00000376752 and standing height (transcript-specific statistic, P = 1.3x10-16, effect = 0.7 SD decrease; gene-based statistic, P = 0.02, effect = 0.05 SD decrease) and LDLR transcript ENST00000252444 and apolipoprotein B (transcript-specific statistic, P = 5.7x10-20, effect = 1.0 SD increase; gene-based statistic, P = 3.0x10-4, effect = 0.2 SD increase). This approach demonstrates the importance of considering the effect of pLOFs on specific transcript isoforms when performing rare-variant association studies.
Assuntos
Bancos de Espécimes Biológicos , Biobanco do Reino Unido , Humanos , Biologia Computacional , Fenótipo , Isoformas de Proteínas/genéticaRESUMO
Human genetics research has discovered thousands of proteins associated with complex and rare diseases. Genome-wide association studies (GWAS) and studies of Mendelian disease have resulted in an increased understanding of the role of gene function and regulation in human conditions. Although the application of human genetics has been explored primarily as a method to identify potential drug targets and support their relevance to disease in humans, there is increasing interest in using genetic data to identify potential safety liabilities of modulating a given target. Human genetic variants can be used as a model to anticipate the effect of lifelong modulation of therapeutic targets and identify the potential risk for on-target adverse events. This approach is particularly useful for non-clinical safety evaluation of novel therapeutics that lack pharmacologically relevant animal models and can contribute to the intrinsic safety profile of a drug target. This Review illustrates applications of human genetics to safety studies during drug discovery and development, including assessing the potential for on- and off-target associated adverse events, carcinogenicity risk assessment, and guiding translational safety study designs and monitoring strategies. A summary of available human genetic resources and recommended best practices is provided. The challenges and future perspectives of translating human genetic information to identify risks for potential drug effects in preclinical and clinical development are discussed.
Assuntos
Estudo de Associação Genômica Ampla , Genética Humana , Animais , HumanosRESUMO
BACKGROUND: Whole-exome sequencing (WES) is an effective tool for diagnosis in patients who remain undiagnosed despite a comprehensive clinical work-up. While WES is being used increasingly in pediatrics and oncology, it remains underutilized in non-oncological adult medicine, including in patients with liver disease, in part based on the faulty premise that adults are unlikely to harbor rare genetic variants with large effect size. Here, we aim to assess the burden of rare genetic variants underlying liver disease in adults at two major tertiary referral academic medical centers. METHODS: WES analysis paired with comprehensive clinical evaluation was performed in fifty-two adult patients with liver disease of unknown etiology evaluated at two US tertiary academic health care centers. FINDINGS: Exome analysis uncovered a definitive or presumed diagnosis in 33% of patients (17/52) providing insight into their disease pathogenesis, with most of these patients (12/17) not having a known family history of liver disease. Our data shows that over two-thirds of undiagnosed liver disease patients attaining a genetic diagnosis were being evaluated for cholestasis or hepatic steatosis of unknown etiology. INTERPRETATION: This study reveals an underappreciated incidence and spectrum of genetic diseases presenting in adulthood and underscores the clinical value of incorporating exome sequencing in the evaluation and management of adults with liver disease of unknown etiology. FUNDING: S.V. is supported by the NIH/NIDDK (K08 DK113109 and R01 DK131033-01A1) and the Doris Duke Charitable Foundation Grant #2019081. This work was supported in part by NIH-funded Yale Liver Center, P30 DK34989.
Assuntos
Fígado Gorduroso , Hepatopatias , Humanos , Adulto , Criança , Sequenciamento do Exoma , Hepatopatias/diagnóstico , Hepatopatias/genética , Hepatopatias/terapia , Fígado Gorduroso/genética , Exoma/genéticaRESUMO
The age of menopause is associated with fertility and disease risk, and its genetic control is of great interest. We use whole-exome sequences from 132,370 women in the UK Biobank to test for associations between rare damaging variants and age at natural menopause. Rare damaging variants in five genes are significantly associated with menopause: CHEK2 (p = 3.3 × 10-51), DCLRE1A (p = 8.4 × 10-13), and HELB (p = 5.7 × 10-7) with later menopause and TOP3A (p = 7.6 × 10-8) and CLPB (p = 8.1 × 10-7) with earlier menopause. Two additional genes are suggestive: RAD54L (p = 2.4 × 10-6) with later menopause and HROB (p = 2.9 × 10-6) with earlier menopause. In a follow-up analysis of repeated questionnaires in women who were initially premenopausal, CHEK2, TOP3A, and RAD54L genotypes are associated with subsequent menopause. Consistent with previous genome-wide association studies (GWASs), six of the seven genes are involved in the DNA damage repair pathway. Phenome-wide scans across 398,569 men and women revealed that in addition to known associations with cancers and blood cell counts, rare variants in CHEK2 are also associated with increased risk for uterine fibroids, polycystic ovary syndrome, and prostate hypertrophy; these associations are not shared with higher-penetrance breast cancer genes. Causal mediation analysis suggests that approximately 8% of the breast cancer risk conferred by CHEK2 pathogenic variants after menopause is mediated through delayed menopause.
RESUMO
Identifying genetic variants associated with lower waist-to-hip ratio can reveal new therapeutic targets for abdominal obesity. We use exome sequences from 362,679 individuals to identify genes associated with waist-to-hip ratio adjusted for BMI (WHRadjBMI), a surrogate for abdominal fat that is causally linked to type 2 diabetes and coronary heart disease. Predicted loss of function (pLOF) variants in INHBE associate with lower WHRadjBMI and this association replicates in data from AMP-T2D-GENES. INHBE encodes a secreted protein, the hepatokine activin E. In vitro characterization of the most common INHBE pLOF variant in our study, indicates an in-frame deletion resulting in a 90% reduction in secreted protein levels. We detect associations with lower WHRadjBMI for variants in ACVR1C, encoding an activin receptor, further highlighting the involvement of activins in regulating fat distribution. These findings highlight activin E as a potential therapeutic target for abdominal obesity, a phenotype linked to cardiometabolic disease.
Assuntos
Diabetes Mellitus Tipo 2 , Subunidades beta de Inibinas/genética , Receptores de Ativinas Tipo I/genética , Índice de Massa Corporal , Diabetes Mellitus Tipo 2/genética , Humanos , Obesidade/genética , Obesidade Abdominal/genética , Relação Cintura-QuadrilRESUMO
Novel modalities such as PROTAC and RNAi have the ability to inadvertently alter the abundance of endogenous proteins. Currently available in vitro secondary pharmacology assays, which evaluate off-target binding or activity of small molecules, do not fully assess the off-target effects of PROTAC and are not applicable to RNAi. To address this gap, we developed a proteomics-based platform to comprehensively evaluate the abundance of off-target proteins. First, we selected off-target proteins using genetics and pharmacology evidence. This process yielded 2813 proteins, which we refer to as the "selected off-target proteome" (SOTP). An iterative algorithm was then used to identify four human cell lines out of 932. The 4 cell lines collectively expressed ~ 80% of the SOTP based on transcriptome data. Second, we used mass spectrometry to quantify the intracellular and extracellular proteins from the selected cell lines. Among over 10,000 quantifiable proteins identified, 1828 were part of the predefined SOTP. The SOTP was designed to be easily modified or expanded, owing to the rational selection process developed and the label free LC-MS/MS approach chosen. This versatility inherent to our platform is essential to design fit-for-purpose studies that can address the dynamic questions faced in investigative toxicology.
Assuntos
Biomarcadores Tumorais/metabolismo , Regulação Neoplásica da Expressão Gênica , Neoplasias/metabolismo , Neoplasias/patologia , Proteoma/análise , Proteoma/metabolismo , Proliferação de Células , Humanos , Células Tumorais CultivadasRESUMO
Sequencing of large cohorts offers an unprecedented opportunity to identify rare genetic variants and to find novel contributors to human disease. We used gene-based collapsing tests to identify genes associated with glucose, HbA1c and type 2 diabetes (T2D) diagnosis in 379,066 exome-sequenced participants in the UK Biobank. We identified associations for variants in GCK, HNF1A and PDX1, which are known to be involved in Mendelian forms of diabetes. Notably, we uncovered novel associations for GIGYF1, a gene not previously implicated by human genetics in diabetes. GIGYF1 predicted loss of function (pLOF) variants associated with increased levels of glucose (0.77 mmol/L increase, p = 4.42 × 10-12) and HbA1c (4.33 mmol/mol, p = 1.28 × 10-14) as well as T2D diagnosis (OR = 4.15, p = 6.14 × 10-11). Multiple rare variants contributed to these associations, including singleton variants. GIGYF1 pLOF also associated with decreased cholesterol levels as well as an increased risk of hypothyroidism. The association of GIGYF1 pLOF with T2D diagnosis replicated in an independent cohort from the Geisinger Health System. In addition, a common variant association for glucose and T2D was identified at the GIGYF1 locus. Our results highlight the role of GIGYF1 in regulating insulin signaling and protecting from diabetes.