RESUMEN
Genome-wide association studies (GWASs) have identified numerous lung cancer risk-associated loci. However, decoding molecular mechanisms of these associations is challenging since most of these genetic variants are non-protein-coding with unknown function. Here, we implemented massively parallel reporter assays (MPRAs) to simultaneously measure the allelic transcriptional activity of risk-associated variants. We tested 2,245 variants at 42 loci from 3 recent GWASs in East Asian and European populations in the context of two major lung cancer histological types and exposure to benzo(a)pyrene. This MPRA approach identified one or more variants (median 11 variants) with significant effects on transcriptional activity at 88% of GWAS loci. Multimodal integration of lung-specific epigenomic data demonstrated that 63% of the loci harbored multiple potentially functional variants in linkage disequilibrium. While 22% of the significant variants showed allelic effects in both A549 (adenocarcinoma) and H520 (squamous cell carcinoma) cell lines, a subset of the functional variants displayed a significant cell-type interaction. Transcription factor analyses nominated potential regulators of the functional variants, including those with cell-type-specific expression and those predicted to bind multiple potentially functional variants across the GWAS loci. Linking functional variants to target genes based on four complementary approaches identified candidate susceptibility genes, including those affecting lung cancer cell growth. CRISPR interference of the top functional variant at 20q13.33 validated variant-to-gene connections, including RTEL1, SOX18, and ARFRP1. Our data provide a comprehensive functional analysis of lung cancer GWAS loci and help elucidate the molecular basis of heterogeneity and polygenicity underlying lung cancer susceptibility.
Asunto(s)
Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Neoplasias Pulmonares , Polimorfismo de Nucleótido Simple , Humanos , Neoplasias Pulmonares/genética , Neoplasias Pulmonares/patología , Desequilibrio de Ligamiento , Herencia Multifactorial/genética , Línea Celular Tumoral , Alelos , Células A549RESUMEN
Genome-wide association studies (GWASs) have discovered 20 risk loci in the human genome where germline variants associate with risk of pancreatic ductal adenocarcinoma (PDAC) in populations of European ancestry. Here, we fine-mapped one such locus on chr16q23.1 (rs72802365, p = 2.51 × 10-17, OR = 1.36, 95% CI = 1.31-1.40) and identified colocalization (PP = 0.87) with aberrant exon 5-7 CTRB2 splicing in pancreatic tissues (pGTEx = 1.40 × 10-69, ßGTEx = 1.99; pLTG = 1.02 × 10-30, ßLTG = 1.99). Imputation of a 584 bp structural variant overlapping exon 6 of CTRB2 into the GWAS datasets resulted in a highly significant association with pancreatic cancer risk (p = 2.83 × 10-16, OR = 1.36, 95% CI = 1.31-1.42), indicating that it may underlie this signal. Exon skipping attributable to the deletion (risk) allele introduces a premature stop codon in exon 7 of CTRB2, yielding a truncated chymotrypsinogen B2 protein that lacks chymotrypsin activity, is poorly secreted, and accumulates intracellularly in the endoplasmic reticulum (ER). We propose that intracellular accumulation of a nonfunctional chymotrypsinogen B2 protein leads to ER stress and pancreatic inflammation, which may explain the increased pancreatic cancer risk in carriers of CTRB2 exon 6 deletion alleles.
Asunto(s)
Quimotripsina/genética , Neoplasias Pancreáticas/patología , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Eliminación de Secuencia , Estudios de Casos y Controles , Quimotripsina/antagonistas & inhibidores , Quimotripsina/metabolismo , Estudio de Asociación del Genoma Completo , Genotipo , Humanos , Neoplasias Pancreáticas/etiología , Neoplasias Pancreáticas/metabolismoRESUMEN
Expression QTL (eQTL) analyses have suggested many genes mediating genome-wide association study (GWAS) signals but most GWAS signals still lack compelling explanatory genes. We have leveraged an adipose-specific gene regulatory network to infer expression regulator activities and phenotypic master regulators (MRs), which were used to detect activity QTLs (aQTLs) at cardiometabolic trait GWAS loci. Regulator activities were inferred with the VIPER algorithm that integrates enrichment of expected expression changes among a regulator's target genes with confidence in their regulator-target network interactions and target overlap between different regulators (i.e., pleiotropy). Phenotypic MRs were identified as those regulators whose activities were most important in predicting their respective phenotypes using random forest modeling. While eQTLs were typically more significant than aQTLs in cis, the opposite was true among candidate MRs in trans. Several GWAS loci colocalized with MR trans-eQTLs/aQTLs in the absence of colocalized cis-QTLs. Intriguingly, at the 1p36.1 BMI GWAS locus the EPHB2 cis-aQTL was stronger than its cis-eQTL and colocalized with the GWAS signal and 35 BMI MR trans-aQTLs, suggesting the GWAS signal may be mediated by effects on EPHB2 activity and its downstream effects on a network of BMI MRs. These MR and aQTL analyses represent systems genetic methods that may be broadly applied to supplement standard eQTL analyses for suggesting molecular effects mediating GWAS signals.
Asunto(s)
Redes Reguladoras de Genes , Miocardio/metabolismo , Estudio de Asociación del Genoma Completo/métodos , Humanos , Fenotipo , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Receptor EphB2/genética , Grasa Subcutánea/metabolismo , TranscriptomaRESUMEN
OBJECTIVE: To elucidate the genetic architecture of gene expression in pancreatic tissues. DESIGN: We performed expression quantitative trait locus (eQTL) analysis in histologically normal pancreatic tissue samples (n=95) using RNA sequencing and the corresponding 1000 genomes imputed germline genotypes. Data from pancreatic tumour-derived tissue samples (n=115) from The Cancer Genome Atlas were included for comparison. RESULTS: We identified 38 615 cis-eQTLs (in 484 genes) in histologically normal tissues and 39 713 cis-eQTL (in 237 genes) in tumour-derived tissues (false discovery rate <0.1), with the strongest effects seen near transcriptional start sites. Approximately 23% and 42% of genes with significant cis-eQTLs appeared to be specific for tumour-derived and normal-derived tissues, respectively. Significant enrichment of cis-eQTL variants was noted in non-coding regulatory regions, in particular for pancreatic tissues (1.53-fold to 3.12-fold, p≤0.0001), indicating tissue-specific functional relevance. A common pancreatic cancer risk locus on 9q34.2 (rs687289) was associated with ABO expression in histologically normal (p=5.8×10-8) and tumour-derived (p=8.3×10-5) tissues. The high linkage disequilibrium between this variant and the O blood group generating deletion variant in ABO (exon 6) suggested that nonsense-mediated decay (NMD) of the 'O' mRNA might explain this finding. However, knockdown of crucial NMD regulators did not influence decay of the ABO 'O' mRNA, indicating that a gene regulatory element influenced by pancreatic cancer risk alleles may underlie the eQTL. CONCLUSIONS: We have identified cis-eQTLs representing potential functional regulatory variants in the pancreas and generated a rich data set for further studies on gene expression and its regulation in pancreatic tissues.
Asunto(s)
Sistema del Grupo Sanguíneo ABO/genética , Expresión Génica , Páncreas , Neoplasias Pancreáticas/genética , Sitios de Carácter Cuantitativo , ARN Neoplásico/análisis , Transcriptoma , Alelos , Cromosomas Humanos Par 9 , Estudio de Asociación del Genoma Completo , Genotipo , Humanos , Degradación de ARNm Mediada por Codón sin Sentido , Polimorfismo de Nucleótido Simple , Secuencias Reguladoras de Ácidos Nucleicos , Análisis de Secuencia de ARNRESUMEN
Genome-wide association studies (GWAS) have identified multiple common susceptibility loci for pancreatic cancer. Here we report fine-mapping and functional analysis of one such locus residing in a 610 kb gene desert on chr13q22.1 (marked by rs9543325). The closest candidate genes, KLF5, KLF12, PIBF1, DIS3 and BORA, range in distance from 265-586 kb. Sequencing three sub-regions containing the top ranked SNPs by imputation P-value revealed a 30 bp insertion/deletion (indel) variant that was significantly associated with pancreatic cancer risk (rs386772267, P = 2.30 × 10-11, OR = 1.22, 95% CI 1.15-1.28) and highly correlated to rs9543325 (r2 = 0.97 in the 1000 Genomes EUR population). This indel was the most significant cis-eQTL variant in a set of 222 histologically normal pancreatic tissue samples (ß = 0.26, P = 0.004), with the insertion (risk-increasing) allele associated with reduced DIS3 expression. DIS3 encodes a catalytic subunit of the nuclear RNA exosome complex that mediates RNA processing and decay, and is mutated in several cancers. Chromosome conformation capture revealed a long range (570 kb) physical interaction between a sub-region of the risk locus, containing rs386772267, and a region â¼6 kb upstream of DIS3 Finally, repressor regulatory activity and allele-specific protein binding by transcription factors of the TCF/LEF family were observed for the risk-increasing allele of rs386772267, indicating that expression regulation at this risk locus may be influenced by the Wnt signaling pathway. In conclusion, we have identified a putative functional indel variant at chr13q22.1 that associates with decreased DIS3 expression in carriers of pancreatic cancer risk-increasing alleles, and could therefore affect nuclear RNA processing and/or decay.
Asunto(s)
Cromosomas Humanos Par 13 , Complejo Multienzimático de Ribonucleasas del Exosoma/genética , Neoplasias Pancreáticas/genética , Alelos , Línea Celular Tumoral , Cromatina/genética , Cromatina/metabolismo , Mapeo Cromosómico/métodos , Complejo Multienzimático de Ribonucleasas del Exosoma/metabolismo , Sitios Genéticos , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Mutación INDEL , Neoplasias Pancreáticas/enzimología , Neoplasias Pancreáticas/metabolismo , Análisis de Secuencia de ADN , Factores de Transcripción/genéticaRESUMEN
Genome-wide association studies (GWAS) have mapped risk alleles for at least 10 distinct cancers to a small region of 63 000 bp on chromosome 5p15.33. This region harbors the TERT and CLPTM1L genes; the former encodes the catalytic subunit of telomerase reverse transcriptase and the latter may play a role in apoptosis. To investigate further the genetic architecture of common susceptibility alleles in this region, we conducted an agnostic subset-based meta-analysis (association analysis based on subsets) across six distinct cancers in 34 248 cases and 45 036 controls. Based on sequential conditional analysis, we identified as many as six independent risk loci marked by common single-nucleotide polymorphisms: five in the TERT gene (Region 1: rs7726159, P = 2.10 × 10(-39); Region 3: rs2853677, P = 3.30 × 10(-36) and PConditional = 2.36 × 10(-8); Region 4: rs2736098, P = 3.87 × 10(-12) and PConditional = 5.19 × 10(-6), Region 5: rs13172201, P = 0.041 and PConditional = 2.04 × 10(-6); and Region 6: rs10069690, P = 7.49 × 10(-15) and PConditional = 5.35 × 10(-7)) and one in the neighboring CLPTM1L gene (Region 2: rs451360; P = 1.90 × 10(-18) and PConditional = 7.06 × 10(-16)). Between three and five cancers mapped to each independent locus with both risk-enhancing and protective effects. Allele-specific effects on DNA methylation were seen for a subset of risk loci, indicating that methylation and subsequent effects on gene expression may contribute to the biology of risk variants on 5p15.33. Our results provide strong support for extensive pleiotropy across this region of 5p15.33, to an extent not previously observed in other cancer susceptibility loci.
Asunto(s)
Cromosomas Humanos Par 5/química , Regulación Neoplásica de la Expresión Génica , Sitios Genéticos , Proteínas de la Membrana/genética , Proteínas de Neoplasias/genética , Neoplasias/genética , Telomerasa/genética , Alelos , Biología Computacional , Metilación de ADN , Epigénesis Genética , Femenino , Frecuencia de los Genes , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Masculino , Neoplasias/patología , Oportunidad Relativa , Polimorfismo de Nucleótido Simple , RiesgoRESUMEN
Myotonic dystrophy type 1 (DM1) is a dominantly inherited neuromuscular disorder resulting from expression of RNA containing an expanded CUG repeat (CUG(exp)). The pathogenic RNA is retained in nuclear foci. Poly-(CUG) binding proteins in the Muscleblind-like (MBNL) family are sequestered in foci, causing misregulated alternative splicing of specific pre-mRNAs. Inhibitors of MBNL1-CUG(exp) binding have been shown to restore splicing regulation and correct phenotypes in DM1 models. We therefore conducted a high-throughput screen to identify novel inhibitors of MBNL1-(CUG)12 binding. The most active compound was lomofungin, a natural antimicrobial agent. We found that lomofungin undergoes spontaneous dimerization in DMSO, producing dilomofungin, whose inhibition of MBNL1-(CUG)12 binding was 17-fold more potent than lomofungin itself. However, while dilomofungin displayed the desired binding characteristics in vitro, when applied to cells it produced a large increase of CUG(exp) RNA in nuclear foci, owing to reduced turnover of the CUG(exp) transcript. By comparison, the monomer did not induce CUG(exp) accumulation in cells and was more effective at rescuing a CUG(exp)-induced splicing defect. These results support the feasibility of high-throughput screens to identify compounds targeting toxic RNA, but also demonstrate that ligands for repetitive sequences may have unexpected effects on RNA decay.
Asunto(s)
Fenazinas/farmacología , Proteínas de Unión al ARN/metabolismo , ARN/química , ARN/metabolismo , Regiones no Traducidas 3' , Empalme Alternativo/efectos de los fármacos , Dimerización , Humanos , Fenazinas/química , Fenazinas/metabolismo , Estabilidad del ARN/efectos de los fármacos , Secuencias Repetitivas de Ácidos NucleicosRESUMEN
Pancreatic ductal adenocarcinoma (PDAC) is driven by the accumulation of somatic mutations, epigenetic modifications and changes in the micro-environment. New approaches to investigating disruptions of gene expression networks promise to uncover key regulators and pathways in carcinogenesis. We performed messenger RNA-sequencing in pancreatic normal (n = 10) and tumor (n = 8) derived tissue samples, as well as in pancreatic cancer cell lines (n = 9), to determine differential gene expression (DE) patterns. Sub-network enrichment analyses identified HNF1A as the regulator of the most significantly and consistently dysregulated expression sub-network in pancreatic tumor tissues and cells (median P = 7.56×10(-7), median rank = 1, range = 1-25). To explore the effects of HNF1A expression in pancreatic tumor-derived cells, we generated stable HNF1A-inducible clones in two pancreatic cancer cell lines (PANC-1 and MIA PaCa-2) and observed growth inhibition (5.3-fold, P = 4.5×10(-5) for MIA PaCa-2 clones; 7.2-fold, P = 2.2×10(-5) for PANC-1 clones), and a G0/G1 cell cycle arrest and apoptosis upon induction. These effects correlated with HNF1A-induced down-regulation of 51 of 84 cell cycle genes (e.g. E2F1, CDK2, CDK4, MCM2/3/4/5, SKP2 and CCND1), decreased expression of anti-apoptotic genes (e.g. BIRC2/5/6 and AKT) and increased expression of pro-apoptotic genes (e.g. CASP4/9/10 and APAF1). In light of the established role of HNF1A in the regulation of pancreatic development and homeostasis, our data suggest that it also functions as an important tumor suppressor in the pancreas.
Asunto(s)
Biomarcadores de Tumor/genética , Carcinoma Ductal Pancreático/genética , Perfilación de la Expresión Génica , Genes Supresores de Tumor , Factor Nuclear 1-alfa del Hepatocito/genética , Neoplasias Pancreáticas/genética , Apoptosis , Biomarcadores de Tumor/metabolismo , Western Blotting , Carcinoma Ductal Pancreático/metabolismo , Carcinoma Ductal Pancreático/patología , Ciclo Celular , Proliferación Celular , Células Cultivadas , Citometría de Flujo , Redes Reguladoras de Genes , Factor Nuclear 1-alfa del Hepatocito/metabolismo , Humanos , Técnicas para Inmunoenzimas , Páncreas/metabolismo , Neoplasias Pancreáticas/metabolismo , Neoplasias Pancreáticas/patología , ARN Mensajero/genética , Reacción en Cadena en Tiempo Real de la Polimerasa , Reacción en Cadena de la Polimerasa de Transcriptasa InversaRESUMEN
Genetic and epigenetic variations in regulatory enhancer elements increase susceptibility to a range of pathologies. Despite recent advances, linking enhancer elements to target genes and predicting transcriptional outcomes of enhancer dysfunction remain significant challenges. Using 3D chromatin conformation assays, we generated an extensive enhancer interaction dataset for the human pancreas, encompassing more than 20 donors and five major cell types, including both exocrine and endocrine compartments. We employed a network approach to parse chromatin interactions into enhancer-promoter tree models, facilitating a quantitative, genome-wide analysis of enhancer connectivity. With these tree models, we developed a machine learning algorithm to estimate the impact of enhancer perturbations on cell type-specific gene expression in the human pancreas. Orthogonal to our computational approach, we perturbed enhancer function in primary human pancreas cells using CRISPR interference and quantified the effects at the single-cell level through RNA FISH coupled with high-throughput imaging. Our enhancer tree models enabled the annotation of common germline risk variants associated with pancreas diseases, linking them to putative target genes in specific cell types. For pancreatic ductal adenocarcinoma, we found a stronger enrichment of disease susceptibility variants within acinar cell regulatory elements, despite ductal cells historically being assumed as the primary cell-of-origin. Our integrative approach-combining cell type-specific enhancer-promoter interaction mapping, computational models, and single-cell enhancer perturbation assays-produced a robust resource for studying the genetic basis of pancreas disorders.
RESUMEN
Objective: Genome wide association studies have identified an exon 6 CTRB2 deletion variant that associates with increased risk of pancreatic cancer. To acquire evidence on its causal role, we developed a new mouse strain carrying an equivalent variant in Ctrb1 , the mouse orthologue of CTRB2 . Design: We used CRISPR/Cas9 to introduce a 707bp deletion in Ctrb1 encompassing exon 6 ( Ctrb1 Δexon6 ). This mutation closely mimics the human deletion variant. Mice carrying the mutant allele were extensively profiled at 3 months to assess their phenotype. Results: Ctrb1 Δexon6 mutant mice express a truncated CTRB1 that accumulates in the ER. The pancreas of homozygous mutant mice displays reduced chymotrypsin activity and total protein synthesis. The histological aspect of the pancreas is inconspicuous but ultrastructural analysis shows evidence of dramatic ER stress and cytoplasmic and nuclear inclusions. Transcriptomic analyses of the pancreas of mutant mice reveals acinar program down-regulation and increased activity of ER stress-related and inflammatory pathways. Heterozygous mice have an intermediate phenotype. Agr2 is one of the most up-regulated genes in mutant pancreata. Ctrb1 Δexon6 mice exhibit impaired recovery from acute caerulein-induced pancreatitis. Administration of TUDCA or sulindac partially alleviates the phenotype. A transcriptomic signature derived from the mutant pancreata is significantly enriched in normal human pancreas of CTRB2 exon 6 deletion variant carriers from the GTEx cohort. Conclusions: This mouse strain provides formal evidence that the Ctrb1 Δexon6 variant causes ER stress and inflammation in vivo , providing an excellent model to understand its contribution to pancreatic ductal adenocarcinoma development and to identify preventive strategies. SUMMARY BOX: What is already known about this subject?: - CTRB2 is one of the most abundant proteins produced by human pancreatic acinar cells. - A common exon 6 deletion variant in CTRB2 has been associated with an increased risk of pancreatic ductal adenocarcinoma. - Misfolding of digestive enzymes is associated with pancreatic pathology.What are the new findings?: - We developed a novel genetic model that recapitulates the human CTRB2 deletion variant in the mouse orthologue, Ctrb1 . - Truncated CTRB1 misfolds and accumulates in the ER; yet, mutant mice display a histologically normal pancreas at 3 months age.- CTRB1 and associated chaperones colocalize in the ER, the cytoplasm, and the nucleus of acinar cells.- Transcriptomics analysis reveals reduced activity of the acinar program and increased activity of pathways involved in ER stress, unfolded protein response, and inflammation.- Mutant mice are sensitized to pancreatic damage and do not recover properly from a mild caerulein-induced pancreatitis.- TUDCA administration partially relieves the ER stress in mutant mice.How might it impact on clinical practice in the foreseeable future?: - The new mouse model provides a tool to identify the mechanisms leading to increased pancreatic cancer risk in CTRB2 exon 6 carriers. - The findings suggest that drugs that cause ER stress relief and/or reduce inflammation might provide preventive opportunities.
RESUMEN
Pancreatic Ductal Adenocarcinoma (PDAC) is the third leading cause of cancer-related deaths in the U.S. Both rare and common germline variants contribute to PDAC risk. Here, we fine-map and functionally characterize a common PDAC risk signal at 1p36.33 (tagged by rs13303010) identified through a genome wide association study (GWAS). One of the fine-mapped SNPs, rs13303160 (r2=0.93 in 1000G EUR samples, OR=1.23, P value=2.74x10-9) demonstrated allele-preferential gene regulatory activity in vitro and allele-preferential binding of JunB and JunD in vitro and in vivo. Expression Quantitative Trait Locus (eQTL) analysis identified KLHL17 as a likely target gene underlying the signal. Proteomic analysis identified KLHL17 as a member of the Cullin-E3 ubiquitin ligase complex in PDAC-derived cells. In silico differential gene expression analysis of the GTExv8 pancreas data suggested an association between lower KLHL17 (risk associated) and pro-inflammatory pathways. We hypothesize that KLHL17 may mitigate inflammation by recruiting pro-inflammatory proteins for ubiquitination and degradation thereby influencing PDAC risk.
RESUMEN
Identification of somatic driver mutations in the noncoding genome remains challenging. To comprehensively characterize noncoding driver mutations for pancreatic ductal adenocarcinoma (PDAC), we first created genome-scale maps of accessible chromatin regions (ACRs) and histone modification marks (HMMs) in pancreatic cell lines and purified pancreatic acinar and duct cells. Integration with whole-genome mutation calls from 506 PDACs revealed 314 ACRs/HMMs significantly enriched with 3,614 noncoding somatic mutations (NCSMs). Functional assessment using massively parallel reporter assays (MPRA) identified 178 NCSMs impacting reporter activity (19.45% of those tested). Focused luciferase validation confirmed negative effects on gene regulatory activity for NCSMs near CDKN2A and ZFP36L2. For the latter, CRISPR interference (CRISPRi) further identified ZFP36L2 as a target gene (16.0 - 24.0% reduced expression, P = 0.023-0.0047) with disrupted KLF9 binding likely mediating the effect. Our integrative approach provides a catalog of potentially functional noncoding driver mutations and nominates ZFP36L2 as a PDAC driver gene.
RESUMEN
The prevalence of childhood obesity is increasing worldwide, along with the associated common comorbidities of type 2 diabetes and cardiovascular disease in later life. Motivated by evidence for a strong genetic component, our prior genome-wide association study (GWAS) efforts for childhood obesity revealed 19 independent signals for the trait; however, the mechanism of action of these loci remains to be elucidated. To molecularly characterize these childhood obesity loci we sought to determine the underlying causal variants and the corresponding effector genes within diverse cellular contexts. Integrating childhood obesity GWAS summary statistics with our existing 3D genomic datasets for 57 human cell types, consisting of high-resolution promoter-focused Capture-C/Hi-C, ATAC-seq, and RNA-seq, we applied stratified LD score regression and calculated the proportion of genome-wide SNP heritability attributable to cell type-specific features, revealing pancreatic alpha cell enrichment as the most statistically significant. Subsequent chromatin contact-based fine-mapping was carried out for genome-wide significant childhood obesity loci and their linkage disequilibrium proxies to implicate effector genes, yielded the most abundant number of candidate variants and target genes at the BDNF, ADCY3, TMEM18 and FTO loci in skeletal muscle myotubes and the pancreatic beta-cell line, EndoC-BH1. One novel implicated effector gene, ALKAL2 - an inflammation-responsive gene in nerve nociceptors - was observed at the key TMEM18 locus across multiple immune cell types. Interestingly, this observation was also supported through colocalization analysis using expression quantitative trait loci (eQTL) derived from the Genotype-Tissue Expression (GTEx) dataset, supporting an inflammatory and neurologic component to the pathogenesis of childhood obesity. Our comprehensive appraisal of 3D genomic datasets generated in a myriad of different cell types provides genomic insights into pediatric obesity pathogenesis.
RESUMEN
A portion of the genetic basis for many common autoimmune disorders has been uncovered by genome-wide association studies (GWAS), but GWAS do not reveal causal variants, effector genes, or the cell types impacted by disease-associated variation. We have generated 3D genomic datasets consisting of promoter-focused Capture-C, Hi-C, ATAC-seq, and RNA-seq and integrated these data with GWAS of 16 autoimmune traits to physically map disease-associated variants to the effector genes they likely regulate in 57 human cell types. These 3D maps of gene cis-regulatory architecture are highly powered to identify the cell types most likely impacted by disease-associated genetic variation compared to 1D genomic features, and tend to implicate different effector genes than eQTL approaches in the same cell types. Most of the variants implicated by these cis-regulatory architectures are highly trait-specific, but nearly half of the target genes connected to these variants are shared across multiple autoimmune disorders in multiple cell types, suggesting a high level of genetic diversity and complexity among autoimmune diseases that nonetheless converge at the level of target gene and cell type. Substantial effector gene sharing led to the common enrichment of similar biological networks across disease and cell types. However, trait-specific pathways representing potential areas for disease-specific intervention were identified. To test this, we pharmacologically validated squalene synthase, a cholesterol biosynthetic enzyme encoded by the FDFT1 gene implicated by our approach in MS and SLE, as a novel immunomodulatory drug target controlling inflammatory cytokine production by human T cells. These data represent a comprehensive resource for basic discovery of gene cis-regulatory mechanisms, and the analyses reported reveal mechanisms by which autoimmune-associated variants act to regulate gene expression, function, and pathology across multiple, distinct tissues and cell types.
RESUMEN
Here, we present a protocol to identify transcriptional regulators potentially mediating downstream biological effects of germline variants associated with complex traits of interest, which enables functional hypothesis generation independent of colocalizing expression quantitative trait loci (eQTLs). We describe steps for tissue-/cell-type-specific co-expression network modeling, expression regulator activity inference, and identification of representative phenotypic master regulators. Finally, we detail activity QTL and eQTL analyses. This protocol requires genotype, expression, and relevant covariables and phenotype data from existing eQTL datasets. For complete details on the use and execution of this protocol, please refer to Hoskins et al.1.
Asunto(s)
Estudio de Asociación del Genoma Completo , Sitios de Carácter Cuantitativo , Sitios de Carácter Cuantitativo/genética , Genotipo , Fenotipo , Herencia MultifactorialRESUMEN
BACKGROUND: Although 20 pancreatic cancer susceptibility loci have been identified through genome-wide association studies in individuals of European ancestry, much of its heritability remains unexplained and the genes responsible largely unknown. METHODS: To discover novel pancreatic cancer risk loci and possible causal genes, we performed a pancreatic cancer transcriptome-wide association study in Europeans using three approaches: FUSION, MetaXcan, and Summary-MulTiXcan. We integrated genome-wide association studies summary statistics from 9040 pancreatic cancer cases and 12 496 controls, with gene expression prediction models built using transcriptome data from histologically normal pancreatic tissue samples (NCI Laboratory of Translational Genomics [n = 95] and Genotype-Tissue Expression v7 [n = 174] datasets) and data from 48 different tissues (Genotype-Tissue Expression v7, n = 74-421 samples). RESULTS: We identified 25 genes whose genetically predicted expression was statistically significantly associated with pancreatic cancer risk (false discovery rate < .05), including 14 candidate genes at 11 novel loci (1p36.12: CELA3B; 9q31.1: SMC2, SMC2-AS1; 10q23.31: RP11-80H5.9; 12q13.13: SMUG1; 14q32.33: BTBD6; 15q23: HEXA; 15q26.1: RCCD1; 17q12: PNMT, CDK12, PGAP3; 17q22: SUPT4H1; 18q11.22: RP11-888D10.3; and 19p13.11: PGPEP1) and 11 at six known risk loci (5p15.33: TERT, CLPTM1L, ZDHHC11B; 7p14.1: INHBA; 9q34.2: ABO; 13q12.2: PDX1; 13q22.1: KLF5; and 16q23.1: WDR59, CFDP1, BCAR1, TMEM170A). The association for 12 of these genes (CELA3B, SMC2, and PNMT at novel risk loci and TERT, CLPTM1L, INHBA, ABO, PDX1, KLF5, WDR59, CFDP1, and BCAR1 at known loci) remained statistically significant after Bonferroni correction. CONCLUSIONS: By integrating gene expression and genotype data, we identified novel pancreatic cancer risk loci and candidate functional genes that warrant further investigation.
Asunto(s)
Neoplasias Pancreáticas/genética , Bases de Datos Genéticas , Expresión Génica , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , TranscriptomaRESUMEN
This corrects the article DOI: 10.1038/ncomms15034.
RESUMEN
In 2020, 146,063 deaths due to pancreatic cancer are estimated to occur in Europe and the United States combined. To identify common susceptibility alleles, we performed the largest pancreatic cancer GWAS to date, including 9040 patients and 12,496 controls of European ancestry from the Pancreatic Cancer Cohort Consortium (PanScan) and the Pancreatic Cancer Case-Control Consortium (PanC4). Here, we find significant evidence of a novel association at rs78417682 (7p12/TNS3, P = 4.35 × 10-8). Replication of 10 promising signals in up to 2737 patients and 4752 controls from the PANcreatic Disease ReseArch (PANDoRA) consortium yields new genome-wide significant loci: rs13303010 at 1p36.33 (NOC2L, P = 8.36 × 10-14), rs2941471 at 8q21.11 (HNF4G, P = 6.60 × 10-10), rs4795218 at 17q12 (HNF1B, P = 1.32 × 10-8), and rs1517037 at 18q21.32 (GRP, P = 3.28 × 10-8). rs78417682 is not statistically significantly associated with pancreatic cancer in PANDoRA. Expression quantitative trait locus analysis in three independent pancreatic data sets provides molecular support of NOC2L as a pancreatic cancer susceptibility gene.
Asunto(s)
Carcinoma Ductal Pancreático/genética , Neoplasias Pancreáticas/genética , Bases de Datos Genéticas , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Factor Nuclear 1-beta del Hepatocito/genética , Factor Nuclear 4 del Hepatocito/genética , Humanos , Péptidos y Proteínas de Señalización Intercelular , Péptidos y Proteínas de Señalización Intracelular , Polimorfismo de Nucleótido Simple , Proteínas/genética , Proteínas Represoras/genética , Tensinas/genéticaRESUMEN
Genome wide association studies (GWAS) have mapped multiple independent cancer susceptibility loci to chr5p15.33. Here, we show that fine-mapping of pancreatic and testicular cancer GWAS within one of these loci (Region 2 in CLPTM1L) focuses the signal to nine highly correlated SNPs. Of these, rs36115365-C associated with increased pancreatic and testicular but decreased lung cancer and melanoma risk, and exhibited preferred protein-binding and enhanced regulatory activity. Transcriptional gene silencing of this regulatory element repressed TERT expression in an allele-specific manner. Proteomic analysis identifies allele-preferred binding of Zinc finger protein 148 (ZNF148) to rs36115365-C, further supported by binding of purified recombinant ZNF148. Knockdown of ZNF148 results in reduced TERT expression, telomerase activity and telomere length. Our results indicate that the association with chr5p15.33-Region 2 may be explained by rs36115365, a variant influencing TERT expression via ZNF148 in a manner consistent with elevated TERT in carriers of the C allele.
Asunto(s)
Proteínas de Unión al ADN/genética , Regulación Neoplásica de la Expresión Génica , Neoplasias Pulmonares/genética , Melanoma/genética , Neoplasias Pancreáticas/genética , Neoplasias Cutáneas/genética , Telomerasa/genética , Neoplasias Testiculares/genética , Factores de Transcripción/genética , Alelos , Línea Celular Tumoral , Mapeo Cromosómico , Cromosomas Humanos Par 5 , Proteínas de Unión al ADN/antagonistas & inhibidores , Proteínas de Unión al ADN/metabolismo , Femenino , Sitios Genéticos , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Histonas/genética , Histonas/metabolismo , Humanos , Neoplasias Pulmonares/metabolismo , Neoplasias Pulmonares/patología , Masculino , Melanoma/metabolismo , Melanoma/patología , Neoplasias Pancreáticas/metabolismo , Neoplasias Pancreáticas/patología , Polimorfismo de Nucleótido Simple , ARN Interferente Pequeño/genética , ARN Interferente Pequeño/metabolismo , Transducción de Señal , Neoplasias Cutáneas/metabolismo , Neoplasias Cutáneas/patología , Telomerasa/antagonistas & inhibidores , Telomerasa/metabolismo , Homeostasis del Telómero , Neoplasias Testiculares/metabolismo , Neoplasias Testiculares/patología , Factores de Transcripción/antagonistas & inhibidores , Factores de Transcripción/metabolismoRESUMEN
Genome-wide association studies (GWAS) of 10 different cancers have identified pleiotropic cancer predisposition loci across a region of chromosome 5p15.33 that includes the TERT and CLPTM1L genes. Of these, susceptibility alleles for pancreatic cancer have mapped to the CLPTM1L gene, thus prompting an investigation of the function of CLPTM1L in the pancreas. Immunofluorescence analysis indicated that CLPTM1L localized to the endoplasmic reticulum where it is likely embedded in the membrane, in accord with multiple predicted transmembrane domains. Overexpression of CLPTM1L enhanced growth of pancreatic cancer cells in vitro (1.3-1.5-fold; PDAY7 < 0.003) and in vivo (3.46-fold; PDAY68 = 0.039), suggesting a role in tumor growth; this effect was abrogated by deletion of two hydrophilic domains. Affinity purification followed by mass spectrometry identified an interaction between CLPTM1L and non-muscle myosin II (NMM-II), a protein involved in maintaining cell shape, migration, and cytokinesis. The two proteins colocalized in the cytoplasm and, after treatment with a DNA-damaging agent, at the centrosomes. Overexpression of CLPTM1L and depletion of NMM-II induced aneuploidy, indicating that CLPTM1L may interfere with normal NMM-II function in regulating cytokinesis. Immunohistochemical analysis revealed enhanced staining of CLPTM1L in human pancreatic ductal adenocarcinoma (n = 378) as compared with normal pancreatic tissue samples (n = 17; P = 1.7 × 10(-4)). Our results suggest that CLPTM1L functions as a growth-promoting gene in the pancreas and that overexpression may lead to an abrogation of normal cytokinesis, indicating that it should be considered as a plausible candidate gene that could explain the effect of pancreatic cancer susceptibility alleles on chr5p15.33.