ABSTRACT
Mosaic loss of the X chromosome (mLOX) is the most common clonal somatic alteration in leukocytes of female individuals1,2, but little is known about its genetic determinants or phenotypic consequences. Here, to address this, we used data from 883,574 female participants across 8 biobanks; 12% of participants exhibited detectable mLOX in approximately 2% of leukocytes. Female participants with mLOX had an increased risk of myeloid and lymphoid leukaemias. Genetic analyses identified 56 common variants associated with mLOX, implicating genes with roles in chromosomal missegregation, cancer predisposition and autoimmune diseases. Exome-sequence analyses identified rare missense variants in FBXO10 that confer a twofold increased risk of mLOX. Only a small fraction of associations was shared with mosaic Y chromosome loss, suggesting that distinct biological processes drive formation and clonal expansion of sex chromosome missegregation. Allelic shift analyses identified X chromosome alleles that are preferentially retained in mLOX, demonstrating variation at many loci under cellular selection. A polygenic score including 44 allelic shift loci correctly inferred the retained X chromosomes in 80.7% of mLOX cases in the top decile. Our results support a model in which germline variants predispose female individuals to acquiring mLOX, with the allelic content of the X chromosome possibly shaping the magnitude of clonal expansion.
Subject(s)
Aneuploidy , Chromosomes, Human, X , Clone Cells , Leukocytes , Mosaicism , Adult , Female , Humans , Male , Middle Aged , Alleles , Autoimmune Diseases/genetics , Biological Specimen Banks , Chromosome Segregation/genetics , Chromosomes, Human, X/genetics , Chromosomes, Human, Y/genetics , Clone Cells/metabolism , Clone Cells/pathology , Exome/genetics , F-Box Proteins/genetics , Genetic Predisposition to Disease/genetics , Germ-Line Mutation , Leukemia/genetics , Leukocytes/metabolism , Models, Genetic , Multifactorial Inheritance/genetics , Mutation, Missense/geneticsABSTRACT
Large-scale human genetic data1-3 have shown that cancer mutations display strong tissue-selectivity, but how this selectivity arises remains unclear. Here, using experimental models, functional genomics and analyses of patient samples, we demonstrate that the lineage transcription factor paired box 8 (PAX8) is required for oncogenic signalling by two common genetic alterations that cause clear cell renal cell carcinoma (ccRCC) in humans: the germline variant rs7948643 at 11q13.3 and somatic inactivation of the von Hippel-Lindau tumour suppressor (VHL)4-6. VHL loss, which is observed in about 90% of ccRCCs, can lead to hypoxia-inducible factor 2α (HIF2A) stabilization6,7. We show that HIF2A is preferentially recruited to PAX8-bound transcriptional enhancers, including a pro-tumorigenic cyclin D1 (CCND1) enhancer that is controlled by PAX8 and HIF2A. The ccRCC-protective allele C at rs7948643 inhibits PAX8 binding at this enhancer and downstream activation of CCND1 expression. Co-option of a PAX8-dependent physiological programme that supports the proliferation of normal renal epithelial cells is also required for MYC expression from the ccRCC metastasis-associated amplicons at 8q21.3-q24.3 (ref. 8). These results demonstrate that transcriptional lineage factors are essential for oncogenic signalling and that they mediate tissue-specific cancer risk associated with somatic and inherited genetic variants.
Subject(s)
Carcinogenesis , Kidney Neoplasms , PAX8 Transcription Factor , Signal Transduction , Alleles , Basic Helix-Loop-Helix Transcription Factors/metabolism , Carcinogenesis/genetics , Carcinoma, Renal Cell/metabolism , Carcinoma, Renal Cell/pathology , Cyclin D1/genetics , Gene Expression Regulation, Neoplastic , Humans , Kidney/metabolism , Kidney/pathology , Kidney Neoplasms/metabolism , Kidney Neoplasms/pathology , Mutation , PAX8 Transcription Factor/genetics , PAX8 Transcription Factor/metabolism , Proto-Oncogene Proteins c-myc/genetics , Von Hippel-Lindau Tumor Suppressor Protein/geneticsABSTRACT
We performed a series of integrative analyses including transcriptome-wide association studies (TWASs) and proteome-wide association studies (PWASs) of renal cell carcinoma (RCC) to nominate and prioritize molecular targets for laboratory investigation. On the basis of a genome-wide association study (GWAS) of 29,020 affected individuals and 835,670 control individuals and prediction models trained in transcriptomic reference models, our TWAS across four kidney transcriptomes (GTEx kidney cortex, kidney tubules, TCGA-KIRC [The Cancer Genome Atlas kidney renal clear-cell carcinoma], and TCGA-KIRP [TCGA kidney renal papillary cell carcinoma]) identified 38 gene associations (false-discovery rate <5%) in at least two of four transcriptomic panels and identified 12 genes that were independent of GWAS susceptibility regions. Analyses combining TWAS associations across 48 tissues from GTEx identified associations that were replicable in tumor transcriptomes for 23 additional genes. Analyses by the two major histologic types (clear-cell RCC and papillary RCC) revealed subtype-specific associations, although at least three gene associations were common to both subtypes. PWAS identified 13 associated proteins, all mapping to GWAS-significant loci. TWAS-identified genes were enriched for active enhancer or promoter regions in RCC tumors and hypoxia-inducible factor binding sites in relevant cell lines. Using gene expression correlation, common cancers (breast and prostate) and RCC risk factors (e.g., hypertension and BMI) display genetic contributions shared with RCC. Our work identifies potential molecular targets for RCC susceptibility for downstream functional investigation.
Subject(s)
Carcinoma, Renal Cell , Genome-Wide Association Study , Kidney Neoplasms , Proteome , Transcriptome , Carcinoma, Renal Cell/genetics , Humans , Kidney Neoplasms/genetics , Proteome/genetics , Genetic Predisposition to Disease , Gene Expression Regulation, Neoplastic , Polymorphism, Single Nucleotide , Gene Expression ProfilingABSTRACT
Leukocyte telomere length (LTL) varies significantly across human populations, with individuals of African ancestry having longer LTL than non-Africans. However, the genetic and environmental drivers of LTL variation in Africans remain largely unknown. We report here on the relationship between LTL, genetics, and a variety of environmental and climatic factors in ethnically diverse African adults (n = 1,818) originating from Botswana, Tanzania, Ethiopia, and Cameroon. We observe significant variation in LTL among populations, finding that the San hunter-gatherers from Botswana have the longest leukocyte telomeres and that the Fulani pastoralists from Cameroon have the shortest telomeres. Genetic factors explain â¼50% of LTL variation among individuals. Moreover, we observe a significant negative association between Plasmodium falciparum malaria endemicity and LTL while adjusting for age, sex, and genetics. Within Africa, adults from populations indigenous to areas with high malaria exposure have shorter LTL than those in populations indigenous to areas with low malaria exposure. Finally, we explore to what degree the genetic architecture underlying LTL in Africa covaries with malaria exposure.
Subject(s)
Malaria, Falciparum , Telomere , Adult , Female , Humans , Male , Middle Aged , Young Adult , Africa South of the Sahara/epidemiology , Black People/ethnology , Black People/genetics , Endemic Diseases , Leukocytes/metabolism , Malaria, Falciparum/genetics , Malaria, Falciparum/epidemiology , Malaria, Falciparum/parasitology , Plasmodium falciparum/genetics , Plasmodium falciparum/pathogenicity , Sub-Saharan African People , Telomere/genetics , Telomere Homeostasis/genetics , Botswana , Tanzania , Cameroon , Southern African PeopleABSTRACT
Co-observation of a gene variant with a pathogenic variant in another gene that explains the disease presentation has been designated as evidence against pathogenicity for commonly used variant classification guidelines. Multiple variant curation expert panels have specified, from consensus opinion, that this evidence type is not applicable for the classification of breast cancer predisposition gene variants. Statistical analysis of sequence data for 55,815 individuals diagnosed with breast cancer from the BRIDGES sequencing project was undertaken to formally assess the utility of co-observation data for germline variant classification. Our analysis included expected loss-of-function variants in 11 breast cancer predisposition genes and pathogenic missense variants in BRCA1, BRCA2, and TP53. We assessed whether co-observation of pathogenic variants in two different genes occurred more or less often than expected under the assumption of independence. Co-observation of pathogenic variants in each of BRCA1, BRCA2, and PALB2 with the remaining genes was less frequent than expected. This evidence for depletion remained after adjustment for age at diagnosis, study design (familial versus population-based), and country. Co-observation of a variant of uncertain significance in BRCA1, BRCA2, or PALB2 with a pathogenic variant in another breast cancer gene equated to supporting evidence against pathogenicity following criterion strength assignment based on the likelihood ratio and showed utility in reclassification of missense BRCA1 and BRCA2 variants identified in BRIDGES. Our approach has applicability for assessing the value of co-observation as a predictor of variant pathogenicity in other clinical contexts, including for gene-specific guidelines developed by ClinGen Variant Curation Expert Panels.
Subject(s)
Breast Neoplasms , Genetic Predisposition to Disease , Germ-Line Mutation , Humans , Breast Neoplasms/genetics , Germ-Line Mutation/genetics , Female , BRCA2 Protein/genetics , BRCA1 Protein/genetics , Fanconi Anemia Complementation Group N Protein/genetics , Middle Aged , Mutation, Missense/genetics , Adult , Tumor Suppressor Protein p53/geneticsABSTRACT
To identify credible causal risk variants (CCVs) associated with different histotypes of epithelial ovarian cancer (EOC), we performed genome-wide association analysis for 470,825 genotyped and 10,163,797 imputed SNPs in 25,981 EOC cases and 105,724 controls of European origin. We identified five histotype-specific EOC risk regions (p value <5 × 10-8) and confirmed previously reported associations for 27 risk regions. Conditional analyses identified an additional 11 signals independent of the primary signal at six risk regions (p value <10-5). Fine mapping identified 4,008 CCVs in these regions, of which 1,452 CCVs were located in ovarian cancer-related chromatin marks with significant enrichment in active enhancers, active promoters, and active regions for CCVs from each EOC histotype. Transcriptome-wide association and colocalization analyses across histotypes using tissue-specific and cross-tissue datasets identified 86 candidate susceptibility genes in known EOC risk regions and 32 genes in 23 additional genomic regions that may represent novel EOC risk loci (false discovery rate <0.05). Finally, by integrating genome-wide HiChIP interactome analysis with transcriptome-wide association study (TWAS), variant effect predictor, transcription factor ChIP-seq, and motifbreakR data, we identified candidate gene-CCV interactions at each locus. This included risk loci where TWAS identified one or more candidate susceptibility genes (e.g., HOXD-AS2, HOXD8, and HOXD3 at 2q31) and other loci where no candidate gene was identified (e.g., MYC and PVT1 at 8q24) by TWAS. In summary, this study describes a functional framework and provides a greater understanding of the biological significance of risk alleles and candidate gene targets at EOC susceptibility loci identified by a genome-wide association study.
Subject(s)
Genetic Predisposition to Disease , Genome-Wide Association Study , Ovarian Neoplasms , Polymorphism, Single Nucleotide , Humans , Female , Ovarian Neoplasms/genetics , Ovarian Neoplasms/pathology , Carcinoma, Ovarian Epithelial/genetics , Transcriptome , Risk Factors , Genomics/methods , Case-Control Studies , MultiomicsABSTRACT
Age-related clonal expansion of cells harbouring mosaic chromosomal alterations (mCAs) is one manifestation of clonal haematopoiesis. Identifying factors that influence the generation and promotion of clonal expansion of mCAs are key to investigate the role of mCAs in health and disease. Herein, we report on widely measured serum biomarkers and their possible association with mCAs, which could provide new insights into molecular alterations that promote acquisition and clonal expansion. We performed a cross-sectional investigation of the association of 32 widely measured serum biomarkers with autosomal mCAs, mosaic loss of the Y chromosome, and mosaic loss of the X chromosome in 436 784 cancer-free participants from the UK Biobank. mCAs were associated with a range of commonly measured serum biomarkers such as lipid levels, circulating sex hormones, blood sugar homeostasis, inflammation and immune function, vitamins and minerals, kidney function, and liver function. Biomarker levels in participants with mCAs were estimated to differ by up to 5% relative to mCA-free participants, and individuals with higher cell fraction mCAs had greater deviation in mean biomarker values. Polygenic scores associated with sex hormone binding globulin, vitamin D, and total cholesterol were also associated with mCAs. Overall, we observed commonly used clinical serum biomarkers related to disease risk are associated with mCAs, suggesting mechanisms involved in these diseases could be related to mCA proliferation and clonal expansion.
Subject(s)
Chromosomes, Human, Y , Mosaicism , Humans , Male , Biological Specimen Banks , Cross-Sectional Studies , Biomarkers , United KingdomABSTRACT
Little is known regarding the potential relationship between clonal hematopoiesis (CH) of indeterminate potential (CHIP), which is the expansion of hematopoietic stem cells with somatic mutations, and risk of prostate cancer, the fifth leading cause of cancer death of men worldwide. We evaluated the association of age-related CHIP with overall and aggressive prostate cancer risk in two large whole-exome sequencing studies of 75 047 European ancestry men, including 7663 prostate cancer cases, 2770 of which had aggressive disease, and 3266 men carrying CHIP variants. We found that CHIP, defined by over 50 CHIP genes individually and in aggregate, was not significantly associated with overall (aggregate HR = 0.93, 95% CI = 0.76-1.13, P = 0.46) or aggressive (aggregate OR = 1.14, 95% CI = 0.92-1.41, P = 0.22) prostate cancer risk. CHIP was weakly associated with genetic risk of overall prostate cancer, measured using a polygenic risk score (OR = 1.05 per unit increase, 95% CI = 1.01-1.10, P = 0.01). CHIP was not significantly associated with carrying pathogenic/likely pathogenic/deleterious variants in DNA repair genes, which have previously been found to be associated with aggressive prostate cancer. While findings from this study suggest that CHIP is likely not a risk factor for prostate cancer, it will be important to investigate other types of CH in association with prostate cancer risk.
Subject(s)
Clonal Hematopoiesis , Prostatic Neoplasms , Male , Humans , Hematopoiesis/genetics , Risk Factors , Hematopoietic Stem Cells , Prostatic Neoplasms/genetics , MutationABSTRACT
The most recent genome-wide association study (GWAS) of cutaneous melanoma identified 54 risk-associated loci, but functional variants and their target genes for most have not been established. Here, we performed massively parallel reporter assays (MPRAs) by using malignant melanoma and normal melanocyte cells and further integrated multi-layer annotation to systematically prioritize functional variants and susceptibility genes from these GWAS loci. Of 1,992 risk-associated variants tested in MPRAs, we identified 285 from 42 loci (78% of the known loci) displaying significant allelic transcriptional activities in either cell type (FDR < 1%). We further characterized MPRA-significant variants by motif prediction, epigenomic annotation, and statistical/functional fine-mapping to create integrative variant scores, which prioritized one to six plausible candidate variants per locus for the 42 loci and nominated a single variant for 43% of these loci. Overlaying the MPRA-significant variants with genome-wide significant expression or methylation quantitative trait loci (eQTLs or meQTLs, respectively) from melanocytes or melanomas identified candidate susceptibility genes for 60% of variants (172 of 285 variants). CRISPRi of top-scoring variants validated their cis-regulatory effect on the eQTL target genes, MAFF (22q13.1) and GPRC5A (12p13.1). Finally, we identified 36 melanoma-specific and 45 melanocyte-specific MPRA-significant variants, a subset of which are linked to cell-type-specific target genes. Analyses of transcription factor availability in MPRA datasets and variant-transcription-factor interaction in eQTL datasets highlighted the roles of transcription factors in cell-type-specific variant functionality. In conclusion, MPRAs along with variant scoring effectively prioritized plausible candidates for most melanoma GWAS loci and highlighted cellular contexts where the susceptibility variants are functional.
Subject(s)
Melanoma , Skin Neoplasms , Humans , Melanoma/genetics , Skin Neoplasms/genetics , Genome-Wide Association Study , Biological Assay , Transcription Factors , Receptors, G-Protein-Coupled , Melanoma, Cutaneous MalignantABSTRACT
Mosaic loss of chromosome Y (LOY) in circulating white blood cells is the most common form of clonal mosaicism1-5, yet our knowledge of the causes and consequences of this is limited. Here, using a computational approach, we estimate that 20% of the male population represented in the UK Biobank study (n = 205,011) has detectable LOY. We identify 156 autosomal genetic determinants of LOY, which we replicate in 757,114 men of European and Japanese ancestry. These loci highlight genes that are involved in cell-cycle regulation and cancer susceptibility, as well as somatic drivers of tumour growth and targets of cancer therapy. We demonstrate that genetic susceptibility to LOY is associated with non-haematological effects on health in both men and women, which supports the hypothesis that clonal haematopoiesis is a biomarker of genomic instability in other tissues. Single-cell RNA sequencing identifies dysregulated expression of autosomal genes in leukocytes with LOY and provides insights into why clonal expansion of these cells may occur. Collectively, these data highlight the value of studying clonal mosaicism to uncover fundamental mechanisms that underlie cancer and other ageing-related diseases.
Subject(s)
Chromosome Deletion , Chromosomes, Human, Y/genetics , Genetic Predisposition to Disease/genetics , Genomic Instability/genetics , Leukocytes/pathology , Mosaicism , Adult , Aged , Computational Biology , Databases, Genetic , Female , Genetic Markers/genetics , Humans , Male , Middle Aged , Neoplasms/genetics , United KingdomABSTRACT
An estimated 38 million people live with human immunodeficiency virus (HIV) worldwide and are at excess risk for multiple cancer types. Elevated cancer risks in people living with HIV (PLWH) are driven primarily by increased exposure to carcinogens, most notably oncogenic viruses acquired through shared transmission routes, plus acceleration of viral carcinogenesis by HIV-related immunosuppression. In the era of widespread antiretroviral therapy (ART), life expectancy of PLWH has increased, with cancer now a leading cause of co-morbidity and death. Furthermore, the types of cancers occurring among PLWH are shifting over time and vary in their relative burden in different parts of the world. In this context, the International Agency for Research on Cancer (IARC) and the US National Cancer Institute (NCI) convened a meeting in September 2022 of multinational and multidisciplinary experts to focus on cancer in PLWH. This report summarizes the proceedings, including a review of the state of the science of cancer descriptive epidemiology, etiology, molecular tumor characterization, primary and secondary prevention, treatment disparities and survival in PLWH around the world. A consensus of key research priorities and recommendations in these domains is also presented.
Subject(s)
Anti-HIV Agents , HIV Infections , Neoplasms , United States/epidemiology , Humans , HIV , National Cancer Institute (U.S.) , Neoplasms/drug therapy , HIV Infections/complications , HIV Infections/drug therapy , HIV Infections/epidemiology , Anti-HIV Agents/therapeutic useABSTRACT
Polygenic risk scores (PRSs) are useful for predicting breast cancer risk, but the prediction accuracy of existing PRSs in women of African ancestry (AA) remains relatively low. We aim to develop optimal PRSs for the prediction of overall and estrogen receptor (ER) subtype-specific breast cancer risk in AA women. The AA dataset comprised 9235 cases and 10 184 controls from four genome-wide association study (GWAS) consortia and a GWAS study in Ghana. We randomly divided samples into training and validation sets. We built PRSs using individual-level AA data by a forward stepwise logistic regression and then developed joint PRSs that combined (1) the PRSs built in the AA training dataset and (2) a 313-variant PRS previously developed in women of European ancestry. PRSs were evaluated in the AA validation set. For overall breast cancer, the odds ratio per standard deviation of the joint PRS in the validation set was 1.34 [95% confidence interval (CI): 1.27-1.42] with the area under receiver operating characteristic curve (AUC) of 0.581. Compared with women with average risk (40th-60th PRS percentile), women in the top decile of the PRS had a 1.98-fold increased risk (95% CI: 1.63-2.39). For PRSs of ER-positive and ER-negative breast cancer, the AUCs were 0.608 and 0.576, respectively. Compared with existing methods, the proposed joint PRSs can improve prediction of breast cancer risk in AA women.
Subject(s)
Breast Neoplasms , Genome-Wide Association Study , Breast Neoplasms/genetics , Female , Genetic Predisposition to Disease , Humans , Multifactorial Inheritance/genetics , Receptors, Estrogen/genetics , Risk FactorsABSTRACT
BACKGROUND: The association of fitness with cancer risk is not clear. METHODS: We used Cox proportional hazards models to estimate hazard ratios (HRs) and 95% confidence intervals (CIs) for risk of lung, colorectal, endometrial, breast, and prostate cancer in a subset of UK Biobank participants who completed a submaximal fitness test in 2009-12 (N = 72,572). We also investigated relationships using two-sample Mendelian randomisation (MR), odds ratios (ORs) were estimated using the inverse-variance weighted method. RESULTS: After a median of 11 years of follow-up, 4290 cancers of interest were diagnosed. A 3.5 ml O2â min-1â kg-1 total-body mass increase in fitness (equivalent to 1 metabolic equivalent of task (MET), approximately 0.5 standard deviation (SD)) was associated with lower risks of endometrial (HR = 0.81, 95% CI: 0.73-0.89), colorectal (0.94, 0.90-0.99), and breast cancer (0.96, 0.92-0.99). In MR analyses, a 0.5 SD increase in genetically predicted O2â min-1â kg-1 fat-free mass was associated with a lower risk of breast cancer (OR = 0.92, 95% CI: 0.86-0.98). After adjusting for adiposity, both the observational and genetic associations were attenuated. DISCUSSION: Higher fitness levels may reduce risks of endometrial, colorectal, and breast cancer, though relationships with adiposity are complex and may mediate these relationships. Increasing fitness, including via changes in body composition, may be an effective strategy for cancer prevention.
Subject(s)
Breast Neoplasms , Cardiorespiratory Fitness , Colorectal Neoplasms , Male , Humans , Biological Specimen Banks , UK Biobank , Breast Neoplasms/epidemiology , Breast Neoplasms/genetics , Colorectal Neoplasms/epidemiology , Colorectal Neoplasms/genetics , Colorectal Neoplasms/diagnosis , Risk FactorsABSTRACT
Our study investigated the underlying mechanism for the 14q24 renal cell carcinoma (RCC) susceptibility risk locus identified by a genome-wide association study (GWAS). The sentinel single-nucleotide polymorphism (SNP), rs4903064, at 14q24 confers an allele-specific effect on expression of the double PHD fingers 3 (DPF3) of the BAF SWI/SNF complex as assessed by massively parallel reporter assay, confirmatory luciferase assays, and eQTL analyses. Overexpression of DPF3 in renal cell lines increases growth rates and alters chromatin accessibility and gene expression, leading to inhibition of apoptosis and activation of oncogenic pathways. siRNA interference of multiple DPF3-deregulated genes reduces growth. Our results indicate that germline variation in DPF3, a component of the BAF complex, part of the SWI/SNF complexes, can lead to reduced apoptosis and activation of the STAT3 pathway, both critical in RCC carcinogenesis. In addition, we show that altered DPF3 expression in the 14q24 RCC locus could influence the effectiveness of immunotherapy treatment for RCC by regulating tumor cytokine secretion and immune cell activation.
Subject(s)
Carcinoma, Renal Cell/genetics , Chromosomes, Human, Pair 14 , DNA-Binding Proteins/genetics , Genetic Loci , Kidney Neoplasms/genetics , STAT3 Transcription Factor/genetics , Transcription Factors/genetics , Carcinogenesis/genetics , Carcinogenesis/immunology , Carcinogenesis/pathology , Carcinoma, Renal Cell/immunology , Carcinoma, Renal Cell/pathology , Carcinoma, Renal Cell/therapy , Cell Line, Tumor , Chromatin/chemistry , Chromatin/immunology , Chromatin Assembly and Disassembly/immunology , Cytokines/genetics , Cytokines/immunology , DNA-Binding Proteins/immunology , Gene Expression Regulation , Genetic Predisposition to Disease , Genome, Human , Genome-Wide Association Study , High-Throughput Nucleotide Sequencing , Humans , Immunotherapy/methods , Kidney Neoplasms/immunology , Kidney Neoplasms/pathology , Kidney Neoplasms/therapy , Polymorphism, Single Nucleotide , STAT3 Transcription Factor/immunology , T-Lymphocytes, Cytotoxic , Transcription Factors/immunologyABSTRACT
Genome-wide association studies (GWASs) have identified a melanoma-associated locus on chromosome band 7p21.1 with rs117132860 as the lead SNP and a secondary independent signal marked by rs73069846. rs117132860 is also associated with tanning ability and cutaneous squamous cell carcinoma (cSCC). Because ultraviolet radiation (UVR) is a key environmental exposure for all three traits, we investigated the mechanisms by which this locus contributes to melanoma risk, focusing on cellular response to UVR. Fine-mapping of melanoma GWASs identified four independent sets of candidate causal variants. A GWAS region-focused Capture-C study of primary melanocytes identified physical interactions between two causal sets and the promoter of the aryl hydrocarbon receptor (AHR). Subsequent chromatin state annotation, eQTL, and luciferase assays identified rs117132860 as a functional variant and reinforced AHR as a likely causal gene. Because AHR plays critical roles in cellular response to dioxin and UVR, we explored links between this SNP and AHR expression after both 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD) and ultraviolet B (UVB) exposure. Allele-specific AHR binding to rs117132860-G was enhanced following both, consistent with predicted weakened AHR binding to the risk/poor-tanning rs117132860-A allele, and allele-preferential AHR expression driven from the protective rs117132860-G allele was observed following UVB exposure. Small deletions surrounding rs117132860 introduced via CRISPR abrogates AHR binding, reduces melanocyte cell growth, and prolongs growth arrest following UVB exposure. These data suggest AHR is a melanoma susceptibility gene at the 7p21.1 risk locus and rs117132860 is a functional variant within a UVB-responsive element, leading to allelic AHR expression and altering melanocyte growth phenotypes upon exposure.
Subject(s)
Basic Helix-Loop-Helix Transcription Factors/genetics , Carcinoma, Squamous Cell/genetics , Chromosomes, Human, Pair 7 , Genetic Loci , Melanocytes/metabolism , Melanoma/genetics , Receptors, Aryl Hydrocarbon/genetics , Skin Neoplasms/genetics , Alleles , Basic Helix-Loop-Helix Transcription Factors/metabolism , Carcinogenesis/genetics , Carcinogenesis/metabolism , Carcinogenesis/pathology , Carcinoma, Squamous Cell/metabolism , Carcinoma, Squamous Cell/pathology , Chromatin/chemistry , Chromatin/metabolism , Gene Expression Regulation , Genetic Predisposition to Disease , Genome, Human , Genome-Wide Association Study , Humans , Melanocytes/drug effects , Melanocytes/pathology , Melanocytes/radiation effects , Melanoma/metabolism , Melanoma/pathology , Polychlorinated Dibenzodioxins/toxicity , Polymorphism, Single Nucleotide , Primary Cell Culture , Promoter Regions, Genetic , Receptors, Aryl Hydrocarbon/metabolism , Skin Neoplasms/metabolism , Skin Neoplasms/pathology , Sunbathing , Ultraviolet Rays/adverse effectsABSTRACT
Genome-wide association studies (GWASs) have discovered 20 risk loci in the human genome where germline variants associate with risk of pancreatic ductal adenocarcinoma (PDAC) in populations of European ancestry. Here, we fine-mapped one such locus on chr16q23.1 (rs72802365, p = 2.51 × 10-17, OR = 1.36, 95% CI = 1.31-1.40) and identified colocalization (PP = 0.87) with aberrant exon 5-7 CTRB2 splicing in pancreatic tissues (pGTEx = 1.40 × 10-69, ßGTEx = 1.99; pLTG = 1.02 × 10-30, ßLTG = 1.99). Imputation of a 584 bp structural variant overlapping exon 6 of CTRB2 into the GWAS datasets resulted in a highly significant association with pancreatic cancer risk (p = 2.83 × 10-16, OR = 1.36, 95% CI = 1.31-1.42), indicating that it may underlie this signal. Exon skipping attributable to the deletion (risk) allele introduces a premature stop codon in exon 7 of CTRB2, yielding a truncated chymotrypsinogen B2 protein that lacks chymotrypsin activity, is poorly secreted, and accumulates intracellularly in the endoplasmic reticulum (ER). We propose that intracellular accumulation of a nonfunctional chymotrypsinogen B2 protein leads to ER stress and pancreatic inflammation, which may explain the increased pancreatic cancer risk in carriers of CTRB2 exon 6 deletion alleles.
Subject(s)
Chymotrypsin/genetics , Pancreatic Neoplasms/pathology , Polymorphism, Single Nucleotide , Quantitative Trait Loci , Sequence Deletion , Case-Control Studies , Chymotrypsin/antagonists & inhibitors , Chymotrypsin/metabolism , Genome-Wide Association Study , Genotype , Humans , Pancreatic Neoplasms/etiology , Pancreatic Neoplasms/metabolismABSTRACT
Although many loci have been associated with height in European ancestry populations, very few have been identified in African ancestry individuals. Furthermore, many of the known loci have yet to be generalized to and fine-mapped within a large-scale African ancestry sample. We performed sex-combined and sex-stratified meta-analyses in up to 52,764 individuals with height and genome-wide genotyping data from the African Ancestry Anthropometry Genetics Consortium (AAAGC). We additionally combined our African ancestry meta-analysis results with published European genome-wide association study (GWAS) data. In the African ancestry analyses, we identified three novel loci (SLC4A3, NCOA2, ECD/FAM149B1) in sex-combined results and two loci (CRB1, KLF6) in women only. In the African plus European sex-combined GWAS, we identified an additional three novel loci (RCCD1, G6PC3, CEP95) which were equally driven by AAAGC and European results. Among 39 genome-wide significant signals at known loci, conditioning index SNPs from European studies identified 20 secondary signals. Two of the 20 new secondary signals and none of the 8 novel loci had minor allele frequencies (MAF) < 5%. Of 802 known European height signals, 643 displayed directionally consistent associations with height, of which 205 were nominally significant (p < 0.05) in the African ancestry sex-combined sample. Furthermore, 148 of 241 loci contained ≤20 variants in the credible sets that jointly account for 99% of the posterior probability of driving the associations. In summary, trans-ethnic meta-analyses revealed novel signals and further improved fine-mapping of putative causal variants in loci shared between African and European ancestry populations.
Subject(s)
Black People/genetics , Body Height/genetics , Genome-Wide Association Study , Africa/ethnology , Black or African American/genetics , Europe/ethnology , Female , Humans , Male , Polymorphism, Single Nucleotide/geneticsABSTRACT
A combination of genetic and functional approaches has identified three independent breast cancer risk loci at 2q35. A recent fine-scale mapping analysis to refine these associations resulted in 1 (signal 1), 5 (signal 2), and 42 (signal 3) credible causal variants at these loci. We used publicly available in silico DNase I and ChIP-seq data with in vitro reporter gene and CRISPR assays to annotate signals 2 and 3. We identified putative regulatory elements that enhanced cell-type-specific transcription from the IGFBP5 promoter at both signals (30- to 40-fold increased expression by the putative regulatory element at signal 2, 2- to 3-fold by the putative regulatory element at signal 3). We further identified one of the five credible causal variants at signal 2, a 1.4 kb deletion (esv3594306), as the likely causal variant; the deletion allele of this variant was associated with an average additional increase in IGFBP5 expression of 1.3-fold (MCF-7) and 2.2-fold (T-47D). We propose a model in which the deletion allele of esv3594306 juxtaposes two transcription factor binding regions (annotated by estrogen receptor alpha ChIP-seq peaks) to generate a single extended regulatory element. This regulatory element increases cell-type-specific expression of the tumor suppressor gene IGFBP5 and, thereby, reduces risk of estrogen receptor-positive breast cancer (odds ratio = 0.77, 95% CI 0.74-0.81, p = 3.1 × 10-31).
Subject(s)
Insulin-Like Growth Factor Binding Protein 5/genetics , Molecular Sequence Annotation , Promoter Regions, Genetic , Breast Neoplasms/genetics , CRISPR-Cas Systems , Cell Line , Chromosome Mapping , Chromosomes, Human, Pair 2 , Female , Genetic Association Studies , Genetic Variation , Humans , Risk Factors , Sequence DeletionABSTRACT
Burkitt lymphoma (BL) is an aggressive B-cell lymphoma that significantly contributes to childhood cancer burden in sub-Saharan Africa. Plasmodium falciparum, which causes malaria, is geographically associated with BL, but the evidence remains insufficient for causal inference. Inference could be strengthened by demonstrating that mendelian genes known to protect against malaria-such as the sickle cell trait variant, HBB-rs334(T)-also protect against BL. We investigated this hypothesis among 800 BL cases and 3845 controls in four East African countries using genome-scan data to detect polymorphisms in 22 genes known to affect malaria risk. We fit generalized linear mixed models to estimate odds ratios (OR) and 95% confidence intervals (95% CI), controlling for age, sex, country, and ancestry. The ORs of the loci with BL and P. falciparum infection among controls were correlated (Spearman's ρ = 0.37, p = .039). HBB-rs334(T) was associated with lower P. falciparum infection risk among controls (OR = 0.752, 95% CI 0.628-0.9; p = .00189) and BL risk (OR = 0.687, 95% CI 0.533-0.885; p = .0037). ABO-rs8176703(T) was associated with decreased risk of BL (OR = 0.591, 95% CI 0.379-0.992; p = .00271), but not of P. falciparum infection. Our results increase support for the etiological correlation between P. falciparum and BL risk.
Subject(s)
Burkitt Lymphoma , Malaria, Falciparum , Malaria , Sickle Cell Trait , Humans , Africa, Eastern , Alleles , Burkitt Lymphoma/epidemiology , Burkitt Lymphoma/genetics , Malaria, Falciparum/epidemiology , Malaria, Falciparum/genetics , Malaria, Falciparum/complications , Sickle Cell Trait/epidemiology , Sickle Cell Trait/genetics , Sickle Cell Trait/complications , Nectins/metabolismABSTRACT
BACKGROUND: Genome-wide studies of gene-environment interactions (G×E) may identify variants associated with disease risk in conjunction with lifestyle/environmental exposures. We conducted a genome-wide G×E analysis of ~ 7.6 million common variants and seven lifestyle/environmental risk factors for breast cancer risk overall and for estrogen receptor positive (ER +) breast cancer. METHODS: Analyses were conducted using 72,285 breast cancer cases and 80,354 controls of European ancestry from the Breast Cancer Association Consortium. Gene-environment interactions were evaluated using standard unconditional logistic regression models and likelihood ratio tests for breast cancer risk overall and for ER + breast cancer. Bayesian False Discovery Probability was employed to assess the noteworthiness of each SNP-risk factor pairs. RESULTS: Assuming a 1 × 10-5 prior probability of a true association for each SNP-risk factor pairs and a Bayesian False Discovery Probability < 15%, we identified two independent SNP-risk factor pairs: rs80018847(9p13)-LINGO2 and adult height in association with overall breast cancer risk (ORint = 0.94, 95% CI 0.92-0.96), and rs4770552(13q12)-SPATA13 and age at menarche for ER + breast cancer risk (ORint = 0.91, 95% CI 0.88-0.94). CONCLUSIONS: Overall, the contribution of G×E interactions to the heritability of breast cancer is very small. At the population level, multiplicative G×E interactions do not make an important contribution to risk prediction in breast cancer.