RESUMO
INTRODUCTION: This study investigates how quantitative texture analysis can be used to non-invasively identify novel radiogenomic correlations with clear cell renal cell carcinoma (ccRCC) biomarkers. METHODS: The Cancer Genome Atlas-Kidney Renal Clear Cell Carcinoma open-source database was used to identify 190 sets of patient genomic data that had corresponding multiphase contrast-enhanced CT images in The Cancer Imaging Archive. 2,824 radiomic features spanning fifteen texture families were extracted from CT images using a custom-built MATLAB software package. Robust radiomic features with strong inter-scanner reproducibility were selected. Random forest, AdaBoost, and elastic net machine learning (ML) algorithms evaluated the ability of the selected radiomic features to predict the presence of 12 clinically relevant molecular biomarkers identified from the literature. ML analysis was repeated with cases stratified by stage (I/II vs. III/IV) and grade (1/2 vs. 3/4). 10-fold cross validation was used to evaluate model performance. RESULTS: Before stratification by tumor grade and stage, radiomics predicted the presence of several biomarkers with weak discrimination (AUC 0.60-0.68). Once stratified, radiomics predicted KDM5C, SETD2, PBRM1, and mTOR mutation status with acceptable to excellent predictive discrimination (AUC ranges from 0.70 to 0.86). CONCLUSIONS: Radiomic texture analysis can potentially identify a variety of clinically relevant biomarkers in patients with ccRCC and may have a prognostic implication.
Assuntos
Carcinoma de Células Renais , Neoplasias Renais , Humanos , Carcinoma de Células Renais/diagnóstico por imagem , Carcinoma de Células Renais/genética , Carcinoma de Células Renais/patologia , Neoplasias Renais/diagnóstico por imagem , Neoplasias Renais/genética , Neoplasias Renais/patologia , Reprodutibilidade dos Testes , Tomografia Computadorizada por Raios X/métodos , Aprendizado de Máquina , Estudos RetrospectivosRESUMO
Lung cancer is the leading cause of cancer-related death and lung adenocarcinoma is its most common subtype. Although genetic alterations have been identified as drivers in subsets of lung adenocarcinoma, they do not fully explain tumor development. Epigenetic alterations have been implicated in the pathogenesis of tumors. To identify epigenetic alterations driving lung adenocarcinoma, we used an improved version of the Tracing Enhancer Networks using Epigenetic Traits method (TENET 2.0) in primary normal lung and lung adenocarcinoma cells. We found over 32,000 enhancers that appear differentially activated between normal lung and lung adenocarcinoma. Among the identified transcriptional regulators inactivated in lung adenocarcinoma vs. normal lung, NKX2-1 was linked to a large number of silenced enhancers. Among the activated transcriptional regulators identified, CENPA, FOXM1, and MYBL2 were linked to numerous cancer-specific enhancers. High expression of CENPA, FOXM1, and MYBL2 is particularly observed in a subgroup of lung adenocarcinomas and is associated with poor patient survival. Notably, CENPA, FOXM1, and MYBL2 are also key regulators of cancer-specific enhancers in breast adenocarcinoma of the basal subtype, but they are associated with distinct sets of activated enhancers. We identified individual lung adenocarcinoma enhancers linked to CENPA, FOXM1, or MYBL2 that were associated with poor patient survival. Knockdown experiments of FOXM1 and MYBL2 suggest that these factors regulate genes involved in controlling cell cycle progression and cell division. For example, we found that expression of TK1, a potential target gene of a MYBL2-linked enhancer, is associated with poor patient survival. Identification and characterization of key transcriptional regulators and associated enhancers in lung adenocarcinoma provides important insights into the deregulation of lung adenocarcinoma epigenomes, highlighting novel potential targets for clinical intervention.
Assuntos
Adenocarcinoma de Pulmão/genética , Epigênese Genética/genética , Elementos Reguladores de Transcrição/genética , Adenocarcinoma/genética , Adulto , Idoso , Proteínas de Ciclo Celular/genética , Epigenômica , Proteína Forkhead Box M1/genética , Regulação Neoplásica da Expressão Gênica/genética , Genes Homeobox , Humanos , Pulmão/metabolismo , Neoplasias Pulmonares/genética , Masculino , Pessoa de Meia-Idade , Sequências Reguladoras de Ácido Nucleico/genéticaRESUMO
High expression of the transcription factor ZFX is correlated with proliferation, tumorigenesis, and patient survival in multiple types of human cancers. However, the mechanism by which ZFX influences transcriptional regulation has not been determined. We performed ChIP-seq in four cancer cell lines (representing kidney, colon, prostate, and breast cancers) to identify ZFX binding sites throughout the human genome. We identified ~9,000 ZFX binding sites and found that the majority of the sites are in CpG island promoters. Moreover, genes with promoters bound by ZFX are expressed at higher levels than genes with promoters not bound by ZFX. To determine if ZFX contributes to regulation of the promoters to which it is bound, we performed RNA-seq analysis after knockdown of ZFX by siRNA in prostate and breast cancer cells. Many genes with promoters bound by ZFX were downregulated upon ZFX knockdown, supporting the hypothesis that ZFX acts as a transcriptional activator. Surprisingly, ZFX binds at +240 bp downstream of the TSS of the responsive promoters. Using Nucleosome Occupancy and Methylome Sequencing (NOMe-seq), we show that ZFX binds between the open chromatin region at the TSS and the first downstream nucleosome, suggesting that ZFX may play a critical role in promoter architecture. We have also shown that a closely related zinc finger protein ZNF711 has a similar binding pattern at CpG island promoters, but ZNF711 may play a subordinate role to ZFX. This functional characterization of ZFX provides important new insights into transcription, chromatin structure, and the regulation of the cancer transcriptome.
RESUMO
Highly tumorigenic, drug-resistant cancer stem-like cells drive cancer progression. These aggressive cells can arise repeatedly from bulk tumor cells independently of mutational events, suggesting an epigenetic mechanism. To test this possibility, we studied bladder cancer cells as they cyclically shifted to and from a cancer stem-like phenotype, and we discovered that these two states exhibit distinct DNA methylation and chromatin accessibility. Most differential chromatin accessibility was independent of methylation and affected the expression of driver genes such as E2F3, a cell cycle regulator associated with aggressive bladder cancer. Cancer stem-like cells exhibited increased E2F3 promoter accessibility and increased E2F3 expression that drove cell migration, invasiveness and drug resistance. Epigenetic interference using a DNA methylation inhibitor blocked the transition to a cancer stem-like state and reduced E2F3 expression. Our findings indicate that epigenetic plasticity plays a key role in the transition to and from an aggressive, drug-resistant phenotype.
Assuntos
Plasticidade Celular/genética , Metilação de DNA/genética , Fator de Transcrição E2F3/genética , Células-Tronco Neoplásicas/patologia , Neoplasias da Bexiga Urinária/genética , Linhagem Celular Tumoral , Movimento Celular/genética , Cromatina/metabolismo , Resistencia a Medicamentos Antineoplásicos/genética , Fator de Transcrição E2F3/metabolismo , Regulação Neoplásica da Expressão Gênica/genética , Humanos , Invasividade Neoplásica/genética , Células-Tronco Neoplásicas/efeitos dos fármacos , Regiões Promotoras Genéticas/genética , Neoplasias da Bexiga Urinária/patologiaRESUMO
Genome-wide association studies (GWAS) have revolutionized the field of cancer genetics, but the causal links between increased genetic risk and onset/progression of disease processes remain to be identified. Here we report the first step in such an endeavor for prostate cancer. We provide a comprehensive annotation of the 77 known risk loci, based upon highly correlated variants in biologically relevant chromatin annotations--we identified 727 such potentially functional SNPs. We also provide a detailed account of possible protein disruption, microRNA target sequence disruption and regulatory response element disruption of all correlated SNPs at r(2) ≥ 0.88%. 88% of the 727 SNPs fall within putative enhancers, and many alter critical residues in the response elements of transcription factors known to be involved in prostate biology. We define as risk enhancers those regions with enhancer chromatin biofeatures in prostate-derived cell lines with prostate-cancer correlated SNPs. To aid the identification of these enhancers, we performed genomewide ChIP-seq for H3K27-acetylation, a mark of actively engaged enhancers, as well as the transcription factor TCF7L2. We analyzed in depth three variants in risk enhancers, two of which show significantly altered androgen sensitivity in LNCaP cells. This includes rs4907792, that is in linkage disequilibrium (r(2) = 0.91) with an eQTL for NUDT11 (on the X chromosome) in prostate tissue, and rs10486567, the index SNP in intron 3 of the JAZF1 gene on chromosome 7. Rs4907792 is within a critical residue of a strong consensus androgen response element that is interrupted in the protective allele, resulting in a 56% decrease in its androgen sensitivity, whereas rs10486567 affects both NKX3-1 and FOXA-AR motifs where the risk allele results in a 39% increase in basal activity and a 28% fold-increase in androgen stimulated enhancer activity. Identification of such enhancer variants and their potential target genes represents a preliminary step in connecting risk to disease process.
Assuntos
Elementos Facilitadores Genéticos , Anotação de Sequência Molecular/classificação , Neoplasias da Próstata/genética , Elementos de Resposta/genética , Alelos , Cromatina/genética , Regulação Neoplásica da Expressão Gênica , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Desequilíbrio de Ligação , Masculino , Análise de Sequência com Séries de Oligonucleotídeos , Polimorfismo de Nucleotídeo Único/genética , Neoplasias da Próstata/metabolismo , Neoplasias da Próstata/patologia , Fatores de Risco , Fatores de Transcrição/genéticaRESUMO
Genome-wide association studies have identified 73 breast cancer risk variants mainly in European populations. Given considerable differences in linkage disequilibrium structure between populations of European and African ancestry, the known risk variants may not be informative for risk in African ancestry populations. In a previous fine-mapping investigation of 19 breast cancer loci, we were able to identify SNPs in four regions that better captured risk associations in African American women. In this study of breast cancer in African American women (3016 cases, 2745 controls), we tested an additional 54 novel breast cancer risk variants. Thirty-eight variants (70%) were found to have an association with breast cancer in the same direction as previously reported, with eight (15%) replicating at P < 0.05. Through fine-mapping, in three regions (1q32, 3p24, 10q25), we identified variants that better captured associations with overall breast cancer or estrogen receptor positive disease. We also observed suggestive associations with variants (at P < 5 × 10(-6)) in three separate regions (6q25, 14q13, 22q12) that may represent novel risk variants. Directional consistency of association observed for â¼65-70% of currently known genetic variants for breast cancer in women of African ancestry implies a shared functional common variant at most loci. To validate and enhance the spectrum of alleles that define associations at the known breast cancer risk loci, as well as genome-wide, will require even larger collaborative efforts in women of African ancestry.
Assuntos
Negro ou Afro-Americano/genética , Neoplasias da Mama/genética , Predisposição Genética para Doença , Feminino , Loci Gênicos , Variação Genética , Estudo de Associação Genômica Ampla , Humanos , Polimorfismo de Nucleotídeo Único , Receptores de Estrogênio/genéticaRESUMO
BACKGROUND: The precise nature of how cell type specific chromatin structures at enhancer sites affect gene expression is largely unknown. Here we identified cell type specific enhancers coupled with gene expression in two different types of breast epithelial cells, HMEC (normal breast epithelial cells) and MDAMB231 (triple negative breast cancer cell line). RESULTS: Enhancers were defined by modified neighboring histones [using chromatin immunoprecipitation followed by sequencing (ChIP-seq)] and nucleosome depletion [using formaldehyde-assisted isolation of regulatory elements followed by sequencing (FAIRE-seq)]. Histone modifications at enhancers were related to the expression levels of nearby genes up to 750 kb away. These expression levels were correlated with enhancer status (poised or active), defined by surrounding histone marks. Furthermore, about fifty percent of poised and active enhancers contained nucleosome-depleted regions. We also identified response element motifs enriched at these enhancer sites that revealed key transcription factors (e.g. TP63) likely involved in regulating breast epithelial enhancer-mediated gene expression. By utilizing expression data, potential target genes of more than 600 active enhancers were identified. These genes were involved in proteolysis, epidermis development, cell adhesion, mitosis, cell cycle, and DNA replication. CONCLUSIONS: These findings facilitate the understanding of epigenetic regulation specifically, such as the relationships between regulatory elements and gene expression and generally, how breast epithelial cellular phenotypes are determined by cell type specific enhancers.
Assuntos
Histonas/metabolismo , Glândulas Mamárias Humanas/metabolismo , Nucleossomos , Sequências Reguladoras de Ácido Nucleico , Linhagem Celular Tumoral , Feminino , Humanos , Glândulas Mamárias Humanas/patologia , Fatores de Transcrição/metabolismoRESUMO
Genome-wide association studies (GWAS) of breast cancer defined by hormone receptor status have revealed loci contributing to susceptibility of estrogen receptor (ER)-negative subtypes. To identify additional genetic variants for ER-negative breast cancer, we conducted the largest meta-analysis of ER-negative disease to date, comprising 4754 ER-negative cases and 31 663 controls from three GWAS: NCI Breast and Prostate Cancer Cohort Consortium (BPC3) (2188 ER-negative cases; 25 519 controls of European ancestry), Triple Negative Breast Cancer Consortium (TNBCC) (1562 triple negative cases; 3399 controls of European ancestry) and African American Breast Cancer Consortium (AABC) (1004 ER-negative cases; 2745 controls). We performed in silico replication of 86 SNPs at P ≤ 1 × 10(-5) in an additional 11 209 breast cancer cases (946 with ER-negative disease) and 16 057 controls of Japanese, Latino and European ancestry. We identified two novel loci for breast cancer at 20q11 and 6q14. SNP rs2284378 at 20q11 was associated with ER-negative breast cancer (combined two-stage OR = 1.16; P = 1.1 × 10(-8)) but showed a weaker association with overall breast cancer (OR = 1.08, P = 1.3 × 10(-6)) based on 17 869 cases and 43 745 controls and no association with ER-positive disease (OR = 1.01, P = 0.67) based on 9965 cases and 22 902 controls. Similarly, rs17530068 at 6q14 was associated with breast cancer (OR = 1.12; P = 1.1 × 10(-9)), and with both ER-positive (OR = 1.09; P = 1.5 × 10(-5)) and ER-negative (OR = 1.16, P = 2.5 × 10(-7)) disease. We also confirmed three known loci associated with ER-negative (19p13) and both ER-negative and ER-positive breast cancer (6q25 and 12p11). Our results highlight the value of large-scale collaborative studies to identify novel breast cancer risk loci.
Assuntos
Neoplasias da Mama/genética , Predisposição Genética para Doença/genética , Estudo de Associação Genômica Ampla , Feminino , Humanos , Polimorfismo de Nucleotídeo Único/genética , Receptores de Estrogênio/genéticaRESUMO
To better understand the impact of chromatin structure on regulation of the prostate cancer transcriptome, we develop high-resolution chromatin interaction maps in normal and prostate cancer cells using in situ Hi-C. By combining the in situ Hi-C data with active and repressive histone marks, CTCF binding sites, nucleosome-depleted regions, and transcriptome profiling, we identify topologically associating domains (TADs) that change in size and epigenetic states between normal and prostate cancer cells. Moreover, we identify normal and prostate cancer-specific enhancer-promoter loops and involved transcription factors. For example, we show that FOXA1 is enriched in prostate cancer-specific enhancer-promoter loop anchors. We also find that the chromatin structure surrounding the androgen receptor (AR) locus is altered in the prostate cancer cells with many cancer-specific enhancer-promoter loops. This creation of 3D epigenomic maps enables a better understanding of prostate cancer biology and mechanisms of gene regulation.
Assuntos
Epigenômica , Neoplasias da Próstata/genética , Transcriptoma/genética , Linhagem Celular Tumoral , Cromatina/metabolismo , Elementos Facilitadores Genéticos , Epigênese Genética , Loci Gênicos , Código das Histonas/genética , Humanos , Masculino , Regiões Promotoras Genéticas , Receptores Androgênicos/genéticaRESUMO
Genome-wide association studies identified single-nucleotide polymorphism (SNP) rs55958994 as a significant variant associated with increased susceptibility to prostate cancer. However, the mechanisms by which this SNP mediates increased risk to cancer are still unknown. In this study, we show that this variant is located in an enhancer active in prostate cancer cells. Deletion of this enhancer from prostate tumor cells resulted in decreased tumor initiation, tumor growth, and invasive migration, as well as a loss of stem-like cells. Using a combination of capture chromosome conformation capture (Capture-C) and RNA sequencing, we identified genes on the same and different chromosomes as targets regulated by the enhancer. Furthermore, we show that expression of individual candidate target genes in an enhancer-deleted cell line rescued different aspects of tumorigenesis. Our data suggest that the rs55958994-associated enhancer affects prostate cancer progression by influencing expression of multiple genes via long-range chromatin interactions.
Assuntos
Cromatina/metabolismo , Progressão da Doença , Regulação Neoplásica da Expressão Gênica , Predisposição Genética para Doença , Polimorfismo de Nucleotídeo Único/genética , Neoplasias da Próstata/genética , Neoplasias da Próstata/patologia , Alelos , Sequência de Bases , Carcinogênese/genética , Carcinogênese/patologia , Linhagem Celular Tumoral , Elementos Facilitadores Genéticos , Humanos , Masculino , Fenótipo , Fatores de Risco , Deleção de Sequência/genética , Transcriptoma/genéticaRESUMO
NOMe-seq (nucleosome occupancy and methylome sequencing) identifies nucleosome-depleted regions that correspond to promoters, enhancers, and insulators. The NOMe-seq method is based on the treatment of chromatin with the M.CviPI methyltransferase, which methylates GpC dinucleotides that are not protected by nucleosomes or other proteins that are tightly bound to the chromatin (GpCm does not occur in the human genome and therefore there is no endogenous background of GpCm). Following bisulfite treatment of the M.CviPI-methylated chromatin (which converts unmethylated Cs to Ts and thus allows the distinction of GpC from GpCm) and subsequent genomic sequencing, nucleosome-depleted regions can be ascertained on a genome-wide scale. The bisulfite treatment also allows the distinction of CpG from CmpG (most endogenous methylation occurs at CpG dinucleotides) and thus the endogenous methylation status of the genome can also be obtained in the same sequencing reaction. Importantly, open chromatin is expected to have high levels of GpCm but low levels of CmpG; thus, each of the two separate methylation analyses serve as independent (but opposite) measures which provide matching chromatin designations for each regulatory element.NOMe-seq has advantages over ChIP-seq for identification of regulatory elements because it is not reliant upon knowing the exact modifications on the surrounding nucleosomes. Also, NOMe-seq has advantages over DHS (DNase hypersensitive site)-seq, FAIRE (Formaldehyde-Assisted Isolation of Regulatory Elements)-seq, and ATAC (Assay for Transposase-Accessible Chromatin)-seq because it also gives positioning information for several nucleosomes on either side of each open regulatory element. Here, we provide a detailed protocol for NOMe-seq that begins with the isolation of chromatin, followed by methylation of GpCs with M.CviPI and treatment with bisulfite, and ending with the creation of next generation sequencing libraries. We also include sequencing QC analysis metrics and bioinformatics steps that can be used to identify nucleosome-depleted regions throughout the genome.
Assuntos
Metilação de DNA , Elementos Facilitadores Genéticos/genética , Genoma Humano/genética , Elementos Isolantes/genética , Nucleossomos/genética , Núcleo Celular/química , Núcleo Celular/genética , Cromatina/química , Cromatina/genética , Ilhas de CpG , Humanos , Nucleossomos/química , Regiões Promotoras Genéticas , Análise de Sequência de DNA , Sulfitos/químicaRESUMO
BACKGROUND: Recent genome-wide association studies (GWAS) have identified more than 100 loci associated with increased risk of prostate cancer, most of which are in non-coding regions of the genome. Understanding the function of these non-coding risk loci is critical to elucidate the genetic susceptibility to prostate cancer. RESULTS: We generate genome-wide regulatory element maps and performed genome-wide chromosome confirmation capture assays (in situ Hi-C) in normal and tumorigenic prostate cells. Using this information, we annotate the regulatory potential of 2,181 fine-mapped prostate cancer risk-associated SNPs and predict a set of target genes that are regulated by prostate cancer risk-related H3K27Ac-mediated loops. We next identify prostate cancer risk-associated CTCF sites involved in long-range chromatin loops. We use CRISPR-mediated deletion to remove prostate cancer risk-associated CTCF anchor regions and the CTCF anchor regions looped to the prostate cancer risk-associated CTCF sites, and we observe up to 100-fold increases in expression of genes within the loops when the prostate cancer risk-associated CTCF anchor regions are deleted. CONCLUSIONS: We identify GWAS risk loci involved in long-range loops that function to repress gene expression within chromatin loops. Our studies provide new insights into the genetic susceptibility to prostate cancer.
Assuntos
Fator de Ligação a CCCTC/metabolismo , Cromatina/metabolismo , Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas/genética , Deleção de Genes , Neoplasias da Próstata/genética , Acetilação , Linhagem Celular Tumoral , Elementos Facilitadores Genéticos/genética , Histonas/metabolismo , Humanos , Lisina/metabolismo , Masculino , Polimorfismo de Nucleotídeo Único/genética , Fatores de Risco , Canais de Potássio Ativados por Cálcio de Condutância Baixa/genética , Regulação para Cima/genéticaRESUMO
Despite progress in defining genetic risk for psychiatric disorders, their molecular mechanisms remain elusive. Addressing this, the PsychENCODE Consortium has generated a comprehensive online resource for the adult brain across 1866 individuals. The PsychENCODE resource contains ~79,000 brain-active enhancers, sets of Hi-C linkages, and topologically associating domains; single-cell expression profiles for many cell types; expression quantitative-trait loci (QTLs); and further QTLs associated with chromatin, splicing, and cell-type proportions. Integration shows that varying cell-type proportions largely account for the cross-population variation in expression (with >88% reconstruction accuracy). It also allows building of a gene regulatory network, linking genome-wide association study variants to genes (e.g., 321 for schizophrenia). We embed this network into an interpretable deep-learning model, which improves disease prediction by ~6-fold versus polygenic risk scores and identifies key genes and pathways in psychiatric disorders.
Assuntos
Encéfalo/metabolismo , Regulação da Expressão Gênica , Transtornos Mentais/genética , Conjuntos de Dados como Assunto , Aprendizado Profundo , Elementos Facilitadores Genéticos , Epigênese Genética , Epigenômica , Redes Reguladoras de Genes , Estudo de Associação Genômica Ampla , Humanos , Locos de Características Quantitativas , Análise de Célula Única , TranscriptomaRESUMO
Prostate cancer (PCa) is the leading cancer among men in the United States, with genetic factors contributing to â¼42% of the susceptibility to PCa. We analyzed a PCa risk region located at 7p15.2 to gain insight into the mechanisms by which this noncoding region may affect gene regulation and contribute to PCa risk. We performed Hi-C analysis and demonstrated that this region has long-range interactions with the HOXA locus, located â¼873 kb away. Using the CRISPR/Cas9 system, we deleted a 4-kb region encompassing several PCa risk-associated SNPs and performed RNA-seq to investigate transcriptomic changes in prostate cells lacking the regulatory element. Our results suggest that the risk element affects the expression of HOXA13 and HOTTIP, but not other genes in the HOXA locus, via a repressive loop. Forced expression of HOXA13 was performed to gain further insight into the mechanisms by which this risk element affects PCa risk.
Assuntos
Proteínas de Homeodomínio/metabolismo , Sistemas CRISPR-Cas/genética , Linhagem Celular Tumoral , Proteínas Correpressoras , Proteínas de Ligação a DNA , Loci Gênicos , Proteínas de Homeodomínio/genética , Humanos , Masculino , Proteínas de Neoplasias/genética , Polimorfismo de Nucleotídeo Único , Neoplasias da Próstata/genética , Neoplasias da Próstata/metabolismo , Neoplasias da Próstata/patologia , RNA/química , RNA/isolamento & purificação , RNA/metabolismo , RNA Longo não Codificante/genética , RNA Longo não Codificante/metabolismo , Risco , Análise de Sequência de RNA , TranscriptomaRESUMO
Background: Genome-wide association studies have identified approximately 100 common genetic variants associated with breast cancer risk, the majority of which were discovered in women of European ancestry. Because of different patterns of linkage disequilibrium, many of these genetic markers may not represent signals in populations of African ancestry.Methods: We tested 74 breast cancer risk variants and conducted fine-mapping of these susceptibility regions in 6,522 breast cancer cases and 7,643 controls of African ancestry from three genetic consortia (AABC, AMBER, and ROOT).Results: Fifty-four of the 74 variants (73%) were found to have ORs that were directionally consistent with those previously reported, of which 12 were nominally statistically significant (P < 0.05). Through fine-mapping, in six regions (3p24, 12p11, 14q13, 16q12/FTO, 16q23, 19p13), we observed seven markers that better represent the underlying risk variant for overall breast cancer or breast cancer subtypes, whereas in another two regions (11q13, 16q12/TOX3), we identified suggestive evidence of signals that are independent of the reported index variant. Overlapping chromatin features and regulatory elements suggest that many of the risk alleles lie in regions with biological functionality.Conclusions: Through fine-mapping of known susceptibility regions, we have revealed alleles that better characterize breast cancer risk in women of African ancestry.Impact: The risk alleles identified represent genetic markers for modeling and stratifying breast cancer risk in women of African ancestry. Cancer Epidemiol Biomarkers Prev; 26(7); 1016-26. ©2017 AACR.
Assuntos
Biomarcadores Tumorais/genética , Negro ou Afro-Americano/genética , Neoplasias da Mama/genética , Predisposição Genética para Doença , Alelos , Neoplasias da Mama/patologia , Estudos de Casos e Controles , Mapeamento Cromossômico , Feminino , Loci Gênicos , Humanos , Polimorfismo de Nucleotídeo Único , Receptores de Estrogênio/metabolismo , Fatores de RiscoRESUMO
BACKGROUND: Although technological advances now allow increased tumor profiling, a detailed understanding of the mechanisms leading to the development of different cancers remains elusive. Our approach toward understanding the molecular events that lead to cancer is to characterize changes in transcriptional regulatory networks between normal and tumor tissue. Because enhancer activity is thought to be critical in regulating cell fate decisions, we have focused our studies on distal regulatory elements and transcription factors that bind to these elements. RESULTS: Using DNA methylation data, we identified more than 25,000 enhancers that are differentially activated in breast, prostate, and kidney tumor tissues, as compared to normal tissues. We then developed an analytical approach called Tracing Enhancer Networks using Epigenetic Traits that correlates DNA methylation levels at enhancers with gene expression to identify more than 800,000 genome-wide links from enhancers to genes and from genes to enhancers. We found more than 1200 transcription factors to be involved in these tumor-specific enhancer networks. We further characterized several transcription factors linked to a large number of enhancers in each tumor type, including GATA3 in non-basal breast tumors, HOXC6 and DLX1 in prostate tumors, and ZNF395 in kidney tumors. We showed that HOXC6 and DLX1 are associated with different clusters of prostate tumor-specific enhancers and confer distinct transcriptomic changes upon knockdown in C42B prostate cancer cells. We also discovered de novo motifs enriched in enhancers linked to ZNF395 in kidney tumors. CONCLUSIONS: Our studies characterized tumor-specific enhancers and revealed key transcription factors involved in enhancer networks for specific tumor types and subgroups. Our findings, which include a large set of identified enhancers and transcription factors linked to those enhancers in breast, prostate, and kidney cancers, will facilitate understanding of enhancer networks and mechanisms leading to the development of these cancers.
Assuntos
Elementos Facilitadores Genéticos/genética , Epigenômica , Fatores de Transcrição/genética , Neoplasias da Mama/genética , Neoplasias da Mama/metabolismo , Neoplasias da Mama/patologia , Linhagem Celular Tumoral , Metilação de DNA , Proteínas de Ligação a DNA/genética , Feminino , Fator de Transcrição GATA3/genética , Regulação Neoplásica da Expressão Gênica , Redes Reguladoras de Genes , Histonas/genética , Histonas/metabolismo , Proteínas de Homeodomínio/antagonistas & inibidores , Proteínas de Homeodomínio/genética , Proteínas de Homeodomínio/metabolismo , Humanos , Neoplasias Renais/genética , Neoplasias Renais/metabolismo , Neoplasias Renais/patologia , Masculino , Neoplasias da Próstata/genética , Neoplasias da Próstata/metabolismo , Neoplasias da Próstata/patologia , Interferência de RNA , RNA Interferente Pequeno/metabolismo , Fatores de Transcrição/antagonistas & inibidores , Fatores de Transcrição/metabolismoRESUMO
Breast Cancer (BCa) genome-wide association studies revealed allelic frequency differences between cases and controls at index single nucleotide polymorphisms (SNPs). To date, 71 loci have thus been identified and replicated. More than 320,000 SNPs at these loci define BCa risk due to linkage disequilibrium (LD). We propose that BCa risk resides in a subgroup of SNPs that functionally affects breast biology. Such a shortlist will aid in framing hypotheses to prioritize a manageable number of likely disease-causing SNPs. We extracted all the SNPs, residing in 1 Mb windows around breast cancer risk index SNP from the 1000 genomes project to find correlated SNPs. We used FunciSNP, an R/Bioconductor package developed in-house, to identify potentially functional SNPs at 71 risk loci by coinciding them with chromatin biofeatures. We identified 1,005 SNPs in LD with the index SNPs (r(2)≥0.5) in three categories; 21 in exons of 18 genes, 76 in transcription start site (TSS) regions of 25 genes, and 921 in enhancers. Thirteen SNPs were found in more than one category. We found two correlated and predicted non-benign coding variants (rs8100241 in exon 2 and rs8108174 in exon 3) of the gene, ANKLE1. Most putative functional LD SNPs, however, were found in either epigenetically defined enhancers or in gene TSS regions. Fifty-five percent of these non-coding SNPs are likely functional, since they affect response element (RE) sequences of transcription factors. Functionality of these SNPs was assessed by expression quantitative trait loci (eQTL) analysis and allele-specific enhancer assays. Unbiased analyses of SNPs at BCa risk loci revealed new and overlooked mechanisms that may affect risk of the disease, thereby providing a valuable resource for follow-up studies.
Assuntos
Neoplasias da Mama/genética , Frequência do Gene/genética , Predisposição Genética para Doença/genética , Polimorfismo de Nucleotídeo Único/genética , Locos de Características Quantitativas/genética , Alelos , Células Cultivadas , Cromatina/genética , Endonucleases/genética , Epigênese Genética/genética , Éxons/genética , Feminino , Estudo de Associação Genômica Ampla/métodos , Humanos , Desequilíbrio de Ligação/genética , Elementos de Resposta/genética , Risco , Fatores de Risco , Fatores de Transcrição/genética , Sítio de Iniciação de TranscriçãoRESUMO
Estrogen receptor (ER)-negative tumors represent 20-30% of all breast cancers, with a higher proportion occurring in younger women and women of African ancestry. The etiology and clinical behavior of ER-negative tumors are different from those of tumors expressing ER (ER positive), including differences in genetic predisposition. To identify susceptibility loci specific to ER-negative disease, we combined in a meta-analysis 3 genome-wide association studies of 4,193 ER-negative breast cancer cases and 35,194 controls with a series of 40 follow-up studies (6,514 cases and 41,455 controls), genotyped using a custom Illumina array, iCOGS, developed by the Collaborative Oncological Gene-environment Study (COGS). SNPs at four loci, 1q32.1 (MDM4, P = 2.1 × 10(-12) and LGR6, P = 1.4 × 10(-8)), 2p24.1 (P = 4.6 × 10(-8)) and 16q12.2 (FTO, P = 4.0 × 10(-8)), were associated with ER-negative but not ER-positive breast cancer (P > 0.05). These findings provide further evidence for distinct etiological pathways associated with invasive ER-positive and ER-negative breast cancers.