RESUMEN
Cancer development and progression are generally associated with gene dysregulation, often resulting from changes in the transcription factor (TF) sequence or expression. Identifying key TFs involved in cancer gene regulation provides a framework for potential new therapeutics. This study presents a large-scale cancer gene TF-DNA interaction network, as well as an extensive promoter clone resource for future studies. Highly connected TFs bind to promoters of genes associated with either good or poor cancer prognosis, suggesting that strategies aimed at shifting gene expression balance between these two prognostic groups may be inherently complex. However, we identified potential for oncogene-targeted therapeutics, with half of the tested oncogenes being potentially repressed by influencing specific activators or bifunctional TFs. Finally, we investigate the role of intrinsically disordered regions within the key cancer-related TF ESR1 in DNA binding and transcriptional activity, and found that these regions can have complex trade-offs in TF function. Altogether, our study broadens our knowledge of the TFs involved in cancer gene regulation and provides a valuable resource for future studies and therapeutics.
Asunto(s)
ADN , Regulación Neoplásica de la Expresión Génica , Redes Reguladoras de Genes , Neoplasias , Unión Proteica , Factores de Transcripción , Humanos , Neoplasias/genética , Neoplasias/metabolismo , Factores de Transcripción/metabolismo , Factores de Transcripción/genética , ADN/metabolismo , ADN/genética , Regiones Promotoras Genéticas/genética , Oncogenes/genética , Pronóstico , Receptor alfa de Estrógeno/metabolismo , Receptor alfa de Estrógeno/genética , Biología Computacional/métodosRESUMEN
Cancer development and progression are generally associated with dysregulation of gene expression, often resulting from changes in transcription factor (TF) sequence or expression. Identifying key TFs involved in cancer gene regulation provides a framework for potential new therapeutics. This study presents a large-scale cancer gene TF-DNA interaction network as well as an extensive promoter clone resource for future studies. Most highly connected TFs do not show a preference for binding to promoters of genes associated with either good or poor cancer prognosis, suggesting that emerging strategies aimed at shifting gene expression balance between these two prognostic groups may be inherently complex. However, we identified potential for oncogene targeted therapeutics, with half of the tested oncogenes being potentially repressed by influencing specific activator or bifunctional TFs. Finally, we investigate the role of intrinsically disordered regions within the key cancer-related TF estrogen receptor É (ESR1) on DNA binding and transcriptional activity, and found that these regions can have complex trade-offs in TF function. Altogether, our study not only broadens our knowledge of TFs involved in the cancer gene regulatory network but also provides a valuable resource for future studies, laying a foundation for potential therapeutic strategies targeting TFs in cancer.
RESUMEN
What new questions can we ask about transcriptional regulation given recent developments in large-scale approaches?
Asunto(s)
Regulación de la Expresión Génica , Regulación de la Expresión Génica/genéticaRESUMEN
Although >90% of somatic mutations reside in non-coding regions, few have been reported as cancer drivers. To predict driver non-coding variants (NCVs), we present a transcription factor (TF)-aware burden test based on a model of coherent TF function in promoters. We apply this test to NCVs from the Pan-Cancer Analysis of Whole Genomes cohort and predict 2555 driver NCVs in the promoters of 813 genes across 20 cancer types. These genes are enriched in cancer-related gene ontologies, essential genes, and genes associated with cancer prognosis. We find that 765 candidate driver NCVs alter transcriptional activity, 510 lead to differential binding of TF-cofactor regulatory complexes, and that they primarily impact the binding of ETS factors. Finally, we show that different NCVs within a promoter often affect transcriptional activity through shared mechanisms. Our integrated computational and experimental approach shows that cancer NCVs are widespread and that ETS factors are commonly disrupted.
Asunto(s)
Neoplasias , Humanos , Mutación , Neoplasias/genética , Sitios de Unión/genética , Factores de Transcripción/metabolismo , Regulación de la Expresión GénicaRESUMEN
The specificity in gene regulation is controlled by interactions between transcription factors (TFs) and genomic DNA regions such as promoters and enhancers. Enhanced yeast one-hybrid (eY1H) assays are among the methods used for high-throughput detection of transcription factor-DNA interactions. Here, we describe the procedure for screening interactions between DNA regions of interest ("DNA-baits") and an array of transcription factors ("TF-preys"), after DNA-bait and TF-prey yeast strains have been generated. Using a high-density array robotic platform, this method can be used to screen interactions between multiple DNA regions and >1000 TFs within a single experiment.
Asunto(s)
ADN , Saccharomyces cerevisiae , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Técnicas del Sistema de Dos Híbridos , ADN/genética , Factores de Transcripción/genética , Factores de Transcripción/metabolismo , Regulación de la Expresión GénicaRESUMEN
Multiple immunoinformatic tools have been developed to predict T-cell epitopes from protein amino acid sequences for different major histocompatibility complex (MHC) alleles. These prediction tools output hundreds of potential peptide candidates which require further processing; however, these tools are either not graphical or not friendly for non-programming users. We present Epitope-Evaluator, a web tool developed in the Shiny/R framework to interactively analyze predicted T-cell epitopes. Epitope-Evaluator contains six tools providing the distribution of epitopes across a selected set of MHC alleles, the promiscuity and conservation of epitopes, and their density and location within antigens. Epitope-Evaluator requires as input the fasta file of protein sequences and the output prediction file coming out from any predictor. By choosing different cutoffs and parameters, users can produce several interactive plots and tables that can be downloaded as JPG and text files, respectively. Using Epitope-Evaluator, we found the HLA-B*40, HLA-B*27:05 and HLA-B*07:02 recognized fewer epitopes from the SARS-CoV-2 proteome than other MHC Class I alleles. We also identified shared epitopes between Delta, Omicron, and Wuhan Spike variants as well as variant-specific epitopes. In summary, Epitope-Evaluator removes the programming barrier and provides intuitive tools, allowing a straightforward interpretation and graphical representations that facilitate the selection of candidate epitopes for experimental evaluation. The web server Epitope-Evaluator is available at https://fuxmanlab.shinyapps.io/Epitope-Evaluator/.
Asunto(s)
COVID-19 , Epítopos de Linfocito T , Antígenos HLA-B , Antígenos de Histocompatibilidad Clase I , Humanos , SARS-CoV-2RESUMEN
Single nucleotide variants (SNVs) located in transcriptional regulatory regions can result in gene expression changes that lead to adaptive or detrimental phenotypic outcomes. Here, we predict gain or loss of binding sites for 741 transcription factors (TFs) across the human genome. We calculated 'gainability' and 'disruptability' scores for each TF that represent the likelihood of binding sites being created or disrupted, respectively. We found that functional cis-eQTL SNVs are more likely to alter TF binding sites than rare SNVs in the human population. In addition, we show that cancer somatic mutations have different effects on TF binding sites from different TF families on a cancer-type basis. Finally, we discuss the relationship between these results and cancer mutational signatures. Altogether, we provide a blueprint to study the impact of SNVs derived from genetic variation or disease association on TF binding to gene regulatory regions.
Asunto(s)
Genoma Humano , Polimorfismo de Nucleótido Simple , Factores de Transcripción/genética , Sitios de Unión , Expresión Génica , Humanos , Neoplasias/genética , Sitios de Carácter Cuantitativo , Factores de Transcripción/metabolismoRESUMEN
Identifying transcription factor (TF) binding to noncoding variants, uncharacterized DNA motifs, and repetitive genomic elements has been technically and computationally challenging. Current experimental methods, such as chromatin immunoprecipitation, generally test one TF at a time, and computational motif algorithms often lead to false-positive and -negative predictions. To address these limitations, we developed an experimental approach based on enhanced yeast one-hybrid assays. The first variation of this approach interrogates the binding of >1000 human TFs to repetitive DNA elements, while the second evaluates TF binding to single nucleotide variants, short insertions and deletions (indels), and novel DNA motifs. Using this approach, we detected the binding of 75 TFs, including several nuclear hormone receptors and ETS factors, to the highly repetitive Alu elements. Further, we identified cancer-associated changes in TF binding, including gain of interactions involving ETS TFs and loss of interactions involving KLF TFs to different mutations in the TERT promoter, and gain of a MYB interaction with an 18-bp indel in the TAL1 superenhancer. Additionally, we identified TFs that bind to three uncharacterized DNA motifs identified in DNase footprinting assays. We anticipate that these enhanced yeast one-hybrid approaches will expand our capabilities to study genetic variation and undercharacterized genomic regions.
Asunto(s)
Biología Computacional/métodos , ADN/química , ADN/metabolismo , Neoplasias/genética , Factores de Transcripción/metabolismo , Algoritmos , Línea Celular Tumoral , Inmunoprecipitación de Cromatina , Regulación de la Expresión Génica , Células Hep G2 , Humanos , Mutación INDEL , Células K562 , Neoplasias/metabolismo , Motivos de Nucleótidos , Polimorfismo de Nucleótido Simple , Regiones Promotoras Genéticas , Secuencias Repetitivas de Ácidos Nucleicos , Factores de Transcripción/química , Técnicas del Sistema de Dos HíbridosRESUMEN
Identifying the sets of transcription factors (TFs) that regulate each human gene is a daunting task that requires integrating numerous experimental and computational approaches. One such method is the yeast one-hybrid (Y1H) assay, in which interactions between TFs and DNA regions are tested in the milieu of the yeast nucleus using reporter genes. Y1H assays involve two components: a 'DNA-bait' (e.g., promoters, enhancers, silencers, etc.) and a 'TF-prey,' which can be screened for reporter gene activation. Most published protocols for performing Y1H screens are based on transforming TF-prey libraries or arrays into DNA-bait yeast strains. Here, we describe a pipeline, called enhanced Y1H (eY1H) assays, where TF-DNA interactions are interrogated by mating DNA-bait strains with an arrayed collection of TF-prey strains using a high density array (HDA) robotic platform that allows screening in a 1,536 colony format. This allows for a dramatic increase in throughput (60 DNA-bait sequences against >1,000 TFs takes two weeks per researcher) and reproducibility. We illustrate the different types of expected results by testing human promoter sequences against an array of 1,086 human TFs, as well as examples of issues that can arise during screens and how to troubleshoot them.
Asunto(s)
Secuencia de Bases/genética , Factores de Transcripción/genética , Técnicas del Sistema de Dos Híbridos/normas , HumanosRESUMEN
Cytokines are cell-to-cell signaling proteins that play a central role in immune development, pathogen responses, and diseases. Cytokines are highly regulated at the transcriptional level by combinations of transcription factors (TFs) that recruit cofactors and the transcriptional machinery. Here, we mined through three decades of studies to generate a comprehensive database, CytReg, reporting 843 and 647 interactions between TFs and cytokine genes, in human and mouse respectively. By integrating CytReg with other functional datasets, we determined general principles governing the transcriptional regulation of cytokine genes. In particular, we show a correlation between TF connectivity and immune phenotype and disease, we discuss the balance between tissue-specific and pathogen-activated TFs regulating each cytokine gene, and cooperativity and plasticity in cytokine regulation. We also illustrate the use of our database as a blueprint to predict TF-disease associations and identify potential TF-cytokine regulatory axes in autoimmune diseases. Finally, we discuss research biases in cytokine regulation studies, and use CytReg to predict novel interactions based on co-expression and motif analyses which we further validated experimentally. Overall, this resource provides a framework for the rational design of future cytokine gene regulation studies.