Your browser doesn't support javascript.
loading
Machine learning enables pan-cancer identification of mutational hotspots at persistent CTCF binding sites.
Chen, Wenhan; Zeng, Yi C; Achinger-Kawecka, Joanna; Campbell, Elyssa; Jones, Alicia K; Stewart, Alastair G; Khoury, Amanda; Clark, Susan J.
Affiliation
  • Chen W; Epigenetics Laboratory, Garvan Institute of Medical Research, Sydney 2010 New South Wales, Australia.
  • Zeng YC; Structural Biology Laboratory, Victor Chang Cardiac Research Institute, Sydney 2010 New South Wales, Australia.
  • Achinger-Kawecka J; St Vincent's Clinical School, UNSW, Sydney 2010 New South Wales, Australia.
  • Campbell E; Epigenetics Laboratory, Garvan Institute of Medical Research, Sydney 2010 New South Wales, Australia.
  • Jones AK; St Vincent's Clinical School, UNSW, Sydney 2010 New South Wales, Australia.
  • Stewart AG; Epigenetics Laboratory, Garvan Institute of Medical Research, Sydney 2010 New South Wales, Australia.
  • Khoury A; Epigenetics Laboratory, Garvan Institute of Medical Research, Sydney 2010 New South Wales, Australia.
  • Clark SJ; Structural Biology Laboratory, Victor Chang Cardiac Research Institute, Sydney 2010 New South Wales, Australia.
Nucleic Acids Res ; 2024 Jul 02.
Article in En | MEDLINE | ID: mdl-38950902
ABSTRACT
CCCTC-binding factor (CTCF) is an insulator protein that binds to a highly conserved DNA motif and facilitates regulation of three-dimensional (3D) nuclear architecture and transcription. CTCF binding sites (CTCF-BSs) reside in non-coding DNA and are frequently mutated in cancer. Our previous study identified a small subclass of CTCF-BSs that are resistant to CTCF knock down, termed persistent CTCF binding sites (P-CTCF-BSs). P-CTCF-BSs show high binding conservation and potentially regulate cell-type constitutive 3D chromatin architecture. Here, using ICGC sequencing data we made the striking observation that P-CTCF-BSs display a highly elevated mutation rate in breast and prostate cancer when compared to all CTCF-BSs. To address whether P-CTCF-BS mutations are also enriched in other cell-types, we developed CTCF-INSITE-a tool utilising machine learning to predict persistence based on genetic and epigenetic features of experimentally-determined P-CTCF-BSs. Notably, predicted P-CTCF-BSs also show a significantly elevated mutational burden in all 12 cancer-types tested. Enrichment was even stronger for P-CTCF-BS mutations with predicted functional impact to CTCF binding and chromatin looping. Using in vitro binding assays we validated that P-CTCF-BS cancer mutations, predicted to be disruptive, indeed reduced CTCF binding. Together this study reveals a new subclass of cancer specific CTCF-BS DNA mutations and provides insights into their importance in genome organization in a pan-cancer setting.

Full text: 1 Collection: 01-internacional Database: MEDLINE Language: En Journal: Nucleic Acids Res Year: 2024 Document type: Article Affiliation country: Australia

Full text: 1 Collection: 01-internacional Database: MEDLINE Language: En Journal: Nucleic Acids Res Year: 2024 Document type: Article Affiliation country: Australia
...