ABSTRACT
Intergenic transcription in normal and cancerous tissues is pervasive but incompletely understood. To investigate this, we constructed an atlas of over 180,000 consensus RNA polymerase II (RNAPII)-bound intergenic regions from 900 RNAPII chromatin immunoprecipitation sequencing (ChIP-seq) experiments in normal and cancer samples. Through unsupervised analysis, we identified 51 RNAPII consensus clusters, many of which mapped to specific biotypes and revealed tissue-specific regulatory signatures. We developed a meta-clustering methodology to integrate our RNAPII atlas with active transcription across 28,797 RNA sequencing (RNA-seq) samples from The Cancer Genome Atlas (TCGA), Genotype-Tissue Expression (GTEx), and Encyclopedia of DNA Elements (ENCODE). This analysis revealed strong tissue- and disease-specific interconnections between RNAPII occupancy and transcriptional activity. We demonstrate that intergenic transcription at RNAPII-bound regions is a novel per-cancer and pan-cancer biomarker. This biomarker displays genomic and clinically relevant characteristics, distinguishing cancer subtypes and linking to overall survival. Our results demonstrate the effectiveness of coherent data integration to uncover intergenic transcriptional activity in normal and cancer tissues.
ABSTRACT
Chagas disease is a parasitical disease caused by Trypanosoma cruzi which affects â¼7 million people worldwide. Per year, â¼10 000 people die from this pathology. Indeed, â¼30% of humans develop severe chronic forms, including cardiac, digestive or neurological disorders, for which there is still no treatment. In order to facilitate research on Chagas disease, a manual curation of all papers corresponding to 'Chagas disease' referenced on PubMed has been performed. All deregulated molecules in hosts (all mammals, humans, mice or others) following T. cruzi infection were retrieved and included in a database, named ChagasDB. A website has been developed to make this database accessible to all. In this article, we detail the construction of this database, its contents and how to use it. Database URL https://chagasdb.tagc.univ-amu.fr.