RESUMO
Traditional gene set enrichment analysis falters when applied to large genomic domains, where neighboring genes often share functions. This spatial dependency creates misleading enrichments, mistaking mere physical proximity for genuine biological connections. Here we present Spatial Adjusted Gene Ontology (SAGO), a novel cyclic permutation-based approach, to tackle this challenge. SAGO separates enrichments due to spatial proximity from genuine biological links by incorporating the genes' spatial arrangement into the analysis. We applied SAGO to various datasets in which the identified genomic intervals are large, including replication timing domains, large H3K9me3 and H3K27me3 domains, HiC compartments and lamina-associated domains (LADs). Intriguingly, applying SAGO to prostate cancer samples with large copy number alteration (CNA) domains eliminated most of the enriched GO terms, thus helping to accurately identify biologically relevant gene sets linked to oncogenic processes, free from spatial bias.
RESUMO
Stochastic asynchronous replication timing (AS-RT) is a phenomenon in which the time of replication of each allele is different, and the identity of the early allele varies between cells. By taking advantage of stable clonal pre-B cell populations derived from C57BL6/Castaneous mice, we have mapped the genome-wide AS-RT loci, independently of genetic differences. These regions are characterized by differential chromatin accessibility, mono-allelic expression and include new gene families involved in specifying cell identity. By combining population level mapping with single cell FISH, our data reveal the existence of a novel regulatory program that coordinates a fixed relationship between AS-RT regions on any given chromosome, with some loci set to replicate in a parallel and others set in the anti-parallel orientation. Our results show that AS-RT is a highly regulated epigenetic mark established during early embryogenesis that may be used for facilitating the programming of mono-allelic choice throughout development.