ABSTRACT
The Encyclopedia of DNA Elements (ENCODE) project has established a genomic resource for mammalian development, profiling a diverse panel of mouse tissues at 8 developmental stages from 10.5 days after conception until birth, including transcriptomes, methylomes and chromatin states. Here we systematically examined the state and accessibility of chromatin in the developing mouse fetus. In total we performed 1,128 chromatin immunoprecipitation with sequencing (ChIP-seq) assays for histone modifications and 132 assay for transposase-accessible chromatin using sequencing (ATAC-seq) assays for chromatin accessibility across 72 distinct tissue-stages. We used integrative analysis to develop a unified set of chromatin state annotations, infer the identities of dynamic enhancers and key transcriptional regulators, and characterize the relationship between chromatin state and accessibility during developmental gene regulation. We also leveraged these data to link enhancers to putative target genes and demonstrate tissue-specific enrichments of sequence variants associated with disease in humans. The mouse ENCODE data sets provide a compendium of resources for biomedical researchers and achieve, to our knowledge, the most comprehensive view of chromatin dynamics during mammalian fetal development to date.
Subject(s)
Chromatin/genetics , Chromatin/metabolism , Datasets as Topic , Fetal Development/genetics , Histones/metabolism , Molecular Sequence Annotation , Regulatory Sequences, Nucleic Acid/genetics , Animals , Chromatin/chemistry , Chromatin Immunoprecipitation Sequencing , Disease/genetics , Enhancer Elements, Genetic/genetics , Female , Gene Expression Regulation, Developmental/genetics , Genetic Variation , Histones/chemistry , Humans , Male , Mice , Mice, Inbred C57BL , Organ Specificity/genetics , Reproducibility of Results , Transposases/metabolismABSTRACT
An amendment to this paper has been published and can be accessed via a link at the top of the paper.
ABSTRACT
How the locus-specificity of epigenetic modifications is regulated remains an unanswered question. A contributing mechanism is that epigenetic enzymes are recruited to specific loci by DNA binding factors recognizing particular sequence motifs (referred to as epi-motifs). Using these motifs to predict biological outputs depending on local epigenetic state such as somatic mutation rates would confirm their functionality. Here, we used DNA motifs including known TF motifs and epi-motifs as a surrogate of epigenetic signals to predict somatic mutation rates in 13 cancers at an average 23kbp resolution. We implemented an interpretable neural network model, called contextual regression, to successfully learn the universal relationship between mutations and DNA motifs, and uncovered motifs that are most impactful on the regional mutation rates such as TP53 and epi-motifs associated with H3K9me3. Furthermore, we identified genomic regions with significantly higher mutation rates than the expected values in each individual tumor and demonstrated that such cancer-related regions can accurately predict cancer types. Interestingly, we found that the same mutation signatures often have different contributions to cancer-related and cancer-independent regions, and we also identified the motifs with the most contribution to each mutation signature.
Subject(s)
Mutation Rate , Neoplasms , Humans , Nucleotide Motifs/genetics , Mutation/genetics , Epigenesis, Genetic/genetics , Neoplasms/geneticsABSTRACT
DNA methylation plays crucial roles in many biological processes and abnormal DNA methylation patterns are often observed in diseases. Recent studies have shed light on cis-acting DNA elements that regulate locus-specific DNA methylation, which involves transcription factors, histone modification and DNA secondary structures. In addition, several recent studies have surveyed DNA motifs that regulate DNA methylation and suggest potential applications in diagnosis and prognosis. Here, we discuss the current biological foundation for the cis-acting genetic code that regulates DNA methylation. We review the computational models that predict DNA methylation with genetic features and discuss the biological insights revealed from these models. We also provide an in-depth discussion on how to leverage such knowledge in clinical applications, particularly in the context of liquid biopsy for early cancer diagnosis and treatment.
Subject(s)
CpG Islands , DNA Methylation , Epigenesis, Genetic , Genetic Code , Models, Genetic , Promoter Regions, Genetic , Animals , HumansABSTRACT
Histones are modified by enzymes that act in a locus, cell-type, and developmental stage-specific manner. The recruitment of enzymes to chromatin is regulated at multiple levels, including interaction with sequence-specific DNA-binding factors. However, the DNA-binding specificity of the regulatory factors that orchestrate specific histone modifications has not been broadly mapped. We have analyzed 6 histone marks (H3K4me1, H3K4me3, H3K27ac, H3K27me3, K3H9me3, H3K36me3) across 121 human cell types and tissues from the NIH Roadmap Epigenomics Project as well as 8 histone marks (with addition of H3K4me2 and H3K9ac) from the mouse ENCODE Consortium. We have identified 361 and 369 DNA motifs in human and mouse, respectively, that are the most predictive of each histone mark. Interestingly, 107 human motifs are conserved between the two species. In human embryonic cell line H1, we mutated only the found DNA motifs at particular loci and the significant reduction of H3K27ac levels validated the regulatory roles of the perturbed motifs. The functionality of these motifs was also supported by the evidence that histone-associated motifs, especially H3K4me3 motifs, significantly overlap with the expression of quantitative trait loci SNPs in cancer patients more than the known and random motifs. Furthermore, we observed possible feedbacks to control chromatin dynamics as the found motifs appear in the promoters or enhancers associated with various histone modification enzymes. These results pave the way toward revealing the molecular mechanisms of epigenetic events, such as histone modification dynamics and epigenetic priming.
Subject(s)
DNA Methylation/genetics , Histone Code/genetics , Nucleotide Motifs/genetics , Regulatory Sequences, Nucleic Acid/genetics , Animals , Chromatin/genetics , DNA-Binding Proteins/genetics , Epigenomics , Humans , Mice , Promoter Regions, Genetic , Protein Processing, Post-Translational/geneticsABSTRACT
DNA methylation is an important epigenetic mark but how its locus-specificity is decided in relation to DNA sequence is not fully understood. Here, we have analyzed 34 diverse whole-genome bisulfite sequencing datasets in human and identified 313 motifs, including 92 and 221 associated with methylation (methylation motifs, MMs) and unmethylation (unmethylation motifs, UMs), respectively. The functionality of these motifs is supported by multiple lines of evidence. First, the methylation levels at the MM and UM motifs are respectively higher and lower than the genomic background. Second, these motifs are enriched at the binding sites of methylation modifying enzymes including DNMT3A and TET1, indicating their possible roles of recruiting these enzymes. Third, these motifs significantly overlap with "somatic QTLs" (quantitative trait loci)Ā of methylation and expression. Fourth, disruption of these motifs by mutation is associated with significantly altered methylation level of the CpGs in the neighbor regions. Furthermore, these motifs together with somatic mutations are predictive of cancer subtypes and patient survival. We revealed some of these motifs were also associated with histone modifications, suggesting a possible interplay between the two types of epigenetic modifications. We also found some motifs form feed forward loops to contribute to DNA methylation dynamics.
Subject(s)
DNA Methylation/genetics , DNA/genetics , Epigenesis, Genetic/genetics , Base Sequence , Binding Sites , CpG Islands , DNA/metabolism , DNA (Cytosine-5-)-Methyltransferases/metabolism , DNA Methyltransferase 3A , DNA, Neoplasm/genetics , Datasets as Topic , Histone Code , Humans , Kaplan-Meier Estimate , Mixed Function Oxygenases/metabolism , Models, Genetic , Neoplasms/genetics , Neoplasms/mortality , Promoter Regions, Genetic/genetics , Proto-Oncogene Proteins/metabolism , Quantitative Trait Loci , Sequence Analysis, DNAABSTRACT
MOTIVATION: Increasing evidence has shown that nucleotide modifications such as methylation and hydroxymethylation on cytosine would greatly impact the binding of transcription factors (TFs). However, there is a lack of motif finding algorithms with the function to search for motifs with modified bases. In this study, we expand on our previous motif finding pipeline Epigram to provide systematic de novo motif discovery and performance evaluation on methylated DNA motifs. RESULTS: mEpigram outperforms both MEME and DREME on finding modified motifs in simulated data that mimics various motif enrichment scenarios. Furthermore we were able to identify methylated motifs in Arabidopsis DNA affinity purification sequencing (DAP-seq) data that were previously demonstrated to contain such motifs. When applied to TF ChIP-seq and DNA methylome data in H1 and GM12878, our method successfully identified novel methylated motifs that can be recognized by the TFs or their co-factors. We also observed spacing constraint between the canonical motif of the TF and the newly discovered methylated motifs, which suggests operative recognition of these cis-elements by collaborative proteins. AVAILABILITY AND IMPLEMENTATION: The mEpigram program is available at http://wanglab.ucsd.edu/star/mEpigram. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Subject(s)
Algorithms , Sequence Analysis, DNA , Binding Sites , Chromatin Immunoprecipitation , Nucleotide Motifs , Transcription FactorsABSTRACT
MOTIVATION: DNA methylation signatures in rheumatoid arthritis (RA) have been identified in fibroblast-like synoviocytes (FLS) with Illumina HumanMethylation450 array. Since <2% of CpG sites are covered by the Illumina 450K array and whole genome bisulfite sequencing is still too expensive for many samples, computationally predicting DNA methylation levels based on 450K data would be valuable to discover more RA-related genes. RESULTS: We developed a computational model that is trained on 14 tissues with both whole genome bisulfite sequencing and 450K array data. This model integrates information derived from the similarity of local methylation pattern between tissues, the methylation information of flanking CpG sites and the methylation tendency of flanking DNA sequences. The predicted and measured methylation values were highly correlated with a Pearson correlation coefficient of 0.9 in leave-one-tissue-out cross-validations. Importantly, the majority (76%) of the top 10% differentially methylated loci among the 14 tissues was correctly detected using the predicted methylation values. Applying this model to 450K data of RA, osteoarthritis and normal FLS, we successfully expanded the coverage of CpG sites 18.5-fold and accounts for about 30% of all the CpGs in the human genome. By integrative omics study, we identified genes and pathways tightly related to RA pathogenesis, among which 12 genes were supported by triple evidences, including 6 genes already known to perform specific roles in RA and 6 genes as new potential therapeutic targets. AVAILABILITY AND IMPLEMENTATION: The source code, required data for prediction, and demo data for test are freely available at: http://wanglab.ucsd.edu/star/LR450K/ CONTACT: wei-wang@ucsd.edu or gfirestein@ucsd.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Subject(s)
Arthritis, Rheumatoid , DNA Methylation , CpG Islands , Fibroblasts , Genome, Human , Humans , Oligonucleotide Array Sequence AnalysisABSTRACT
The Infinium HumanMethylation450 BeadChip array, referred as 450K array hereinafter, has been widely adopted as an affordable technique to determine DNA methylation. Tens of thousands of data have been generated on diverse cell types and patient tissues, which have provided great insight into understanding the crucial roles of epigenetic modifications in many biological processes and diseases. The limitation of this technique is its coverage, which measures methylation levels of about 450,000 CpGs, accounting for about 1.6% of all CpGs in the human genome. In the present study we developed and compared computational models to significantly expand the coverage of Illumina 450K (~11 folds). Using the whole genome bisulfite sequencing and Illumina 450K data in the human H1 embryonic stem cell, we showed that the predicted and measured methylation levels were well correlated. Our proposed model showed superior prediction accuracies compared to the existing methods on the same dataset. When applied to predict the DNA methylome on other cells, our proposed model achieved comparable performance in cross-validations, which indicates the generalizibility of the method. Our method would thus be invaluable to maximize the usage of the existing data.
Subject(s)
CpG Islands , DNA Methylation , Oligonucleotide Array Sequence Analysis/methods , Embryonic Stem Cells/cytology , Epigenesis, Genetic , Genome, Human , Humans , Logistic Models , Support Vector MachineABSTRACT
Diabetic macular edema (DME) has emerged as the foremost cause of vision loss in the population with diabetes. Early detection of DME is paramount, yet the prevailing screening, relying on two-dimensional and labor-intensive fundus photography (FP), results in frequent unwarranted referrals and overlooked diagnoses. Self-imaging optical coherence tomography (SI-OCT), offering fully automated, three-dimensional macular imaging, holds the potential to enhance DR screening. We conducted an observational study within a cohort of 1822 participants with diabetes, who received comprehensive assessments, including visual acuity testing, FP, and SI-OCT examinations. We compared the performance of three screening strategies: the conventional FP-based strategy, a combination strategy of FP and SI-OCT, and a simulated combination strategy of FP and manual SD-OCT. Additionally, we undertook a cost-effectiveness analysis utilizing Markov models to evaluate the costs and benefits of the three strategies for referable DR. We found that the FP + SI-OCT strategy demonstrated superior sensitivity (87.69% vs 61.53%) and specificity (98.29% vs 92.47%) in detecting DME when compared to the FP-based strategy. Importantly, the FP + SI-OCT strategy outperformed the FP-based strategy, with an incremental cost-effectiveness ratio (ICER) of $8016 per quality-adjusted life year (QALY), while the FP + SD-OCT strategy was less cost-effective, with an ICER of $45,754/QALY. Our results were robust to extensive sensitivity analyses, with the FP + SI-OCT strategy standing as the dominant choice in 69.36% of simulations conducted at the current willingness-to-pay threshold. In summary, incorporating SI-OCT into FP-based screening offers substantial enhancements in sensitivity, specificity for detecting DME, and most notably, cost-effectiveness for DR screening.
ABSTRACT
As CRISPR-based therapies enter the clinic, evaluation of safety remains a critical and active area of study. Here, we employ a clinical next generation sequencing (NGS) workflow to achieve high sequencing depth and detect ultra-low frequency variants across exons of genes associated with cancer, all exons, and genome wide. In three separate primary human hematopoietic stem and progenitor cell (HSPC) donors assessed in technical triplicates, we electroporated high-fidelity Cas9 protein targeted to three loci (AAVS1, HBB, and ZFPM2) and harvested genomic DNA at days 4 and 10. Our results demonstrate that clinically relevant delivery of high-fidelity Cas9 to primary HSPCs and ex vivo culture up to 10 days does not introduce or enrich for tumorigenic variants and that even a single SNP in a gRNA spacer sequenceĀ is sufficient to eliminate Cas9 off-target activity in primary, repair-competent human HSPCs.
Subject(s)
CRISPR-Cas Systems , Gene Editing , CRISPR-Associated Protein 9/genetics , CRISPR-Associated Protein 9/metabolism , CRISPR-Cas Systems/genetics , Gene Editing/methods , Hematopoietic Stem Cells/metabolism , High-Throughput Nucleotide Sequencing , Humans , RNA, Guide, Kinetoplastida/geneticsABSTRACT
A scarcity of functionally validated enhancers in the human genome presents a significant hurdle to understanding how these cis-regulatory elements contribute to human diseases. We carry out highly multiplexed CRISPR-based perturbation and sequencing to identify enhancers required for cell proliferation and fitness in 10 human cancer cell lines. Our results suggest that the cell fitness enhancers, unlike their target genes, display high cell-type specificity of chromatin features. They typically adopt a modular structure, comprised of activating elements enriched for motifs of oncogenic transcription factors, surrounded by repressive elements enriched for motifs recognized by transcription factors with tumor suppressor functions. We further identify cell fitness enhancers that are selectively accessible in clinical tumor samples, and the levels of chromatin accessibility are associated with patient survival. These results reveal functional enhancers across multiple cancer cell lines, characterize their context-dependent chromatin organization, and yield insights into altered transcription programs in cancer cells.
Subject(s)
Enhancer Elements, Genetic , Neoplasms , Humans , Enhancer Elements, Genetic/genetics , Chromatin , Genome, Human , Transcription Factors/metabolism , Cell Proliferation/genetics , Neoplasms/geneticsABSTRACT
Current pooled CRISPR screens for cis-regulatory elements (CREs), based on transcriptional output changes, are typically limited to characterizing CREs of only one gene. Here, we describe CRISPRpath, a scalable screening strategy for parallelly characterizing CREs of genes linked to the same biological pathway and converging phenotypes. We demonstrate the ability of CRISPRpath for simultaneously identifying functional enhancers of six genes in the 6-thioguanineĀinduced DNA mismatch repair pathway using both CRISPR interference (CRISPRi) and CRISPR nuclease (CRISPRn) approaches. Sixty percent of the identified enhancers are known promoters with distinct epigenomic features compared to other active promoters, including increased chromatin accessibility and interactivity. Furthermore, by imposing different levels of selection pressure, CRISPRpath can distinguish enhancers exerting strong impact on gene expression from those exerting weak impact. Our results offer a nuanced view of cis-regulation and demonstrate that CRISPRpath can be leveraged for understanding the complex gene regulatory program beyond transcriptional output at scale.
ABSTRACT
Sequence analysis frequently requires intuitive understanding and convenient representation of motifs. Typically, motifs are represented as position weight matrices (PWMs) and visualized using sequence logos. However, in many scenarios, in order to interpret the motif information or search for motif matches, it is compact and sufficient to represent motifs by wildcard-style consensus sequences (such as [GC][AT]GATAAG[GAC]). Based on mutual information theory and Jensen-Shannon divergence, we propose a mathematical framework to minimize the information loss in converting PWMs to consensus sequences. We name this representation as sequence Motto and have implemented an efficient algorithm with flexible options for converting motif PWMs into Motto from nucleotides, amino acids, and customized characters. We show that this representation provides a simple and efficient way to identify the binding sites of 1156 common transcription factors (TFs) in the human genome. The effectiveness of the method was benchmarked by comparing sequence matches found by Motto with PWM scanning results found by FIMO. On average, our method achieves a 0.81 area under the precision-recall curve, significantly (P-value < 0.01) outperforming all existing methods, including maximal positional weight, Cavener's method, and minimal mean square error. We believe this representation provides a distilled summary of a motif, as well as the statistical justification.
Subject(s)
Position-Specific Scoring Matrices , Sequence Analysis, DNA/methods , Algorithms , Genome, Human , Humans , Sequence Analysis, DNA/standards , Transcription Factors/geneticsABSTRACT
BACKGROUNDS: HER-2 positive breast cancer is a subtype of breast cancer with poor clinical outcome. The aim of this study was to identify differentially expressed genes (DEGs) for HER-2 positive breast cancer and elucidate the potential interactions among them. MATERIAL AND METHODS: Three gene expression profiles (GSE29431, GSE45827, and GSE65194) were derived from the Gene Expression Omnibus (GEO) database. GEO2R tool was applied to obtain DEGs between HER-2 positive breast cancer and normal breast tissues. Gene ontology (GO) annotation analysis and Kyoto Encyclopedia of Genes and Genome (KEGG) pathway enrichment analysis was performed by the Database for Annotation, Visualization and Integrated Discovery (David) online tool. Protein-protein interaction (PPI) network, hub gene identification and module analysis was conducted by Cytoscape software. Online Kaplan-Meier plotter survival analysis tool was also used to investigate the prognostic values of hub genes in HER-2 positive breast cancer patients. RESULTS: A total of 54 upregulated DEGs and 269 downregulated DEGs were identified. Among them, 10 hub genes including CCNB1, RAC1, TOP2A, KIF20A, RRM2, ASPM, NUSAP1, BIRC5, BUB1B, and CEP55 demonstrated by connectivity degree in the PPI network were screened out. In Kaplan-Meier plotter survival analysis, the overexpression of RAC1 and RRM2 were shown to be associated with an unfavorable prognosis in HER-2 positive breast cancer patients. CONCLUSIONS: This present study identified a number of potential target genes and pathways which might impact the oncogenesis and progression of HER-2 positive breast cancer. These findings could provide new insights into the detection of novel diagnostic and therapeutic biomarkers for this disease.
Subject(s)
Breast Neoplasms/genetics , Gene Expression Regulation, Neoplastic/genetics , Ribonucleoside Diphosphate Reductase/genetics , rac1 GTP-Binding Protein/genetics , Case-Control Studies , Computational Biology , Down-Regulation , Female , Humans , Receptor, ErbB-2 , Transcriptome/genetics , Up-RegulationABSTRACT
Transcriptional regulation is pivotal to the specification of distinct cell types during embryonic development. However, it still lacks a systematic way to identify key transcription factors (TFs) orchestrating the temporal and tissue specificity of gene expression. Here, we integrated epigenomic and transcriptomic data to reveal key regulators from two cells to postnatal day 0 in mouse embryogenesis. We predicted three-dimensional chromatin interactions in 12 tissues across eight developmental stages, which facilitates linking TFs to their target genes for constructing transcriptional regulatory networks. To identify driver TFs, we developed a new algorithm, dubbed Taiji, to assess the global influence of each TF and systematically uncovered TFs critical for lineage-specific and stage-dependent tissue specification. We have also identified TF combinations that function in spatiotemporal order to form transcriptional waves regulating developmental progress. Furthermore, lacking stage-specific TF combinations suggests a distributed timing strategy to orchestrate the coordination between tissues during embryonic development.
Subject(s)
Algorithms , Embryonic Development/genetics , Gene Expression Regulation, Developmental , Gene Regulatory Networks , Transcription Factors/genetics , Animals , Epigenomics/methods , Gene Expression Profiling/methods , Mice , Organ Specificity/genetics , Transcription Factors/classificationABSTRACT
The integration of genomic and DNA methylation data has been demonstrated as a powerful strategy in understanding cancer mechanisms and identifying therapeutic targets. The TCGA consortium has mapped DNA methylation in thousands of cancer samples using Illumina Infinium Human Methylation 450 K BeadChip (Illumina 450 K array) that only covers about 1.5% of CpGs in the human genome. Therefore, increasing the coverage of the DNA methylome would significantly leverage the usage of the TCGA data. Here, we present a new model called EAGLING that can expand the Illumina 450 K array data 18 times to cover about 30% of the CpGs in the human genome. We applied it to analyze 13 cancers in TCGA. By integrating the expanded methylation, gene expression, and somatic mutation data, we identified the genes showing differential patterns in each of the 13 cancers. Many of the triple-evidenced genes identified in majority of the cancers are biomarkers or potential biomarkers. Pan-cancer analysis also revealed the pathways in which the triple-evidenced genes are enriched, which include well known ones as well as new ones, such as axonal guidance signaling pathway and pathways related to inflammatory processing or inflammation response. Triple-evidenced genes, particularly TNXB, RRM2, CELSR3, SLC16A3, FANCI, MMP9, MMP11, SIK1, and TRIM59 showed superior predictive power in both tumor diagnosis and prognosis. These results have demonstrated that the integrative analysis using the expanded methylation data is powerful in identifying critical genes/pathways that may serve as new therapeutic targets.
ABSTRACT
Epigenetics contributes to the pathogenesis of immune-mediated diseases like rheumatoid arthritis (RA). Here we show the first comprehensive epigenomic characterization of RA fibroblast-like synoviocytes (FLS), including histone modifications (H3K27ac, H3K4me1, H3K4me3, H3K36me3, H3K27me3, and H3K9me3), open chromatin, RNA expression and whole-genome DNA methylation. To address complex multidimensional relationship and reveal epigenetic regulation of RA, we perform integrative analyses using a novel unbiased method to identify genomic regions with similar profiles. Epigenomically similar regions exist in RA cells and are associated with active enhancers and promoters and specific transcription factor binding motifs. Differentially marked genes are enriched for immunological and unexpected pathways, with "Huntington's Disease Signaling" identified as particularly prominent. We validate the relevance of this pathway to RA by showing that Huntingtin-interacting protein-1 regulates FLS invasion into matrix. This work establishes a high-resolution epigenomic landscape of RA and demonstrates the potential for integrative analyses to identify unanticipated therapeutic targets.
Subject(s)
Arthritis, Rheumatoid/genetics , Epigenesis, Genetic , Fibroblasts/metabolism , Synoviocytes/metabolism , Adult , Aged , Arthritis, Rheumatoid/metabolism , Chromatin/genetics , Chromatin/metabolism , DNA Methylation , Female , Histone Code , Histones/genetics , Histones/metabolism , Humans , Male , Methylation , Middle Aged , Promoter Regions, GeneticABSTRACT
The human genome is tightly packaged into chromatin whose functional output depends on both one-dimensional (1D) local chromatin states and three-dimensional (3D) genome organization. Currently, chromatin modifications and 3D genome organization are measured by distinct assays. An emerging question is whether it is possible to deduce 3D interactions by integrative analysis of 1D epigenomic data and associate 3D contacts to functionality of the interacting loci. Here we present EpiTensor, an algorithm to identify 3D spatial associations within topologically associating domains (TADs) from 1D maps of histone modifications, chromatin accessibility and RNA-seq. We demonstrate that active promoter-promoter, promoter-enhancer and enhancer-enhancer associations identified by EpiTensor are highly concordant with those detected by Hi-C, ChIA-PET and eQTL analyses at 200 bp resolution. Moreover, EpiTensor has identified a set of interaction hotspots, characterized by higher chromatin and transcriptional activity as well as enriched TF and ncRNA binding across diverse cell types, which may be critical for stabilizing the local 3D interactions.