Your browser doesn't support javascript.
loading
CASPIAN: A method to identify chromatin topological associated domains based on spatial density cluster.
Gong, Haiyan; Yang, Yi; Zhang, Xiaotong; Li, Minghong; Zhang, Sichen; Chen, Yang.
Affiliation
  • Gong H; School of Computer and Communication Engineering, Beijing Key Laboratory of Knowledge Engineering for Materials Science, Beijing Advanced Innovation Center for Materials Genome Engineering, University of Science and Technology Beijing, Beijing 100083, China.
  • Yang Y; School of Computer and Communication Engineering, Beijing Key Laboratory of Knowledge Engineering for Materials Science, Beijing Advanced Innovation Center for Materials Genome Engineering, University of Science and Technology Beijing, Beijing 100083, China.
  • Zhang X; School of Computer and Communication Engineering, Beijing Key Laboratory of Knowledge Engineering for Materials Science, Beijing Advanced Innovation Center for Materials Genome Engineering, University of Science and Technology Beijing, Beijing 100083, China.
  • Li M; Shunde Graduate School, University of Science and Technology Beijing, Foshan 528399, Guangdong, China.
  • Zhang S; School of Computer and Communication Engineering, Beijing Key Laboratory of Knowledge Engineering for Materials Science, Beijing Advanced Innovation Center for Materials Genome Engineering, University of Science and Technology Beijing, Beijing 100083, China.
  • Chen Y; School of Computer and Communication Engineering, Beijing Key Laboratory of Knowledge Engineering for Materials Science, Beijing Advanced Innovation Center for Materials Genome Engineering, University of Science and Technology Beijing, Beijing 100083, China.
Comput Struct Biotechnol J ; 20: 4816-4824, 2022.
Article de En | MEDLINE | ID: mdl-36147659
ABSTRACT
With the development of Hi-C technology, the detection of topologically associated domains (TADs) boundaries plays an important role in exploring the relationship between gene structure and expression. However, a method that can identify accurate TAD boundaries from the Hi-C contact matrix with different resolutions is currently lacking. We proposed a method named CASPIAN that can identify chromatin TAD boundaries based on the spatial density clustering algorithm. CASPIAN requires few parameters to call TADs. This method is realized using the hierarchical density-based clustering method HDBSCAN, where the distance of pairwise bins is calculated based on three distance metrics (Euclidean, Manhattan, and Chebyshev distance metric) to adapt to the characteristics of the Hi-C contact matrix generated from simulation experiments or normalized methods. Our results show that, same as standard methods (e.g., Insulation Score, TopDom), CASPIAN can enrich factors related to promoting the gene expression, such as CTCF, H3K4me1, H3K4me3, RAD21, POLR2A, and SMC3. We also calculated the approximate proportion of various factors anchored at the TAD boundaries to observe the distribution of these factors surrounding the TAD boundaries. In conclusion, CASPIAN is an easy method to explore the relationship between transcription factors and TAD boundaries. CASPIAN is available online (https//gitee.com/ghaiyan/caspian).
Mots clés

Texte intégral: 1 Collection: 01-internacional Base de données: MEDLINE Type d'étude: Prognostic_studies / Risk_factors_studies Langue: En Journal: Comput Struct Biotechnol J Année: 2022 Type de document: Article Pays d'affiliation: Chine

Texte intégral: 1 Collection: 01-internacional Base de données: MEDLINE Type d'étude: Prognostic_studies / Risk_factors_studies Langue: En Journal: Comput Struct Biotechnol J Année: 2022 Type de document: Article Pays d'affiliation: Chine