Your browser doesn't support javascript.
loading
TOAST: A novel method for identifying topologically associated domains based on graph auto-encoders and clustering.
Gong, Haiyan; Zhang, Dawei; Zhang, Xiaotong.
Affiliation
  • Gong H; Institute for Advanced Materials and Technology, University of Science and Technology Beijing, Beijing, 100083, China.
  • Zhang D; School of Computer and Communication Engineering, Beijing Advanced Innovation Center for Materials Genome Engineering, University of Science and Technology Beijing, Beijing, 100083, China.
  • Zhang X; Shunde innovation School, University of Science and Technology Beijing, Foshan, 528399, Guangdong, China.
Comput Struct Biotechnol J ; 21: 4759-4768, 2023.
Article in En | MEDLINE | ID: mdl-37822562
ABSTRACT
Topologically associated domains (TADs) play a pivotal role in disease detection. This study introduces a novel TADs recognition approach named TOAST, leveraging graph auto-encoders and clustering techniques. TOAST conceptualizes each genomic bin as a node of a graph and employs the Hi-C contact matrix as the graph's adjacency matrix. By employing graph auto-encoders, TOAST generates informative embeddings as features. Subsequently, the unsupervised clustering algorithm HDBSCAN is utilized to assign labels to each genomic bin, facilitating the identification of contiguous regions with the same label as TADs. Our experimental analysis of several simulated Hi-C data sets shows that TOAST can quickly and accurately identify TADs from different types of simulated Hi-C contact matrices, outperforming existing algorithms. We also determined the anchoring ratio of TAD boundaries by analyzing different TAD recognition algorithms, and obtained an average ratio of anchoring CTCF, SMC3, RAD21, POLR2A, H3K36me3, H3K9me3, H3K4me3, H3K4me1, Enhancer, and Promoters of 0.66, 0.47, 0.54, 0.27, 0.24, 0.12, 0.32, 0.41, 0.26, and 0.13, respectively. In conclusion, TOAST is a method that can quickly identify TAD boundary parameters that are easy to understand and have important biological significance. The TOAST web server can be accessed via http//223.223.185.1894005/. The code of TOAST is available online at https//github.com/ghaiyan/TOAST.
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Type of study: Risk_factors_studies Language: En Journal: Comput Struct Biotechnol J Year: 2023 Document type: Article Affiliation country:

Full text: 1 Collection: 01-internacional Database: MEDLINE Type of study: Risk_factors_studies Language: En Journal: Comput Struct Biotechnol J Year: 2023 Document type: Article Affiliation country: