Your browser doesn't support javascript.
loading
MotifHub: Detection of trans-acting DNA motif group with probabilistic modeling algorithm.
Liu, Zhe; Wong, Hiu-Man; Chen, Xingjian; Lin, Jiecong; Zhang, Shixiong; Yan, Shankai; Wang, Fuzhou; Li, Xiangtao; Wong, Ka-Chun.
Afiliação
  • Liu Z; Department of Computer Science, City University of Hong Kong, Kowloon Tong, Kowloon, Hong Kong, China.
  • Wong HM; Department of Computer Science, City University of Hong Kong, Kowloon Tong, Kowloon, Hong Kong, China.
  • Chen X; Department of Computer Science, City University of Hong Kong, Kowloon Tong, Kowloon, Hong Kong, China.
  • Lin J; Department of Computer Science, City University of Hong Kong, Kowloon Tong, Kowloon, Hong Kong, China.
  • Zhang S; Department of Computer Science, City University of Hong Kong, Kowloon Tong, Kowloon, Hong Kong, China.
  • Yan S; Department of Computer Science, City University of Hong Kong, Kowloon Tong, Kowloon, Hong Kong, China.
  • Wang F; Department of Computer Science, City University of Hong Kong, Kowloon Tong, Kowloon, Hong Kong, China.
  • Li X; School of Artificial Intelligence, Jilin University, Jilin, China.
  • Wong KC; Department of Computer Science, City University of Hong Kong, Kowloon Tong, Kowloon, Hong Kong, China. Electronic address: kc.w@cityu.edu.hk.
Comput Biol Med ; 168: 107753, 2024 01.
Article em En | MEDLINE | ID: mdl-38039889
ABSTRACT

BACKGROUND:

Trans-acting factors are of special importance in transcription regulation, which is a group of proteins that can directly or indirectly recognize or bind to the 8-12 bp core sequence of cis-acting elements and regulate the transcription efficiency of target genes. The progressive development in high-throughput chromatin capture technology (e.g., Hi-C) enables the identification of chromatin-interacting sequence groups where trans-acting DNA motif groups can be discovered. The problem difficulty lies in the combinatorial nature of DNA sequence pattern matching and its underlying sequence pattern search space.

METHOD:

Here, we propose to develop MotifHub for trans-acting DNA motif group discovery on grouped sequences. Specifically, the main approach is to develop probabilistic modeling for accommodating the stochastic nature of DNA motif patterns.

RESULTS:

Based on the modeling, we develop global sampling techniques based on EM and Gibbs sampling to address the global optimization challenge for model fitting with latent variables. The results reflect that our proposed approaches demonstrate promising performance with linear time complexities.

CONCLUSION:

MotifHub is a novel algorithm considering the identification of both DNA co-binding motif groups and trans-acting TFs. Our study paves the way for identifying hub TFs of stem cell development (OCT4 and SOX2) and determining potential therapeutic targets of prostate cancer (FOXA1 and MYC). To ensure scientific reproducibility and long-term impact, its matrix-algebra-optimized source code is released at http//bioinfo.cs.cityu.edu.hk/MotifHub.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Algoritmos / Software Idioma: En Revista: Comput Biol Med Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Algoritmos / Software Idioma: En Revista: Comput Biol Med Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China