Your browser doesn't support javascript.
loading
Multiomics-integrated deep language model enables in silico genome-wide detection of transcription factor binding site in unexplored biosamples.
Yang, Zikun; Li, Xin; Sheng, Lele; Zhu, Ming; Lan, Xun; Gu, Fei.
Afiliación
  • Yang Z; Damo Academy, Alibaba Group, Hangzhou 310023, China.
  • Li X; Hupan Lab, Hangzhou 310023, China.
  • Sheng L; Damo Academy, Alibaba Group, Hangzhou 310023, China.
  • Zhu M; Hupan Lab, Hangzhou 310023, China.
  • Lan X; Damo Academy, Alibaba Group, Hangzhou 310023, China.
  • Gu F; Hupan Lab, Hangzhou 310023, China.
Bioinformatics ; 40(1)2024 01 02.
Article en En | MEDLINE | ID: mdl-38216534
ABSTRACT
MOTIVATION Transcription factor binding sites (TFBS) are regulatory elements that have significant impact on transcription regulation and cell fate determination. Canonical motifs, biological experiments, and computational methods have made it possible to discover TFBS. However, most existing in silico TFBS prediction models are solely DNA-based, and are trained and utilized within the same biosample, which fail to infer TFBS in experimentally unexplored biosamples.

RESULTS:

Here, we propose TFBS prediction by modified TransFormer (TFTF), a multimodal deep language architecture which integrates multiomics information in epigenetic studies. In comparison to existing computational techniques, TFTF has state-of-the-art accuracy, and is also the first approach to accurately perform genome-wide detection for cell-type and species-specific TFBS in experimentally unexplored biosamples. Compared to peak calling methods, TFTF consistently discovers true TFBS in threshold tuning-free way, with higher recalled rates. The underlying mechanism of TFTF reveals greater attention to the targeted TF's motif region in TFBS, and general attention to the entire peak region in non-TFBS. TFTF can benefit from the integration of broader and more diverse data for improvement and can be applied to multiple epigenetic scenarios. AVAILABILITY AND IMPLEMENTATION We provide a web server (https//tftf.ibreed.cn/) for users to utilize TFTF model. Users can train TFTF model and discover TFBS with their own data.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Genoma / Multiómica Tipo de estudio: Diagnostic_studies / Prognostic_studies Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2024 Tipo del documento: Article País de afiliación: China Pais de publicación: Reino Unido

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Genoma / Multiómica Tipo de estudio: Diagnostic_studies / Prognostic_studies Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2024 Tipo del documento: Article País de afiliación: China Pais de publicación: Reino Unido