Your browser doesn't support javascript.
loading
Modulation of DNA-protein Interactions by Proximal Genetic Elements as Uncovered by Interpretable Deep Learning.
Kalakoti, Yogesh; Peter, Swathik Clarancia; Gawande, Swaraj; Sundar, Durai.
Afiliação
  • Kalakoti Y; Department of Biochemical Engineering & Biotechnology, Indian Institute of Technology (IIT) Delhi, New Delhi 110016, India. Electronic address: yogesh.kalakoti@dbeb.iitd.ac.in.
  • Peter SC; Department of Biochemical Engineering & Biotechnology, Indian Institute of Technology (IIT) Delhi, New Delhi 110016, India; Regional Centre for Biotechnology (RCB), Faridabad, Haryana 121001, India. Electronic address: pswathik.clarancia@rcb.res.in.
  • Gawande S; Department of Computer Science and Engineering, Indian Institute of Technology (IIT) Delhi, New Delhi 110016, India. Electronic address: cs1190406@cse.iitd.ac.in.
  • Sundar D; Department of Biochemical Engineering & Biotechnology, Indian Institute of Technology (IIT) Delhi, New Delhi 110016, India; Yardi School of Artificial Intelligence, Indian Institute of Technology (IIT) Delhi, New Delhi 110016, India. Electronic address: sundar@dbeb.iitd.ac.in.
J Mol Biol ; 435(13): 168121, 2023 07 01.
Article em En | MEDLINE | ID: mdl-37100167
Transcription factors (TF) recognize specific motifs in the genome that are typically 6-12 bp long to regulate various aspects of the cellular machinery. Presence of binding motifs and favorable genome accessibility are key drivers for a consistent TF-DNA interaction. Although these pre-requisites may occur thousands of times in the genome, there seems to be a high degree of selectivity for the sites that are actually bound. Here, we present a deep-learning framework that identifies and characterizes the upstream and downstream genetic elements to the binding motif, for their role in enforcing the mentioned selectivity. The proposed framework is based on an interpretable recurrent neural network architecture that enables for the relative analysis of sequence context features. We apply the framework to model twenty-six transcription factors and score the TF-DNA binding at a base-pair resolution. We find significant differences in activations of DNA context features for bound and unbound sequences. In addition to standardized evaluation protocols, we offer outstanding interpretability that enables us to identify and annotate DNA sequence with possible elements that modulate TF-DNA binding. Also, differences in data processing have a huge influence on the overall model performance. Overall, the proposed framework allows for novel insights on the non-coding genetic elements and their role in facilitating a stable TF-DNA interaction.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Fatores de Transcrição / DNA / Aprendizado Profundo Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2023 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Fatores de Transcrição / DNA / Aprendizado Profundo Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2023 Tipo de documento: Article