Your browser doesn't support javascript.
loading
Identification of DNA-protein binding residues through integration of Transformer encoder and Bi-directional Long Short-Term Memory.
Zhao, Haipeng; Zhu, Baozhong; Jiang, Tengsheng; Cui, Zhiming; Wu, Hongjie.
Afiliação
  • Zhao H; School of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou, China.
  • Zhu B; School of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou, China.
  • Jiang T; Gusu School, Nanjing Medical University, Suzhou, China.
  • Cui Z; School of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou, China.
  • Wu H; School of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou, China.
Math Biosci Eng ; 21(1): 170-185, 2024 Jan.
Article em En | MEDLINE | ID: mdl-38303418
ABSTRACT
DNA-protein binding is crucial for the normal development and function of organisms. The significance of accurately identifying DNA-protein binding sites lies in its role in disease prevention and the development of innovative approaches to disease treatment. In the present study, we introduce a precise and robust identifier for DNA-protein binding residues. In the context of protein representation, we combine the evolutionary information of the protein, represented by its position-specific scoring matrix, with the spatial information of the protein's secondary structure, enriching the overall informational content. This approach initially employs a combination of Bi-directional Long Short-Term Memory and Transformer encoder to jointly extract the interdependencies among residues within the protein sequence. Subsequently, convolutional operations are applied to the resulting feature matrix to capture local features of the residues. Experimental results on the benchmark dataset demonstrate that our method exhibits a higher level of competitiveness when compared to contemporary classifiers. Specifically, our method achieved an MCC of 0.349, SP of 96.50%, SN of 44.03% and ACC of 94.59% on the PDNA-41 dataset.
Assuntos
Palavras-chave

Texto completo: 1 Bases de dados: MEDLINE Assunto principal: Proteínas / Memória de Curto Prazo Tipo de estudo: Diagnostic_studies / Prognostic_studies Idioma: En Revista: Math Biosci Eng Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China

Texto completo: 1 Bases de dados: MEDLINE Assunto principal: Proteínas / Memória de Curto Prazo Tipo de estudo: Diagnostic_studies / Prognostic_studies Idioma: En Revista: Math Biosci Eng Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China