Your browser doesn't support javascript.
loading
THPLM: a sequence-based deep learning framework for protein stability changes prediction upon point variations using pretrained protein language model.
Gong, Jianting; Jiang, Lili; Chen, Yongbing; Zhang, Yixiang; Li, Xue; Ma, Zhiqiang; Fu, Zhiguo; He, Fei; Sun, Pingping; Ren, Zilin; Tian, Mingyao.
Afiliação
  • Gong J; School of Information Science and Technology, Institution of Computational Biology, Northeast Normal University, Changchun 130117, China.
  • Jiang L; Changchun Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Changchun 130122, China.
  • Chen Y; School of Information Science and Technology, Institution of Computational Biology, Northeast Normal University, Changchun 130117, China.
  • Zhang Y; Changchun Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Changchun 130122, China.
  • Li X; School of Information Science and Technology, Institution of Computational Biology, Northeast Normal University, Changchun 130117, China.
  • Ma Z; Changchun Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Changchun 130122, China.
  • Fu Z; School of Information Science and Technology, Institution of Computational Biology, Northeast Normal University, Changchun 130117, China.
  • He F; Changchun Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Changchun 130122, China.
  • Sun P; Changchun Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Changchun 130122, China.
  • Ren Z; School of Information Science and Technology, Institution of Computational Biology, Northeast Normal University, Changchun 130117, China.
  • Tian M; Department of Computer Science, College of Humanities and Sciences of Northeast Normal University, Changchun 130117, China.
Bioinformatics ; 39(11)2023 11 01.
Article em En | MEDLINE | ID: mdl-37874953
MOTIVATION: Quantitative determination of protein thermodynamic stability is a critical step in protein and drug design. Reliable prediction of protein stability changes caused by point variations contributes to developing-related fields. Over the past decades, dozens of structure-based and sequence-based methods have been proposed, showing good prediction performance. Despite the impressive progress, it is necessary to explore wild-type and variant protein representations to address the problem of how to represent the protein stability change in view of global sequence. With the development of structure prediction using learning-based methods, protein language models (PLMs) have shown accurate and high-quality predictions of protein structure. Because PLM captures the atomic-level structural information, it can help to understand how single-point variations cause functional changes. RESULTS: Here, we proposed THPLM, a sequence-based deep learning model for stability change prediction using Meta's ESM-2. With ESM-2 and a simple convolutional neural network, THPLM achieved comparable or even better performance than most methods, including sequence-based and structure-based methods. Furthermore, the experimental results indicate that the PLM's ability to generate representations of sequence can effectively improve the ability of protein function prediction. AVAILABILITY AND IMPLEMENTATION: The source code of THPLM and the testing data can be accessible through the following links: https://github.com/FPPGroup/THPLM.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Aprendizado Profundo Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2023 Tipo de documento: Article País de afiliação: China País de publicação: Reino Unido

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Aprendizado Profundo Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2023 Tipo de documento: Article País de afiliação: China País de publicação: Reino Unido