Your browser doesn't support javascript.
loading
PEL-PVP: Application of plant vacuolar protein discriminator based on PEFT ESM-2 and bilayer LSTM in an unbalanced dataset.
Xiao, Cuilin; Zhou, Zheyu; She, Jiayi; Yin, Jinfen; Cui, Feifei; Zhang, Zilong.
Affiliation
  • Xiao C; School of Computer Science and Technology, Hainan University, Haikou 570228, China.
  • Zhou Z; School of Computer Science and Technology, Hainan University, Haikou 570228, China.
  • She J; School of Computer Science and Technology, Hainan University, Haikou 570228, China.
  • Yin J; School of Computer Science and Technology, Hainan University, Haikou 570228, China.
  • Cui F; School of Computer Science and Technology, Hainan University, Haikou 570228, China.
  • Zhang Z; School of Computer Science and Technology, Hainan University, Haikou 570228, China. Electronic address: zhangzilong@hainanu.edu.cn.
Int J Biol Macromol ; 277(Pt 3): 134317, 2024 Jul 31.
Article in En | MEDLINE | ID: mdl-39094861
ABSTRACT
Plant vacuoles, play a crucial role in maintaining cellular stability, adapting to environmental changes, and responding to external pressures. The accurate identification of vacuolar proteins (PVPs) is crucial for understanding the biosynthetic mechanisms of intracellular vacuoles and the adaptive mechanisms of plants. In order to more accurately identify vacuole proteins, this study developed a new predictive model PEL-PVP based on ESM-2. Through this study, the feasibility and effectiveness of using advanced pre-training models and fine-tuning techniques for bioinformatics tasks were demonstrated, providing new methods and ideas for plant vacuolar protein research. In addition, previous datasets for vacuolar proteins were balanced, but imbalance is more closely related to the actual situation. Therefore, this study constructed an imbalanced dataset UB-PVP from the UniProt database,helping the model better adapt to the complexity and uncertainty in real environments, thereby improving the model's generalization ability and practicality. The experimental results show that compared with existing recognition techniques, achieving significant improvements in multiple indicators, with 6.08 %, 13.51 %, 11.9 %, and 5 % improvements in ACC, SP, MCC, and AUC, respectively. The accuracy reaches 94.59 %, significantly higher than the previous best model GraphIdn. This provides an efficient and precise tool for the study of plant vacuole proteins.
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Language: En Journal: Int J Biol Macromol Year: 2024 Document type: Article Affiliation country: China

Full text: 1 Collection: 01-internacional Database: MEDLINE Language: En Journal: Int J Biol Macromol Year: 2024 Document type: Article Affiliation country: China