Your browser doesn't support javascript.
loading
An Ensemble Classifiers for Improved Prediction of Native-Non-Native Protein-Protein Interaction.
Pratiwi, Nor Kumalasari Caecar; Tayara, Hilal; Chong, Kil To.
Afiliação
  • Pratiwi NKC; Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, Republic of Korea.
  • Tayara H; Department of Electrical Engineering, Telkom University, Bandung 40257, West Java, Indonesia.
  • Chong KT; School of International Engineering and Science, Jeonbuk National University, Jeonju 54896, Republic of Korea.
Int J Mol Sci ; 25(11)2024 May 29.
Article em En | MEDLINE | ID: mdl-38892144
ABSTRACT
In this study, we present an innovative approach to improve the prediction of protein-protein interactions (PPIs) through the utilization of an ensemble classifier, specifically focusing on distinguishing between native and non-native interactions. Leveraging the strengths of various base models, including random forest, gradient boosting, extreme gradient boosting, and light gradient boosting, our ensemble classifier integrates these diverse predictions using a logistic regression meta-classifier. Our model was evaluated using a comprehensive dataset generated from molecular dynamics simulations. While the gains in AUC and other metrics might seem modest, they contribute to a model that is more robust, consistent, and adaptable. To assess the effectiveness of various approaches, we compared the performance of logistic regression to four baseline models. Our results indicate that logistic regression consistently underperforms across all evaluated metrics. This suggests that it may not be well-suited to capture the complex relationships within this dataset. Tree-based models, on the other hand, appear to be more effective for problems involving molecular dynamics simulations. Extreme gradient boosting (XGBoost) and light gradient boosting (LightGBM) are optimized for performance and speed, handling datasets effectively and incorporating regularizations to avoid over-fitting. Our findings indicate that the ensemble method enhances the predictive capability of PPIs, offering a promising tool for computational biology and drug discovery by accurately identifying potential interaction sites and facilitating the understanding of complex protein functions within biological systems.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Mapeamento de Interação de Proteínas / Simulação de Dinâmica Molecular Idioma: En Revista: Int J Mol Sci Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Mapeamento de Interação de Proteínas / Simulação de Dinâmica Molecular Idioma: En Revista: Int J Mol Sci Ano de publicação: 2024 Tipo de documento: Article