Your browser doesn't support javascript.
loading
PLAS-20k: Extended Dataset of Protein-Ligand Affinities from MD Simulations for Machine Learning Applications.
Korlepara, Divya B; C S, Vasavi; Srivastava, Rakesh; Pal, Pradeep Kumar; Raza, Saalim H; Kumar, Vishal; Pandit, Shivam; Nair, Aathira G; Pandey, Sanjana; Sharma, Shubham; Jeurkar, Shruti; Thakran, Kavita; Jaglan, Reena; Verma, Shivangi; Ramachandran, Indhu; Chatterjee, Prathit; Nayar, Divya; Priyakumar, U Deva.
Afiliación
  • Korlepara DB; IHub-Data, International Institute of Information Technology, Hyderabad, 500032, India.
  • C S V; Divison of Physics, School of Advanced Sciences, Vellore Institute of Technology, Chennai, 600127, India.
  • Srivastava R; IHub-Data, International Institute of Information Technology, Hyderabad, 500032, India.
  • Pal PK; Department of Artificial Intelligence, School of Artificial Intelligence, Amrita Vishwa Vidyapeetham, Bengaluru, 560035, India.
  • Raza SH; Centre for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology, Hyderabad, 500032, India.
  • Kumar V; Centre for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology, Hyderabad, 500032, India.
  • Pandit S; IHub-Data, International Institute of Information Technology, Hyderabad, 500032, India.
  • Nair AG; Centre for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology, Hyderabad, 500032, India.
  • Pandey S; IHub-Data, International Institute of Information Technology, Hyderabad, 500032, India.
  • Sharma S; IHub-Data, International Institute of Information Technology, Hyderabad, 500032, India.
  • Jeurkar S; IHub-Data, International Institute of Information Technology, Hyderabad, 500032, India.
  • Thakran K; IHub-Data, International Institute of Information Technology, Hyderabad, 500032, India.
  • Jaglan R; Centre for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology, Hyderabad, 500032, India.
  • Verma S; IHub-Data, International Institute of Information Technology, Hyderabad, 500032, India.
  • Ramachandran I; IHub-Data, International Institute of Information Technology, Hyderabad, 500032, India.
  • Chatterjee P; IHub-Data, International Institute of Information Technology, Hyderabad, 500032, India.
  • Nayar D; IHub-Data, International Institute of Information Technology, Hyderabad, 500032, India.
  • Priyakumar UD; IHub-Data, International Institute of Information Technology, Hyderabad, 500032, India.
Sci Data ; 11(1): 180, 2024 Feb 09.
Article en En | MEDLINE | ID: mdl-38336857
ABSTRACT
Computing binding affinities is of great importance in drug discovery pipeline and its prediction using advanced machine learning methods still remains a major challenge as the existing datasets and models do not consider the dynamic features of protein-ligand interactions. To this end, we have developed PLAS-20k dataset, an extension of previously developed PLAS-5k, with 97,500 independent simulations on a total of 19,500 different protein-ligand complexes. Our results show good correlation with the available experimental values, performing better than docking scores. This holds true even for a subset of ligands that follows Lipinski's rule, and for diverse clusters of complex structures, thereby highlighting the importance of PLAS-20k dataset in developing new ML models. Along with this, our dataset is also beneficial in classifying strong and weak binders compared to docking. Further, OnionNet model has been retrained on PLAS-20k dataset and is provided as a baseline for the prediction of binding affinities. We believe that large-scale MD-based datasets along with trajectories will form new synergy, paving the way for accelerating drug discovery.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Proteínas / Ligandos Tipo de estudio: Prognostic_studies Límite: Animals / Humans Idioma: En Revista: Sci Data Año: 2024 Tipo del documento: Article País de afiliación: India Pais de publicación: Reino Unido

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Proteínas / Ligandos Tipo de estudio: Prognostic_studies Límite: Animals / Humans Idioma: En Revista: Sci Data Año: 2024 Tipo del documento: Article País de afiliación: India Pais de publicación: Reino Unido