Pesquisa | Portal Regional da BVS

Author Correction: PLAS-20k: Extended Dataset of Protein-Ligand Affinities from MD Simulations for Machine Learning Applications.

Korlepara, Divya B; Vasavi, C S; Srivastava, Rakesh; Pal, Pradeep Kumar; Raza, Saalim H; Kumar, Vishal; Pandit, Shivam; Nair, Aathira G; Pandey, Sanjana; Sharma, Shubham; Jeurkar, Shruti; Thakran, Kavita; Jaglan, Reena; Verma, Shivangi; Ramachandran, Indhu; Chatterjee, Prathit; Nayar, Divya; Priyakumar, U Deva.

Sci Data ; 11(1): 730, 2024 Jul 04.

Artigo em Inglês | MEDLINE | ID: mdl-38965269

PLAS-20k: Extended Dataset of Protein-Ligand Affinities from MD Simulations for Machine Learning Applications.

Korlepara, Divya B; C S, Vasavi; Srivastava, Rakesh; Pal, Pradeep Kumar; Raza, Saalim H; Kumar, Vishal; Pandit, Shivam; Nair, Aathira G; Pandey, Sanjana; Sharma, Shubham; Jeurkar, Shruti; Thakran, Kavita; Jaglan, Reena; Verma, Shivangi; Ramachandran, Indhu; Chatterjee, Prathit; Nayar, Divya; Priyakumar, U Deva.

Sci Data ; 11(1): 180, 2024 Feb 09.

Artigo em Inglês | MEDLINE | ID: mdl-38336857

RESUMO

Computing binding affinities is of great importance in drug discovery pipeline and its prediction using advanced machine learning methods still remains a major challenge as the existing datasets and models do not consider the dynamic features of protein-ligand interactions. To this end, we have developed PLAS-20k dataset, an extension of previously developed PLAS-5k, with 97,500 independent simulations on a total of 19,500 different protein-ligand complexes. Our results show good correlation with the available experimental values, performing better than docking scores. This holds true even for a subset of ligands that follows Lipinski's rule, and for diverse clusters of complex structures, thereby highlighting the importance of PLAS-20k dataset in developing new ML models. Along with this, our dataset is also beneficial in classifying strong and weak binders compared to docking. Further, OnionNet model has been retrained on PLAS-20k dataset and is provided as a baseline for the prediction of binding affinities. We believe that large-scale MD-based datasets along with trajectories will form new synergy, paving the way for accelerating drug discovery.

Assuntos

Ligantes , Proteínas , Descoberta de Drogas , Aprendizado de Máquina , Ligação Proteica , Proteínas/química , Humanos , Animais

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA