Best of both worlds: An expansion of the state of the art pKa model with data from three industrial partners.
Mol Inform
; 43(10): e202400088, 2024 Oct.
Article
in En
| MEDLINE
| ID: mdl-39031889
ABSTRACT
In a unique collaboration between Simulations Plus and several industrial partners, we were able to develop a new version 11.0 of the previously published in silico pKa model, S+pKa, with considerably improved prediction accuracy. The model's training set was vastly expanded by large amounts of experimental data obtained from F. Hoffmann-La Roche AG, Genentech Inc., and the Crop Science division of Bayer AG. The previous v7.0 of S+pKa was trained on data from public sources and the Pharmaceutical division of Bayer AG. The model has shown dramatic improvements in predictive accuracy when externally validated on three new contributor compound sets. Less expected was v11.0's improvement in prediction on new compounds developed at Bayer Pharma after v7.0 was released (2013-2023), even without contributing additional data to v11.0. We illustrate chemical space coverage by chemistries encountered in the five domains, public and industrial, outline model construction, and discuss factors contributing to model's success.
Key words
Full text:
1
Collection:
01-internacional
Database:
MEDLINE
Main subject:
Models, Chemical
Language:
En
Journal:
Mol Inform
Year:
2024
Document type:
Article
Affiliation country:
United States
Country of publication:
Germany