Your browser doesn't support javascript.
loading
A comparison of molecular representations for lipophilicity quantitative structure-property relationships with results from the SAMPL6 logP Prediction Challenge.
Lui, Raymond; Guan, Davy; Matthews, Slade.
Afiliación
  • Lui R; Pharmacoinformatics Laboratory, Discipline of Pharmacology, School of Medical Sciences, Faculty of Medicine and Health, The University of Sydney, Sydney, NSW, 2006, Australia.
  • Guan D; Pharmacoinformatics Laboratory, Discipline of Pharmacology, School of Medical Sciences, Faculty of Medicine and Health, The University of Sydney, Sydney, NSW, 2006, Australia.
  • Matthews S; Pharmacoinformatics Laboratory, Discipline of Pharmacology, School of Medical Sciences, Faculty of Medicine and Health, The University of Sydney, Sydney, NSW, 2006, Australia. slade.matthews@sydney.edu.au.
J Comput Aided Mol Des ; 34(5): 523-534, 2020 05.
Article en En | MEDLINE | ID: mdl-31933037
ABSTRACT
Effective representation of a molecule is required to develop useful quantitative structure-property relationships (QSPR) for accurate prediction of chemical properties. The octanol-water partition coefficient logP, a measure of lipophilicity, is an important property for pharmacological and toxicological endpoints used in the pharmaceutical and regulatory spheres. We compare physicochemical descriptors, structural keys, and circular fingerprints in their ability to effectively represent a chemical space and characterise molecular features to correlate with lipophilicity. Exploratory landscape continuity analyses revealed that whole-molecule physicochemical descriptors could map together compounds that were similar in both molecular features and logP, indicating higher potential for use in logP QSPRs compared to the substructural approach of structural keys and circular fingerprints. Indeed, logP QSPR models parameterised by physicochemical descriptors consistently performed with the lowest error. Our best performing model was a stochastic gradient descent-optimised multilinear regression with 1438 descriptors, returning an internal benchmark RMSE of 1.03 log units. This corroborates the well-established notion that lipophilicity is an additive, whole-molecule property. We externally tested the model by participating in the 2019 SAMPL6 logP Prediction Challenge and blindly predicting for 11 protein kinase inhibitor fragment-like molecules. Our model returned an RMSE of 0.49 log units, placing eighth overall and third in the empirical methods category (submission ID 'hdpuj'). Permutation feature importance analyses revealed that physicochemical descriptors could characterise predictive molecular features highly relevant to the kinase inhibitor fragment-like molecules.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Agua / Relación Estructura-Actividad Cuantitativa / Inhibidores de Proteínas Quinasas / Modelos Químicos Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: J Comput Aided Mol Des Asunto de la revista: BIOLOGIA MOLECULAR / ENGENHARIA BIOMEDICA Año: 2020 Tipo del documento: Article País de afiliación: Australia

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Agua / Relación Estructura-Actividad Cuantitativa / Inhibidores de Proteínas Quinasas / Modelos Químicos Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: J Comput Aided Mol Des Asunto de la revista: BIOLOGIA MOLECULAR / ENGENHARIA BIOMEDICA Año: 2020 Tipo del documento: Article País de afiliación: Australia