Your browser doesn't support javascript.
loading
Predictive Capability of QSAR Models Based on the CompTox Zebrafish Embryo Assays: An Imbalanced Classification Problem.
Lovric, Mario; Malev, Olga; Klobucar, Göran; Kern, Roman; Liu, Jay J; Lucic, Bono.
Afiliación
  • Lovric M; Know-Center, Inffeldgasse 13, 8010 Graz, Austria.
  • Malev O; Ruder Boskovic Institute, P.O. Box 180, 10002 Zagreb, Croatia.
  • Klobucar G; Ruder Boskovic Institute, P.O. Box 180, 10002 Zagreb, Croatia.
  • Kern R; Department of Biology, Faculty of Science, University of Zagreb, Rooseveltov Trg 6, 10000 Zagreb, Croatia.
  • Liu JJ; Department of Biology, Faculty of Science, University of Zagreb, Rooseveltov Trg 6, 10000 Zagreb, Croatia.
  • Lucic B; Know-Center, Inffeldgasse 13, 8010 Graz, Austria.
Molecules ; 26(6)2021 Mar 15.
Article en En | MEDLINE | ID: mdl-33803931
ABSTRACT
The CompTox Chemistry Dashboard (ToxCast) contains one of the largest public databases on Zebrafish (Danio rerio) developmental toxicity. The data consists of 19 toxicological endpoints on unique 1018 compounds measured in relatively low concentration ranges. The endpoints are related to developmental effects occurring in dechorionated zebrafish embryos for 120 hours post fertilization and monitored via gross malformations and mortality. We report the predictive capability of 209 quantitative structure-activity relationship (QSAR) models developed by machine learning methods using penalization techniques and diverse model quality metrics to cope with the imbalanced endpoints. All these QSAR models were generated to test how the imbalanced classification (toxic or non-toxic) endpoints could be predicted regardless which of three algorithms is used logistic regression, multi-layer perceptron, or random forests. Additionally, QSAR toxicity models are developed starting from sets of classical molecular descriptors, structural fingerprints and their combinations. Only 8 out of 209 models passed the 0.20 Matthew's correlation coefficient value defined a priori as a threshold for acceptable model quality on the test sets. The best models were obtained for endpoints mortality (MORT), ActivityScore and JAW (deformation). The low predictability of the QSAR model developed from the zebrafish embryotoxicity data in the database is mainly due to a higher sensitivity of 19 measurements of endpoints carried out on dechorionated embryos at low concentrations.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Contaminantes Químicos del Agua / Relación Estructura-Actividad Cuantitativa / Embrión no Mamífero Tipo de estudio: Prognostic_studies / Risk_factors_studies Límite: Animals Idioma: En Revista: Molecules Asunto de la revista: BIOLOGIA Año: 2021 Tipo del documento: Article País de afiliación: Austria

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Contaminantes Químicos del Agua / Relación Estructura-Actividad Cuantitativa / Embrión no Mamífero Tipo de estudio: Prognostic_studies / Risk_factors_studies Límite: Animals Idioma: En Revista: Molecules Asunto de la revista: BIOLOGIA Año: 2021 Tipo del documento: Article País de afiliación: Austria