Your browser doesn't support javascript.
loading
Bio-QSARs 2.0: Unlocking a new level of predictive power for machine learning-based ecotoxicity predictions by exploiting chemical and biological information.
Zubrod, Jochen P; Galic, Nika; Vaugeois, Maxime; Dreier, David A.
Afiliación
  • Zubrod JP; Zubrod Environmental Data Science, 76829 Landau, Germany.
  • Galic N; Syngenta Crop Protection AG, 4058 Basel, Switzerland.
  • Vaugeois M; Syngenta Crop Protection, LLC, Greensboro, NC, USA.
  • Dreier DA; Syngenta Crop Protection, LLC, Greensboro, NC, USA. Electronic address: david.dreier@syngenta.com.
Environ Int ; 186: 108607, 2024 Apr.
Article en En | MEDLINE | ID: mdl-38593686
ABSTRACT
Practical, legal, and ethical reasons necessitate the development of methods to replace animal experiments. Computational techniques to acquire information that traditionally relied on animal testing are considered a crucial pillar among these so-called new approach methodologies. In this light, we recently introduced the Bio-QSAR concept for multispecies aquatic toxicity regression tasks. These machine learning models, trained on both chemical and biological information, are capable of both cross-chemical and cross-species predictions. Here, we significantly extend these models' applicability. This was realized by increasing the quantity of training data by a factor of approximately 20, accomplished by considering both additional chemicals and aquatic organisms. Additionally, variable test durations and associated random effects were accommodated by employing a machine learning algorithm that combines tree-boosting with mixed-effects modeling (i.e., Gaussian Process Boosting). We also explored various biological descriptors including Dynamic Energy Budget model parameters, taxonomic distances, as well as genus-specific traits and investigated the inclusion of mode-of-action information. Through these efforts, we developed Bio-QSARs for fish and aquatic invertebrates with exceptional predictive power (R squared of up to 0.92 on independent test sets). Moreover, we made considerable strides to make models applicable for a range of use cases in environmental risk assessment as well as research and development of chemicals. Models were made fully explainable by implementing an algorithmic multicollinearity correction combined with SHapley Additive exPlanations. Furthermore, we devised novel approaches for applicability domain construction that take feature importance into account. We are hence confident these models, which are available via open access, will make a significant contribution towards the implementation of new approach methodologies and ultimately have the potential to support "Green Chemistry" and "Green Toxicology".
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Contexto en salud: 12_ODS3_hazardous_contamination / 15_ODS3_global_health_risks Problema de salud: 12_water_sanitation_hygiene / 15_technological_hazards Asunto principal: Relación Estructura-Actividad Cuantitativa / Peces / Aprendizaje Automático Límite: Animals Idioma: En Revista: Environ Int Año: 2024 Tipo del documento: Article País de afiliación: Alemania

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Contexto en salud: 12_ODS3_hazardous_contamination / 15_ODS3_global_health_risks Problema de salud: 12_water_sanitation_hygiene / 15_technological_hazards Asunto principal: Relación Estructura-Actividad Cuantitativa / Peces / Aprendizaje Automático Límite: Animals Idioma: En Revista: Environ Int Año: 2024 Tipo del documento: Article País de afiliación: Alemania
...