Your browser doesn't support javascript.
loading
Curated Data In - Trustworthy In Silico Models Out: The Impact of Data Quality on the Reliability of Artificial Intelligence Models as Alternatives to Animal Testing.
Alves, Vinicius M; Auerbach, Scott S; Kleinstreuer, Nicole; Rooney, John P; Muratov, Eugene N; Rusyn, Ivan; Tropsha, Alexander; Schmitt, Charles.
Afiliação
  • Alves VM; Office of Data Science, Division of the National Toxicology Program (DNTP), 6857National Institute of Environmental Health Sciences (NIEHS), Durham, NC, USA.
  • Auerbach SS; Toxinformatics Group, Predictive Toxicology Branch, DNTP, NIEHS, Durham, NC, USA.
  • Kleinstreuer N; National Toxicology Program Interagency Center for the Evaluation of Alternative Toxicological Methods, Scientific Director's Office, DNTP, NIEHS, Durham, NC, USA.
  • Rooney JP; Integrated Laboratory Systems, LLC, Morrisville, NC, USA.
  • Muratov EN; Laboratory for Molecular Modeling, UNC Eshelman School of Pharmacy, 2331The University of North Carolina at Chapel Hill, NC, USA.
  • Rusyn I; Department of Pharmaceutical Sciences, Federal University of Paraiba, Joao Pessoa, Paraiba, Brazil.
  • Tropsha A; Department of Veterinary Integrative Biosciences, College of Veterinary Medicine and Biomedical Sciences, Texas A&M University, College Station, TX, USA.
  • Schmitt C; Laboratory for Molecular Modeling, UNC Eshelman School of Pharmacy, 2331The University of North Carolina at Chapel Hill, NC, USA.
Altern Lab Anim ; 49(3): 73-82, 2021 May.
Article em En | MEDLINE | ID: mdl-34233495
ABSTRACT
New Approach Methodologies (NAMs) that employ artificial intelligence (AI) for predicting adverse effects of chemicals have generated optimistic expectations as alternatives to animal testing. However, the major underappreciated challenge in developing robust and predictive AI models is the impact of the quality of the input data on the model accuracy. Indeed, poor data reproducibility and quality have been frequently cited as factors contributing to the crisis in biomedical research, as well as similar shortcomings in the fields of toxicology and chemistry. In this article, we review the most recent efforts to improve confidence in the robustness of toxicological data and investigate the impact that data curation has on the confidence in model predictions. We also present two case studies demonstrating the effect of data curation on the performance of AI models for predicting skin sensitisation and skin irritation. We show that, whereas models generated with uncurated data had a 7-24% higher correct classification rate (CCR), the perceived performance was, in fact, inflated owing to the high number of duplicates in the training set. We assert that data curation is a critical step in building computational models, to help ensure that reliable predictions of chemical toxicity are achieved through use of the models.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Inteligência Artificial / Alternativas aos Testes com Animais Tipo de estudo: Prognostic_studies Limite: Animals Idioma: En Revista: Altern Lab Anim Ano de publicação: 2021 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Inteligência Artificial / Alternativas aos Testes com Animais Tipo de estudo: Prognostic_studies Limite: Animals Idioma: En Revista: Altern Lab Anim Ano de publicação: 2021 Tipo de documento: Article País de afiliação: Estados Unidos