Artificial intelligence-based data extraction for next generation risk assessment: Is fine-tuning of a large language model worth the effort?

Sonnenburg, Anna; van der Lugt, Benthe; Rehn, Johannes; Wittkowski, Paul; Bech, Karsten; Padberg, Florian; Eleftheriadou, Dimitra; Dobrikov, Todor; Bouwmeester, Hans; Mereu, Carla; Graf, Ferdinand; Kneuer, Carsten; Kramer, Nynke I; Blümmel, Tilmann

Sonnenburg, Anna; van der Lugt, Benthe; Rehn, Johannes; Wittkowski, Paul; Bech, Karsten; Padberg, Florian; Eleftheriadou, Dimitra; Dobrikov, Todor; Bouwmeester, Hans; Mereu, Carla; Graf, Ferdinand; Kneuer, Carsten; Kramer, Nynke I; Blümmel, Tilmann.

Afiliação

Sonnenburg A; Department of Pesticides Safety, German Federal Institute for Risk Assessment, Max-Dohrn-Straße 8-10, Berlin 10589, Germany. Electronic address: anna.sonnenburg@bfr.bund.de.
van der Lugt B; Division of Toxicology, Wageningen University & Research, Stippeneng 4, Wageningen 6708 WE, the Netherlands; Wageningen Food Safety Research, Wageningen University & Research, Akkermaalsbos 2 6708WB Wageningen, The Netherlands.
Rehn J; d-fine GmbH, An der Hauptwache 7, Frankfurt am Main 60313, Germany.
Wittkowski P; Department of Pesticides Safety, German Federal Institute for Risk Assessment, Max-Dohrn-Straße 8-10, Berlin 10589, Germany.
Bech K; Department of Pesticides Safety, German Federal Institute for Risk Assessment, Max-Dohrn-Straße 8-10, Berlin 10589, Germany.
Padberg F; Department of Pesticides Safety, German Federal Institute for Risk Assessment, Max-Dohrn-Straße 8-10, Berlin 10589, Germany.
Eleftheriadou D; Department of Pesticides Safety, German Federal Institute for Risk Assessment, Max-Dohrn-Straße 8-10, Berlin 10589, Germany.
Dobrikov T; d-fine GmbH, An der Hauptwache 7, Frankfurt am Main 60313, Germany.
Bouwmeester H; Division of Toxicology, Wageningen University & Research, Stippeneng 4, Wageningen 6708 WE, the Netherlands.
Mereu C; d-fine GmbH, An der Hauptwache 7, Frankfurt am Main 60313, Germany.
Graf F; d-fine GmbH, An der Hauptwache 7, Frankfurt am Main 60313, Germany.
Kneuer C; Department of Pesticides Safety, German Federal Institute for Risk Assessment, Max-Dohrn-Straße 8-10, Berlin 10589, Germany.
Kramer NI; Division of Toxicology, Wageningen University & Research, Stippeneng 4, Wageningen 6708 WE, the Netherlands.
Blümmel T; d-fine GmbH, An der Hauptwache 7, Frankfurt am Main 60313, Germany.

Toxicology ; 508: 153933, 2024 Nov.

Article em En | MEDLINE | ID: mdl-39181527

ABSTRACT

ABSTRACT

To underpin scientific evaluations of chemical risks, agencies such as the European Food Safety Authority (EFSA) heavily rely on the outcome of systematic reviews, which currently require extensive manual effort. One specific challenge constitutes the meaningful use of vast amounts of valuable data from new approach methodologies (NAMs) which are mostly reported in an unstructured way in the scientific literature. In the EFSA-initiated project 'AI4NAMS', the potential of large language models (LLMs) was explored. Models from the GPT family, where GPT refers to Generative Pre-trained Transformer, were used for searching, extracting, and integrating data from scientific publications for NAM-based risk assessment. A case study on bisphenol A (BPA), a substance of very high concern due to its adverse effects on human health, focused on the structured extraction of information on test systems measuring biologic activities of BPA. Fine-tuning of a GPT-3 model (Curie base model) for extraction tasks was tested and the performance of the fine-tuned model was compared to the performance of a ready-to-use model (text-davinci-002). To update findings from the AI4NAMS project and to check for technical progress, the fine-tuning exercise was repeated and a newer ready-to-use model (text-davinci-003) served as comparison. In both cases, the fine-tuned Curie model was found to be superior to the ready-to-use model. Performance improvement was also obvious between text-davinci-002 and the newer text-davinci-003. Our findings demonstrate how fine-tuning and the swift general technical development improve model performance and contribute to the growing number of investigations on the use of AI in scientific and regulatory tasks.

Assuntos

Inteligência Artificial; Compostos Benzidrílicos; Fenóis; Medição de Risco/métodos; Compostos Benzidrílicos/toxicidade; Humanos; Fenóis/toxicidade; Mineração de Dados/métodos

Palavras-chave

Artificial intelligence; Automated data extraction; Fine-tuning; Large Language models; Risk Assessment; Systematic literature review

Texto completo

Adicionar na Minha BVS

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Fenóis / Compostos Benzidrílicos / Inteligência Artificial Limite: Humans Idioma: En Revista: Toxicology Ano de publicação: 2024 Tipo de documento: Article País de publicação: Irlanda

Texto completo

Adicionar na Minha BVS

Imprimir

XML

PubMed Links

Buscar no Google