Your browser doesn't support javascript.
loading
Using large language models for safety-related table summarization in clinical study reports.
Landman, Rogier; Healey, Sean P; Loprinzo, Vittorio; Kochendoerfer, Ulrike; Winnier, Angela Russell; Henstock, Peter V; Lin, Wenyi; Chen, Aqiu; Rajendran, Arthi; Penshanwar, Sushant; Khan, Sheraz; Madhavan, Subha.
Afiliação
  • Landman R; Pfizer Research and Development, New York, NY 10001, United States.
  • Healey SP; Pfizer Research and Development, New York, NY 10001, United States.
  • Loprinzo V; Pfizer Research and Development, New York, NY 10001, United States.
  • Kochendoerfer U; Pfizer Research and Development, New York, NY 10001, United States.
  • Winnier AR; Pfizer Research and Development, New York, NY 10001, United States.
  • Henstock PV; Pfizer Research and Development, New York, NY 10001, United States.
  • Lin W; Pfizer Research and Development, New York, NY 10001, United States.
  • Chen A; Pfizer Research and Development, New York, NY 10001, United States.
  • Rajendran A; Pfizer Research and Development, New York, NY 10001, United States.
  • Penshanwar S; Pfizer Research and Development, New York, NY 10001, United States.
  • Khan S; Pfizer Research and Development, New York, NY 10001, United States.
  • Madhavan S; Pfizer Research and Development, New York, NY 10001, United States.
JAMIA Open ; 7(2): ooae043, 2024 Jul.
Article em En | MEDLINE | ID: mdl-38818116
ABSTRACT

Objectives:

The generation of structured documents for clinical trials is a promising application of large language models (LLMs). We share opportunities, insights, and challenges from a competitive challenge that used LLMs for automating clinical trial documentation. Materials and

Methods:

As part of a challenge initiated by Pfizer (organizer), several teams (participant) created a pilot for generating summaries of safety tables for clinical study reports (CSRs). Our evaluation framework used automated metrics and expert reviews to assess the quality of AI-generated documents.

Results:

The comparative analysis revealed differences in performance across solutions, particularly in factual accuracy and lean writing. Most participants employed prompt engineering with generative pre-trained transformer (GPT) models.

Discussion:

We discuss areas for improvement, including better ingestion of tables, addition of context and fine-tuning.

Conclusion:

The challenge results demonstrate the potential of LLMs in automating table summarization in CSRs while also revealing the importance of human involvement and continued research to optimize this technology.
Palavras-chave

Texto completo: 1 Bases de dados: MEDLINE Idioma: En Revista: JAMIA Open Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Bases de dados: MEDLINE Idioma: En Revista: JAMIA Open Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Estados Unidos