Using Large Language Models to Support Content Analysis: A Case Study of ChatGPT for Adverse Event Detection.

Leas, Eric C; Ayers, John W; Desai, Nimit; Dredze, Mark; Hogarth, Michael; Smith, Davey M

Leas, Eric C; Ayers, John W; Desai, Nimit; Dredze, Mark; Hogarth, Michael; Smith, Davey M.

Afiliación

Leas EC; Herbert Wertheim School of Public Health and Human Longevity Science, University of California San Diego, La Jolla, CA, United States.
Ayers JW; Qualcomm Institute, University of California San Diego, La Jolla, CA, United States.
Desai N; Qualcomm Institute, University of California San Diego, La Jolla, CA, United States.
Dredze M; Division of Infectious Diseases and Global Public Health, Department of Medicine, University of California San Diego, La Jolla, CA, United States.
Hogarth M; Altman Clinical Translational Research Institute, University of California San Diego, La Jolla, CA, United States.
Smith DM; Qualcomm Institute, University of California San Diego, La Jolla, CA, United States.

J Med Internet Res ; 26: e52499, 2024 May 02.

Article en En | MEDLINE | ID: mdl-38696245

ABSTRACT

ABSTRACT

This study explores the potential of using large language models to assist content analysis by conducting a case study to identify adverse events (AEs) in social media posts. The case study compares ChatGPT's performance with human annotators' in detecting AEs associated with delta-8-tetrahydrocannabinol, a cannabis-derived product. Using the identical instructions given to human annotators, ChatGPT closely approximated human results, with a high degree of agreement noted 94.4% (9436/10,000) for any AE detection (Fleiss κ=0.95) and 99.3% (9931/10,000) for serious AEs (κ=0.96). These findings suggest that ChatGPT has the potential to replicate human annotation accurately and efficiently. The study recognizes possible limitations, including concerns about the generalizability due to ChatGPT's training data, and prompts further research with different models, data sources, and content analysis tasks. The study highlights the promise of large language models for enhancing the efficiency of biomedical research.

Asunto(s)

Medios de Comunicación Sociales; Humanos; Medios de Comunicación Sociales/estadística & datos numéricos; Dronabinol/efectos adversos; Procesamiento de Lenguaje Natural

Palabras clave

AI; ChatGPT; LLM; adverse events; annotation; artificial intelligence; cannabis; delta-8-THC; delta-8-tetrahydrocannabiol; large language model; text analysis

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Medios de Comunicación Sociales Límite: Humans Idioma: En Revista: J Med Internet Res Asunto de la revista: INFORMATICA MEDICA Año: 2024 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google