Large Language Model Influence on Management Reasoning: A Randomized Controlled Trial.

Goh, Ethan; Gallo, Robert; Strong, Eric; Weng, Yingjie; Kerman, Hannah; Freed, Jason; Cool, Joséphine A; Kanjee, Zahir; Lane, Kathleen P; Parsons, Andrew S; Ahuja, Neera; Horvitz, Eric; Yang, Daniel; Milstein, Arnold; Olson, Andrew P J; Hom, Jason; Chen, Jonathan H; Rodman, Adam

Goh, Ethan; Gallo, Robert; Strong, Eric; Weng, Yingjie; Kerman, Hannah; Freed, Jason; Cool, Joséphine A; Kanjee, Zahir; Lane, Kathleen P; Parsons, Andrew S; Ahuja, Neera; Horvitz, Eric; Yang, Daniel; Milstein, Arnold; Olson, Andrew P J; Hom, Jason; Chen, Jonathan H; Rodman, Adam.

Afiliação

Goh E; Stanford Center for Biomedical Informatics Research, Stanford University, Stanford, CA.
Gallo R; Stanford Clinical Excellence Research Center, Stanford University, Stanford, CA.
Strong E; Center for Innovation to Implementation, VA Palo Alto Health Care System, PA, CA.
Weng Y; Stanford University School of Medicine, Stanford, CA.
Kerman H; Quantitative Sciences Unit, Stanford University School of Medicine, Stanford, CA.
Freed J; Beth Israel Deaconess Medical Center, Boston, MA.
Cool JA; Harvard Medical School, Boston, MA.
Kanjee Z; Beth Israel Deaconess Medical Center, Boston, MA.
Lane KP; Beth Israel Deaconess Medical Center, Boston, MA.
Parsons AS; Harvard Medical School, Boston, MA.
Ahuja N; Beth Israel Deaconess Medical Center, Boston, MA.
Horvitz E; Harvard Medical School, Boston, MA.
Yang D; University of Minnesota Medical School, Minneapolis, MN.
Milstein A; University of Virginia, School of Medicine, Charlottesville, VA.
Olson APJ; Stanford University School of Medicine, Stanford, CA.
Hom J; Microsoft, Redmond, WA.
Chen JH; Stanford Institute for Human-Centered Artificial Intelligence, Stanford, CA.
Rodman A; Kaiser Permanente, Oakland, CA.

medRxiv ; 2024 Aug 07.

Article em En | MEDLINE | ID: mdl-39148822

ABSTRACT

ABSTRACT

Importance Large language model (LLM) artificial intelligence (AI) systems have shown promise in diagnostic reasoning, but their utility in management reasoning with no clear right answers is unknown.

Objective:

To determine whether LLM assistance improves physician performance on open-ended management reasoning tasks compared to conventional resources.

Design:

Prospective, randomized controlled trial conducted from 30 November 2023 to 21 April 2024.

Setting:

Multi-institutional study from Stanford University, Beth Israel Deaconess Medical Center, and the University of Virginia involving physicians from across the United States.

Participants:

92 practicing attending physicians and residents with training in internal medicine, family medicine, or emergency medicine. Intervention Five expert-developed clinical case vignettes were presented with multiple open-ended management questions and scoring rubrics created through a Delphi process. Physicians were randomized to use either GPT-4 via ChatGPT Plus in addition to conventional resources (e.g., UpToDate, Google), or conventional resources alone. Main Outcomes and

Measures:

The primary outcome was difference in total score between groups on expert-developed scoring rubrics. Secondary outcomes included domain-specific scores and time spent per case.

Results:

Physicians using the LLM scored higher compared to those using conventional resources (mean difference 6.5 %, 95% CI 2.7-10.2, p<0.001). Significant improvements were seen in management decisions (6.1%, 95% CI 2.5-9.7, p=0.001), diagnostic decisions (12.1%, 95% CI 3.1-21.0, p=0.009), and case-specific (6.2%, 95% CI 2.4-9.9, p=0.002) domains. GPT-4 users spent more time per case (mean difference 119.3 seconds, 95% CI 17.4-221.2, p=0.02). There was no significant difference between GPT-4-augmented physicians and GPT-4 alone (-0.9%, 95% CI -9.0 to 7.2, p=0.8). Conclusions and Relevance LLM assistance improved physician management reasoning compared to conventional resources, with particular gains in contextual and patient-specific decision-making. These findings indicate that LLMs can augment management decision-making in complex cases. Trial registration ClinicalTrials.gov Identifier NCT06208423; https//classic.clinicaltrials.gov/ct2/show/NCT06208423.

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article