Evaluating ChatGPT's Capabilities on Orthopedic Training Examinations: An Analysis of New Image Processing Features.

Posner, Kevin M; Bakus, Cassandra; Basralian, Grace; Chester, Grace; Zeiman, Mallery; O'Malley, Geoffrey R; Klein, Gregg R

Posner, Kevin M; Bakus, Cassandra; Basralian, Grace; Chester, Grace; Zeiman, Mallery; O'Malley, Geoffrey R; Klein, Gregg R.

Afiliação

Posner KM; Department of Orthopedic Surgery, Hackensack Meridian School of Medicine, Nutley, USA.
Bakus C; Department of Orthopedic Surgery, Hackensack Meridian School of Medicine, Nutley, USA.
Basralian G; Department of Orthopedic Surgery, Hackensack Meridian School of Medicine, Nutley, USA.
Chester G; Department of Orthopedic Surgery, Hackensack Meridian School of Medicine, Nutley, USA.
Zeiman M; Department of Orthopedic Surgery, Hackensack Meridian School of Medicine, Nutley, USA.
O'Malley GR; Department of Orthopedic Surgery, Hackensack University Medical Center, Hackensack, USA.
Klein GR; Department of Orthopedic Surgery, Hackensack University Medical Center, Hackensack, USA.

Cureus ; 16(3): e55945, 2024 Mar.

Article em En | MEDLINE | ID: mdl-38601421

ABSTRACT

ABSTRACT

Introduction The efficacy of integrating artificial intelligence (AI) models like ChatGPT into the medical field, specifically orthopedic surgery, has yet to be fully determined. The most recent adaptation of ChatGPT that has yet to be explored is its image analysis capabilities. This study assesses ChatGPT's performance in answering Orthopedic In-Training Examination (OITE) questions, including those that require image analysis. Methods Questions from the 2014, 2015, 2021, and 2022 AAOS OITE were screened for inclusion. All questions without images were entered into ChatGPT 3.5 and 4.0 twice. Questions that necessitated the use of images were only entered into ChatGPT 4.0 twice, as this is the only version of the system that can analyze images. The responses were recorded and compared to AAOS's correct answers, evaluating the AI's accuracy and precision. Results A total of 940 questions were included in the final analysis (457 questions with images and 483 questions without images). ChatGPT 4.0 performed significantly better on questions that did not require image analysis (67.81% vs 47.59%, p<0.001). Discussion While the use of AI in orthopedics is an intriguing possibility, this evaluation demonstrates how, even with the addition of image processing capabilities, ChatGPT still falls short in terms of its accuracy. As AI technology evolves, ongoing research is vital to harness AI's potential effectively, ensuring it complements rather than attempts to replace the nuanced skills of orthopedic surgeons.

Palavras-chave

artificial intelligence and education; orthopedic clinical practice; orthopedic education; orthopedic examinations; resident education

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Cureus Ano de publicação: 2024 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Cureus Ano de publicação: 2024 Tipo de documento: Article