Your browser doesn't support javascript.
loading
Will ChatGPT be Able to Replace a Spine Surgeon in the Clinical Setting?
Chalhoub, Ralph; Mouawad, Antoine; Aoun, Marven; Daher, Mohammad; El-Sett, Pierre; Kreichati, Gaby; Kharrat, Khalil; Sebaaly, Amer.
Afiliação
  • Chalhoub R; Saint Joseph University, Faculty of medicine, Beirut, Lebanon.
  • Mouawad A; Saint Joseph University, Faculty of medicine, Beirut, Lebanon.
  • Aoun M; Saint Joseph University, Faculty of medicine, Beirut, Lebanon.
  • Daher M; Saint Joseph University, Faculty of medicine, Beirut, Lebanon; Department of Orthopedic Surgery, Brown University, Providence, Rhode Island, USA.
  • El-Sett P; Saint Joseph University, Faculty of medicine, Beirut, Lebanon; Department of Orthopedic Surgery, Hotel Dieu de France Hospital, Beirut, Lebanon.
  • Kreichati G; Saint Joseph University, Faculty of medicine, Beirut, Lebanon; Department of Orthopedic Surgery, Hotel Dieu de France Hospital, Beirut, Lebanon.
  • Kharrat K; Saint Joseph University, Faculty of medicine, Beirut, Lebanon; Department of Orthopedic Surgery, Hotel Dieu de France Hospital, Beirut, Lebanon.
  • Sebaaly A; Saint Joseph University, Faculty of medicine, Beirut, Lebanon; Department of Orthopedic Surgery, Hotel Dieu de France Hospital, Beirut, Lebanon. Electronic address: amersebaaly@hotmail.com.
World Neurosurg ; 185: e648-e652, 2024 05.
Article em En | MEDLINE | ID: mdl-38417624
ABSTRACT

OBJECTIVE:

This study evaluates ChatGPT's performance in diagnosing and managing spinal pathologies.

METHODS:

Patients underwent evaluation by two spine surgeons (and the case was discussed and a consensus was reached) and ChatGPT. Patient data, including demographics, symptoms, and available imaging reports, were collected using a standardized form. This information was then processed by ChatGPT for diagnosis and management recommendations. The study assessed ChatGPT's diagnostic and management accuracy through descriptive statistics, comparing its performance to that of experienced spine specialists.

RESULTS:

A total of 97 patients with various spinal pathologies participated in the study, with a gender distribution of 40 males and 57 females. ChatGPT achieved a 70% diagnostic accuracy rate and provided suitable management recommendations for 95% of patients. However, it struggled with certain pathologies, misdiagnosing 100% of vertebral trauma and facet joint syndrome, 40% of spondylolisthesis, stenosis, and scoliosis, and 22% of disc-related pathologies. Furthermore, ChatGPT's management recommendations were poor in 53% of cases, often failing to suggest the most appropriate treatment options and occasionally providing incomplete advice.

CONCLUSIONS:

While helpful in the medical field, ChatGPT falls short in providing reliable management recommendations, with a 30% misdiagnosis rate and 53% mismanagement rate in our study. Its limitations, including reliance on outdated data and the inability to interactively gather patient information, must be acknowledged. Surgeons should use ChatGPT cautiously as a supplementary tool rather than a substitute for their clinical expertise, as the complexities of healthcare demand human judgment and interaction.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Doenças da Coluna Vertebral Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Doenças da Coluna Vertebral Idioma: En Ano de publicação: 2024 Tipo de documento: Article