Your browser doesn't support javascript.
loading
Chat-GPT on brain tumors: An examination of Artificial Intelligence/Machine Learning's ability to provide diagnoses and treatment plans for example neuro-oncology cases.
Kozel, Giovanni; Gurses, Muhammet Enes; Gecici, Neslihan Nisa; Gökalp, Elif; Bahadir, Siyar; Merenzon, Martin A; Shah, Ashish H; Komotar, Ricardo J; Ivan, Michael E.
Afiliação
  • Kozel G; Warren Alpert Medical School, Brown University, Providence, RI, USA.
  • Gurses ME; Department of Neurosurgery, Miller School of Medicine, University of Miami, Miami, FL, USA. Electronic address: megursesmd@gmail.com.
  • Gecici NN; Hacettepe University School of Medicine, Ankara, Turkey.
  • Gökalp E; Ankara University School of Medicine, Ankara, Turkey.
  • Bahadir S; Feinstein Institute, New York, USA.
  • Merenzon MA; Department of Neurosurgery, Yale University School of Medicine, New Haven, CT, USA.
  • Shah AH; Department of Neurosurgery, Miller School of Medicine, University of Miami, Miami, FL, USA.
  • Komotar RJ; Department of Neurosurgery, Miller School of Medicine, University of Miami, Miami, FL, USA.
  • Ivan ME; Department of Neurosurgery, Miller School of Medicine, University of Miami, Miami, FL, USA.
Clin Neurol Neurosurg ; 239: 108238, 2024 04.
Article em En | MEDLINE | ID: mdl-38507989
ABSTRACT

OBJECTIVE:

Assess the capabilities of ChatGPT-3.5 and 4 to provide accurate diagnoses, treatment options, and treatment plans for brain tumors in example neuro-oncology cases.

METHODS:

ChatGPT-3.5 and 4 were provided with twenty example neuro-oncology cases of brain tumors, all selected from medical textbooks. The artificial intelligence programs were asked to give a diagnosis, treatment option, and treatment plan for each of these twenty example cases. Team members first determined in which cases ChatGPT-3.5 and 4 provided the correct diagnosis or treatment plan. Twenty neurosurgeons from the researchers' institution then independently rated the diagnoses, treatment options, and treatment plans provided by both artificial intelligence programs for each of the twenty example cases, on a scale of one to ten, with ten being the highest score. To determine whether the difference between the scores of ChatGPT-3.5 and 4 was statistically significant, a paired t-test was conducted for the average scores given to the programs for each example case.

RESULTS:

In the initial analysis of correct responses, ChatGPT-4 had an accuracy of 85% for its diagnoses of example brain tumors and an accuracy of 75% for its provided treatment plans, while ChatGPT-3.5 only had an accuracy of 65% and 10%, respectively. The average scores given by the twenty independent neurosurgeons to ChatGPT-4 for its accuracy of diagnosis, provided treatment options, and provided treatment plan were 8.3, 8.4, and 8.5 out of 10, respectively, while ChatGPT-3.5's average scores for these categories of assessment were 5.9, 5.7, and 5.7. These differences in average score are statistically significant on a paired t-test, with a p-value of less than 0.001 for each difference.

CONCLUSIONS:

ChatGPT-4 demonstrates great promise as a diagnostic tool for brain tumors in neuro-oncology, as attested to by the program's performance in this study and its assessment by surveyed neurosurgeon reviewers.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Neoplasias Encefálicas / Inteligência Artificial Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Neoplasias Encefálicas / Inteligência Artificial Idioma: En Ano de publicação: 2024 Tipo de documento: Article