Your browser doesn't support javascript.
loading
Assessing the performance of ChatGPT's responses to questions related to epilepsy: A cross-sectional study on natural language processing and medical information retrieval.
Kim, Hyun-Woo; Shin, Dong-Hyeon; Kim, Jiyoung; Lee, Gha-Hyun; Cho, Jae Wook.
Afiliação
  • Kim HW; Department of Neurology, Pusan National University Yangsan Hospital, 50612 Geumoro 20, Yangsan, South Korea.
  • Shin DH; Department of Neurology, Pusan National University Yangsan Hospital, 50612 Geumoro 20, Yangsan, South Korea.
  • Kim J; Department of Neurology, Pusan National University Hospital, Busan, South Korea; Pusan National University School of Medicine, Research Institute for Convergence of Biomedical Science and Technology, Yangsan, South Korea.
  • Lee GH; Department of Neurology, Pusan National University Hospital, Busan, South Korea; Pusan National University School of Medicine, Research Institute for Convergence of Biomedical Science and Technology, Yangsan, South Korea.
  • Cho JW; Department of Neurology, Pusan National University Yangsan Hospital, 50612 Geumoro 20, Yangsan, South Korea; Pusan National University School of Medicine, Research Institute for Convergence of Biomedical Science and Technology, Yangsan, South Korea. Electronic address: sleepcho@pusan.ac.kr.
Seizure ; 114: 1-8, 2024 Jan.
Article em En | MEDLINE | ID: mdl-38007922
ABSTRACT

BACKGROUND:

Epilepsy is a neurological condition marked by frequent seizures and various cognitive and psychological effects. Reliable information is essential for effective treatment. Natural language processing models like ChatGPT are increasingly used in healthcare for information access and data analysis, making it crucial to assess their accuracy.

OBJECTIVE:

This study aimed to investigate the accuracy of ChatGPT in providing educational information related to epilepsy.

METHODS:

We compared the answers from ChatGPT-4 and ChatGPT-3.5 to 57 common epilepsy questions based on the Korean Epilepsy Society's "Epilepsy Patient and Caregiver Guide." Two epileptologists reviewed the responses, with a third serving as an arbiter in cases of disagreement.

RESULTS:

Out of 57 questions, 40 responses from ChatGPT-4 had "sufficient educational value," 16 were "correct but inadequate," and one was "mixed with correct and incorrect" information. No answers were entirely incorrect. GPT-4 generally outperformed GPT-3.5 and was often on par with or better than the official guide.

CONCLUSIONS:

ChatGPT-4 shows promise as a tool for delivering reliable epilepsy-related information and could help alleviate the educational burden on healthcare professionals. Further research is needed to explore the benefits and limitations of using such models in medical contexts.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Processamento de Linguagem Natural / Epilepsia Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Processamento de Linguagem Natural / Epilepsia Idioma: En Ano de publicação: 2024 Tipo de documento: Article