Your browser doesn't support javascript.
loading
Evaluating the Performance of Different Large Language Models on Health Consultation and Patient Education in Urolithiasis.
Song, Haifeng; Xia, Yi; Luo, Zhichao; Liu, Hui; Song, Yan; Zeng, Xue; Li, Tianjie; Zhong, Guangxin; Li, Jianxing; Chen, Ming; Zhang, Guangyuan; Xiao, Bo.
Afiliación
  • Song H; Department of Urology, Beijing Tsinghua Changgung Hospital, School of Clinical Medicine, Tsinghua University, 168 Litang Rd, Beijing, 102218, China.
  • Xia Y; Institute of Urology, School of Clinical Medicine, Tsinghua University, Beijing, 102218, China.
  • Luo Z; Department of Urology, Zhongda Hospital, Southeast University, 87 Dingjiaqiao, Nanjing, 210009, China.
  • Liu H; School of Medicine, Southeast University, Nanjing, 210009, China.
  • Song Y; Department of Urology, Beijing Tsinghua Changgung Hospital, School of Clinical Medicine, Tsinghua University, 168 Litang Rd, Beijing, 102218, China.
  • Zeng X; Institute of Urology, School of Clinical Medicine, Tsinghua University, Beijing, 102218, China.
  • Li T; Department of Urology, Beijing Tsinghua Changgung Hospital, School of Clinical Medicine, Tsinghua University, 168 Litang Rd, Beijing, 102218, China.
  • Zhong G; Institute of Urology, School of Clinical Medicine, Tsinghua University, Beijing, 102218, China.
  • Li J; Department of Urology, Sheng Jing Hospital of China Medical University, Shenyang, 110000, China.
  • Chen M; Department of Urology, Beijing Tsinghua Changgung Hospital, School of Clinical Medicine, Tsinghua University, 168 Litang Rd, Beijing, 102218, China.
  • Zhang G; Institute of Urology, School of Clinical Medicine, Tsinghua University, Beijing, 102218, China.
  • Xiao B; Department of Urology, Beijing Tsinghua Changgung Hospital, School of Clinical Medicine, Tsinghua University, 168 Litang Rd, Beijing, 102218, China.
J Med Syst ; 47(1): 125, 2023 Nov 24.
Article en En | MEDLINE | ID: mdl-37999899
OBJECTIVES: To evaluate the effectiveness of four large language models (LLMs) (Claude, Bard, ChatGPT4, and New Bing) that have large user bases and significant social attention, in the context of medical consultation and patient education in urolithiasis. MATERIALS AND METHODS: In this study, we developed a questionnaire consisting of 21 questions and 2 clinical scenarios related to urolithiasis. Subsequently, clinical consultations were simulated for each of the four models to assess their responses to the questions. Urolithiasis experts then evaluated the model responses in terms of accuracy, comprehensiveness, ease of understanding, human care, and clinical case analysis ability based on a predesigned 5-point Likert scale. Visualization and statistical analyses were then employed to compare the four models and evaluate their performance. RESULTS: All models yielded satisfying performance, except for Bard, who failed to provide a valid response to Question 13. Claude consistently scored the highest in all dimensions compared with the other three models. ChatGPT4 ranked second in accuracy, with a relatively stable output across multiple tests, but shortcomings were observed in empathy and human caring. Bard exhibited the lowest accuracy and overall performance. Claude and ChatGPT4 both had a high capacity to analyze clinical cases of urolithiasis. Overall, Claude emerged as the best performer in urolithiasis consultations and education. CONCLUSION: Claude demonstrated superior performance compared with the other three in urolithiasis consultation and education. This study highlights the remarkable potential of LLMs in medical health consultations and patient education, although professional review, further evaluation, and modifications are still required.
Asunto(s)
Palabras clave

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Educación del Paciente como Asunto / Urolitiasis Límite: Humans Idioma: En Revista: J Med Syst Año: 2023 Tipo del documento: Article País de afiliación: China

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Educación del Paciente como Asunto / Urolitiasis Límite: Humans Idioma: En Revista: J Med Syst Año: 2023 Tipo del documento: Article País de afiliación: China