Your browser doesn't support javascript.
loading
[Efficiency of different large language models in China in response to consultations about PCa-related perioperative nursing and health education].
Tan, Xiao-Wen; Chen, Wen-Fang; Wang, Na-Na; Li, Hui-Yu; Li, Juan; Cao, Yu-Mei; Zhu, Meng-Qi; Li, Kun; Zhang, Ting-Ling; Fu, Dian.
Afiliación
  • Tan XW; Department of Urology, General Hospital of Eastern Theater Command, Nanjing, Jiangsu 210002, China.
  • Chen WF; Department of Urology, General Hospital of Eastern Theater Command, Nanjing, Jiangsu 210002, China.
  • Wang NN; Department of Urology, General Hospital of Eastern Theater Command, Nanjing, Jiangsu 210002, China.
  • Li HY; Department of Urology, General Hospital of Eastern Theater Command, Nanjing, Jiangsu 210002, China.
  • Li J; Department of Urology, General Hospital of Eastern Theater Command, Nanjing, Jiangsu 210002, China.
  • Cao YM; Department of Urology, General Hospital of Eastern Theater Command, Nanjing, Jiangsu 210002, China.
  • Zhu MQ; Department of Urology, General Hospital of Eastern Theater Command, Nanjing, Jiangsu 210002, China.
  • Li K; Department of Urology, General Hospital of Eastern Theater Command, Nanjing, Jiangsu 210002, China.
  • Zhang TL; Department of Urology, General Hospital of Eastern Theater Command, Nanjing, Jiangsu 210002, China.
  • Fu D; Department of Urology, General Hospital of Eastern Theater Command, Nanjing, Jiangsu 210002, China.
Zhonghua Nan Ke Xue ; 30(2): 151-156, 2024 Feb.
Article en Zh | MEDLINE | ID: mdl-39177349
ABSTRACT

OBJECTIVE:

To evaluate the efficiency of the four domestic language models, ERNIE Bot, ChatGLM2, Spark Desk and Qwen-14B-Chat, all with a massive user base and significant social attention, in response to consultations about PCa-related perioperative nursing and health education.

METHODS:

We designed a questionnaire that includes 15 questions commonly concerned by patients undergoing radical prostatectomy and 2 common nursing cases, and inputted the questions into each of the four language models for simulation consultation. Three nursing experts assessed the model responses based on a pre-designed Likert 5-point scale in terms of accuracy, comprehensiveness, understandability, humanistic care, and case analysis. We evaluated and compared the performance of the four models using visualization tools and statistical analyses.

RESULTS:

All the models generated high-quality texts with no misleading information and exhibited satisfactory performance. Qwen-14B-Chat scored the highest in all aspects and showed relatively stable outputs in multiple tests compared with ChatGLM2. Spark Desk performed well in terms of understandability but lacked comprehensiveness and humanistic care. Both Qwen-14B-Chat and ChatGLM2 demonstrated excellent performance in case analysis. The overall performance of ERNIE Bot was slightly inferior. All things considered, Qwen-14B-Chat was superior to the other three models in consultations about PCa-related perioperative nursing and health education.

CONCLUSION:

In PCa-related perioperative nursing, large language models represented by Qwen-14B-Chat are expected to become powerful auxiliary tools to provide patients with more medical expertise and information support, so as to improve the patient compliance and the quality of clinical treatment and nursing.
Asunto(s)
Palabras clave
Buscar en Google
Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Enfermería Perioperatoria Límite: Humans / Male País/Región como asunto: Asia Idioma: Zh Revista: Zhonghua Nan Ke Xue Asunto de la revista: MEDICINA REPRODUTIVA Año: 2024 Tipo del documento: Article País de afiliación: China
Buscar en Google
Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Enfermería Perioperatoria Límite: Humans / Male País/Región como asunto: Asia Idioma: Zh Revista: Zhonghua Nan Ke Xue Asunto de la revista: MEDICINA REPRODUTIVA Año: 2024 Tipo del documento: Article País de afiliación: China