Your browser doesn't support javascript.
loading
Enhancement of Large Language Models' Performance in Diabetes Education: Retrieval-Augmented Generation Approach.
Wang, Dingqiao; Liang, Jiangbo; Ye, Jinguo; Li, Jingni; Li, Jingpeng; Zhang, Qikai; Hu, Qiuling; Pan, Caineng; Wang, Dongliang; Liu, Zhong; Shi, Wen; Shi, Danli; Li, Fei; Qu, Bo; Zheng, Yingfeng.
Affiliation
  • Wang D; State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, GuangZhou, CN.
  • Liang J; State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, GuangZhou, CN.
  • Ye J; State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, GuangZhou, CN.
  • Li J; State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, GuangZhou, CN.
  • Li J; State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, GuangZhou, CN.
  • Zhang Q; State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, GuangZhou, CN.
  • Hu Q; State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, GuangZhou, CN.
  • Pan C; State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, GuangZhou, CN.
  • Wang D; State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, GuangZhou, CN.
  • Liu Z; State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, GuangZhou, CN.
  • Shi W; State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, GuangZhou, CN.
  • Shi D; Research Centre for SHARP Vision, The Hong Kong Polytechnic University, Hong Kong, CN.
  • Li F; State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, GuangZhou, CN.
  • Qu B; State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, GuangZhou, CN.
  • Zheng Y; State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, GuangZhou, CN.
J Med Internet Res ; 2024 Jul 15.
Article in En | MEDLINE | ID: mdl-39046096
ABSTRACT

BACKGROUND:

Large language models (LLMs) demonstrated advanced performance in processing clinical information. However, commercially available LLMs lack specialized medical knowledge and remain susceptible to generating inaccurate information. Given the need for self-management in diabetes, patients commonly seek information online. We introduce the RISE framework and evaluate its performance in enhancing LLMs to provide accurate responses to diabetes-related inquiries.

OBJECTIVE:

This study aimed to evaluate the potential of RISE framework, an information retrieval and augmentation tool, to improve the LLM's performance to accurately and safely respond to diabetes-related inquiries.

METHODS:

The RISE, an innovative retrieval augmentation framework, comprises four

steps:

Rewriting Query, Information Retrieval, Summarization, and Execution. Using a set of 43 common diabetes-related questions, we evaluated three base LLMs (GPT-4, Anthropic Claude 2, Google Bard) and their RISE-enhanced versions. Assessments were conducted by clinicians for accuracy and comprehensiveness, and by patients for understandability.

RESULTS:

The integration of RISE significantly improved the accuracy and comprehensiveness of responses from all three based LLMs. On average, the percentage of accurate responses increased by 12% (122 - 107/129) with RISE. Specifically, the rates of accurate responses increased by 7% (42 - 39/43) for GPT-4, 19% (39 - 31/43) for Claude 2, and 9% (41 - 37/43) for Google Bard. The framework also enhanced response comprehensiveness, with mean scores improving by 0.44. Understandability was also enhanced by 0.19 on average. Data collection was conducted from Sept. 30, 2023, to Feb. 05, 2024.

CONCLUSIONS:

RISE significantly improves LLMs' performance in responding to diabetes-related inquiries, enhancing accuracy, comprehensiveness, and understandability. These improvements have crucial implications for RISE's future role in patient education and chronic illness self-management, which contributes to relieving medical resource pressures and raising public awareness of medical knowledge.

Full text: 1 Collection: 01-internacional Database: MEDLINE Language: En Journal: J Med Internet Res Journal subject: INFORMATICA MEDICA Year: 2024 Document type: Article Country of publication: Canada

Full text: 1 Collection: 01-internacional Database: MEDLINE Language: En Journal: J Med Internet Res Journal subject: INFORMATICA MEDICA Year: 2024 Document type: Article Country of publication: Canada