Your browser doesn't support javascript.
loading
Disambiguating Clinical Abbreviations by One-to-All Classification: Algorithm Development and Validation Study.
Sung, Sheng-Feng; Hu, Ya-Han; Chen, Chong-Yan.
Affiliation
  • Sung SF; Division of Neurology, Department of Internal Medicine, Ditmanson Medical Foundation Chia-Yi Christian Hospital, Chiayi City, Taiwan.
  • Hu YH; Department of Nursing, Fooyin University, Kaohsiung, Taiwan.
  • Chen CY; Department of Information Management, National Central University, 300 Zhongda Rd, Zhongli District, Taoyuan City, 32001, Taiwan, 886 34227151 ext 66560.
JMIR Med Inform ; 12: e56955, 2024 Oct 01.
Article in En | MEDLINE | ID: mdl-39352715
ABSTRACT

Background:

Electronic medical records store extensive patient data and serve as a comprehensive repository, including textual medical records like surgical and imaging reports. Their utility in clinical decision support systems is substantial, but the widespread use of ambiguous and unstandardized abbreviations in clinical documents poses challenges for natural language processing in clinical decision support systems. Efficient abbreviation disambiguation methods are needed for effective information extraction.

Objective:

This study aims to enhance the one-to-all (OTA) framework for clinical abbreviation expansion, which uses a single model to predict multiple abbreviation meanings. The objective is to improve OTA by developing context-candidate pairs and optimizing word embeddings in Bidirectional Encoder Representations From Transformers (BERT), evaluating the model's efficacy in expanding clinical abbreviations using real data.

Methods:

Three datasets were used Medical Subject Headings Word Sense Disambiguation, University of Minnesota, and Chia-Yi Christian Hospital from Ditmanson Medical Foundation Chia-Yi Christian Hospital. Texts containing polysemous abbreviations were preprocessed and formatted for BERT. The study involved fine-tuning pretrained models, ClinicalBERT and BlueBERT, generating dataset pairs for training and testing based on Huang et al's method.

Results:

BlueBERT achieved macro- and microaccuracies of 95.41% and 95.16%, respectively, on the Medical Subject Headings Word Sense Disambiguation dataset. It improved macroaccuracy by 0.54%-1.53% compared to two baselines, long short-term memory and deepBioWSD with random embedding. On the University of Minnesota dataset, BlueBERT recorded macro- and microaccuracies of 98.40% and 98.22%, respectively. Against the baselines of Word2Vec + support vector machine and BioWordVec + support vector machine, BlueBERT demonstrated a macroaccuracy improvement of 2.61%-4.13%.

Conclusions:

This research preliminarily validated the effectiveness of the OTA method for abbreviation disambiguation in medical texts, demonstrating the potential to enhance both clinical staff efficiency and research effectiveness.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Algorithms / Natural Language Processing / Abbreviations as Topic / Electronic Health Records Limits: Humans Language: En Journal: JMIR Med Inform Year: 2024 Document type: Article Affiliation country: Taiwan Country of publication: Canada

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Algorithms / Natural Language Processing / Abbreviations as Topic / Electronic Health Records Limits: Humans Language: En Journal: JMIR Med Inform Year: 2024 Document type: Article Affiliation country: Taiwan Country of publication: Canada