Pesquisa | BVS - MINISTÉRIO DA SAÚDE

Improving the quality of Persian clinical text with a novel spelling correction system.

Dashti, Seyed Mohammad Sadegh; Dashti, Seyedeh Fatemeh.

BMC Med Inform Decis Mak ; 24(1): 220, 2024 Aug 05.

Artigo em Inglês | MEDLINE | ID: mdl-39103825

RESUMO

BACKGROUND: The accuracy of spelling in Electronic Health Records (EHRs) is a critical factor for efficient clinical care, research, and ensuring patient safety. The Persian language, with its abundant vocabulary and complex characteristics, poses unique challenges for real-word error correction. This research aimed to develop an innovative approach for detecting and correcting spelling errors in Persian clinical text. METHODS: Our strategy employs a state-of-the-art pre-trained model that has been meticulously fine-tuned specifically for the task of spelling correction in the Persian clinical domain. This model is complemented by an innovative orthographic similarity matching algorithm, PERTO, which uses visual similarity of characters for ranking correction candidates. RESULTS: The evaluation of our approach demonstrated its robustness and precision in detecting and rectifying word errors in Persian clinical text. In terms of non-word error correction, our model achieved an F1-Score of 90.0% when the PERTO algorithm was employed. For real-word error detection, our model demonstrated its highest performance, achieving an F1-Score of 90.6%. Furthermore, the model reached its highest F1-Score of 91.5% for real-word error correction when the PERTO algorithm was employed. CONCLUSIONS: Despite certain limitations, our method represents a substantial advancement in the field of spelling error detection and correction for Persian clinical text. By effectively addressing the unique challenges posed by the Persian language, our approach paves the way for more accurate and efficient clinical documentation, contributing to improved patient care and safety. Future research could explore its use in other areas of the Persian medical domain, enhancing its impact and utility.

Assuntos

Registros Eletrônicos de Saúde , Processamento de Linguagem Natural , Humanos , Registros Eletrônicos de Saúde/normas , Algoritmos , Irã (Geográfico)

Automatic Correction of Real-Word Errors in Spanish Clinical Texts.

Bravo-Candel, Daniel; López-Hernández, Jésica; García-Díaz, José Antonio; Molina-Molina, Fernando; García-Sánchez, Francisco.

Sensors (Basel) ; 21(9)2021 Apr 21.

Artigo em Inglês | MEDLINE | ID: mdl-33919018

RESUMO

Real-word errors are characterized by being actual terms in the dictionary. By providing context, real-word errors are detected. Traditional methods to detect and correct such errors are mostly based on counting the frequency of short word sequences in a corpus. Then, the probability of a word being a real-word error is computed. On the other hand, state-of-the-art approaches make use of deep learning models to learn context by extracting semantic features from text. In this work, a deep learning model were implemented for correcting real-word errors in clinical text. Specifically, a Seq2seq Neural Machine Translation Model mapped erroneous sentences to correct them. For that, different types of error were generated in correct sentences by using rules. Different Seq2seq models were trained and evaluated on two corpora: the Wikicorpus and a collection of three clinical datasets. The medicine corpus was much smaller than the Wikicorpus due to privacy issues when dealing with patient information. Moreover, GloVe and Word2Vec pretrained word embeddings were used to study their performance. Despite the medicine corpus being much smaller than the Wikicorpus, Seq2seq models trained on the medicine corpus performed better than those models trained on the Wikicorpus. Nevertheless, a larger amount of clinical text is required to improve the results.

Assuntos

Idioma , Semântica , Humanos , Processamento de Linguagem Natural , Privacidade , Probabilidade

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA