Your browser doesn't support javascript.

Repositório BVS

Documentos sobre a Biblioteca Virtual em Saúde

> Pesquisa > ()
Imprimir Exportar

Formato de exportação:


Adicionar mais destinatários
| |

Word Sense Disambiguation of Medical Terms via Recurrent Convolutional Neural Networks.

Festag, Sven; Spreckelsen, Cord.
Stud Health Technol Inform; 236: 8-15, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28508773


Tagging text data with codes representing biomedical concepts plays an important role in medical data management and analysis. A problem occurs if there are ambiguous words linked to several concepts.


This study aims at investigating word sense disambiguation based on word embedding and recurrent convolutional neural networks. The study focuses on terms mapped to multiple concepts of the Unified Medical Language System (UMLS).


We created 20 text processing pipelines trained on a subset of the MeSH Word Sense Disambiguation (MSH WSD) data set, each pipeline disambiguating the sense of one word. The pipelines were then tested on a disjoint subset of MSH WSD data. Most pipelines achieved good or even excellent results (70% of the pipelines achieved at least 90% accuracy, 40% achieved at least 98% accuracy). One poor-performing outlier was detected.


The proposed approach can serve as a basis for an up-scaled system combining pipelines for many ambiguous words. The methods used here recently proved very successful in other fields of text understanding and can be expected to scale-up with improved availability of training data.