Your browser doesn't support javascript.
loading
Word sense disambiguation across two domains: biomedical literature and clinical notes.
Savova, Guergana K; Coden, Anni R; Sominsky, Igor L; Johnson, Rie; Ogren, Philip V; de Groen, Piet C; Chute, Christopher G.
Afiliación
  • Savova GK; Division of Biomedical Informatics, Mayo Clinic College of Medicine, 150 Third Street SW, Rochester, MN 55902, USA. savova.guergana@mayo.edu
J Biomed Inform ; 41(6): 1088-100, 2008 Dec.
Article en En | MEDLINE | ID: mdl-18375190
ABSTRACT
The aim of this study is to explore the word sense disambiguation (WSD) problem across two biomedical domains-biomedical literature and clinical notes. A supervised machine learning technique was used for the WSD task. One of the challenges addressed is the creation of a suitable clinical corpus with manual sense annotations. This corpus in conjunction with the WSD set from the National Library of Medicine provided the basis for the evaluation of our method across multiple domains and for the comparison of our results to published ones. Noteworthy is that only 20% of the most relevant ambiguous terms within a domain overlap between the two domains, having more senses associated with them in the clinical space than in the biomedical literature space. Experimentation with 28 different feature sets rendered a system achieving an average F-score of 0.82 on the clinical data and 0.86 on the biomedical literature.
Asunto(s)

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Lenguaje Tipo de estudio: Guideline Idioma: En Año: 2008 Tipo del documento: Article

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Lenguaje Tipo de estudio: Guideline Idioma: En Año: 2008 Tipo del documento: Article