Detecting Protected Health Information in Heterogeneous Clinical Notes.
Stud Health Technol Inform
; 245: 393-397, 2017.
Article
em En
| MEDLINE
| ID: mdl-29295123
To enable secondary use of healthcare data in a privacy-preserving manner, there is a need for methods capable of automatically identifying protected health information (PHI) in clinical text. To that end, learning predictive models from labeled examples has emerged as a promising alternative to rule-based systems. However, little is known about differences with respect to PHI prevalence in different types of clinical notes and how potential domain differences may affect the performance of predictive models trained on one particular type of note and applied to another. In this study, we analyze the performance of a predictive model trained on an existing PHI corpus of Swedish clinical notes and applied to a variety of clinical notes: written (i) in different clinical specialties, (ii) under different headings, and (iii) by persons in different professions. The results indicate that domain adaption is needed for effective detection of PHI in heterogeneous clinical notes.
Palavras-chave
Buscar no Google
Coleções:
01-internacional
Contexto em Saúde:
1_ASSA2030
Base de dados:
MEDLINE
Assunto principal:
Privacidade
/
Registros Eletrônicos de Saúde
Tipo de estudo:
Prevalence_studies
/
Prognostic_studies
/
Risk_factors_studies
Limite:
Humans
País/Região como assunto:
Europa
Idioma:
En
Revista:
Stud Health Technol Inform
Ano de publicação:
2017
Tipo de documento:
Article