Your browser doesn't support javascript.
loading
A Comparative Analysis of Speed and Accuracy for Three Off-the-Shelf De-Identification Tools.
Heider, Paul M; Obeid, Jihad S; Meystre, Stéphane M.
Afiliação
  • Heider PM; Biomedical Informatics Center, Medical University of South Carolina, Charleston, SC.
  • Obeid JS; Biomedical Informatics Center, Medical University of South Carolina, Charleston, SC.
  • Meystre SM; Biomedical Informatics Center, Medical University of South Carolina, Charleston, SC.
AMIA Jt Summits Transl Sci Proc ; 2020: 241-250, 2020.
Article em En | MEDLINE | ID: mdl-32477643
ABSTRACT
A growing quantity of health data is being stored in Electronic Health Records (EHR). The free-text section of these clinical notes contains important patient and treatment information for research but also contains Personally Identifiable Information (PII), which cannot be freely shared within the research community without compromising patient confidentiality and privacy rights. Significant work has been invested in investigating automated approaches to text de-identification, the process of removing or redacting PII. Few studies have examined the performance of existing de-identification pipelines in a controlled comparative analysis. In this study, we use publicly available corpora to analyze speed and accuracy differences between three de-identification systems that can be run off-the-shelf Amazon Comprehend Medical PHId, Clinacuity's CliniDeID, and the National Library of Medicine's Scrubber. No single system dominated all the compared metrics. NLM Scrubber was the fastest while CliniDeID generally had the highest accuracy.

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Diagnostic_studies Idioma: En Ano de publicação: 2020 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Diagnostic_studies Idioma: En Ano de publicação: 2020 Tipo de documento: Article