Your browser doesn't support javascript.
loading
[The risk of re-identification when analyzing electronic health records: a critical appraisal and possible solutions]. / Das Risiko von Re-Identifizierung bei der Auswertung medizinischer Routinedaten ­ Kritische Bewertung und Lösungsansätze.
Hauswaldt, Johannes; Demmer, Iris; Heinemann, Stephanie; Himmel, Wolfgang; Hummers, Eva; Pung, Johannes; Schlegelmilch, Falk; Drepper, Johannes.
Afiliação
  • Hauswaldt J; Institut für Allgemeinmedizin, Universitätsmedizin Göttingen, Göttingen, Deutschland. Electronic address: johannes.hauswaldt@med.uni-goettingen.de.
  • Demmer I; Institut für Allgemeinmedizin, Universitätsmedizin Göttingen, Göttingen, Deutschland.
  • Heinemann S; Institut für Allgemeinmedizin, Universitätsmedizin Göttingen, Göttingen, Deutschland.
  • Himmel W; Institut für Allgemeinmedizin, Universitätsmedizin Göttingen, Göttingen, Deutschland.
  • Hummers E; Institut für Allgemeinmedizin, Universitätsmedizin Göttingen, Göttingen, Deutschland.
  • Pung J; Institut für Medizinische Informatik, Universitätsmedizin Göttingen, Göttingen, Deutschland.
  • Schlegelmilch F; Institut für Allgemeinmedizin, Universitätsmedizin Göttingen, Göttingen, Deutschland.
  • Drepper J; TMF - Technologie- und Methodenplattform für die vernetzte medizinische Forschung e.V., Berlin (TMF), Deutschland.
Z Evid Fortbild Qual Gesundhwes ; 149: 22-31, 2019 Dec.
Article em De | MEDLINE | ID: mdl-32165110
BACKGROUND AND OBJECTIVES: The use of primary care data gathered from electronic health records in local practices could be an important building block for the future of health services research. However, the risks and reservations associated with using this data for research purposes should not be underestimated. We show the data protection and privacy problems that may arise through secondary analysis of routine primary care data and describe the technical solutions that are available to address these concerns - as a trust-building measure. METHODS: We screened 40 variables that are deemed important for documentation in the electronic health records of primary care physicians and rated the risk of patient re-identification when using these records from routine medical data for research purposes. The criteria used to rate the risk of re-identification were "expert perception" (inferences of a professional observer of phenotypical characteristics which are documented in the 40 variables), "researchable additional knowledge" (knowledge of characteristics of a person through publicly available information and social media networks), and "statistic frequency" according to diagnosis and medication statistics. RESULTS: Diagnoses and reasons for contacting a general practitioner can contain particularly identifiable characteristics such as "obesity" (ICD-10 E66) and "nicotine dependence" (F17). About half of all ICD codes documented in primary care fall below a critical threshold value in their absolute frequency; this is all the more problematic if diagnoses allow for re-identification due to phenotypical characteristics. Medication information holds little potential risk of re-identification of a person. However, the application of medications could be a source of re-identification, e. g., self-injections of insulin or use of inhalators. Information about times and dates are especially sensitive for the re-identification of a person. Sex and age of a patient generally pose no problems, except in the case of very young or very old individuals when these age groups are seldom represented in the practice. DISCUSSION: Routine health data are, in principle, sensitive data. Knowledge about the variables in primary care data gathered from electronic health records in local practices and the evaluation of this data allow us to more accurately estimate the risk of re-identification for the persons concerned. In particular, chronic diagnoses and/or diagnoses in long text, calendar dates for patient contacts and therapies bear a high risk of re-identification. Technical measures such as removing data, masking values and coding should make re-identification considerably more difficult. There will always be a remaining risk of re-identification which should be openly discussed to counteract concerns about a lack of data protection or a sweeping critique of digitization in healthcare.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Contexto em Saúde: 11_ODS3_cobertura_universal / 1_ASSA2030 Problema de saúde: 11_delivery_arrangements / 1_sistemas_informacao_saude Assunto principal: Atenção Primária à Saúde / Projetos de Pesquisa / Registros Eletrônicos de Saúde / Clínicos Gerais Tipo de estudo: Diagnostic_studies / Etiology_studies / Prognostic_studies / Risk_factors_studies Limite: Humans País/Região como assunto: Europa Idioma: De Revista: Z Evid Fortbild Qual Gesundhwes Assunto da revista: MEDICINA Ano de publicação: 2019 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Contexto em Saúde: 11_ODS3_cobertura_universal / 1_ASSA2030 Problema de saúde: 11_delivery_arrangements / 1_sistemas_informacao_saude Assunto principal: Atenção Primária à Saúde / Projetos de Pesquisa / Registros Eletrônicos de Saúde / Clínicos Gerais Tipo de estudo: Diagnostic_studies / Etiology_studies / Prognostic_studies / Risk_factors_studies Limite: Humans País/Região como assunto: Europa Idioma: De Revista: Z Evid Fortbild Qual Gesundhwes Assunto da revista: MEDICINA Ano de publicação: 2019 Tipo de documento: Article
...