Statistical biases due to anonymization evaluated in an open clinical dataset from COVID-19 patients.

Koll, Carolin E M; Hopff, Sina M; Meurers, Thierry; Lee, Chin Huang; Kohls, Mirjam; Stellbrink, Christoph; Thibeault, Charlotte; Reinke, Lennart; Steinbrecher, Sarah; Schreiber, Stefan; Mitrov, Lazar; Frank, Sandra; Miljukov, Olga; Erber, Johanna; Hellmuth, Johannes C; Reese, Jens-Peter; Steinbeis, Fridolin; Bahmer, Thomas; Hagen, Marina; Meybohm, Patrick; Hansch, Stefan; Vadász, István; Krist, Lilian; Jiru-Hillmann, Steffi; Prasser, Fabian; Vehreschild, Jörg Janne

Koll, Carolin E M; Hopff, Sina M; Meurers, Thierry; Lee, Chin Huang; Kohls, Mirjam; Stellbrink, Christoph; Thibeault, Charlotte; Reinke, Lennart; Steinbrecher, Sarah; Schreiber, Stefan; Mitrov, Lazar; Frank, Sandra; Miljukov, Olga; Erber, Johanna; Hellmuth, Johannes C; Reese, Jens-Peter; Steinbeis, Fridolin; Bahmer, Thomas; Hagen, Marina; Meybohm, Patrick; Hansch, Stefan; Vadász, István; Krist, Lilian; Jiru-Hillmann, Steffi; Prasser, Fabian; Vehreschild, Jörg Janne.

Afiliación

Koll CEM; University of Cologne, Faculty of Medicine and University Hospital Cologne, Department I of Internal Medicine, Center for Integrated Oncology Aachen Bonn Cologne Duesseldorf, Cologne, Germany. carolin.koll@uk-koeln.de.
Hopff SM; University of Cologne, Faculty of Medicine and University Hospital Cologne, Department I of Internal Medicine, Center for Integrated Oncology Aachen Bonn Cologne Duesseldorf, Cologne, Germany.
Meurers T; Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Charitéplatz 1, 10117, Berlin, Germany.
Lee CH; University of Cologne, Faculty of Medicine and University Hospital Cologne, Department I of Internal Medicine, Center for Integrated Oncology Aachen Bonn Cologne Duesseldorf, Cologne, Germany.
Kohls M; University of Wuerzburg, Faculty of Medicine, Institute for Clinical Epidemiology and Biometry, Wuerzburg, Germany.
Stellbrink C; Department of Cardiology and Intensive Care Medicine, Bielefeld Medical Centre, Medical Faculty OWL, University of Bielefeld, Bielefeld, Germany.
Thibeault C; Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt Universität zu Berlin, Berlin, Germany.
Reinke L; Internal Medicine Department I, University Medical Center Schleswig-Holstein Campus Kiel, Kiel, Germany.
Steinbrecher S; Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt Universität zu Berlin, Berlin, Germany.
Schreiber S; Internal Medicine Department I, University Medical Center Schleswig-Holstein Campus Kiel, Kiel, Germany.
Mitrov L; University of Cologne, Faculty of Medicine and University Hospital Cologne, Department I of Internal Medicine, Center for Integrated Oncology Aachen Bonn Cologne Duesseldorf, Cologne, Germany.
Frank S; Department of Anesthesiology, University Hospital of Ludwig-Maximilians-University (LMU), Munich, Germany.
Miljukov O; Department of Medicine III, University Hospital, LMU Munich, Munich, Germany.
Erber J; University of Wuerzburg, Faculty of Medicine, Institute for Clinical Epidemiology and Biometry, Wuerzburg, Germany.
Hellmuth JC; Technical University of Munich, School of Medicine, University Hospital rechts der Isar, Department of Internal Medicine II, Munich, Germany.
Reese JP; Department of Medicine III, University Hospital, LMU Munich, Munich, Germany.
Steinbeis F; COVID-19 Registry of the LMU Munich (CORKUM), University Hospital, LMU Munich, Munich, Germany.
Bahmer T; University of Wuerzburg, Faculty of Medicine, Institute for Clinical Epidemiology and Biometry, Wuerzburg, Germany.
Hagen M; Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt Universität zu Berlin, Berlin, Germany.
Meybohm P; Internal Medicine Department I, University Medical Center Schleswig-Holstein Campus Kiel, Kiel, Germany.
Hansch S; Airway Research Center North (ARCN), German Center for Lung Research (DZL), Großhansdorf, Germany.
Vadász I; Department II for Internal Medicine, Hematology/Oncology, University Hospital Frankfurt, Frankfurt am Main, Germany.
Krist L; Department of Anaesthesiology, Intensive Care, Emergency and Pain Medicine, University Hospital Wuerzburg, Wuerzburg, Germany.
Jiru-Hillmann S; Department of Infection Prevention and Infectious Diseases, University Hospital Regensburg, Regensburg, Germany.
Prasser F; Department of Internal Medicine, Justus Liebig University, Universities of Giessen and Marburg Lung Center (UGMLC), Member of the German Center for Lung Research (DZL), Giessen, Germany.
Vehreschild JJ; The Cardio-Pulmonary Institute (CPI), Giessen, Germany.

Sci Data ; 9(1): 776, 2022 12 21.

Article en En | MEDLINE | ID: mdl-36543828

RESUMEN

Anonymization has the potential to foster the sharing of medical data. State-of-the-art methods use mathematical models to modify data to reduce privacy risks. However, the degree of protection must be balanced against the impact on statistical properties. We studied an extreme case of this trade-off: the statistical validity of an open medical dataset based on the German National Pandemic Cohort Network (NAPKON), which was prepared for publication using a strong anonymization procedure. Descriptive statistics and results of regression analyses were compared before and after anonymization of multiple variants of the original dataset. Despite significant differences in value distributions, the statistical bias was found to be small in all cases. In the regression analyses, the median absolute deviations of the estimated adjusted odds ratios for different sample sizes ranged from 0.01 [minimum = 0, maximum = 0.58] to 0.52 [minimum = 0.25, maximum = 0.91]. Disproportionate impact on the statistical properties of data is a common argument against the use of anonymization. Our analysis demonstrates that anonymization can actually preserve validity of statistical results in relatively low-dimensional data.

Asunto(s)

COVID-19; Humanos; Sesgo; Anonimización de la Información; Modelos Teóricos; Privacidad; Interpretación Estadística de Datos; Conjuntos de Datos como Asunto

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Contexto en salud: 4_TD Problema de salud: 4_pneumonia Asunto principal: COVID-19 Límite: Humans Idioma: En Revista: Sci Data Año: 2022 Tipo del documento: Article País de afiliación: Alemania

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google