Your browser doesn't support javascript.
loading
Open tools for quantitative anonymization of tabular phenotype data: literature review.
Haber, Anna C; Sax, Ulrich; Prasser, Fabian.
Afiliación
  • Haber AC; Health Data Science Center, Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany.
  • Sax U; Department of Medical Informatics, University Medical Center Göttingen, Göttingen, Germany.
  • Prasser F; Campus-Institute Data Science, Georg-August-University Göttingen.
Brief Bioinform ; 23(6)2022 11 19.
Article en En | MEDLINE | ID: mdl-36215114
ABSTRACT
Precision medicine relies on molecular and systems biology methods as well as bidirectional association studies of phenotypes and (high-throughput) genomic data. However, the integrated use of such data often faces obstacles, especially in regards to data protection. An important prerequisite for research data processing is usually informed consent. But collecting consent is not always feasible, in particular when data are to be analyzed retrospectively. For phenotype data, anonymization, i.e. the altering of data in such a way that individuals cannot be identified, can provide an alternative. Several re-identification attacks have shown that this is a complex task and that simply removing directly identifying attributes such as names is usually not enough. More formal approaches are needed that use mathematical models to quantify risks and guide their reduction. Due to the complexity of these techniques, it is challenging and not advisable to implement them from scratch. Open software libraries and tools can provide a robust alternative. However, also the range of available anonymization tools is heterogeneous and obtaining an overview of their strengths and weaknesses is difficult due to the complexity of the problem space. We therefore performed a systematic review of open anonymization tools for structured phenotype data described in the literature between 1990 and 2021. Through a two-step eligibility assessment process, we selected 13 tools for an in-depth analysis. By comparing the supported anonymization techniques and further aspects, such as maturity, we derive recommendations for tools to use for anonymizing phenotype datasets with different properties.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Privacidad / Investigación Biomédica Tipo de estudio: Observational_studies / Risk_factors_studies / Systematic_reviews Idioma: En Revista: Brief Bioinform Asunto de la revista: BIOLOGIA / INFORMATICA MEDICA Año: 2022 Tipo del documento: Article País de afiliación: Alemania

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Privacidad / Investigación Biomédica Tipo de estudio: Observational_studies / Risk_factors_studies / Systematic_reviews Idioma: En Revista: Brief Bioinform Asunto de la revista: BIOLOGIA / INFORMATICA MEDICA Año: 2022 Tipo del documento: Article País de afiliación: Alemania