Your browser doesn't support javascript.
loading
Data encoding for healthcare data democratization and information leakage prevention.
Thakur, Anshul; Zhu, Tingting; Abrol, Vinayak; Armstrong, Jacob; Wang, Yujiang; Clifton, David A.
Afiliação
  • Thakur A; Department of Engineering Science, University of Oxford, OX3 7DQ, Oxfordshire, UK. anshul.thakur@eng.ox.ac.uk.
  • Zhu T; Department of Engineering Science, University of Oxford, OX3 7DQ, Oxfordshire, UK.
  • Abrol V; Infosys Centre for AI, IIIT Delhi, Delhi, India.
  • Armstrong J; Department of Engineering Science, University of Oxford, OX3 7DQ, Oxfordshire, UK.
  • Wang Y; Department of Engineering Science, University of Oxford, OX3 7DQ, Oxfordshire, UK. yujiang.wang@oscar.ox.ac.uk.
  • Clifton DA; Oxford Suzhou Centre for Advanced Research, Suzhou, China. yujiang.wang@oscar.ox.ac.uk.
Nat Commun ; 15(1): 1582, 2024 Feb 21.
Article em En | MEDLINE | ID: mdl-38383571
ABSTRACT
The lack of data democratization and information leakage from trained models hinder the development and acceptance of robust deep learning-based healthcare solutions. This paper argues that irreversible data encoding can provide an effective solution to achieve data democratization without violating the privacy constraints imposed on healthcare data and clinical models. An ideal encoding framework transforms the data into a new space where it is imperceptible to a manual or computational inspection. However, encoded data should preserve the semantics of the original data such that deep learning models can be trained effectively. This paper hypothesizes the characteristics of the desired encoding framework and then exploits random projections and random quantum encoding to realize this framework for dense and longitudinal or time-series data. Experimental evaluation highlights that models trained on encoded time-series data effectively uphold the information bottleneck principle and hence, exhibit lesser information leakage from trained models.

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article