Pesquisa | Secretaria de Estado da Saúde

Confidence-ranked reconstruction of census microdata from published statistics.

Dick, Travis; Dwork, Cynthia; Kearns, Michael; Liu, Terrance; Roth, Aaron; Vietri, Giuseppe; Wu, Zhiwei Steven.

Proc Natl Acad Sci U S A ; 120(8): e2218605120, 2023 Feb 21.

Artigo em Inglês | MEDLINE | ID: mdl-36800385

RESUMO

A reconstruction attack on a private dataset D takes as input some publicly accessible information about the dataset and produces a list of candidate elements of D. We introduce a class of data reconstruction attacks based on randomized methods for nonconvex optimization. We empirically demonstrate that our attacks can not only reconstruct full rows of D from aggregate query statistics Q(D)∈âm but can do so in a way that reliably ranks reconstructed rows by their odds of appearing in the private data, providing a signature that could be used for prioritizing reconstructed rows for further actions such as identity theft or hate crime. We also design a sequence of baselines for evaluating reconstruction attacks. Our attacks significantly outperform those that are based only on access to a public distribution or population from which the private dataset D was sampled, demonstrating that they are exploiting information in the aggregate statistics Q(D) and not simply the overall structure of the distribution. In other words, the queries Q(D) are permitting reconstruction of elements of this dataset, not the distribution from which D was drawn. These findings are established both on 2010 US decennial Census data and queries and Census-derived American Community Survey datasets. Taken together, our methods and experiments illustrate the risks in releasing numerically precise aggregate statistics of a large dataset and provide further motivation for the careful application of provably private techniques such as differential privacy.

Reply to Sanchéz et al.: Multiplicity does not protect privacy.

Dick, Travis; Dwork, Cynthia; Kearns, Michael; Liu, Terrance; Roth, Aaron; Vietri, Giuseppe; Wu, Zhiwei Steven.

Proc Natl Acad Sci U S A ; 120(18): e2304263120, 2023 05 02.

Artigo em Inglês | MEDLINE | ID: mdl-37094130

Assuntos

Confidencialidade , Privacidade

STATISTICS. The reusable holdout: Preserving validity in adaptive data analysis.

Dwork, Cynthia; Feldman, Vitaly; Hardt, Moritz; Pitassi, Toniann; Reingold, Omer; Roth, Aaron.

Science ; 349(6248): 636-8, 2015 Aug 07.

Artigo em Inglês | MEDLINE | ID: mdl-26250683

RESUMO

Misapplication of statistical data analysis is a common cause of spurious discoveries in scientific research. Existing approaches to ensuring the validity of inferences drawn from data assume a fixed procedure to be performed, selected before the data are examined. In common practice, however, data analysis is an intrinsically adaptive process, with new analyses generated on the basis of data exploration, as well as the results of previous analyses on the same data. We demonstrate a new approach for addressing the challenges of adaptivity based on insights from privacy-preserving data analysis. As an application, we show how to safely reuse a holdout data set many times to validate the results of adaptively chosen analyses.

Toward practicing privacy.

Dwork, Cynthia; Pottenger, Rebecca.

J Am Med Inform Assoc ; 20(1): 102-8, 2013 Jan 01.

Artigo em Inglês | MEDLINE | ID: mdl-23243088

RESUMO

Private data analysis-the useful analysis of confidential data-requires a rigorous and practicable definition of privacy. Differential privacy, an emerging standard, is the subject of intensive investigation in several diverse research communities. We review the definition, explain its motivation, and discuss some of the challenges to bringing this concept to practice.

Assuntos

Segurança Computacional , Confidencialidade , Registros Eletrônicos de Saúde , Disseminação de Informação , Algoritmos , Crime/psicologia , Humanos , Motivação , Terminologia como Assunto

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

Detalhe da pesquisa