Your browser doesn't support javascript.
loading
Data Sanitization to Reduce Private Information Leakage from Functional Genomics.
Gürsoy, Gamze; Emani, Prashant; Brannon, Charlotte M; Jolanki, Otto A; Harmanci, Arif; Strattan, J Seth; Cherry, J Michael; Miranker, Andrew D; Gerstein, Mark.
Afiliação
  • Gürsoy G; Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA.
  • Emani P; Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA.
  • Brannon CM; Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA.
  • Jolanki OA; Stanford University School of Medicine, Department of Genetics, Stanford, CA 94305, USA.
  • Harmanci A; School of Biomedical Informatics, Center for Precision Health, University of Texas Health Sciences Center, Houston, TX 77030, USA.
  • Strattan JS; Stanford University School of Medicine, Department of Genetics, Stanford, CA 94305, USA.
  • Cherry JM; Stanford University School of Medicine, Department of Genetics, Stanford, CA 94305, USA.
  • Miranker AD; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA; Department of Chemical and Environmental Engineering, Yale University, New Haven, CT 06520, USA.
  • Gerstein M; Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA; Department of Computer Science, Yale University, New Haven, CT 06520, USA; Department of Statistics and Data
Cell ; 183(4): 905-917.e16, 2020 11 12.
Article em En | MEDLINE | ID: mdl-33186529
ABSTRACT
The generation of functional genomics datasets is surging, because they provide insight into gene regulation and organismal phenotypes (e.g., genes upregulated in cancer). The intent behind functional genomics experiments is not necessarily to study genetic variants, yet they pose privacy concerns due to their use of next-generation sequencing. Moreover, there is a great incentive to broadly share raw reads for better statistical power and general research reproducibility. Thus, we need new modes of sharing beyond traditional controlled-access models. Here, we develop a data-sanitization procedure allowing raw functional genomics reads to be shared while minimizing privacy leakage, enabling principled privacy-utility trade-offs. Our protocol works with traditional Illumina-based assays and newer technologies such as 10x single-cell RNA sequencing. It involves quantifying the privacy leakage in reads by statistically linking study participants to known individuals. We carried out these linkages using data from highly accurate reference genomes and more realistic environmental samples.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Segurança Computacional / Privacidade / Genômica Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Revista: Cell Ano de publicação: 2020 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Segurança Computacional / Privacidade / Genômica Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Revista: Cell Ano de publicação: 2020 Tipo de documento: Article País de afiliação: Estados Unidos