Your browser doesn't support javascript.
loading
Privacy-preserving federated genome-wide association studies via dynamic sampling.
Wang, Xinyue; Dervishi, Leonard; Li, Wentao; Ayday, Erman; Jiang, Xiaoqian; Vaidya, Jaideep.
Afiliação
  • Wang X; Management Science and Information Systems Department, Rutgers University, New Brunswick, NJ 07102, United States.
  • Dervishi L; Department of Computer and Data Sciences, Cleveland, OH 44106, United States.
  • Li W; Department of Health Data Science and Artificial Intelligence, Houston, TX 77030, United States.
  • Ayday E; Department of Computer and Data Sciences, Cleveland, OH 44106, United States.
  • Jiang X; Department of Health Data Science and Artificial Intelligence, Houston, TX 77030, United States.
  • Vaidya J; Management Science and Information Systems Department, Rutgers University, New Brunswick, NJ 07102, United States.
Bioinformatics ; 39(10)2023 10 03.
Article em En | MEDLINE | ID: mdl-37856329
ABSTRACT
MOTIVATION Genome-wide association studies (GWAS) benefit from the increasing availability of genomic data and cross-institution collaborations. However, sharing data across institutional boundaries jeopardizes medical data confidentiality and patient privacy. While modern cryptographic techniques provide formal secure guarantees, the substantial communication and computational overheads hinder the practical application of large-scale collaborative GWAS.

RESULTS:

This work introduces an efficient framework for conducting collaborative GWAS on distributed datasets, maintaining data privacy without compromising the accuracy of the results. We propose a novel two-step strategy aimed at reducing communication and computational overheads, and we employ iterative and sampling techniques to ensure accurate results. We instantiate our approach using logistic regression, a commonly used statistical method for identifying associations between genetic markers and the phenotype of interest. We evaluate our proposed methods using two real genomic datasets and demonstrate their robustness in the presence of between-study heterogeneity and skewed phenotype distributions using a variety of experimental settings. The empirical results show the efficiency and applicability of the proposed method and the promise for its application for large-scale collaborative GWAS. AVAILABILITY AND IMPLEMENTATION The source code and data are available at https//github.com/amioamo/TDS.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Privacidade / Estudo de Associação Genômica Ampla Limite: Humans Idioma: En Ano de publicação: 2023 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Privacidade / Estudo de Associação Genômica Ampla Limite: Humans Idioma: En Ano de publicação: 2023 Tipo de documento: Article