Public platform with 39,472 exome control samples enables association studies without genotype sharing.
Nat Genet
; 56(2): 327-335, 2024 Feb.
Article
en En
| MEDLINE
| ID: mdl-38200129
ABSTRACT
Acquiring a sufficiently powered cohort of control samples matched to a case sample can be time-consuming or, in some cases, impossible. Accordingly, an ability to leverage genetic data from control samples that were already collected elsewhere could dramatically improve power in genetic association studies. Sharing of control samples can pose significant challenges, since most human genetic data are subject to strict sharing regulations. Here, using the properties of singular value decomposition and subsampling algorithm, we developed a method allowing selection of the best-matching controls in an external pool of samples compliant with personal data protection and eliminating the need for genotype sharing. We provide access to a library of 39,472 exome sequencing controls at http//dnascore.net enabling association studies for case cohorts lacking control subjects. Using this approach, control sets can be selected from this online library with a prespecified matching accuracy, ensuring well-calibrated association analysis for both rare and common variants.
Texto completo:
1
Banco de datos:
MEDLINE
Asunto principal:
Algoritmos
/
Exoma
Tipo de estudio:
Risk_factors_studies
Límite:
Humans
Idioma:
En
Revista:
Nat Genet
Asunto de la revista:
GENETICA MEDICA
Año:
2024
Tipo del documento:
Article
País de afiliación:
Estados Unidos