Your browser doesn't support javascript.
loading
Second-order group knockoffs with applications to genome-wide association studies.
Chu, Benjamin B; Gu, Jiaqi; Chen, Zhaomeng; Morrison, Tim; Candès, Emmanuel; He, Zihuai; Sabatti, Chiara.
Afiliación
  • Chu BB; Department of Biomedical Data Science, Stanford University, Stanford, CA, 94305, USA.
  • Gu J; Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA, 94035, USA.
  • Chen Z; Department of Statistics, Stanford University, Stanford, CA, 94035, USA.
  • Morrison T; Department of Statistics, Stanford University, Stanford, CA, 94035, USA.
  • Candès E; Department of Statistics, Stanford University, Stanford, CA, 94035, USA.
  • He Z; Department of Mathematics, Stanford University, Stanford, CA, 94035, USA.
  • Sabatti C; Department of Biomedical Data Science, Stanford University, Stanford, CA, 94305, USA.
Bioinformatics ; 40(10)2024 Oct 01.
Article en En | MEDLINE | ID: mdl-39340798
ABSTRACT
MOTIVATION Conditional testing via the knockoff framework allows one to identify-among a large number of possible explanatory variables-those that carry unique information about an outcome of interest and also provides a false discovery rate guarantee on the selection. This approach is particularly well suited to the analysis of genome-wide association studies (GWAS), which have the goal of identifying genetic variants that influence traits of medical relevance.

RESULTS:

While conditional testing can be both more powerful and precise than traditional GWAS analysis methods, its vanilla implementation encounters a difficulty common to all multivariate analysis

methods:

it is challenging to distinguish among multiple, highly correlated regressors. This impasse can be overcome by shifting the object of inference from single variables to groups of correlated variables. To achieve this, it is necessary to construct "group knockoffs." While successful examples are already documented in the literature, this paper substantially expands the set of algorithms and software for group knockoffs. We focus in particular on second-order knockoffs, for which we describe correlation matrix approximations that are appropriate for GWAS data and that result in considerable computational savings. We illustrate the effectiveness of the proposed methods with simulations and with the analysis of albuminuria data from the UK Biobank. AVAILABILITY AND IMPLEMENTATION The described algorithms are implemented in an open-source Julia package Knockoffs.jl. R and Python wrappers are available as knockoffsr and knockoffspy packages.
Asunto(s)

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Algoritmos / Programas Informáticos / Estudio de Asociación del Genoma Completo Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2024 Tipo del documento: Article

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Algoritmos / Programas Informáticos / Estudio de Asociación del Genoma Completo Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2024 Tipo del documento: Article