Your browser doesn't support javascript.
loading
Fast and accurate Bayesian polygenic risk modeling with variational inference.
Zabad, Shadi; Gravel, Simon; Li, Yue.
Afiliação
  • Zabad S; School of Computer Science, McGill University, Montreal, QC, Canada.
  • Gravel S; Department of Human Genetics, McGill University, Montreal, QC, Canada. Electronic address: simon.gravel@mcgill.ca.
  • Li Y; School of Computer Science, McGill University, Montreal, QC, Canada. Electronic address: yueli@cs.mcgill.ca.
Am J Hum Genet ; 110(5): 741-761, 2023 05 04.
Article em En | MEDLINE | ID: mdl-37030289
ABSTRACT
The advent of large-scale genome-wide association studies (GWASs) has motivated the development of statistical methods for phenotype prediction with single-nucleotide polymorphism (SNP) array data. These polygenic risk score (PRS) methods use a multiple linear regression framework to infer joint effect sizes of all genetic variants on the trait. Among the subset of PRS methods that operate on GWAS summary statistics, sparse Bayesian methods have shown competitive predictive ability. However, most existing Bayesian approaches employ Markov chain Monte Carlo (MCMC) algorithms, which are computationally inefficient and do not scale favorably to higher dimensions, for posterior inference. Here, we introduce variational inference of polygenic risk scores (VIPRS), a Bayesian summary statistics-based PRS method that utilizes variational inference techniques to approximate the posterior distribution for the effect sizes. Our experiments with 36 simulation configurations and 12 real phenotypes from the UK Biobank dataset demonstrated that VIPRS is consistently competitive with the state-of-the-art in prediction accuracy while being more than twice as fast as popular MCMC-based approaches. This performance advantage is robust across a variety of genetic architectures, SNP heritabilities, and independent GWAS cohorts. In addition to its competitive accuracy on the "White British" samples, VIPRS showed improved transferability when applied to other ethnic groups, with up to 1.7-fold increase in R2 among individuals of Nigerian ancestry for low-density lipoprotein (LDL) cholesterol. To illustrate its scalability, we applied VIPRS to a dataset of 9.6 million genetic markers, which conferred further improvements in prediction accuracy for highly polygenic traits, such as height.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Herança Multifatorial / Estudo de Associação Genômica Ampla Tipo de estudo: Etiology_studies / Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Revista: Am J Hum Genet Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Canadá

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Herança Multifatorial / Estudo de Associação Genômica Ampla Tipo de estudo: Etiology_studies / Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Revista: Am J Hum Genet Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Canadá