Your browser doesn't support javascript.
loading
A flexible and parallelizable approach to genome-wide polygenic risk scores.
Newcombe, Paul J; Nelson, Christopher P; Samani, Nilesh J; Dudbridge, Frank.
Afiliação
  • Newcombe PJ; MRC Biostatistics Unit, School of Clinical Medicine, Cambridge Institute of Public Health, Cambridge Biomedical Campus, Cambridge, UK.
  • Nelson CP; Department of Cardiovascular Sciences, Cardiovascular Research Centre, Glenfield Hospital, University of Leicester, Leicester, UK.
  • Samani NJ; NIHR Leicester Biomedical Research Centre, Glenfield Hospital, Leicester, UK.
  • Dudbridge F; Department of Cardiovascular Sciences, Cardiovascular Research Centre, Glenfield Hospital, University of Leicester, Leicester, UK.
Genet Epidemiol ; 43(7): 730-741, 2019 10.
Article em En | MEDLINE | ID: mdl-31328830
The heritability of most complex traits is driven by variants throughout the genome. Consequently, polygenic risk scores, which combine information on multiple variants genome-wide, have demonstrated improved accuracy in genetic risk prediction. We present a new two-step approach to constructing genome-wide polygenic risk scores from meta-GWAS summary statistics. Local linkage disequilibrium (LD) is adjusted for in Step 1, followed by, uniquely, long-range LD in Step 2. Our algorithm is highly parallelizable since block-wise analyses in Step 1 can be distributed across a high-performance computing cluster, and flexible, since sparsity and heritability are estimated within each block. Inference is obtained through a formal Bayesian variable selection framework, meaning final risk predictions are averaged over competing models. We compared our method to two alternative approaches: LDPred and lassosum using all seven traits in the Welcome Trust Case Control Consortium as well as meta-GWAS summaries for type 1 diabetes (T1D), coronary artery disease, and schizophrenia. Performance was generally similar across methods, although our framework provided more accurate predictions for T1D, for which there are multiple heterogeneous signals in regions of both short- and long-range LD. With sufficient compute resources, our method also allows the fastest runtimes.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Predisposição Genética para Doença / Herança Multifatorial / Estudo de Associação Genômica Ampla Idioma: En Ano de publicação: 2019 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Predisposição Genética para Doença / Herança Multifatorial / Estudo de Associação Genômica Ampla Idioma: En Ano de publicação: 2019 Tipo de documento: Article