Your browser doesn't support javascript.
loading
Beyond guilty by association at scale: searching for causal variants on the basis of genome-wide summary statistics.
He, Zihuai; Chu, Benjamin; Yang, James; Gu, Jiaqi; Chen, Zhaomeng; Liu, Linxi; Morrison, Tim; Belloy, Michael E; Qi, Xinran; Hejazi, Nima; Mathur, Maya; Le Guen, Yann; Tang, Hua; Hastie, Trevor; Ionita-Laza, Iuliana; Sabatti, Chiara; Candès, Emmanuel.
  • He Z; Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA 94305, USA.
  • Chu B; Quantitative Sciences Unit, Department of Medicine, Stanford University, Stanford, CA, 94305, USA.
  • Yang J; Department of Biomedical Data Science, Stanford University, Stanford, CA 94305, USA.
  • Gu J; Department of Biomedical Data Science, Stanford University, Stanford, CA 94305, USA.
  • Chen Z; Department of Statistics, Stanford University, Stanford, CA 94305, USA.
  • Liu L; Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA 94305, USA.
  • Morrison T; Quantitative Sciences Unit, Department of Medicine, Stanford University, Stanford, CA, 94305, USA.
  • Belloy ME; Department of Statistics, Stanford University, Stanford, CA 94305, USA.
  • Qi X; Department of Statistics, University of Pittsburgh, Pittsburgh, PA 15260, USA.
  • Hejazi N; Department of Statistics, Stanford University, Stanford, CA 94305, USA.
  • Mathur M; Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA 94305, USA.
  • Le Guen Y; Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA 94305, USA.
  • Tang H; Quantitative Sciences Unit, Department of Medicine, Stanford University, Stanford, CA, 94305, USA.
  • Hastie T; Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA.
  • Ionita-Laza I; Quantitative Sciences Unit, Department of Medicine, Stanford University, Stanford, CA, 94305, USA.
  • Sabatti C; Department of Pediatrics, Stanford University, Stanford, CA 94305, USA.
  • Candès E; Quantitative Sciences Unit, Department of Medicine, Stanford University, Stanford, CA, 94305, USA.
bioRxiv ; 2024 May 02.
Article en En | MEDLINE | ID: mdl-38464202
ABSTRACT
Understanding the causal genetic architecture of complex phenotypes is essential for future research into disease mechanisms and potential therapies. Here, we present a novel framework for genome-wide detection of sets of variants that carry non-redundant information on the phenotypes and are therefore more likely to be causal in a biological sense. Crucially, our framework requires only summary statistics obtained from standard genome-wide marginal association testing. The described approach, implemented in open-source software, is also computationally efficient, requiring less than 15 minutes on a single CPU to perform genome-wide analysis. Through extensive genome-wide simulation studies, we show that the method can substantially outperform usual two-stage marginal association testing and fine-mapping procedures in precision and recall. In applications to a meta-analysis of ten large-scale genetic studies of Alzheimer's disease (AD), we identified 82 loci associated with AD, including 37 additional loci missed by conventional GWAS pipeline. The identified putative causal variants achieve state-of-the-art agreement with massively parallel reporter assays and CRISPR-Cas9 experiments. Additionally, we applied the method to a retrospective analysis of 67 large-scale GWAS summary statistics since 2013 for a variety of phenotypes. Results reveal the method's capacity to robustly discover additional loci for polygenic traits and pinpoint potential causal variants underpinning each locus beyond conventional GWAS pipeline, contributing to a deeper understanding of complex genetic architectures in post-GWAS analyses.