Your browser doesn't support javascript.
loading
Priors, population sizes, and power in genome-wide hypothesis tests.
Cai, Jitong; Zhan, Jianan; Arking, Dan E; Bader, Joel S.
Afiliação
  • Cai J; Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA.
  • Zhan J; Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA.
  • Arking DE; Department of Genetic Medicine, Johns Hopkins University, Baltimore, MD, 21218, USA.
  • Bader JS; Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA. joel.bader@jhu.edu.
BMC Bioinformatics ; 24(1): 170, 2023 Apr 26.
Article em En | MEDLINE | ID: mdl-37101120
BACKGROUND: Genome-wide tests, including genome-wide association studies (GWAS) of germ-line genetic variants, driver tests of cancer somatic mutations, and transcriptome-wide association tests of RNAseq data, carry a high multiple testing burden. This burden can be overcome by enrolling larger cohorts or alleviated by using prior biological knowledge to favor some hypotheses over others. Here we compare these two methods in terms of their abilities to boost the power of hypothesis testing. RESULTS: We provide a quantitative estimate for progress in cohort sizes and present a theoretical analysis of the power of oracular hard priors: priors that select a subset of hypotheses for testing, with an oracular guarantee that all true positives are within the tested subset. This theory demonstrates that for GWAS, strong priors that limit testing to 100-1000 genes provide less power than typical annual 20-40% increases in cohort sizes. Furthermore, non-oracular priors that exclude even a small fraction of true positives from the tested set can perform worse than not using a prior at all. CONCLUSION: Our results provide a theoretical explanation for the continued dominance of simple, unbiased univariate hypothesis tests for GWAS: if a statistical question can be answered by larger cohort sizes, it should be answered by larger cohort sizes rather than by more complicated biased methods involving priors. We suggest that priors are better suited for non-statistical aspects of biology, such as pathway structure and causality, that are not yet easily captured by standard hypothesis tests.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Polimorfismo de Nucleotídeo Único / Estudo de Associação Genômica Ampla Limite: Humans Idioma: En Ano de publicação: 2023 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Polimorfismo de Nucleotídeo Único / Estudo de Associação Genômica Ampla Limite: Humans Idioma: En Ano de publicação: 2023 Tipo de documento: Article