Your browser doesn't support javascript.
loading
An efficient resampling method for calibrating single and gene-based rare variant association analysis in case-control studies.
Lee, Seunggeun; Fuchsberger, Christian; Kim, Sehee; Scott, Laura.
Afiliação
  • Lee S; Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA and Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USA leeshawn@umich.edu.
  • Fuchsberger C; Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA and Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USA.
  • Kim S; Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA.
  • Scott L; Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA and Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USA.
Biostatistics ; 17(1): 1-15, 2016 Jan.
Article em En | MEDLINE | ID: mdl-26363037
ABSTRACT
For aggregation tests of genes or regions, the set of included variants often have small total minor allele counts (MACs), and this is particularly true when the most deleterious sets of variants are considered. When MAC is low, commonly used asymptotic tests are not well calibrated for binary phenotypes and can have conservative or anti-conservative results and potential power loss. Empirical p-values obtained via resampling methods are computationally costly for highly significant p-values and the results can be conservative due to the discrete nature of resampling tests. Based on the observation that only the individuals containing minor alleles contribute to the score statistics, we develop an efficient resampling method for single and multiple variant score-based tests that can adjust for covariates. Our method can improve computational efficiency >1000-fold over conventional resampling for low MAC variant sets. We ameliorate the conservativeness of results through the use of mid-p-values. Using the estimated minimum achievable p-value for each test, we calibrate QQ plots and provide an effective number of tests. In analysis of a case-control study with deep exome sequence, we demonstrate that our methods are both well calibrated and also reduce computation time significantly compared with resampling methods.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Variação Genética / Modelos Estatísticos / Análise de Sequência de DNA / Estudos de Associação Genética Tipo de estudo: Observational_studies / Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Ano de publicação: 2016 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Variação Genética / Modelos Estatísticos / Análise de Sequência de DNA / Estudos de Associação Genética Tipo de estudo: Observational_studies / Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Ano de publicação: 2016 Tipo de documento: Article