Your browser doesn't support javascript.
loading
Statistical selection of biological models for genome-wide association analyses.
Bi, Wenjian; Kang, Guolian; Pounds, Stanley B.
Afiliação
  • Bi W; Department of Biostatistics, St. Jude Children's Research Hospital, Memphis, TN 38105, USA.
  • Kang G; Department of Biostatistics, St. Jude Children's Research Hospital, Memphis, TN 38105, USA.
  • Pounds SB; Department of Biostatistics, St. Jude Children's Research Hospital, Memphis, TN 38105, USA. Electronic address: stanley.pounds@stjude.org.
Methods ; 145: 67-75, 2018 08 01.
Article em En | MEDLINE | ID: mdl-29803781
ABSTRACT
Genome-wide association studies have discovered many biologically important associations of genes with phenotypes. Typically, genome-wide association analyses formally test the association of each genetic feature (SNP, CNV, etc) with the phenotype of interest and summarize the results with multiplicity-adjusted p-values. However, very small p-values only provide evidence against the null hypothesis of no association without indicating which biological model best explains the observed data. Correctly identifying a specific biological model may improve the scientific interpretation and can be used to more effectively select and design a follow-up validation study. Thus, statistical methodology to identify the correct biological model for a particular genotype-phenotype association can be very useful to investigators. Here, we propose a general statistical method to summarize how accurately each of five biological models (null, additive, dominant, recessive, co-dominant) represents the data observed for each variant in a GWAS study. We show that the new method stringently controls the false discovery rate and asymptotically selects the correct biological model. Simulations of two-stage discovery-validation studies show that the new method has these properties and that its validation power is similar to or exceeds that of simple methods that use the same statistical model for all SNPs. Example analyses of three data sets also highlight these advantages of the new method. An R package is freely available at www.stjuderesearch.org/site/depts/biostats/maew.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Polimorfismo Genético / Estatística como Assunto / Estudo de Associação Genômica Ampla / Modelos Genéticos Tipo de estudo: Risk_factors_studies Limite: Humans Idioma: En Revista: Methods Assunto da revista: BIOQUIMICA Ano de publicação: 2018 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Polimorfismo Genético / Estatística como Assunto / Estudo de Associação Genômica Ampla / Modelos Genéticos Tipo de estudo: Risk_factors_studies Limite: Humans Idioma: En Revista: Methods Assunto da revista: BIOQUIMICA Ano de publicação: 2018 Tipo de documento: Article País de afiliação: Estados Unidos