Your browser doesn't support javascript.
loading
Enrichment or depletion of a GO category within a class of genes: which test?
Rivals, Isabelle; Personnaz, Léon; Taing, Lieng; Potier, Marie-Claude.
Afiliación
  • Rivals I; Equipe de Statistique Appliquée, 10 rue Vauquelin, 75005 Paris, France. isabelle.rivals@espci.fr
Bioinformatics ; 23(4): 401-7, 2007 Feb 15.
Article en En | MEDLINE | ID: mdl-17182697
MOTIVATION: A number of available program packages determine the significant enrichments and/or depletions of GO categories among a class of genes of interest. Whereas a correct formulation of the problem leads to a single exact null distribution, these GO tools use a large variety of statistical tests whose denominations often do not clarify the underlying P-value computations. SUMMARY: We review the different formulations of the problem and the tests they lead to: the binomial, chi2, equality of two probabilities, Fisher's exact and hypergeometric tests. We clarify the relationships existing between these tests, in particular the equivalence between the hypergeometric test and Fisher's exact test. We recall that the other tests are valid only for large samples, the test of equality of two probabilities and the chi2-test being equivalent. We discuss the appropriateness of one- and two-sided P-values, as well as some discreteness and conservatism issues. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)
Buscar en Google
Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Proteínas / Familia de Multigenes / Almacenamiento y Recuperación de la Información / Análisis de Secuencia por Matrices de Oligonucleótidos / Perfilación de la Expresión Génica / Bases de Datos de Proteínas Tipo de estudio: Evaluation_studies Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2007 Tipo del documento: Article País de afiliación: Francia
Buscar en Google
Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Proteínas / Familia de Multigenes / Almacenamiento y Recuperación de la Información / Análisis de Secuencia por Matrices de Oligonucleótidos / Perfilación de la Expresión Génica / Bases de Datos de Proteínas Tipo de estudio: Evaluation_studies Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2007 Tipo del documento: Article País de afiliación: Francia
...