Use of the p-values as a size-dependent function to address practical differences when analyzing large datasets.

Gómez-de-Mariscal, Estibaliz; Guerrero, Vanesa; Sneider, Alexandra; Jayatilaka, Hasini; Phillip, Jude M; Wirtz, Denis; Muñoz-Barrutia, Arrate

Gómez-de-Mariscal, Estibaliz; Guerrero, Vanesa; Sneider, Alexandra; Jayatilaka, Hasini; Phillip, Jude M; Wirtz, Denis; Muñoz-Barrutia, Arrate.

Afiliação

Gómez-de-Mariscal E; Bioengineering and Aerospace Engineering Department, Universidad Carlos III de Madrid, 28911, Leganés, Spain.
Guerrero V; Instituto de Investigación Sanitaria Gregorio Marañón, 28007, Madrid, Spain.
Sneider A; Statistics Department, Universidad Carlos III de Madrid, 28903, Getafe, Spain.
Jayatilaka H; Department of Chemical and Biomolecular Engineering, Institute for Nanobiotechnology, The Johns Hopkins University, Baltimore, MD, 21218, USA.
Phillip JM; AtlasXomics Inc., New Haven, CT, 06511, USA.
Wirtz D; Department of Biomedical Engineering, The Johns Hopkins University, Baltimore, MD, 21218, USA.
Muñoz-Barrutia A; Department of Chemical and Biomolecular Engineering, Institute for Nanobiotechnology, The Johns Hopkins University, Baltimore, MD, 21218, USA.

Sci Rep ; 11(1): 20942, 2021 10 22.

Article em En | MEDLINE | ID: mdl-34686696

RESUMO

Biomedical research has come to rely on p-values as a deterministic measure for data-driven decision-making. In the largely extended null hypothesis significance testing for identifying statistically significant differences among groups of observations, a single p-value is computed from sample data. Then, it is routinely compared with a threshold, commonly set to 0.05, to assess the evidence against the hypothesis of having non-significant differences among groups, or the null hypothesis. Because the estimated p-value tends to decrease when the sample size is increased, applying this methodology to datasets with large sample sizes results in the rejection of the null hypothesis, making it not meaningful in this specific situation. We propose a new approach to detect differences based on the dependence of the p-value on the sample size. We introduce new descriptive parameters that overcome the effect of the size in the p-value interpretation in the framework of datasets with large sample sizes, reducing the uncertainty in the decision about the existence of biological differences between the compared experiments. The methodology enables the graphical and quantitative characterization of the differences between the compared experiments guiding the researchers in the decision process. An in-depth study of the methodology is carried out on simulated and experimental data. Code availability at https://github.com/BIIG-UC3M/pMoSS .

Assuntos

Pesquisa Biomédica/métodos; Linhagem Celular Tumoral; Interpretação Estatística de Dados; Conjuntos de Dados como Assunto; Humanos; Probabilidade; Projetos de Pesquisa; Tamanho da Amostra; Incerteza

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Pesquisa Biomédica Idioma: En Ano de publicação: 2021 Tipo de documento: Article País de afiliação: Espanha

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google