Online multiple hypothesis testing.

Robertson, David S; Wason, James M S; Ramdas, Aaditya

Robertson, David S; Wason, James M S; Ramdas, Aaditya.

Afiliação

Robertson DS; MRC Biostatistics Unit, University of Cambridge, Cambridge, UK.
Wason JMS; Population Health Sciences Institute, Newcastle University, Newcastle, UK.
Ramdas A; Departments of Statistics and Machine Learning, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA.

Stat Sci ; 38(4): 557-575, 2023 Nov 01.

Article em En | MEDLINE | ID: mdl-38223302

ABSTRACT

ABSTRACT

Modern data analysis frequently involves large-scale hypothesis testing, which naturally gives rise to the problem of maintaining control of a suitable type I error rate, such as the false discovery rate (FDR). In many biomedical and technological applications, an additional complexity is that hypotheses are tested in an online manner, one-by-one over time. However, traditional procedures that control the FDR, such as the Benjamini-Hochberg procedure, assume that all p-values are available to be tested at a single time point. To address these challenges, a new field of methodology has developed over the past 15 years showing how to control error rates for online multiple hypothesis testing. In this framework, hypotheses arrive in a stream, and at each time point the analyst decides whether to reject the current hypothesis based both on the evidence against it, and on the previous rejection decisions. In this paper, we present a comprehensive exposition of the literature on online error rate control, with a review of key theory as well as a focus on applied examples. We also provide simulation results comparing different online testing algorithms and an up-to-date overview of the many methodological extensions that have been proposed.

Palavras-chave

A/B testing; data repositories; platform trials; type I error rate

Texto completo

Adicionar na Minha BVS

Imprimir

XML

PubMed Links

Buscar no Google