Your browser doesn't support javascript.
loading
Better-than-chance classification for signal detection.
Rosenblatt, Jonathan D; Benjamini, Yuval; Gilron, Roee; Mukamel, Roy; Goeman, Jelle J.
  • Rosenblatt JD; Department of IE&M and Zlotowsky Center for Neuroscience, Ben Gurion University of the Negev, P.O. 653, Beer Sheva, 84105 Israel.
  • Benjamini Y; Department of Statistics, Hebrew University, Mount Scopus, Jerusalem 9190501, Israel.
  • Gilron R; Movement Disorders and Neuromodulation Center, University of California, 1635 Divisadero St, San Francisco, CA 94115, USA.
  • Mukamel R; School of Psychological Sciences, and Sagol School of Neuroscience, Tel-Aviv University, Tel-Aviv 69978, Israel.
  • Goeman JJ; Department of Biomedical Data Sciences, Leiden University Medical Center, Postbus 9600, 2300 RC Leiden, The Netherlands.
Biostatistics ; 22(2): 365-380, 2021 04 10.
Article en En | MEDLINE | ID: mdl-31612223
ABSTRACT
The estimated accuracy of a classifier is a random quantity with variability. A common practice in supervised machine learning, is thus to test if the estimated accuracy is significantly better than chance level. This method of signal detection is particularly popular in neuroimaging and genetics. We provide evidence that using a classifier's accuracy as a test statistic can be an underpowered strategy for finding differences between populations, compared to a bona fide statistical test. It is also computationally more demanding than a statistical test. Via simulation, we compare test statistics that are based on classification accuracy, to others based on multivariate test statistics. We find that the probability of detecting differences between two distributions is lower for accuracy-based statistics. We examine several candidate causes for the low power of accuracy-tests. These causes include the discrete nature of the accuracy-test statistic, the type of signal accuracy-tests are designed to detect, their inefficient use of the data, and their suboptimal regularization. When the purpose of the analysis is the evaluation of a particular classifier, not signal detection, we suggest several improvements to increase power. In particular, to replace V-fold cross-validation with the Leave-One-Out Bootstrap.
Asunto(s)
Palabras clave

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Neuroimagen / Aprendizaje Automático Supervisado Tipo de estudio: Diagnostic_studies / Prognostic_studies Límite: Humans Idioma: En Año: 2021 Tipo del documento: Article

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Neuroimagen / Aprendizaje Automático Supervisado Tipo de estudio: Diagnostic_studies / Prognostic_studies Límite: Humans Idioma: En Año: 2021 Tipo del documento: Article