Systematic misestimation of machine learning performance in neuroimaging studies of depression.

Flint, Claas; Cearns, Micah; Opel, Nils; Redlich, Ronny; Mehler, David M A; Emden, Daniel; Winter, Nils R; Leenings, Ramona; Eickhoff, Simon B; Kircher, Tilo; Krug, Axel; Nenadic, Igor; Arolt, Volker; Clark, Scott; Baune, Bernhard T; Jiang, Xiaoyi; Dannlowski, Udo; Hahn, Tim

Flint, Claas; Cearns, Micah; Opel, Nils; Redlich, Ronny; Mehler, David M A; Emden, Daniel; Winter, Nils R; Leenings, Ramona; Eickhoff, Simon B; Kircher, Tilo; Krug, Axel; Nenadic, Igor; Arolt, Volker; Clark, Scott; Baune, Bernhard T; Jiang, Xiaoyi; Dannlowski, Udo; Hahn, Tim.

Affiliation

Flint C; Department of Psychiatry, University of Münster, Münster, Germany.
Cearns M; Faculty of Mathematics and Computer Science, University of Münster, Münster, Germany.
Opel N; Discipline of Psychiatry, School of Medicine, University of Adelaide, Adelaide, SA, Australia.
Redlich R; Department of Psychiatry, Melbourne Medical School, The University of Melbourne, Parkville, VIC, Australia.
Mehler DMA; Department of Psychiatry, University of Münster, Münster, Germany.
Emden D; Department of Psychiatry, University of Münster, Münster, Germany.
Winter NR; Department of Psychiatry, University of Münster, Münster, Germany.
Leenings R; Department of Psychiatry, University of Münster, Münster, Germany.
Eickhoff SB; Department of Psychiatry, University of Münster, Münster, Germany.
Kircher T; Department of Psychiatry, University of Münster, Münster, Germany.
Krug A; Institute of Neuroscience and Medicine (INM-7) Research Center Jülich, Jülich, Germany.
Nenadic I; Institute of Systems Neuroscience, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany.
Arolt V; Department of Psychiatry and Psychotherapy, University of Marburg, Marburg, Germany.
Clark S; Department of Psychiatry and Psychotherapy, University of Marburg, Marburg, Germany.
Baune BT; Department of Psychiatry and Psychotherapy, University of Marburg, Marburg, Germany.
Jiang X; Department of Psychiatry, University of Münster, Münster, Germany.
Dannlowski U; Discipline of Psychiatry, School of Medicine, University of Adelaide, Adelaide, SA, Australia.
Hahn T; Department of Psychiatry, University of Münster, Münster, Germany.

Neuropsychopharmacology ; 46(8): 1510-1517, 2021 07.

Article in En | MEDLINE | ID: mdl-33958703

ABSTRACT

We currently observe a disconcerting phenomenon in machine learning studies in psychiatry: While we would expect larger samples to yield better results due to the availability of more data, larger machine learning studies consistently show much weaker performance than the numerous small-scale studies. Here, we systematically investigated this effect focusing on one of the most heavily studied questions in the field, namely the classification of patients suffering from Major Depressive Disorder (MDD) and healthy controls based on neuroimaging data. Drawing upon structural MRI data from a balanced sample of N = 1868 MDD patients and healthy controls from our recent international Predictive Analytics Competition (PAC), we first trained and tested a classification model on the full dataset which yielded an accuracy of 61%. Next, we mimicked the process by which researchers would draw samples of various sizes (N = 4 to N = 150) from the population and showed a strong risk of misestimation. Specifically, for small sample sizes (N = 20), we observe accuracies of up to 95%. For medium sample sizes (N = 100) accuracies up to 75% were found. Importantly, further investigation showed that sufficiently large test sets effectively protect against performance misestimation whereas larger datasets per se do not. While these results question the validity of a substantial part of the current literature, we outline the relatively low-cost remedy of larger test sets, which is readily available in most cases.

Subject(s)

Depressive Disorder, Major; Depression; Depressive Disorder, Major/diagnostic imaging; Humans; Machine Learning; Magnetic Resonance Imaging; Neuroimaging

Fulltext

XML

PubMed Links

Search on Google

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Depressive Disorder, Major Type of study: Prognostic_studies Limits: Humans Language: En Journal: Neuropsychopharmacology Journal subject: NEUROLOGIA / PSICOFARMACOLOGIA Year: 2021 Type: Article Affiliation country: Germany

Fulltext

XML

PubMed Links

Search on Google