Your browser doesn't support javascript.
loading
Biased accuracy in multisite machine-learning studies due to incomplete removal of the effects of the site.
Solanes, Aleix; Palau, Pol; Fortea, Lydia; Salvador, Raymond; González-Navarro, Laura; Llach, Cristian Daniel; Valentí, Marc; Vieta, Eduard; Radua, Joaquim.
Affiliation
  • Solanes A; Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain; Department of Psychiatry and Forensic Medicine, Autonomous University of Barcelona, Barcelona, Spain.
  • Palau P; FIDMAG Research Foundation, Barcelona, Spain; CASM Benito Menni Granollers-Hospital General de Granollers, Barcelona, Spain.
  • Fortea L; Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain; Biomedical Network Research Centre on Mental Health (CIBERSAM), Instituto de Salud Carlos III, Madrid, Spain; Institute of Neurosciences, University of Barcelona, Barcelona, Spain.
  • Salvador R; FIDMAG Research Foundation, Barcelona, Spain; Biomedical Network Research Centre on Mental Health (CIBERSAM), Instituto de Salud Carlos III, Madrid, Spain.
  • González-Navarro L; Faculty of Biology, University of Barcelona, Barcelona, Spain.
  • Llach CD; Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain; Biomedical Network Research Centre on Mental Health (CIBERSAM), Instituto de Salud Carlos III, Madrid, Spain; Institute of Neurosciences, University of Barcelona, Barcelona, Spain; Barcelona Bipolar Disorders and D
  • Valentí M; Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain; Biomedical Network Research Centre on Mental Health (CIBERSAM), Instituto de Salud Carlos III, Madrid, Spain; Institute of Neurosciences, University of Barcelona, Barcelona, Spain; Barcelona Bipolar Disorders and D
  • Vieta E; Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain; Biomedical Network Research Centre on Mental Health (CIBERSAM), Instituto de Salud Carlos III, Madrid, Spain; Institute of Neurosciences, University of Barcelona, Barcelona, Spain; Barcelona Bipolar Disorders and D
  • Radua J; Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain; Biomedical Network Research Centre on Mental Health (CIBERSAM), Instituto de Salud Carlos III, Madrid, Spain; Department of Psychosis Studies, Institute of Psychiatry, Psychology, and Neuroscience, King's College L
Psychiatry Res Neuroimaging ; 314: 111313, 2021 08 30.
Article in En | MEDLINE | ID: mdl-34098248
ABSTRACT
Brain MRI researchers conducting multisite studies, such as within the ENIGMA Consortium, are very aware of the importance of controlling the effects of the site (EoS) in the statistical analysis. Conversely, authors of the novel machine-learning MRI studies may remove the EoS when training the machine-learning models but not control them when estimating the models' accuracy, potentially leading to severely biased estimates. We show examples from a toy simulation study and real MRI data in which we remove the EoS from both the "training set" and the "test set" during the training and application of the model. However, the accuracy is still inflated (or occasionally shrunk) unless we further control the EoS during the estimation of the accuracy. We also provide several methods for controlling the EoS during the estimation of the accuracy, and a simple R package ("multisite.accuracy") that smoothly does this task for several accuracy estimates (e.g., sensitivity/specificity, area under the curve, correlation, hazard ratio, etc.).
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Magnetic Resonance Imaging / Machine Learning Type of study: Diagnostic_studies / Prognostic_studies Limits: Humans Language: En Journal: Psychiatry Res Neuroimaging Year: 2021 Document type: Article Affiliation country: España

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Magnetic Resonance Imaging / Machine Learning Type of study: Diagnostic_studies / Prognostic_studies Limits: Humans Language: En Journal: Psychiatry Res Neuroimaging Year: 2021 Document type: Article Affiliation country: España