Your browser doesn't support javascript.
loading
The reliability of a deep learning model in clinical out-of-distribution MRI data: A multicohort study.
Mårtensson, Gustav; Ferreira, Daniel; Granberg, Tobias; Cavallin, Lena; Oppedal, Ketil; Padovani, Alessandro; Rektorova, Irena; Bonanni, Laura; Pardini, Matteo; Kramberger, Milica G; Taylor, John-Paul; Hort, Jakub; Snædal, Jón; Kulisevsky, Jaime; Blanc, Frederic; Antonini, Angelo; Mecocci, Patrizia; Vellas, Bruno; Tsolaki, Magda; Kloszewska, Iwona; Soininen, Hilkka; Lovestone, Simon; Simmons, Andrew; Aarsland, Dag; Westman, Eric.
Afiliación
  • Mårtensson G; Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet, Stockholm, Sweden. Electronic address: gustav.martensson@ki.se.
  • Ferreira D; Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet, Stockholm, Sweden.
  • Granberg T; Department of Clinical Neuroscience, Karolinska Institutet, Stockholm, Sweden; Department of Radiology, Karolinska University Hospital, Stockholm, Sweden.
  • Cavallin L; Department of Clinical Neuroscience, Karolinska Institutet, Stockholm, Sweden; Department of Radiology, Karolinska University Hospital, Stockholm, Sweden.
  • Oppedal K; Centre for Age-Related Medicine, Stavanger University Hospital, Stavanger, Norway; Stavanger Medical Imaging Laboratory (SMIL), Department of Radiology, Stavanger University Hospital, Stavanger, Norway; Department of Electrical Engineering and Computer Science, University of Stavanger, Stavanger, No
  • Padovani A; Neurology Unit, Department of Clinical and Experimental Sciences, University of Brescia, Brescia, Italy.
  • Rektorova I; 1st Department of Neurology, Medical Faculty, St. Anne's Hospital and CEITEC, Masaryk University, Brno, Czech Republic.
  • Bonanni L; Department of Neuroscience Imaging and Clinical Sciences and CESI, University G d'Annunzio of Chieti-Pescara, Chieti, Italy.
  • Pardini M; Department of Neuroscience (DINOGMI), University of Genoa and Neurology Clinics, Polyclinic San Martino Hospital, Genoa, Italy.
  • Kramberger MG; Department of Neurology, University Medical Centre Ljubljana, Medical faculty, University of Ljubljana, Slovenia.
  • Taylor JP; Institute of Neuroscience, Newcastle University, Newcastle upon Tyne, UK.
  • Hort J; Memory Clinic, Department of Neurology, Charles University, 2nd Faculty of Medicine and Motol University Hospital, Prague, Czech Republic.
  • Snædal J; Landspitali University Hospital, Reykjavik, Iceland.
  • Kulisevsky J; Movement Disorders Unit, Neurology Department, Sant Pau Hospital, Barcelona, Spain; Institut d'Investigacions Biomédiques Sant Pau (IIB-Sant Pau), Barcelona, Spain; Centro de Investigación en Red-Enfermedades Neurodegenerativas (CIBERNED), Barcelona, Spain; Universitat Autónoma de Barcelona (U.A.B.)
  • Blanc F; Day Hospital of Geriatrics, Memory Resource and Research Centre (CM2R) of Strasbourg, Department of Geriatrics, Hôpitaux Universitaires de Strasbourg, Strasbourg, France; University of Strasbourg and French National Centre for Scientific Research (CNRS), ICube Laboratory and Fédération de Médecine T
  • Antonini A; Department of Neuroscience, University of Padua, Padua & Fondazione Ospedale San Camillo, Venezia, Venice, Italy.
  • Mecocci P; Institute of Gerontology and Geriatrics, University of Perugia, Perugia, Italy.
  • Vellas B; UMR INSERM 1027, gerontopole, CHU, University of Toulouse, France.
  • Tsolaki M; 3rd Department of Neurology, Memory and Dementia Unit, Aristotle University of Thessaloniki, Thessaloniki, Greece.
  • Kloszewska I; Medical University of Lodz, Lodz, Poland.
  • Soininen H; Institute of Clinical Medicine, Neurology, University of Eastern Finland, Finland; Neurocenter, Neurology, Kuopio University Hospital, Kuopio, Finland.
  • Lovestone S; Department of Psychiatry, Warneford Hospital, University of Oxford, Oxford, UK.
  • Simmons A; NIHR Biomedical Research Centre for Mental Health, London, UK; NIHR Biomedical Research Unit for Dementia, London, UK; Department of Neuroimaging, Centre for Neuroimaging Sciences, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK.
  • Aarsland D; Centre for Age-Related Medicine, Stavanger University Hospital, Stavanger, Norway; Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK.
  • Westman E; Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet, Stockholm, Sweden; Department of Neuroimaging, Centre for Neuroimaging Sciences, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK.
Med Image Anal ; 66: 101714, 2020 12.
Article en En | MEDLINE | ID: mdl-33007638
Deep learning (DL) methods have in recent years yielded impressive results in medical imaging, with the potential to function as clinical aid to radiologists. However, DL models in medical imaging are often trained on public research cohorts with images acquired with a single scanner or with strict protocol harmonization, which is not representative of a clinical setting. The aim of this study was to investigate how well a DL model performs in unseen clinical datasets-collected with different scanners, protocols and disease populations-and whether more heterogeneous training data improves generalization. In total, 3117 MRI scans of brains from multiple dementia research cohorts and memory clinics, that had been visually rated by a neuroradiologist according to Scheltens' scale of medial temporal atrophy (MTA), were included in this study. By training multiple versions of a convolutional neural network on different subsets of this data to predict MTA ratings, we assessed the impact of including images from a wider distribution during training had on performance in external memory clinic data. Our results showed that our model generalized well to datasets acquired with similar protocols as the training data, but substantially worse in clinical cohorts with visibly different tissue contrasts in the images. This implies that future DL studies investigating performance in out-of-distribution (OOD) MRI data need to assess multiple external cohorts for reliable results. Further, by including data from a wider range of scanners and protocols the performance improved in OOD data, which suggests that more heterogeneous training data makes the model generalize better. To conclude, this is the most comprehensive study to date investigating the domain shift in deep learning on MRI data, and we advocate rigorous evaluation of DL models on clinical data prior to being certified for deployment.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Aprendizaje Profundo Tipo de estudio: Guideline / Prognostic_studies Límite: Humans Idioma: En Revista: Med Image Anal Asunto de la revista: DIAGNOSTICO POR IMAGEM Año: 2020 Tipo del documento: Article Pais de publicación: Países Bajos

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Aprendizaje Profundo Tipo de estudio: Guideline / Prognostic_studies Límite: Humans Idioma: En Revista: Med Image Anal Asunto de la revista: DIAGNOSTICO POR IMAGEM Año: 2020 Tipo del documento: Article Pais de publicación: Países Bajos