Your browser doesn't support javascript.
loading
Beyond rankings: Learning (more) from algorithm validation.
Roß, Tobias; Bruno, Pierangela; Reinke, Annika; Wiesenfarth, Manuel; Koeppel, Lisa; Full, Peter M; Pekdemir, Bünyamin; Godau, Patrick; Trofimova, Darya; Isensee, Fabian; Adler, Tim J; Tran, Thuy N; Moccia, Sara; Calimeri, Francesco; Müller-Stich, Beat P; Kopp-Schneider, Annette; Maier-Hein, Lena.
Afiliação
  • Roß T; Intelligent Medical Systems (IMSY), German Cancer Research Center (DKFZ), Heidelberg, Germany; Medical Faculty, Heidelberg University, Heidelberg, Germany; Helmholtz Imaging, German Cancer Research Center (DKFZ), Heidelberg, Germany. Electronic address: t.ross@dkfz-heidelberg.de.
  • Bruno P; Intelligent Medical Systems (IMSY), German Cancer Research Center (DKFZ), Heidelberg, Germany; Department of Mathematics and Computer Science, University of Calabria, Rende, Italy.
  • Reinke A; Intelligent Medical Systems (IMSY), German Cancer Research Center (DKFZ), Heidelberg, Germany; Helmholtz Imaging, German Cancer Research Center (DKFZ), Heidelberg, Germany; Faculty of Mathematics and Computer Science, Heidelberg University, Germany.
  • Wiesenfarth M; Division of Biostatistics, German Cancer Research Center (DKFZ), Heidelberg, Germany.
  • Koeppel L; Section Clinical Tropical Medicine, Heidelberg University, Heidelberg, Germany.
  • Full PM; Medical Faculty, Heidelberg University, Heidelberg, Germany; Division of Medical Image Computing (MIC), German Cancer Research Center (DKFZ), Heidelberg, Germany.
  • Pekdemir B; Intelligent Medical Systems (IMSY), German Cancer Research Center (DKFZ), Heidelberg, Germany.
  • Godau P; Intelligent Medical Systems (IMSY), German Cancer Research Center (DKFZ), Heidelberg, Germany; Faculty of Mathematics and Computer Science, Heidelberg University, Germany.
  • Trofimova D; Intelligent Medical Systems (IMSY), German Cancer Research Center (DKFZ), Heidelberg, Germany; HIP Applied Computer Vision Lab, MIC, German Cancer Research Center (DKFZ), Heidelberg, Germany.
  • Isensee F; Helmholtz Imaging, German Cancer Research Center (DKFZ), Heidelberg, Germany; Division of Medical Image Computing (MIC), German Cancer Research Center (DKFZ), Heidelberg, Germany; HIP Applied Computer Vision Lab, MIC, German Cancer Research Center (DKFZ), Heidelberg, Germany.
  • Adler TJ; Intelligent Medical Systems (IMSY), German Cancer Research Center (DKFZ), Heidelberg, Germany.
  • Tran TN; Intelligent Medical Systems (IMSY), German Cancer Research Center (DKFZ), Heidelberg, Germany.
  • Moccia S; The BioRobotics Institute and Department of Excellence in Robotics and AI, Scuola Superiore Sant'Anna, Italy.
  • Calimeri F; Department of Mathematics and Computer Science, University of Calabria, Rende, Italy.
  • Müller-Stich BP; Department for General, Visceral and Transplantation Surgery, Heidelberg University Hospital, Heidelberg, Germany.
  • Kopp-Schneider A; Division of Biostatistics, German Cancer Research Center (DKFZ), Heidelberg, Germany.
  • Maier-Hein L; Intelligent Medical Systems (IMSY), German Cancer Research Center (DKFZ), Heidelberg, Germany; Medical Faculty, Heidelberg University, Heidelberg, Germany; Helmholtz Imaging, German Cancer Research Center (DKFZ), Heidelberg, Germany; Faculty of Mathematics and Computer Science, Heidelberg University
Med Image Anal ; 86: 102765, 2023 05.
Article em En | MEDLINE | ID: mdl-36965252
ABSTRACT
Challenges have become the state-of-the-art approach to benchmark image analysis algorithms in a comparative manner. While the validation on identical data sets was a great step forward, results analysis is often restricted to pure ranking tables, leaving relevant questions unanswered. Specifically, little effort has been put into the systematic investigation on what characterizes images in which state-of-the-art algorithms fail. To address this gap in the literature, we (1) present a statistical framework for learning from challenges and (2) instantiate it for the specific task of instrument instance segmentation in laparoscopic videos. Our framework relies on the semantic meta data annotation of images, which serves as foundation for a General Linear Mixed Models (GLMM) analysis. Based on 51,542 meta data annotations performed on 2,728 images, we applied our approach to the results of the Robust Medical Instrument Segmentation Challenge (ROBUST-MIS) challenge 2019 and revealed underexposure, motion and occlusion of instruments as well as the presence of smoke or other objects in the background as major sources of algorithm failure. Our subsequent method development, tailored to the specific remaining issues, yielded a deep learning model with state-of-the-art overall performance and specific strengths in the processing of images in which previous methods tended to fail. Due to the objectivity and generic applicability of our approach, it could become a valuable tool for validation in the field of medical image analysis and beyond.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Algoritmos / Laparoscopia Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Ano de publicação: 2023 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Algoritmos / Laparoscopia Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Ano de publicação: 2023 Tipo de documento: Article