Your browser doesn't support javascript.
loading
Qualitative Evaluation of Common Quantitative Metrics for Clinical Acceptance of Automatic Segmentation: a Case Study on Heart Contouring from CT Images by Deep Learning Algorithms.
van den Oever, L B; van Veldhuizen, W A; Cornelissen, L J; Spoor, D S; Willems, T P; Kramer, G; Stigter, T; Rook, M; Crijns, A P G; Oudkerk, M; Veldhuis, R N J; de Bock, G H; van Ooijen, P M A.
Afiliación
  • van den Oever LB; Department of Radiation Oncology, University Medical Center Groningen, University of Groningen, Hanzeplein 1, 9713GZ, Groningen, The Netherlands.
  • van Veldhuizen WA; Department of Surgery, University Medical Center Groningen, University of Groningen, Hanzeplein 1, 9713GZ, Groningen, The Netherlands.
  • Cornelissen LJ; Department of Radiation Oncology, University Medical Center Groningen, University of Groningen, Hanzeplein 1, 9713GZ, Groningen, The Netherlands.
  • Spoor DS; Department of Radiation Oncology, University Medical Center Groningen, University of Groningen, Hanzeplein 1, 9713GZ, Groningen, The Netherlands.
  • Willems TP; Department of Radiology, University Medical Center Groningen, University of Groningen, Hanzeplein 1, 9713GZ, Groningen, The Netherlands.
  • Kramer G; Department of Radiology, Martini Hospital, Van Swietenplein 1, 9728 NT, Groningen, The Netherlands.
  • Stigter T; Department of Radiology, Martini Hospital, Van Swietenplein 1, 9728 NT, Groningen, The Netherlands.
  • Rook M; Department of Radiology, Martini Hospital, Van Swietenplein 1, 9728 NT, Groningen, The Netherlands.
  • Crijns APG; Department of Radiation Oncology, University Medical Center Groningen, University of Groningen, Hanzeplein 1, 9713GZ, Groningen, The Netherlands.
  • Oudkerk M; Faculty of Medical Sciences, University of Groningen, Groningen, The Netherlands.
  • Veldhuis RNJ; Department of Electrical Engineering, Computer Science and Mathematics, University of Twente, Drienerlolaan 5, 7522 NB, Enschede, The Netherlands.
  • de Bock GH; Department of Epidemiology, University Medical Center Groningen, University of Groningen, Hanzeplein 1, 9713GZ, Groningen, The Netherlands.
  • van Ooijen PMA; Department of Radiation Oncology, University Medical Center Groningen, University of Groningen, Hanzeplein 1, 9713GZ, Groningen, The Netherlands. p.m.a.van.ooijen@umcg.nl.
J Digit Imaging ; 35(2): 240-247, 2022 04.
Article en En | MEDLINE | ID: mdl-35083620
ABSTRACT
Organs-at-risk contouring is time consuming and labour intensive. Automation by deep learning algorithms would decrease the workload of radiotherapists and technicians considerably. However, the variety of metrics used for the evaluation of deep learning algorithms make the results of many papers difficult to interpret and compare. In this paper, a qualitative evaluation is done on five established metrics to assess whether their values correlate with clinical usability. A total of 377 CT volumes with heart delineations were randomly selected for training and evaluation. A deep learning algorithm was used to predict the contours of the heart. A total of 101 CT slices from the validation set with the predicted contours were shown to three experienced radiologists. They examined each slice independently whether they would accept or adjust the prediction and if there were (small) mistakes. For each slice, the scores of this qualitative evaluation were then compared with the Sørensen-Dice coefficient (DC), the Hausdorff distance (HD), pixel-wise accuracy, sensitivity and precision. The statistical analysis of the qualitative evaluation and metrics showed a significant correlation. Of the slices with a DC over 0.96 (N = 20) or a 95% HD under 5 voxels (N = 25), no slices were rejected by the readers. Contours with lower DC or higher HD were seen in both rejected and accepted contours. Qualitative evaluation shows that it is difficult to use common quantification metrics as indicator for use in clinic. We might need to change the reporting of quantitative metrics to better reflect clinical acceptance.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Aprendizaje Profundo Tipo de estudio: Etiology_studies / Prognostic_studies / Qualitative_research Límite: Humans Idioma: En Revista: J Digit Imaging Asunto de la revista: DIAGNOSTICO POR IMAGEM / INFORMATICA MEDICA / RADIOLOGIA Año: 2022 Tipo del documento: Article País de afiliación: Países Bajos

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Aprendizaje Profundo Tipo de estudio: Etiology_studies / Prognostic_studies / Qualitative_research Límite: Humans Idioma: En Revista: J Digit Imaging Asunto de la revista: DIAGNOSTICO POR IMAGEM / INFORMATICA MEDICA / RADIOLOGIA Año: 2022 Tipo del documento: Article País de afiliación: Países Bajos