Your browser doesn't support javascript.
loading
Comparison of Prostate MRI Lesion Segmentation Agreement Between Multiple Radiologists and a Fully Automatic Deep Learning System.
Schelb, Patrick; Tavakoli, Anoshirwan Andrej; Tubtawee, Teeravut; Hielscher, Thomas; Radtke, Jan-Philipp; Görtz, Magdalena; Schütz, Viktoria; Kuder, Tristan Anselm; Schimmöller, Lars; Stenzinger, Albrecht; Hohenfellner, Markus; Schlemmer, Heinz-Peter; Bonekamp, David.
Afiliação
  • Schelb P; Division of Radiology, German Cancer Research Center (DKFZ), Heidelberg, Germany.
  • Tavakoli AA; Division of Radiology, German Cancer Research Center (DKFZ), Heidelberg, Germany.
  • Tubtawee T; Division of Radiology, German Cancer Research Center (DKFZ), Heidelberg, Germany.
  • Hielscher T; Division of Biostatistics, German Cancer Research Center (DKFZ), Heidelberg, Germany.
  • Radtke JP; Department of Urology, University of Heidelberg Medical Center, Heidelberg, Germany.
  • Görtz M; Department of Urology, University of Heidelberg Medical Center, Heidelberg, Germany.
  • Schütz V; Department of Urology, University of Heidelberg Medical Center, Heidelberg, Germany.
  • Kuder TA; Division of Medical Physics, German Cancer Research Center (DKFZ), Heidelberg, Germany.
  • Schimmöller L; University Dusseldorf, Medical Faculty, Department of Diagnostic and Interventional Radiology, Dusseldorf, Germany.
  • Stenzinger A; Institute of Pathology, University of Heidelberg Medical Center, Heidelberg, Germany.
  • Hohenfellner M; Department of Urology, University of Heidelberg Medical Center, Heidelberg, Germany.
  • Schlemmer HP; Division of Radiology, German Cancer Research Center (DKFZ), Heidelberg, Germany.
  • Bonekamp D; Division of Radiology, German Cancer Research Center (DKFZ), Heidelberg, Germany.
Rofo ; 193(5): 559-573, 2021 May.
Article em En | MEDLINE | ID: mdl-33212541
ABSTRACT

PURPOSE:

A recently developed deep learning model (U-Net) approximated the clinical performance of radiologists in the prediction of clinically significant prostate cancer (sPC) from prostate MRI. Here, we compare the agreement between lesion segmentations by U-Net with manual lesion segmentations performed by different radiologists. MATERIALS AND

METHODS:

165 patients with suspicion for sPC underwent targeted and systematic fusion biopsy following 3 Tesla multiparametric MRI (mpMRI). Five sets of segmentations were generated retrospectively segmentations of clinical lesions, independent segmentations by three radiologists, and fully automated bi-parametric U-Net segmentations. Per-lesion agreement was calculated for each rater by averaging Dice coefficients with all overlapping lesions from other raters. Agreement was compared using descriptive statistics and linear mixed models.

RESULTS:

The mean Dice coefficient for manual segmentations showed only moderate agreement at 0.48-0.52, reflecting the difficult visual task of determining the outline of otherwise jointly detected lesions. U-net segmentations were significantly smaller than manual segmentations (p < 0.0001) and exhibited a lower mean Dice coefficient of 0.22, which was significantly lower compared to manual segmentations (all p < 0.0001). These differences remained after correction for lesion size and were unaffected between sPC and non-sPC lesions and between peripheral and transition zone lesions.

CONCLUSION:

Knowledge of the order of agreement of manual segmentations of different radiologists is important to set the expectation value for artificial intelligence (AI) systems in the task of prostate MRI lesion segmentation. Perfect agreement (Dice coefficient of one) should not be expected for AI. Lower Dice coefficients of U-Net compared to manual segmentations are only partially explained by smaller segmentation sizes and may result from a focus on the lesion core and a small relative lesion center shift. Although it is primarily important that AI detects sPC correctly, the Dice coefficient for overlapping lesions from multiple raters can be used as a secondary measure for segmentation quality in future studies. KEY POINTS · Intermediate human Dice coefficients reflect the difficulty of outlining jointly detected lesions.. · Lower Dice coefficients of deep learning motivate further research to approximate human perception.. · Comparable predictive performance of deep learning appears independent of Dice agreement.. · Dice agreement independent of significant cancer presence indicates indistinguishability of some benign imaging findings.. · Improving DWI to T2 registration may improve the observed U-Net Dice coefficients.. CITATION FORMAT · Schelb P, Tavakoli AA, Tubtawee T et al. Comparison of Prostate MRI Lesion Segmentation Agreement Between Multiple Radiologists and a Fully Automatic Deep Learning System. Fortschr Röntgenstr 2021; 193 559 - 573.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Próstata / Imageamento por Ressonância Magnética / Radiologistas / Aprendizado Profundo Tipo de estudo: Guideline / Observational_studies / Prognostic_studies / Risk_factors_studies Limite: Humans / Male Idioma: En Ano de publicação: 2021 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Próstata / Imageamento por Ressonância Magnética / Radiologistas / Aprendizado Profundo Tipo de estudo: Guideline / Observational_studies / Prognostic_studies / Risk_factors_studies Limite: Humans / Male Idioma: En Ano de publicação: 2021 Tipo de documento: Article