Your browser doesn't support javascript.
loading
Evaluation of Artificial Intelligence-Based Gleason Grading Algorithms "in the Wild".
Faryna, Khrystyna; Tessier, Leslie; Retamero, Juan; Bonthu, Saikiran; Samanta, Pranab; Singhal, Nitin; Kammerer-Jacquet, Solene-Florence; Radulescu, Camelia; Agosti, Vittorio; Collin, Alexandre; Farre, Xavier; Fontugne, Jacqueline; Grobholz, Rainer; Hoogland, Agnes Marije; Moreira Leite, Katia Ramos; Oktay, Murat; Polonia, Antonio; Roy, Paromita; Salles, Paulo Guilherme; van der Kwast, Theodorus H; van Ipenburg, Jolique; van der Laak, Jeroen; Litjens, Geert.
Afiliación
  • Faryna K; Radboud University Medical Center, Computational Pathology Group, Nijmegen, The Netherlands. Electronic address: khrystyna.faryna@radboudumc.nl.
  • Tessier L; Radboud University Medical Center, Computational Pathology Group, Nijmegen, The Netherlands.
  • Retamero J; Paige, New York, New York.
  • Bonthu S; AIRA Matrix, Thane, India.
  • Samanta P; AIRA Matrix, Thane, India.
  • Singhal N; AIRA Matrix, Thane, India.
  • Kammerer-Jacquet SF; Department of Pathology, Rennes University Hospital, Rennes, France.
  • Radulescu C; Department of Pathological Anatomy and Cytology, Hopital Foch, Suresnes, France.
  • Agosti V; Department of Medicine and Surgery, University of Brescia, Brescia, Italy.
  • Collin A; Department of Cell and Tissue Pathology, Angers University Hospital Center, Angers, France.
  • Farre X; Public Health Agency of Catalonia, Lleida, Spain.
  • Fontugne J; Department of Pathology, Institut Curie, Saint-Cloud, France.
  • Grobholz R; Institute of Pathology, Cantonal Hospital Aarau, Aarau, Switzerland.
  • Hoogland AM; Department of Pathology, Isala Zwolle, Zwolle, The Netherlands.
  • Moreira Leite KR; Department of Surgery, Faculty of Medicine of the University of Sao Paulo, Sao Paulo, Brazil.
  • Oktay M; Department of Pathology, Memorial Hospitals Group, Istanbul, Turkey.
  • Polonia A; Department of Pathology, Ipatimup, Porto, Portugal.
  • Roy P; Department of Pathology, Tata Medical Center, Kolkata, India.
  • Salles PG; Teaching and Research Center, Instituto Mario Penna, Belo Horizonte, Brazil.
  • van der Kwast TH; Department of Anatomic Pathology, University Health Network and Princess Margaret Cancer Center, Toronto, Canada.
  • van Ipenburg J; Radboud University Medical Center, Computational Pathology Group, Nijmegen, The Netherlands.
  • van der Laak J; Radboud University Medical Center, Computational Pathology Group, Nijmegen, The Netherlands; Center for Medical Image Science and Visualization, Linkoping University, Linkoping, Sweden.
  • Litjens G; Radboud University Medical Center, Computational Pathology Group, Nijmegen, The Netherlands.
Mod Pathol ; 37(11): 100563, 2024 Jul 16.
Article en En | MEDLINE | ID: mdl-39025402
ABSTRACT
The biopsy Gleason score is an important prognostic marker for prostate cancer patients. It is, however, subject to substantial variability among pathologists. Artificial intelligence (AI)-based algorithms employing deep learning have shown their ability to match pathologists' performance in assigning Gleason scores, with the potential to enhance pathologists' grading accuracy. The performance of Gleason AI algorithms in research is mostly reported on common benchmark data sets or within public challenges. In contrast, many commercial algorithms are evaluated in clinical studies, for which data are not publicly released. As commercial AI vendors typically do not publish performance on public benchmarks, comparison between research and commercial AI is difficult. The aims of this study are to evaluate and compare the performance of top-ranked public and commercial algorithms using real-world data. We curated a diverse data set of whole-slide prostate biopsy images through crowdsourcing containing images with a range of Gleason scores and from diverse sources. Predictions were obtained from 5 top-ranked public algorithms from the Prostate cANcer graDe Assessment (PANDA) challenge and 2 commercial Gleason grading algorithms. Additionally, 10 pathologists (A.C., C.R., J.v.I., K.R.M.L., P.R., P.G.S., R.G., S.F.K.J., T.v.d.K., X.F.) evaluated the data set in a reader study. Overall, the pairwise quadratic weighted kappa among pathologists ranged from 0.777 to 0.916. Both public and commercial algorithms showed high agreement with pathologists, with quadratic kappa ranging from 0.617 to 0.900. Commercial algorithms performed on par or outperformed top public algorithms.
Palabras clave

Texto completo: 1 Bases de datos: MEDLINE Idioma: En Revista: Mod Pathol Asunto de la revista: PATOLOGIA Año: 2024 Tipo del documento: Article

Texto completo: 1 Bases de datos: MEDLINE Idioma: En Revista: Mod Pathol Asunto de la revista: PATOLOGIA Año: 2024 Tipo del documento: Article