Machine learning can aid in prediction of IDH mutation from H&E-stained histology slides in infiltrating gliomas.

Liechty, Benjamin; Xu, Zhuoran; Zhang, Zhilu; Slocum, Cheyanne; Bahadir, Cagla D; Sabuncu, Mert R; Pisapia, David J

Liechty, Benjamin; Xu, Zhuoran; Zhang, Zhilu; Slocum, Cheyanne; Bahadir, Cagla D; Sabuncu, Mert R; Pisapia, David J.

Afiliação

Liechty B; Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York, NY, USA.
Xu Z; Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York, NY, USA.
Zhang Z; School of Electrical and Computer Engineering, Cornell University and Cornell Tech, New York, NY, USA.
Slocum C; School of Medicine, Weill Cornell Medicine, New York, NY, USA.
Bahadir CD; Meinig School of Biomedical Engineering, Cornell University, Ithaca, NY, USA.
Sabuncu MR; School of Electrical and Computer Engineering, Cornell University and Cornell Tech, New York, NY, USA. msabuncu@cornell.edu.
Pisapia DJ; Department of Radiology, Weill Cornell Medicine, New York, NY, USA. msabuncu@cornell.edu.

Sci Rep ; 12(1): 22623, 2022 12 31.

Article em En | MEDLINE | ID: mdl-36587030

ABSTRACT

ABSTRACT

While Machine Learning (ML) models have been increasingly applied to a range of histopathology tasks, there has been little emphasis on characterizing these models and contrasting them with human experts. We present a detailed empirical analysis comparing expert neuropathologists and ML models at predicting IDH mutation status in H&E-stained histology slides of infiltrating gliomas, both independently and synergistically. We find that errors made by neuropathologists and ML models trained using the TCGA dataset are distinct, representing modest agreement between predictions (human-vs.-human κ = 0.656; human-vs.-ML model κ = 0.598). While no ML model surpassed human performance on an independent institutional test dataset (human AUC = 0.901, max ML AUC = 0.881), a hybrid model aggregating human and ML predictions demonstrates predictive performance comparable to the consensus of two expert neuropathologists (hybrid classifier AUC = 0.921 vs. two-neuropathologist consensus AUC = 0.920). We also show that models trained at different levels of magnification exhibit different types of errors, supporting the value of aggregation across spatial scales in the ML approach. Finally, we present a detailed interpretation of our multi-scale ML ensemble model which reveals that predictions are driven by human-identifiable features at the patch-level.

Assuntos

Neoplasias Encefálicas; Glioma; Humanos; Neoplasias Encefálicas/genética; Neoplasias Encefálicas/patologia; Imageamento por Ressonância Magnética; Mutação; Isocitrato Desidrogenase/genética; Glioma/genética; Glioma/patologia; Aprendizado de Máquina

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Neoplasias Encefálicas / Glioma Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google