Search | VHL Regional Portal

1.

Using histopathology latent diffusion models as privacy-preserving dataset augmenters improves downstream classification performance.

Niehues, Jan M; Müller-Franzes, Gustav; Schirris, Yoni; Wagner, Sophia Janine; Jendrusch, Michael; Kloor, Matthias; Pearson, Alexander T; Muti, Hannah Sophie; Hewitt, Katherine J; Veldhuizen, Gregory P; Zigutyte, Laura; Truhn, Daniel; Kather, Jakob Nikolas.

Comput Biol Med ; 175: 108410, 2024 Jun.

Article in English | MEDLINE | ID: mdl-38678938

ABSTRACT

Latent diffusion models (LDMs) have emerged as a state-of-the-art image generation method, outperforming previous Generative Adversarial Networks (GANs) in terms of training stability and image quality. In computational pathology, generative models are valuable for data sharing and data augmentation. However, the impact of LDM-generated images on histopathology tasks compared to traditional GANs has not been systematically studied. We trained three LDMs and a styleGAN2 model on histology tiles from nine colorectal cancer (CRC) tissue classes. The LDMs include 1) a fine-tuned version of stable diffusion v1.4, 2) a Kullback-Leibler (KL)-autoencoder (KLF8-DM), and 3) a vector quantized (VQ)-autoencoder deploying LDM (VQF8-DM). We assessed image quality through expert ratings, dimensional reduction methods, distribution similarity measures, and their impact on training a multiclass tissue classifier. Additionally, we investigated image memorization in the KLF8-DM and styleGAN2 models. All models provided a high image quality, with the KLF8-DM achieving the best Frechet Inception Distance (FID) and expert rating scores for complex tissue classes. For simpler classes, the VQF8-DM and styleGAN2 models performed better. Image memorization was negligible for both styleGAN2 and KLF8-DM models. Classifiers trained on a mix of KLF8-DM generated and real images achieved a 4% improvement in overall classification accuracy, highlighting the usefulness of these images for dataset augmentation. Our systematic study of generative methods showed that KLF8-DM produces the highest quality images with negligible image memorization. The higher classifier performance in the generatively augmented dataset suggests that this augmentation technique can be employed to enhance histopathology classifiers for various tasks.

Subject(s)

Colorectal Neoplasms , Humans , Colorectal Neoplasms/pathology , Colorectal Neoplasms/diagnostic imaging , Image Interpretation, Computer-Assisted/methods , Image Processing, Computer-Assisted/methods , Algorithms

2.

End-to-end prognostication in colorectal cancer by deep learning: a retrospective, multicentre study.

Jiang, Xiaofeng; Hoffmeister, Michael; Brenner, Hermann; Muti, Hannah Sophie; Yuan, Tanwei; Foersch, Sebastian; West, Nicholas P; Brobeil, Alexander; Jonnagaddala, Jitendra; Hawkins, Nicholas; Ward, Robyn L; Brinker, Titus J; Saldanha, Oliver Lester; Ke, Jia; Müller, Wolfram; Grabsch, Heike I; Quirke, Philip; Truhn, Daniel; Kather, Jakob Nikolas.

Lancet Digit Health ; 6(1): e33-e43, 2024 Jan.

Article in English | MEDLINE | ID: mdl-38123254

ABSTRACT

BACKGROUND: Precise prognosis prediction in patients with colorectal cancer (ie, forecasting survival) is pivotal for individualised treatment and care. Histopathological tissue slides of colorectal cancer specimens contain rich prognostically relevant information. However, existing studies do not have multicentre external validation with real-world sample processing protocols, and algorithms are not yet widely used in clinical routine. METHODS: In this retrospective, multicentre study, we collected tissue samples from four groups of patients with resected colorectal cancer from Australia, Germany, and the USA. We developed and externally validated a deep learning-based prognostic-stratification system for automatic prediction of overall and cancer-specific survival in patients with resected colorectal cancer. We used the model-predicted risk scores to stratify patients into different risk groups and compared survival outcomes between these groups. Additionally, we evaluated the prognostic value of these risk groups after adjusting for established prognostic variables. FINDINGS: We trained and validated our model on a total of 4428 patients. We found that patients could be divided into high-risk and low-risk groups on the basis of the deep learning-based risk score. On the internal test set, the group with a high-risk score had a worse prognosis than the group with a low-risk score, as reflected by a hazard ratio (HR) of 4·50 (95% CI 3·33-6·09) for overall survival and 8·35 (5·06-13·78) for disease-specific survival (DSS). We found consistent performance across three large external test sets. In a test set of 1395 patients, the high-risk group had a lower DSS than the low-risk group, with an HR of 3·08 (2·44-3·89). In two additional test sets, the HRs for DSS were 2·23 (1·23-4·04) and 3·07 (1·78-5·3). We showed that the prognostic value of the deep learning-based risk score is independent of established clinical risk factors. INTERPRETATION: Our findings indicate that attention-based self-supervised deep learning can robustly offer a prognosis on clinical outcomes in patients with colorectal cancer, generalising across different populations and serving as a potentially new prognostic tool in clinical decision making for colorectal cancer management. We release all source codes and trained models under an open-source licence, allowing other researchers to reuse and build upon our work. FUNDING: The German Federal Ministry of Health, the Max-Eder-Programme of German Cancer Aid, the German Federal Ministry of Education and Research, the German Academic Exchange Service, and the EU.

Subject(s)

Colorectal Neoplasms , Deep Learning , Humans , Retrospective Studies , Prognosis , Risk Factors , Colorectal Neoplasms/diagnosis , Colorectal Neoplasms/pathology

3.

Direct image to subtype prediction for brain tumors using deep learning.

Hewitt, Katherine J; Löffler, Chiara M L; Muti, Hannah Sophie; Berghoff, Anna Sophie; Eisenlöffel, Christian; van Treeck, Marko; Carrero, Zunamys I; El Nahhas, Omar S M; Veldhuizen, Gregory P; Weil, Sophie; Saldanha, Oliver Lester; Bejan, Laura; Millner, Thomas O; Brandner, Sebastian; Brückmann, Sascha; Kather, Jakob Nikolas.

Neurooncol Adv ; 5(1): vdad139, 2023.

Article in English | MEDLINE | ID: mdl-38106649

ABSTRACT

Background: Deep Learning (DL) can predict molecular alterations of solid tumors directly from routine histopathology slides. Since the 2021 update of the World Health Organization (WHO) diagnostic criteria, the classification of brain tumors integrates both histopathological and molecular information. We hypothesize that DL can predict molecular alterations as well as WHO subtyping of brain tumors from hematoxylin and eosin-stained histopathology slides. Methods: We used weakly supervised DL and applied it to three large cohorts of brain tumor samples, comprising Nâ=â2845 patients. Results: We found that the key molecular alterations for subtyping, IDH and ATRX, as well as 1p19q codeletion, were predictable from histology with an area under the receiver operating characteristic curve (AUROC) of 0.95, 0.90, and 0.80 in the training cohort, respectively. These findings were upheld in external validation cohorts with AUROCs of 0.90, 0.79, and 0.87 for prediction of IDH, ATRX, and 1p19q codeletion, respectively. Conclusions: In the future, such DL-based implementations could ease diagnostic workflows, particularly for situations in which advanced molecular testing is not readily available.

4.

The future landscape of large language models in medicine.

Clusmann, Jan; Kolbinger, Fiona R; Muti, Hannah Sophie; Carrero, Zunamys I; Eckardt, Jan-Niklas; Laleh, Narmin Ghaffari; Löffler, Chiara Maria Lavinia; Schwarzkopf, Sophie-Caroline; Unger, Michaela; Veldhuizen, Gregory P; Wagner, Sophia J; Kather, Jakob Nikolas.

Commun Med (Lond) ; 3(1): 141, 2023 Oct 10.

Article in English | MEDLINE | ID: mdl-37816837

ABSTRACT

Large language models (LLMs) are artificial intelligence (AI) tools specifically trained to process and generate text. LLMs attracted substantial public attention after OpenAI's ChatGPT was made publicly available in November 2022. LLMs can often answer questions, summarize, paraphrase and translate text on a level that is nearly indistinguishable from human capabilities. The possibility to actively interact with models like ChatGPT makes LLMs attractive tools in various fields, including medicine. While these models have the potential to democratize medical knowledge and facilitate access to healthcare, they could equally distribute misinformation and exacerbate scientific misconduct due to a lack of accountability and transparency. In this article, we provide a systematic and comprehensive overview of the potentials and limitations of LLMs in clinical practice, medical research and medical education.

5.

Deep learning-based subtyping of gastric cancer histology predicts clinical outcome: a multi-institutional retrospective study.

Veldhuizen, Gregory Patrick; Röcken, Christoph; Behrens, Hans-Michael; Cifci, Didem; Muti, Hannah Sophie; Yoshikawa, Takaki; Arai, Tomio; Oshima, Takashi; Tan, Patrick; Ebert, Matthias P; Pearson, Alexander T; Calderaro, Julien; Grabsch, Heike I; Kather, Jakob Nikolas.

Gastric Cancer ; 26(5): 708-720, 2023 09.

Article in English | MEDLINE | ID: mdl-37269416

ABSTRACT

INTRODUCTION: The Laurén classification is widely used for Gastric Cancer (GC) histology subtyping. However, this classification is prone to interobserver variability and its prognostic value remains controversial. Deep Learning (DL)-based assessment of hematoxylin and eosin (H&E) stained slides is a potentially useful tool to provide an additional layer of clinically relevant information, but has not been systematically assessed in GC. OBJECTIVE: We aimed to train, test and externally validate a deep learning-based classifier for GC histology subtyping using routine H&E stained tissue sections from gastric adenocarcinomas and to assess its potential prognostic utility. METHODS: We trained a binary classifier on intestinal and diffuse type GC whole slide images for a subset of the TCGA cohort (N = 166) using attention-based multiple instance learning. The ground truth of 166 GC was obtained by two expert pathologists. We deployed the model on two external GC patient cohorts, one from Europe (N = 322) and one from Japan (N = 243). We assessed classification performance using the Area Under the Receiver Operating Characteristic Curve (AUROC) and prognostic value (overall, cancer specific and disease free survival) of the DL-based classifier with uni- and multivariate Cox proportional hazard models and Kaplan-Meier curves with log-rank test statistics. RESULTS: Internal validation using the TCGA GC cohort using five-fold cross-validation achieved a mean AUROC of 0.93 ± 0.07. External validation showed that the DL-based classifier can better stratify GC patients' 5-year survival compared to pathologist-based Laurén classification for all survival endpoints, despite frequently divergent model-pathologist classifications. Univariate overall survival Hazard Ratios (HRs) of pathologist-based Laurén classification (diffuse type versus intestinal type) were 1.14 (95% Confidence Interval (CI) 0.66-1.44, p-value = 0.51) and 1.23 (95% CI 0.96-1.43, p-value = 0.09) in the Japanese and European cohorts, respectively. DL-based histology classification resulted in HR of 1.46 (95% CI 1.18-1.65, p-value < 0.005) and 1.41 (95% CI 1.20-1.57, p-value < 0.005), in the Japanese and European cohorts, respectively. In diffuse type GC (as defined by the pathologist), classifying patients using the DL diffuse and intestinal classifications provided a superior survival stratification, and demonstrated statistically significant survival stratification when combined with pathologist classification for both the Asian (overall survival log-rank test p-value < 0.005, HR 1.43 (95% CI 1.05-1.66, p-value = 0.03) and European cohorts (overall survival log-rank test p-value < 0.005, HR 1.56 (95% CI 1.16-1.76, p-value < 0.005)). CONCLUSION: Our study shows that gastric adenocarcinoma subtyping using pathologist's Laurén classification as ground truth can be performed using current state of the art DL techniques. Patient survival stratification seems to be better by DL-based histology typing compared with expert pathologist histology typing. DL-based GC histology typing has potential as an aid in subtyping. Further investigations are warranted to fully understand the underlying biological mechanisms for the improved survival stratification despite apparent imperfect classification by the DL algorithm.

Subject(s)

Adenocarcinoma , Deep Learning , Stomach Neoplasms , Humans , Stomach Neoplasms/pathology , Retrospective Studies , Prognosis , Proportional Hazards Models , Adenocarcinoma/pathology

6.

Direct prediction of Homologous Recombination Deficiency from routine histology in ten different tumor types with attention-based Multiple Instance Learning: a development and validation study.

Loeffler, Chiara Maria Lavinia; El Nahhas, Omar S M; Muti, Hannah Sophie; Seibel, Tobias; Cifci, Didem; van Treeck, Marko; Gustav, Marco; Carrero, Zunamys I; Gaisa, Nadine T; Lehmann, Kjong-Van; Leary, Alexandra; Selenica, Pier; Reis-Filho, Jorge S; Bruechle, Nadina Ortiz; Kather, Jakob Nikolas.

medRxiv ; 2023 Mar 10.

Article in English | MEDLINE | ID: mdl-36945540

ABSTRACT

Background: Homologous Recombination Deficiency (HRD) is a pan-cancer predictive biomarker that identifies patients who benefit from therapy with PARP inhibitors (PARPi). However, testing for HRD is highly complex. Here, we investigated whether Deep Learning can predict HRD status solely based on routine Hematoxylin & Eosin (H&E) histology images in ten cancer types. Methods: We developed a fully automated deep learning pipeline with attention-weighted multiple instance learning (attMIL) to predict HRD status from histology images. A combined genomic scar HRD score, which integrated loss of heterozygosity (LOH), telomeric allelic imbalance (TAI) and large-scale state transitions (LST) was calculated from whole genome sequencing data for n=4,565 patients from two independent cohorts. The primary statistical endpoint was the Area Under the Receiver Operating Characteristic curve (AUROC) for the prediction of genomic scar HRD with a clinically used cutoff value. Results: We found that HRD status is predictable in tumors of the endometrium, pancreas and lung, reaching cross-validated AUROCs of 0.79, 0.58 and 0.66. Predictions generalized well to an external cohort with AUROCs of 0.93, 0.81 and 0.73 respectively. Additionally, an HRD classifier trained on breast cancer yielded an AUROC of 0.78 in internal validation and was able to predict HRD in endometrial, prostate and pancreatic cancer with AUROCs of 0.87, 0.84 and 0.67 indicating a shared HRD-like phenotype is across tumor entities. Conclusion: In this study, we show that HRD is directly predictable from H&E slides using attMIL within and across ten different tumor types.

7.

Direct prediction of genetic aberrations from pathology images in gastric cancer with swarm learning.

Saldanha, Oliver Lester; Muti, Hannah Sophie; Grabsch, Heike I; Langer, Rupert; Dislich, Bastian; Kohlruss, Meike; Keller, Gisela; van Treeck, Marko; Hewitt, Katherine Jane; Kolbinger, Fiona R; Veldhuizen, Gregory Patrick; Boor, Peter; Foersch, Sebastian; Truhn, Daniel; Kather, Jakob Nikolas.

Gastric Cancer ; 26(2): 264-274, 2023 03.

Article in English | MEDLINE | ID: mdl-36264524

ABSTRACT

BACKGROUND: Computational pathology uses deep learning (DL) to extract biomarkers from routine pathology slides. Large multicentric datasets improve performance, but such datasets are scarce for gastric cancer. This limitation could be overcome by Swarm Learning (SL). METHODS: Here, we report the results of a multicentric retrospective study of SL for prediction of molecular biomarkers in gastric cancer. We collected tissue samples with known microsatellite instability (MSI) and Epstein-Barr Virus (EBV) status from four patient cohorts from Switzerland, Germany, the UK and the USA, storing each dataset on a physically separate computer. RESULTS: On an external validation cohort, the SL-based classifier reached an area under the receiver operating curve (AUROC) of 0.8092 (± 0.0132) for MSI prediction and 0.8372 (± 0.0179) for EBV prediction. The centralized model, which was trained on all datasets on a single computer, reached a similar performance. CONCLUSIONS: Our findings demonstrate the feasibility of SL-based molecular biomarkers in gastric cancer. In the future, SL could be used for collaborative training and, thus, improve the performance of these biomarkers. This may ultimately result in clinical-grade performance and generalizability.

Subject(s)

Epstein-Barr Virus Infections , Stomach Neoplasms , Humans , Herpesvirus 4, Human/genetics , Retrospective Studies , Stomach Neoplasms/pathology , Microsatellite Instability , Biomarkers, Tumor/genetics

8.

Erratum to 'Benchmarking weakly-supervised deep learning pipelines for whole slide classification in computational pathology' Medical Image Analysis, Volume 79, July 2022, 102474.

Ghaffari Laleh, Narmin; Muti, Hannah Sophie; Loeffler, Chiara Maria Lavinia; Echle, Amelie; Saldanha, Oliver Lester; Mahmood, Faisal; Lu, Ming Y; Trautwein, Christian; Langer, Rupert; Dislich, Bastian; Buelow, Roman D; Grabsch, Heike Irmgard; Brenner, Hermann; Chang-Claude, Jenny; Alwers, Elizabeth; Brinker, Titus J; Khader, Firas; Truhn, Daniel; Gaisa, Nadine T; Boor, Peter; Hoffmeister, Michael; Schulz, Volkmar; Kather, Jakob Nikolas.

Med Image Anal ; 82: 102622, 2022 Nov.

Article in English | MEDLINE | ID: mdl-36130464

9.

Benchmarking weakly-supervised deep learning pipelines for whole slide classification in computational pathology.

Ghaffari Laleh, Narmin; Muti, Hannah Sophie; Loeffler, Chiara Maria Lavinia; Echle, Amelie; Saldanha, Oliver Lester; Mahmood, Faisal; Lu, Ming Y; Trautwein, Christian; Langer, Rupert; Dislich, Bastian; Buelow, Roman D; Grabsch, Heike Irmgard; Brenner, Hermann; Chang-Claude, Jenny; Alwers, Elizabeth; Brinker, Titus J; Khader, Firas; Truhn, Daniel; Gaisa, Nadine T; Boor, Peter; Hoffmeister, Michael; Schulz, Volkmar; Kather, Jakob Nikolas.

Med Image Anal ; 79: 102474, 2022 07.

Article in English | MEDLINE | ID: mdl-35588568

ABSTRACT

Artificial intelligence (AI) can extract visual information from histopathological slides and yield biological insight and clinical biomarkers. Whole slide images are cut into thousands of tiles and classification problems are often weakly-supervised: the ground truth is only known for the slide, not for every single tile. In classical weakly-supervised analysis pipelines, all tiles inherit the slide label while in multiple-instance learning (MIL), only bags of tiles inherit the label. However, it is still unclear how these widely used but markedly different approaches perform relative to each other. We implemented and systematically compared six methods in six clinically relevant end-to-end prediction tasks using data from N=2980 patients for training with rigorous external validation. We tested three classical weakly-supervised approaches with convolutional neural networks and vision transformers (ViT) and three MIL-based approaches with and without an additional attention module. Our results empirically demonstrate that histological tumor subtyping of renal cell carcinoma is an easy task in which all approaches achieve an area under the receiver operating curve (AUROC) of above 0.9. In contrast, we report significant performance differences for clinically relevant tasks of mutation prediction in colorectal, gastric, and bladder cancer. In these mutation prediction tasks, classical weakly-supervised workflows outperformed MIL-based weakly-supervised methods for mutation prediction, which is surprising given their simplicity. This shows that new end-to-end image analysis pipelines in computational pathology should be compared to classical weakly-supervised methods. Also, these findings motivate the development of new methods which combine the elegant assumptions of MIL with the empirically observed higher performance of classical weakly-supervised approaches. We make all source codes publicly available at https://github.com/KatherLab/HIA, allowing easy application of all methods to any similar task.

Subject(s)

Deep Learning , Artificial Intelligence , Benchmarking , Humans , Neural Networks, Computer , Supervised Machine Learning

10.

Swarm learning for decentralized artificial intelligence in cancer histopathology.

Saldanha, Oliver Lester; Quirke, Philip; West, Nicholas P; James, Jacqueline A; Loughrey, Maurice B; Grabsch, Heike I; Salto-Tellez, Manuel; Alwers, Elizabeth; Cifci, Didem; Ghaffari Laleh, Narmin; Seibel, Tobias; Gray, Richard; Hutchins, Gordon G A; Brenner, Hermann; van Treeck, Marko; Yuan, Tanwei; Brinker, Titus J; Chang-Claude, Jenny; Khader, Firas; Schuppert, Andreas; Luedde, Tom; Trautwein, Christian; Muti, Hannah Sophie; Foersch, Sebastian; Hoffmeister, Michael; Truhn, Daniel; Kather, Jakob Nikolas.

Nat Med ; 28(6): 1232-1239, 2022 06.

Article in English | MEDLINE | ID: mdl-35469069

ABSTRACT

Artificial intelligence (AI) can predict the presence of molecular alterations directly from routine histopathology slides. However, training robust AI systems requires large datasets for which data collection faces practical, ethical and legal obstacles. These obstacles could be overcome with swarm learning (SL), in which partners jointly train AI models while avoiding data transfer and monopolistic data governance. Here, we demonstrate the successful use of SL in large, multicentric datasets of gigapixel histopathology images from over 5,000 patients. We show that AI models trained using SL can predict BRAF mutational status and microsatellite instability directly from hematoxylin and eosin (H&E)-stained pathology slides of colorectal cancer. We trained AI models on three patient cohorts from Northern Ireland, Germany and the United States, and validated the prediction performance in two independent datasets from the United Kingdom. Our data show that SL-trained AI models outperform most locally trained models, and perform on par with models that are trained on the merged datasets. In addition, we show that SL-based AI models are data efficient. In the future, SL can be used to train distributed AI models for any histopathology image analysis task, eliminating the need for data transfer.

Subject(s)

Artificial Intelligence , Neoplasms , Humans , Image Processing, Computer-Assisted , Neoplasms/genetics , Staining and Labeling , United Kingdom

11.

Classical mathematical models for prediction of response to chemotherapy and immunotherapy.

Ghaffari Laleh, Narmin; Loeffler, Chiara Maria Lavinia; Grajek, Julia; Stanková, Katerina; Pearson, Alexander T; Muti, Hannah Sophie; Trautwein, Christian; Enderling, Heiko; Poleszczuk, Jan; Kather, Jakob Nikolas.

PLoS Comput Biol ; 18(2): e1009822, 2022 02.

Article in English | MEDLINE | ID: mdl-35120124

ABSTRACT

Classical mathematical models of tumor growth have shaped our understanding of cancer and have broad practical implications for treatment scheduling and dosage. However, even the simplest textbook models have been barely validated in real world-data of human patients. In this study, we fitted a range of differential equation models to tumor volume measurements of patients undergoing chemotherapy or cancer immunotherapy for solid tumors. We used a large dataset of 1472 patients with three or more measurements per target lesion, of which 652 patients had six or more data points. We show that the early treatment response shows only moderate correlation with the final treatment response, demonstrating the need for nuanced models. We then perform a head-to-head comparison of six classical models which are widely used in the field: the Exponential, Logistic, Classic Bertalanffy, General Bertalanffy, Classic Gompertz and General Gompertz model. Several models provide a good fit to tumor volume measurements, with the Gompertz model providing the best balance between goodness of fit and number of parameters. Similarly, when fitting to early treatment data, the general Bertalanffy and Gompertz models yield the lowest mean absolute error to forecasted data, indicating that these models could potentially be effective at predicting treatment outcome. In summary, we provide a quantitative benchmark for classical textbook models and state-of-the art models of human tumor growth. We publicly release an anonymized version of our original data, providing the first benchmark set of human tumor growth data for evaluation of mathematical models.

Subject(s)

Models, Biological , Neoplasms , Humans , Immunotherapy , Models, Theoretical , Neoplasms/drug therapy , Neoplasms/pathology , Tumor Burden

12.

Development and validation of deep learning classifiers to detect Epstein-Barr virus and microsatellite instability status in gastric cancer: a retrospective multicentre cohort study.

Muti, Hannah Sophie; Heij, Lara Rosaline; Keller, Gisela; Kohlruss, Meike; Langer, Rupert; Dislich, Bastian; Cheong, Jae-Ho; Kim, Young-Woo; Kim, Hyunki; Kook, Myeong-Cherl; Cunningham, David; Allum, William H; Langley, Ruth E; Nankivell, Matthew G; Quirke, Philip; Hayden, Jeremy D; West, Nicholas P; Irvine, Andrew J; Yoshikawa, Takaki; Oshima, Takashi; Huss, Ralf; Grosser, Bianca; Roviello, Franco; d'Ignazio, Alessia; Quaas, Alexander; Alakus, Hakan; Tan, Xiuxiang; Pearson, Alexander T; Luedde, Tom; Ebert, Matthias P; Jäger, Dirk; Trautwein, Christian; Gaisa, Nadine Therese; Grabsch, Heike I; Kather, Jakob Nikolas.

Lancet Digit Health ; 3(10): e654-e664, 2021 10.

Article in English | MEDLINE | ID: mdl-34417147

ABSTRACT

BACKGROUND: Response to immunotherapy in gastric cancer is associated with microsatellite instability (or mismatch repair deficiency) and Epstein-Barr virus (EBV) positivity. We therefore aimed to develop and validate deep learning-based classifiers to detect microsatellite instability and EBV status from routine histology slides. METHODS: In this retrospective, multicentre study, we collected tissue samples from ten cohorts of patients with gastric cancer from seven countries (South Korea, Switzerland, Japan, Italy, Germany, the UK and the USA). We trained a deep learning-based classifier to detect microsatellite instability and EBV positivity from digitised, haematoxylin and eosin stained resection slides without annotating tumour containing regions. The performance of the classifier was assessed by within-cohort cross-validation in all ten cohorts and by external validation, for which we split the cohorts into a five-cohort training dataset and a five-cohort test dataset. We measured the area under the receiver operating curve (AUROC) for detection of microsatellite instability and EBV status. Microsatellite instability and EBV status were determined to be detectable if the lower bound of the 95% CI for the AUROC was above 0·5. FINDINGS: Across the ten cohorts, our analysis included 2823 patients with known microsatellite instability status and 2685 patients with known EBV status. In the within-cohort cross-validation, the deep learning-based classifier could detect microsatellite instability status in nine of ten cohorts, with AUROCs ranging from 0·597 (95% CI 0·522-0·737) to 0·836 (0·795-0·880) and EBV status in five of eight cohorts, with AUROCs ranging from 0·819 (0·752-0·841) to 0·897 (0·513-0·966). Training a classifier on the pooled training dataset and testing it on the five remaining cohorts resulted in high classification performance with AUROCs ranging from 0·723 (95% CI 0·676-0·794) to 0·863 (0·747-0·969) for detection of microsatellite instability and from 0·672 (0·403-0·989) to 0·859 (0·823-0·919) for detection of EBV status. INTERPRETATION: Classifiers became increasingly robust when trained on pooled cohorts. After prospective validation, this deep learning-based tissue classification system could be used as an inexpensive predictive biomarker for immunotherapy in gastric cancer. FUNDING: German Cancer Aid and German Federal Ministry of Health.

Subject(s)

Deep Learning , Epstein-Barr Virus Infections/complications , Epstein-Barr Virus Infections/diagnosis , Microsatellite Instability , Stomach Neoplasms/complications , Stomach Neoplasms/genetics , Aged , Cohort Studies , Female , Germany , Histological Techniques/methods , Humans , Italy , Japan , Male , Middle Aged , Reproducibility of Results , Republic of Korea , Retrospective Studies , Switzerland , United Kingdom , United States

13.

Predicting Mutational Status of Driver and Suppressor Genes Directly from Histopathology With Deep Learning: A Systematic Study Across 23 Solid Tumor Types.

Loeffler, Chiara Maria Lavinia; Gaisa, Nadine T; Muti, Hannah Sophie; van Treeck, Marko; Echle, Amelie; Ghaffari Laleh, Narmin; Trautwein, Christian; Heij, Lara R; Grabsch, Heike I; Ortiz Bruechle, Nadina; Kather, Jakob Nikolas.

Front Genet ; 12: 806386, 2021.

Article in English | MEDLINE | ID: mdl-35251119

ABSTRACT

In the last four years, advances in Deep Learning technology have enabled the inference of selected mutational alterations directly from routine histopathology slides. In particular, recent studies have shown that genetic changes in clinically relevant driver genes are reflected in the histological phenotype of solid tumors and can be inferred by analysing routine Haematoxylin and Eosin (H&E) stained tissue sections with Deep Learning. However, these studies mostly focused on selected individual genes in selected tumor types. In addition, genetic changes in solid tumors primarily act by changing signaling pathways that regulate cell behaviour. In this study, we hypothesized that Deep Learning networks can be trained to directly predict alterations of genes and pathways across a spectrum of solid tumors. We manually outlined tumor tissue in H&E-stained tissue sections from 7,829 patients with 23 different tumor types from The Cancer Genome Atlas. We then trained convolutional neural networks in an end-to-end way to detect alterations in the most clinically relevant pathways or genes, directly from histology images. Using this automatic approach, we found that alterations in 12 out of 14 clinically relevant pathways and numerous single gene alterations appear to be detectable in tissue sections, many of which have not been reported before. Interestingly, we show that the prediction performance for single gene alterations is better than that for pathway alterations. Collectively, these data demonstrate the predictability of genetic alterations directly from routine cancer histology images and show that individual genes leave a stronger morphological signature than genetic pathways.

14.

Pan-cancer image-based detection of clinically actionable genetic alterations.

Kather, Jakob Nikolas; Heij, Lara R; Grabsch, Heike I; Loeffler, Chiara; Echle, Amelie; Muti, Hannah Sophie; Krause, Jeremias; Niehues, Jan M; Sommer, Kai A J; Bankhead, Peter; Kooreman, Loes F S; Schulte, Jefree J; Cipriani, Nicole A; Buelow, Roman D; Boor, Peter; Ortiz-Brüchle, Nadi-Na; Hanby, Andrew M; Speirs, Valerie; Kochanny, Sara; Patnaik, Akash; Srisuwananukorn, Andrew; Brenner, Hermann; Hoffmeister, Michael; van den Brandt, Piet A; Jäger, Dirk; Trautwein, Christian; Pearson, Alexander T; Luedde, Tom.

Nat Cancer ; 1(8): 789-799, 2020 08.

Article in English | MEDLINE | ID: mdl-33763651

ABSTRACT

Molecular alterations in cancer can cause phenotypic changes in tumor cells and their micro-environment. Routine histopathology tissue slides - which are ubiquitously available - can reflect such morphological changes. Here, we show that deep learning can consistently infer a wide range of genetic mutations, molecular tumor subtypes, gene expression signatures and standard pathology biomarkers directly from routine histology. We developed, optimized, validated and publicly released a one-stop-shop workflow and applied it to tissue slides of more than 5000 patients across multiple solid tumors. Our findings show that a single deep learning algorithm can be trained to predict a wide range of molecular alterations from routine, paraffin-embedded histology slides stained with hematoxylin and eosin. These predictions generalize to other populations and are spatially resolved. Our method can be implemented on mobile hardware, potentially enabling point-of-care diagnostics for personalized cancer treatment. More generally, this approach could elucidate and quantify genotype-phenotype links in cancer.

Subject(s)

Deep Learning , Neoplasms , Eosine Yellowish-(YS) , Hematoxylin , Humans , Mutation , Neoplasms/diagnosis

15.

Author Correction: Pan-cancer image-based detection of clinically actionable genetic alterations.

Kather, Jakob Nikolas; Heij, Lara R; Grabsch, Heike I; Loeffler, Chiara; Echle, Amelie; Muti, Hannah Sophie; Krause, Jeremias; Niehues, Jan M; Sommer, Kai A J; Bankhead, Peter; Kooreman, Loes F S; Schulte, Jefree J; Cipriani, Nicole A; Buelow, Roman D; Boor, Peter; Ortiz-Brüchle, Nadina; Hanby, Andrew M; Speirs, Valerie; Kochanny, Sara; Patnaik, Akash; Srisuwananukorn, Andrew; Brenner, Hermann; Hoffmeister, Michael; van den Brandt, Piet A; Jäger, Dirk; Trautwein, Christian; Pearson, Alexander T; Luedde, Tom.

Nat Cancer ; 1(11): 1129, 2020 Nov.

Article in English | MEDLINE | ID: mdl-35122072

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL