RESUMEN
Tumors of the major and minor salivary glands histologically encompass a diverse and partly overlapping spectrum of frequent diagnostically challenging neoplasms. Despite recent advances in molecular testing and the identification of tumor-specific mutations or gene fusions, there is an unmet need to identify additional diagnostic biomarkers for entities lacking specific alterations. In this study, we collected a comprehensive cohort of 363 cases encompassing 20 different salivary gland tumor entities and explored the potential of DNA methylation to classify these tumors. We were able to show that most entities show specific epigenetic signatures and present a machine learning algorithm that achieved a mean balanced accuracy of 0.991. Of note, we showed that cribriform adenocarcinoma is epigenetically distinct from classical polymorphous adenocarcinoma, which could support risk stratification of these tumors. Myoepithelioma and pleomorphic adenoma form a uniform epigenetic class, supporting the theory of a single entity with a broad but continuous morphologic spectrum. Furthermore, we identified a histomorphologically heterogeneous but epigenetically distinct class that could represent a novel tumor entity. In conclusion, our study provides a comprehensive resource of the DNA methylation landscape of salivary gland tumors. Our data provide novel insight into disputed entities and show the potential of DNA methylation to identify new tumor classes. Furthermore, in future, our machine learning classifier could support the histopathologic diagnosis of salivary gland tumors.
RESUMEN
The diagnosis of ependymoma has moved from a purely histopathological review with limited prognostic value to an integrated diagnosis, relying heavily on molecular information. However, as the integrated approach is still novel and some molecular ependymoma subtypes are quite rare, few studies have correlated integrated pathology and clinical outcome, often focusing on small series of single molecular types. We collected data from 2023 ependymomas as classified by DNA methylation profiling, consisting of 1736 previously published and 287 unpublished methylation profiles. Methylation data and clinical information were correlated, and an integrated model was developed to predict progression-free survival. Patients with EPN-PFA, EPN-ZFTA, and EPN-MYCN tumors showed the worst outcome with 10-year overall survival rates of 56%, 62%, and 32%, respectively. EPN-PFA harbored chromosome 1q gains and/or 6q losses as markers for worse survival. In supratentorial EPN-ZFTA, a combined loss of CDKN2A and B indicated worse survival, whereas a single loss did not. Twelve out of 200 EPN-ZFTA (6%) were located in the posterior fossa, and these tumors relapsed or progressed even earlier than supratentorial tumors with a combined loss of CDKN2A/B. Patients with MPE and PF-SE, generally regarded as non-aggressive tumors, only had a 10-year progression-free survival of 59% and 65%, respectively. For the prediction of the 5-year progression-free survival, Kaplan-Meier estimators based on the molecular subtype, a Support Vector Machine based on methylation, and an integrated model based on clinical factors, CNV data, and predicted methylation scores achieved balanced accuracies of 66%, 68%, and 73%, respectively. Excluding samples with low prediction scores resulted in balanced accuracies of over 80%. In sum, our large-scale analysis of ependymomas provides robust information about molecular features and their clinical meaning. Our data are particularly relevant for rare and hardly explored tumor subtypes and seemingly benign variants that display higher recurrence rates than previously believed.
Asunto(s)
Ependimoma , Humanos , Supervivencia sin Progresión , Procesamiento Proteico-PostraduccionalRESUMEN
The diagnosis of sinonasal tumors is challenging due to a heterogeneous spectrum of various differential diagnoses as well as poorly defined, disputed entities such as sinonasal undifferentiated carcinomas (SNUCs). In this study, we apply a machine learning algorithm based on DNA methylation patterns to classify sinonasal tumors with clinical-grade reliability. We further show that sinonasal tumors with SNUC morphology are not as undifferentiated as their current terminology suggests but rather reassigned to four distinct molecular classes defined by epigenetic, mutational and proteomic profiles. This includes two classes with neuroendocrine differentiation, characterized by IDH2 or SMARCA4/ARID1A mutations with an overall favorable clinical course, one class composed of highly aggressive SMARCB1-deficient carcinomas and another class with tumors that represent potentially previously misclassified adenoid cystic carcinomas. Our findings can aid in improving the diagnostic classification of sinonasal tumors and could help to change the current perception of SNUCs.
Asunto(s)
Carcinoma , Metilación de ADN , Humanos , Metilación de ADN/genética , Proteómica , Reproducibilidad de los Resultados , ADN Helicasas/genética , Proteínas Nucleares/genética , Factores de TranscripciónRESUMEN
OBJECTIVES: Our goal was to evaluate the diagnostic value of DNA methylation analysis in combination with machine learning to differentiate pleural mesothelioma (PM) from important histopathological mimics. MATERIAL AND METHODS: DNA methylation data of PM, lung adenocarcinomas, lung squamous cell carcinomas and chronic pleuritis was used to train a random forest as well as a support vector machine. These classifiers were validated using an independent validation cohort including pleural carcinosis and pleomorphic variants of lung adeno- and squamous cell carcinomas. Furthermore, we performed differential methylation analysis and used a deconvolution method to estimate the composition of the tumor microenvironment. RESULTS: T-distributed stochastic neighbor embedding clearly separated PM from lung adenocarcinomas and squamous cell carcinomas, but there was a considerable overlap between chronic pleuritis specimens and PM with low tumor cell content. In a nested cross validation on the training cohort, both machine learning algorithms achieved the same accuracies (94.8%). On the validation cohort, we observed high accuracies for the support vector machine (97.8%) while the random forest performed considerably worse (89.5%), especially in distinguishing PM from chronic pleuritis. Differential methylation analysis revealed promoter hypermethylation in PM specimens, including the tumor suppressor genes BCL11B, EBF1, FOXA1, and WNK2. Deconvolution of the stromal and immune cell composition revealed higher rates of regulatory T-cells and endothelial cells in tumor specimens and a heterogenous inflammation including macrophages, B-cells and natural killer cells in chronic pleuritis. CONCLUSION: DNA methylation in combination with machine learning classifiers is a promising tool to reliably differentiate PM from chronic pleuritis and lung cancer, including pleomorphic carcinomas. Furthermore, our study highlights new candidate genes for PM carcinogenesis and shows that deconvolution of DNA methylation data can provide reasonable insights into the composition of the tumor microenvironment.
Asunto(s)
Adenocarcinoma del Pulmón , Carcinoma de Pulmón de Células no Pequeñas , Carcinoma de Células Escamosas , Neoplasias Pulmonares , Mesotelioma Maligno , Mesotelioma , Neoplasias Pleurales , Pleuresia , Adenocarcinoma del Pulmón/genética , Carcinoma de Pulmón de Células no Pequeñas/genética , Carcinoma de Células Escamosas/genética , Metilación de ADN , Células Endoteliales/patología , Humanos , Pulmón/patología , Neoplasias Pulmonares/diagnóstico , Neoplasias Pulmonares/genética , Neoplasias Pulmonares/patología , Aprendizaje Automático , Mesotelioma/diagnóstico , Mesotelioma/genética , Mesotelioma/patología , Mesotelioma Maligno/genética , Neoplasias Pleurales/diagnóstico , Neoplasias Pleurales/genética , Neoplasias Pleurales/patología , Pleuresia/diagnóstico , Pleuresia/genética , Proteínas Serina-Treonina Quinasas , Microambiente Tumoral/genéticaRESUMEN
In head and neck squamous cell cancers (HNSCs) that present as metastases with an unknown primary (HNSC-CUPs), the identification of a primary tumor improves therapy options and increases patient survival. However, the currently available diagnostic methods are laborious and do not offer a sufficient detection rate. Predictive machine learning models based on DNA methylation profiles have recently emerged as a promising technique for tumor classification. We applied this technique to HNSC to develop a tool that can improve the diagnostic work-up for HNSC-CUPs. On a reference cohort of 405 primary HNSC samples, we developed four classifiers based on different machine learning models [random forest (RF), neural network (NN), elastic net penalized logistic regression (LOGREG), and support vector machine (SVM)] that predict the primary site of HNSC tumors from their DNA methylation profile. The classifiers achieved high classification accuracies (RF = 83%, NN = 88%, LOGREG = SVM = 89%) on an independent cohort of 64 HNSC metastases. Further, the NN, LOGREG, and SVM models significantly outperformed p16 status as a marker for an origin in the oropharynx. In conclusion, the DNA methylation profiles of HNSC metastases are characteristic for their primary sites, and the classifiers developed in this study, which are made available to the scientific community, can provide valuable information to guide the diagnostic work-up of HNSC-CUP. © 2021 The Authors. The Journal of Pathology published by John Wiley & Sons Ltd on behalf of The Pathological Society of Great Britain and Ireland.
Asunto(s)
Metilación de ADN , Neoplasias de Cabeza y Cuello , Neoplasias de Cabeza y Cuello/genética , Humanos , Aprendizaje Automático , Redes Neurales de la Computación , Carcinoma de Células Escamosas de Cabeza y Cuello/genéticaRESUMEN
PURPOSE: Phenotype information is crucial for the interpretation of genomic variants. So far it has only been accessible for bioinformatics workflows after encoding into clinical terms by expert dysmorphologists. METHODS: Here, we introduce an approach driven by artificial intelligence that uses portrait photographs for the interpretation of clinical exome data. We measured the value added by computer-assisted image analysis to the diagnostic yield on a cohort consisting of 679 individuals with 105 different monogenic disorders. For each case in the cohort we compiled frontal photos, clinical features, and the disease-causing variants, and simulated multiple exomes of different ethnic backgrounds. RESULTS: The additional use of similarity scores from computer-assisted analysis of frontal photos improved the top 1 accuracy rate by more than 20-89% and the top 10 accuracy rate by more than 5-99% for the disease-causing gene. CONCLUSION: Image analysis by deep-learning algorithms can be used to quantify the phenotypic similarity (PP4 criterion of the American College of Medical Genetics and Genomics guidelines) and to advance the performance of bioinformatics pipelines for exome analysis.