RESUMO
MicroRNAs (miRNAs) are important regulators of cell fate decisions in immune responses. They act by coordinate repression of multiple target genes, a property that we exploited to uncover regulatory networks that govern T helper-2 (Th2) cells. A functional screen of individual miRNAs in primary T cells uncovered multiple miRNAs that inhibited Th2 cell differentiation. Among these were miR-24 and miR-27, miRNAs coexpressed from two genomic clusters, which each functioned independently to limit interleukin-4 (IL-4) production. Mice lacking both clusters in T cells displayed increased Th2 cell responses and tissue pathology in a mouse model of asthma. Gene expression and pathway analyses placed miR-27 upstream of genes known to regulate Th2 cells. They also identified targets not previously associated with Th2 cell biology which regulated IL-4 production in unbiased functional testing. Thus, elucidating the biological function and target repertoire of miR-24 and miR-27 reveals regulators of Th2 cell biology.
Assuntos
Asma/imunologia , Interleucina-4/biossíntese , MicroRNAs/genética , Células Th2/imunologia , Animais , Sequência de Bases , Diferenciação Celular/genética , Diferenciação Celular/imunologia , Células Cultivadas , Modelos Animais de Doenças , Feminino , Inflamação/imunologia , Interleucina-4/imunologia , Ativação Linfocitária/imunologia , Masculino , Camundongos , Camundongos Knockout , Família Multigênica/genética , Análise de Sequência de RNA , Células Th2/citologiaRESUMO
The tissue diagnosis of adenocarcinoma and intraductal carcinoma of the prostate includes Gleason grading of tumor morphology on the hematoxylin and eosin stain and immunohistochemistry markers on the prostatic intraepithelial neoplasia-4 stain (CK5/6, P63, and AMACR). In this work, we create an automated system for producing both virtual hematoxylin and eosin and prostatic intraepithelial neoplasia-4 immunohistochemistry stains from unstained prostate tissue using a high-throughput hyperspectral fluorescence microscope and artificial intelligence and machine learning. We demonstrate that the virtual stainer models can produce high-quality images suitable for diagnosis by genitourinary pathologists. Specifically, we validate our system through extensive human review and computational analysis, using a previously validated Gleason scoring model, and an expert panel, on a large data set of test slides. This study extends our previous work on virtual staining from autofluorescence, demonstrates the clinical utility of this technology for prostate cancer, and exemplifies a rigorous standard of qualitative and quantitative evaluation for digital pathology.
RESUMO
The comprehensive genomic analysis of endometrial carcinoma (EC) by The Cancer Genome Atlas (TCGA) led to the discovery of four distinct and prognostically significant molecular subgroups. Molecular classification has the potential to improve risk-stratification when integrated with clinicopathologic features and has recently been included in national and international patient management EC guidelines. Thus, the adoption of molecular classification into routine pathologic and clinical practice is likely to grow significantly in the upcoming years. Establishing an efficient and standardized workflow for performing molecular classification on ECs, and reporting both the molecular and histologic findings in an integrative manner, is imperative. Here we describe our effort to implement rapid and routine molecular classification on all ECs diagnosed at our institution. To this effect, we performed immunohistochemistry as a surrogate marker for identifying genetic and/or epigenetic alterations in DNA mismatch repair (e.g., MLH1, PMS2, MSH6, MSH2), and TP53 genes. In addition, we have developed and employed a single-gene POLE SNaPshot assay, which is a rapid and analytically sensitive method for detecting select POLE exonuclease domain mutations (EDMs). We report our molecular testing workflow and integrative reporting system as well as the clinicopathologic and molecular features of 310 ECs that underwent routine molecular classification at our institution. The 310 ECs were molecularly classified as follows: 15 (5%) POLE mutant (POLEmut), 79 (25%) mismatch repair-deficient (MMRd), 135 (44%) no specific molecular profile (NSMP), and 81 (26%) p53 abnormal (p53abnl). This work provides an initial framework for implementing routine molecular classification of ECs.
Assuntos
Neoplasias do Endométrio , Biomarcadores Tumorais/genética , Reparo de Erro de Pareamento de DNA , Neoplasias do Endométrio/patologia , Feminino , Genes p53 , Humanos , Imuno-Histoquímica , Mutação , Estudos ProspectivosRESUMO
Determining the replicative DNA polymerase epsilon ( POLE) mutation status in endometrial carcinomas (ECs) has important clinical implications given that the majority of "ultramutated" tumors harboring pathogenic exonuclease domain mutations in POLE ( POLE mut) have a favorable prognosis, even among high-grade histotypes. Currently, there are no specific morphologic or immunophenotypic features that allow accurate detection of POLE mut tumors without molecular testing. Consequently, identifying POLE mut tumors has been challenging without employing costly and/or time-consuming DNA sequencing approaches. Here we developed a novel SNaPshot assay to facilitate routine and efficient POLE mutation testing in EC. The SNaPshot assay interrogates 15 nucleotide sites within exons 9, 11, 13, and 14 encoding the POLE exonuclease domain. The variant sites were selected based on recurrence, evidence of functional impact, association with high tumor mutation burden and/or detection in EC clinical outcome studies. Based on the pathogenic somatic variants reported in the literature, the assay is predicted to have a clinical sensitivity of 90% to 95% for ECs. Validation studies showed 100% specificity and sensitivity for the variants covered, with expected genotypic results for both the positive (n=11) and negative (n=20) patient controls on multiple repeat tests and dilution series. Analytic sensitivity was conservatively approximated at a 10% variant allele fraction (VAF), with documented detection as low as 5% VAF. As expected, the SNaPshot assay demonstrated greater sensitivity than Sanger sequencing for VAFs below 20%, an important characteristic for somatic mutation detection. Here we have developed and validated the first SNaPshot assay to detect hotspot POLE mutations. While next-generation sequencing and Sanger sequencing-based approaches have also been used to detect POLE mutations, a SNaPshot approach provides useful balance of analytical sensitivity, cost-effectiveness, and efficiency in a high-volume case load setting.
Assuntos
Carcinoma Endometrioide , Neoplasias do Endométrio , Feminino , Humanos , Carcinoma Endometrioide/patologia , Análise Custo-Benefício , Exonucleases/genética , Proteínas de Ligação a Poli-ADP-Ribose/genética , Neoplasias do Endométrio/diagnóstico , Neoplasias do Endométrio/genética , Neoplasias do Endométrio/patologia , MutaçãoRESUMO
BackgroundDeep learning has the potential to augment the use of chest radiography in clinical radiology, but challenges include poor generalizability, spectrum bias, and difficulty comparing across studies.PurposeTo develop and evaluate deep learning models for chest radiograph interpretation by using radiologist-adjudicated reference standards.Materials and MethodsDeep learning models were developed to detect four findings (pneumothorax, opacity, nodule or mass, and fracture) on frontal chest radiographs. This retrospective study used two data sets. Data set 1 (DS1) consisted of 759 611 images from a multicity hospital network and ChestX-ray14 is a publicly available data set with 112 120 images. Natural language processing and expert review of a subset of images provided labels for 657 954 training images. Test sets consisted of 1818 and 1962 images from DS1 and ChestX-ray14, respectively. Reference standards were defined by radiologist-adjudicated image review. Performance was evaluated by area under the receiver operating characteristic curve analysis, sensitivity, specificity, and positive predictive value. Four radiologists reviewed test set images for performance comparison. Inverse probability weighting was applied to DS1 to account for positive radiograph enrichment and estimate population-level performance.ResultsIn DS1, population-adjusted areas under the receiver operating characteristic curve for pneumothorax, nodule or mass, airspace opacity, and fracture were, respectively, 0.95 (95% confidence interval [CI]: 0.91, 0.99), 0.72 (95% CI: 0.66, 0.77), 0.91 (95% CI: 0.88, 0.93), and 0.86 (95% CI: 0.79, 0.92). With ChestX-ray14, areas under the receiver operating characteristic curve were 0.94 (95% CI: 0.93, 0.96), 0.91 (95% CI: 0.89, 0.93), 0.94 (95% CI: 0.93, 0.95), and 0.81 (95% CI: 0.75, 0.86), respectively.ConclusionExpert-level models for detecting clinically relevant chest radiograph findings were developed for this study by using adjudicated reference standards and with population-level performance estimation. Radiologist-adjudicated labels for 2412 ChestX-ray14 validation set images and 1962 test set images are provided.© RSNA, 2019Online supplemental material is available for this article.See also the editorial by Chang in this issue.
Assuntos
Interpretação de Imagem Radiográfica Assistida por Computador/métodos , Radiografia Torácica/métodos , Doenças Respiratórias/diagnóstico por imagem , Traumatismos Torácicos/diagnóstico por imagem , Adolescente , Adulto , Idoso , Idoso de 80 Anos ou mais , Criança , Pré-Escolar , Aprendizado Profundo , Feminino , Humanos , Lactente , Masculino , Pessoa de Meia-Idade , Pneumotórax , Radiologistas , Padrões de Referência , Reprodutibilidade dos Testes , Estudos Retrospectivos , Sensibilidade e Especificidade , Adulto JovemRESUMO
MicroRNA (miRNA)-deficient helper T cells exhibit abnormal IFN-γ production and decreased proliferation. However, the contributions of individual miRNAs to this phenotype remain poorly understood. We conducted a screen for miRNA function in primary T cells and identified individual miRNAs that rescue the defects associated with miRNA deficiency. Multiple members of the miR-17 and miR-92 families enhanced miRNA-deficient T cell proliferation whereas miR-29 largely corrected their aberrant interferon-γ (IFN-γ) expression. Repression of IFN-γ production by miR-29 involved direct targeting of both T-bet and Eomes, two transcription factors known to induce IFN-γ production. Although not usually expressed at functionally relevant amounts in helper T cells, Eomes was abundant in miRNA-deficient cells and was upregulated after miR-29 inhibition in wild-type cells. These results demonstrate that miR-29 regulates helper T cell differentiation by repressing multiple target genes, including at least two that are independently capable of inducing the T helper 1 (Th1) cell gene expression program.
Assuntos
Interferon gama/metabolismo , MicroRNAs/metabolismo , Proteínas com Domínio T/metabolismo , Linfócitos T Auxiliares-Indutores/metabolismo , Animais , Diferenciação Celular , Proliferação de Células , Células Cultivadas , Citocinas/genética , Citocinas/imunologia , Citocinas/metabolismo , Regulação da Expressão Gênica/imunologia , Interferon gama/genética , Interferon gama/imunologia , Camundongos , Camundongos Endogâmicos C57BL , Camundongos Knockout , MicroRNAs/genética , MicroRNAs/imunologia , Proteínas/genética , Proteínas de Ligação a RNA , Proteínas com Domínio T/imunologia , Linfócitos T Auxiliares-Indutores/citologia , Linfócitos T Auxiliares-Indutores/imunologiaRESUMO
Low-grade serous carcinomas only rarely coexist with or progress to high-grade tumors. We present a case of low-grade serous carcinoma with transformation to carcinosarcoma on recurrence in the lymph node. Identical BRAF V600E and telomerase reverse transcriptase promoter mutations were identified in both the original and recurrent tumor. Given that telomerase reverse transcriptase promotor mutations are thought to play a role in progression of other tumor types, the function of telomerase reverse transcriptase mutations in BRAF mutated low-grade serous carcinoma deserves investigation.
Assuntos
Carcinossarcoma/diagnóstico , Neoplasias Ovarianas/diagnóstico , Regiões Promotoras Genéticas/genética , Proteínas Proto-Oncogênicas B-raf/genética , Telomerase/genética , Idoso , Carcinossarcoma/genética , Carcinossarcoma/patologia , Progressão da Doença , Feminino , Humanos , Linfonodos/patologia , Mutação , Neoplasias Ovarianas/genética , Neoplasias Ovarianas/patologia , Ovário/patologiaRESUMO
The past two decades have seen unprecedented advances in the field of oncogenomics. The ongoing characterization of neoplastic tissues through genomic techniques has transformed many aspects of cancer research, diagnosis, and treatment. However, identifying sequence variants with biological and clinical significance is a challenging endeavor. In order to accomplish this task, variants must be annotated and interpreted using various online resources. Data on protein structure, functional prediction, variant frequency in relevant populations, and multiple other factors have been compiled in useful databases for this purpose. Thus, understanding the available online resources for the annotation and interpretation of sequence variants is critical to aid molecular pathologists and researchers working in this space.
Assuntos
Bases de Dados Genéticas , Privacidade Genética , Neoplasias , Farmacogenética , Privacidade Genética/tendências , Variação Genética , Recursos em Saúde , Humanos , Internet , Neoplasias/fisiopatologia , Neoplasias/terapia , Análise de Sequência de DNA/normas , Análise de Sequência de DNA/tendênciasRESUMO
BACKGROUND: Tumor-Adipose-Feature (TAF) as well as SARIFA (Stroma AReactive Invasion Front Areas) are two histologic features/biomarkers linking tumor-associated adipocytes to poor outcomes in colorectal cancer (CRC) patients. Whereas TAF was identified by deep learning (DL) algorithms, SARIFA was established as a human-observed histopathologic biomarker. METHODS: To study the overlap between TAF and SARIFA, we performed a systematic pathological review of TAF based on all published image tiles. Additionally, we analyzed the presence/absence of TAF in SARIFA-negative CRC cases to elucidate the biologic and prognostic role of a direct tumor-adipocyte contact. TCGA-CRC gene expression data is investigated to assess the association of FABP4 (fatty-acid binding protein 4) and CD36 (fatty-acid translocase) with both TAF and CRC prognosis. RESULTS: By investigating the TAF/SARIFA overlap, we show that many TAF patches correspond to the recently described SARIFA-phenomenon. Even though there is a pronounced morphological and biological overlap, there are differences in the concepts. The presence of TAF in SARIFA-negative CRCs is not associated with poor outcomes in this cohort, potentially highlighting the importance of a direct tumor-adipocyte interaction. Upregulation of FABP4 and CD36 gene expression seem both linked to a poor prognosis in CRC. CONCLUSIONS: By proving the substantial overlap between human-observed SARIFA and DL-based TAF as morphologic biomarkers, we demonstrate that linking DL-based image features to independently developed histopathologic biomarkers is a promising tool in the identification of clinically and biologically meaningful biomarkers. Adipocyte-tumor-cell interactions seem to be crucial in CRC, which should be considered as biomarkers for further investigations.
Different methods exist in assessing samples removed from cancer patients during surgery. We linked two independently established tissue-based methods for determining the outcome of colorectal cancer patients together: tumor adipose feature (TAF) and Stroma AReactive Invasion Front Areas (SARIFA). SARIFA as biological feature was observed solely by humans and TAF was identified by the help of a computer algorithm. We examined TAF in many cancer slides and looked at whether they showed similarities to SARIFA. TAF often matched SARIFA, but not always. Interestingly, these methods could be used to predict outcomes for patients and are associated with specific gene expression involved in tumor and fat cell interaction. Our study shows that combining computer algorithms with human expertize in evaluating tissue samples can identify meaningful features in patient samples, which may help to predict the best treatment options.
RESUMO
BACKGROUND: Presence of lymph node metastasis (LNM) influences prognosis and clinical decision-making in colorectal cancer. However, detection of LNM is variable and depends on a number of external factors. Deep learning has shown success in computational pathology, but has struggled to boost performance when combined with known predictors. METHODS: Machine-learned features are created by clustering deep learning embeddings of small patches of tumor in colorectal cancer via k-means, and then selecting the top clusters that add predictive value to a logistic regression model when combined with known baseline clinicopathological variables. We then analyze performance of logistic regression models trained with and without these machine-learned features in combination with the baseline variables. RESULTS: The machine-learned extracted features provide independent signal for the presence of LNM (AUROC: 0.638, 95% CI: [0.590, 0.683]). Furthermore, the machine-learned features add predictive value to the set of 6 clinicopathologic variables in an external validation set (likelihood ratio test, p < 0.00032; AUROC: 0.740, 95% CI: [0.701, 0.780]). A model incorporating these features can also further risk-stratify patients with and without identified metastasis (p < 0.001 for both stage II and stage III). CONCLUSION: This work demonstrates an effective approach to combine deep learning with established clinicopathologic factors in order to identify independently informative features associated with LNM. Further work building on these specific results may have important impact in prognostication and therapeutic decision making for LNM. Additionally, this general computational approach may prove useful in other contexts.
When colorectal cancers spread to the lymph nodes, it can indicate a poorer prognosis. However, detecting lymph node metastasis (spread) can be difficult and depends on a number of factors such as how samples are taken and processed. Here, we show that machine learning, which involves computer software learning from patterns in data, can predict lymph node metastasis in patients with colorectal cancer from the microscopic appearance of their primary tumor and the clinical characteristics of the patients. We also show that the same approach can predict patient survival. With further work, our approach may help clinicians to inform patients about their prognosis and decide on appropriate treatments.
RESUMO
Importance: Identifying new prognostic features in colon cancer has the potential to refine histopathologic review and inform patient care. Although prognostic artificial intelligence systems have recently demonstrated significant risk stratification for several cancer types, studies have not yet shown that the machine learning-derived features associated with these prognostic artificial intelligence systems are both interpretable and usable by pathologists. Objective: To evaluate whether pathologist scoring of a histopathologic feature previously identified by machine learning is associated with survival among patients with colon cancer. Design, Setting, and Participants: This prognostic study used deidentified, archived colorectal cancer cases from January 2013 to December 2015 from the University of Milano-Bicocca. All available histologic slides from 258 consecutive colon adenocarcinoma cases were reviewed from December 2021 to February 2022 by 2 pathologists, who conducted semiquantitative scoring for tumor adipose feature (TAF), which was previously identified via a prognostic deep learning model developed with an independent colorectal cancer cohort. Main Outcomes and Measures: Prognostic value of TAF for overall survival and disease-specific survival as measured by univariable and multivariable regression analyses. Interpathologist agreement in TAF scoring was also evaluated. Results: A total of 258 colon adenocarcinoma histopathologic cases from 258 patients (138 men [53%]; median age, 67 years [IQR, 65-81 years]) with stage II (n = 119) or stage III (n = 139) cancer were included. Tumor adipose feature was identified in 120 cases (widespread in 63 cases, multifocal in 31, and unifocal in 26). For overall survival analysis after adjustment for tumor stage, TAF was independently prognostic in 2 ways: TAF as a binary feature (presence vs absence: hazard ratio [HR] for presence of TAF, 1.55 [95% CI, 1.07-2.25]; P = .02) and TAF as a semiquantitative categorical feature (HR for widespread TAF, 1.87 [95% CI, 1.23-2.85]; P = .004). Interpathologist agreement for widespread TAF vs lower categories (absent, unifocal, or multifocal) was 90%, corresponding to a κ metric at this threshold of 0.69 (95% CI, 0.58-0.80). Conclusions and Relevance: In this prognostic study, pathologists were able to learn and reproducibly score for TAF, providing significant risk stratification on this independent data set. Although additional work is warranted to understand the biological significance of this feature and to establish broadly reproducible TAF scoring, this work represents the first validation to date of human expert learning from machine learning in pathology. Specifically, this validation demonstrates that a computationally identified histologic feature can represent a human-identifiable, prognostic feature with the potential for integration into pathology practice.
Assuntos
Adenocarcinoma , Neoplasias do Colo , Masculino , Humanos , Idoso , Neoplasias do Colo/diagnóstico , Patologistas , Inteligência Artificial , Aprendizado de Máquina , Medição de RiscoRESUMO
MicroRNAs (miRNAs) are small noncoding RNAs that have recently emerged as critical regulators of gene expression within the immune system. In this study, we used mice with conditional deletion of Dicer and DiGeorge syndrome critical region 8 (Dgcr8) to dissect the roles of miRNAs in NK cell activation, survival, and function during viral infection. We developed a system for deletion of either Dicer or Dgcr8 in peripheral NK cells via drug-induced Cre activity. We found that Dicer- and Dgcr8-deficient NK cells were significantly impaired in survival and turnover, and had impaired function of the ITAM-containing activating NK cell receptors. We further demonstrated that both Dicer- and Dgcr8-dependent pathways were indispensable for the expansion of Ly49H(+) NK cells during mouse cytomegalovirus infection. Our data indicate similar phenotypes for Dicer- and Dgcr8-deficient NK cells, which strongly suggest that these processes are regulated by miRNAs. Thus, our findings indicate a critical role for miRNAs in controlling NK cell homeostasis and effector function, with implications for miRNAs regulating diverse aspects of NK cell biology.
Assuntos
Perfilação da Expressão Gênica , Células Matadoras Naturais/metabolismo , Ativação Linfocitária/genética , MicroRNAs/metabolismo , Transferência Adotiva , Animais , Separação Celular , Sobrevivência Celular , RNA Helicases DEAD-box/genética , Endorribonucleases/genética , Citometria de Fluxo , Expressão Gênica , Técnicas de Inativação de Genes , Humanos , Células Matadoras Naturais/imunologia , Ativação Linfocitária/imunologia , Camundongos , Camundongos Endogâmicos C57BL , MicroRNAs/genética , Análise de Sequência com Séries de Oligonucleotídeos , Proteínas/genética , Proteínas de Ligação a RNA , Reação em Cadeia da Polimerase Via Transcriptase Reversa , Ribonuclease IIIRESUMO
Histologic grading of breast cancer involves review and scoring of three well-established morphologic features: mitotic count, nuclear pleomorphism, and tubule formation. Taken together, these features form the basis of the Nottingham Grading System which is used to inform breast cancer characterization and prognosis. In this study, we develop deep learning models to perform histologic scoring of all three components using digitized hematoxylin and eosin-stained slides containing invasive breast carcinoma. We first evaluate model performance using pathologist-based reference standards for each component. To complement this typical approach to evaluation, we further evaluate the deep learning models via prognostic analyses. The individual component models perform at or above published benchmarks for algorithm-based grading approaches, achieving high concordance rates with pathologist grading. Further, prognostic performance using deep learning-based grading is on par with that of pathologists performing review of matched slides. By providing scores for each component feature, the deep-learning based approach also provides the potential to identify the grading components contributing most to prognostic value. This may enable optimized prognostic models, opportunities to improve access to consistent grading, and approaches to better understand the links between histologic features and clinical outcomes in breast cancer.
RESUMO
Artificial intelligence (AI) has shown promise for diagnosing prostate cancer in biopsies. However, results have been limited to individual studies, lacking validation in multinational settings. Competitions have been shown to be accelerators for medical imaging innovations, but their impact is hindered by lack of reproducibility and independent validation. With this in mind, we organized the PANDA challenge-the largest histopathology competition to date, joined by 1,290 developers-to catalyze development of reproducible AI algorithms for Gleason grading using 10,616 digitized prostate biopsies. We validated that a diverse set of submitted algorithms reached pathologist-level performance on independent cross-continental cohorts, fully blinded to the algorithm developers. On United States and European external validation sets, the algorithms achieved agreements of 0.862 (quadratically weighted κ, 95% confidence interval (CI), 0.840-0.884) and 0.868 (95% CI, 0.835-0.900) with expert uropathologists. Successful generalization across different patient populations, laboratories and reference standards, achieved by a variety of algorithmic approaches, warrants evaluating AI-based Gleason grading in prospective clinical trials.
Assuntos
Gradação de Tumores , Neoplasias da Próstata/patologia , Algoritmos , Biópsia , Estudos de Coortes , Humanos , Masculino , Neoplasias da Próstata/diagnóstico , Reprodutibilidade dos TestesRESUMO
Recent advances in artificial intelligence show tremendous promise to improve the accuracy, reproducibility, and availability of medical diagnostics across a number of medical subspecialities. This is especially true in the field of digital pathology, which has recently witnessed a surge in publications describing state-of-the-art performance for machine learning models across a wide range of diagnostic applications. Nonetheless, despite this promise, there remain significant gaps in translating applications for any of these technologies into actual clinical practice. In this review, we will first give a brief overview of the recent progress in applying AI to digitized pathology images, focusing on how these tools might be applied in clinical workflows in the near term to improve the accuracy and efficiency of pathologists. Then we define and describe in detail the various factors that need to be addressed in order to successfully close the "translation gap" for AI applications in digital pathology.
Assuntos
Inteligência Artificial/tendências , Diagnóstico , Técnicas e Procedimentos Diagnósticos/tendências , Aprendizado de Máquina/tendências , HumanosRESUMO
Both histologic subtypes and tumor mutation burden (TMB) represent important biomarkers in lung cancer, with implications for patient prognosis and treatment decisions. Typically, TMB is evaluated by comprehensive genomic profiling but this requires use of finite tissue specimens and costly, time-consuming laboratory processes. Histologic subtype classification represents an established component of lung adenocarcinoma histopathology, but can be challenging and is associated with substantial inter-pathologist variability. Here we developed a deep learning system to both classify histologic patterns in lung adenocarcinoma and predict TMB status using de-identified Hematoxylin and Eosin (H&E) stained whole slide images. We first trained a convolutional neural network to map histologic features across whole slide images of lung cancer resection specimens. On evaluation using an external data source, this model achieved patch-level area under the receiver operating characteristic curve (AUC) of 0.78-0.98 across nine histologic features. We then integrated the output of this model with clinico-demographic data to develop an interpretable model for TMB classification. The resulting end-to-end system was evaluated on 172 held out cases from TCGA, achieving an AUC of 0.71 (95% CI 0.63-0.80). The benefit of using histologic features in predicting TMB is highlighted by the significant improvement this approach offers over using the clinical features alone (AUC of 0.63 [95% CI 0.53-0.72], p = 0.002). Furthermore, we found that our histologic subtype-based approach achieved performance similar to that of a weakly supervised approach (AUC of 0.72 [95% CI 0.64-0.80]). Together these results underscore that incorporating histologic patterns in biomarker prediction for lung cancer provides informative signals, and that interpretable approaches utilizing these patterns perform comparably with less interpretable, weakly supervised approaches.
Assuntos
Adenocarcinoma de Pulmão/genética , Carcinoma Pulmonar de Células não Pequenas/genética , Aprendizado Profundo , Neoplasias Pulmonares/genética , Mutação , Adenocarcinoma de Pulmão/patologia , Adulto , Fatores Etários , Idoso , Idoso de 80 Anos ou mais , Área Sob a Curva , Carcinoma Pulmonar de Células não Pequenas/patologia , Corantes , Conjuntos de Dados como Assunto , Amarelo de Eosina-(YS) , Feminino , Hematoxilina , Humanos , Neoplasias Pulmonares/patologia , Masculino , Pessoa de Meia-Idade , Curva ROC , Fatores Sexuais , Fumar , Coloração e RotulagemRESUMO
Deriving interpretable prognostic features from deep-learning-based prognostic histopathology models remains a challenge. In this study, we developed a deep learning system (DLS) for predicting disease-specific survival for stage II and III colorectal cancer using 3652 cases (27,300 slides). When evaluated on two validation datasets containing 1239 cases (9340 slides) and 738 cases (7140 slides), respectively, the DLS achieved a 5-year disease-specific survival AUC of 0.70 (95% CI: 0.66-0.73) and 0.69 (95% CI: 0.64-0.72), and added significant predictive value to a set of nine clinicopathologic features. To interpret the DLS, we explored the ability of different human-interpretable features to explain the variance in DLS scores. We observed that clinicopathologic features such as T-category, N-category, and grade explained a small fraction of the variance in DLS scores (R2 = 18% in both validation sets). Next, we generated human-interpretable histologic features by clustering embeddings from a deep-learning-based image-similarity model and showed that they explained the majority of the variance (R2 of 73-80%). Furthermore, the clustering-derived feature most strongly associated with high DLS scores was also highly prognostic in isolation. With a distinct visual appearance (poorly differentiated tumor cell clusters adjacent to adipose tissue), this feature was identified by annotators with 87.0-95.5% accuracy. Our approach can be used to explain predictions from a prognostic deep learning model and uncover potentially-novel prognostic features that can be reliably identified by people for future validation studies.
RESUMO
Background: Gleason grading of prostate cancer is an important prognostic factor, but suffers from poor reproducibility, particularly among non-subspecialist pathologists. Although artificial intelligence (A.I.) tools have demonstrated Gleason grading on-par with expert pathologists, it remains an open question whether and to what extent A.I. grading translates to better prognostication. Methods: In this study, we developed a system to predict prostate cancer-specific mortality via A.I.-based Gleason grading and subsequently evaluated its ability to risk-stratify patients on an independent retrospective cohort of 2807 prostatectomy cases from a single European center with 5-25 years of follow-up (median: 13, interquartile range 9-17). Results: Here, we show that the A.I.'s risk scores produced a C-index of 0.84 (95% CI 0.80-0.87) for prostate cancer-specific mortality. Upon discretizing these risk scores into risk groups analogous to pathologist Grade Groups (GG), the A.I. has a C-index of 0.82 (95% CI 0.78-0.85). On the subset of cases with a GG provided in the original pathology report (n = 1517), the A.I.'s C-indices are 0.87 and 0.85 for continuous and discrete grading, respectively, compared to 0.79 (95% CI 0.71-0.86) for GG obtained from the reports. These represent improvements of 0.08 (95% CI 0.01-0.15) and 0.07 (95% CI 0.00-0.14), respectively. Conclusions: Our results suggest that A.I.-based Gleason grading can lead to effective risk stratification, and warrants further evaluation for improving disease management.
RESUMO
Background: Breast cancer management depends on biomarkers including estrogen receptor, progesterone receptor, and human epidermal growth factor receptor 2 (ER/PR/HER2). Though existing scoring systems are widely used and well-validated, they can involve costly preparation and variable interpretation. Additionally, discordances between histology and expected biomarker findings can prompt repeat testing to address biological, interpretative, or technical reasons for unexpected results. Methods: We developed three independent deep learning systems (DLS) to directly predict ER/PR/HER2 status for both focal tissue regions (patches) and slides using hematoxylin-and-eosin-stained (H&E) images as input. Models were trained and evaluated using pathologist annotated slides from three data sources. Areas under the receiver operator characteristic curve (AUCs) were calculated for test sets at both a patch-level (>135 million patches, 181 slides) and slide-level (n = 3274 slides, 1249 cases, 37 sites). Interpretability analyses were performed using Testing with Concept Activation Vectors (TCAV), saliency analysis, and pathologist review of clustered patches. Results: The patch-level AUCs are 0.939 (95%CI 0.936-0.941), 0.938 (0.936-0.940), and 0.808 (0.802-0.813) for ER/PR/HER2, respectively. At the slide level, AUCs are 0.86 (95%CI 0.84-0.87), 0.75 (0.73-0.77), and 0.60 (0.56-0.64) for ER/PR/HER2, respectively. Interpretability analyses show known biomarker-histomorphology associations including associations of low-grade and lobular histology with ER/PR positivity, and increased inflammatory infiltrates with triple-negative staining. Conclusions: This study presents rapid breast cancer biomarker estimation from routine H&E slides and builds on prior advances by prioritizing interpretability of computationally learned features in the context of existing pathological knowledge.
RESUMO
Providing prognostic information at the time of cancer diagnosis has important implications for treatment and monitoring. Although cancer staging, histopathological assessment, molecular features, and clinical variables can provide useful prognostic insights, improving risk stratification remains an active research area. We developed a deep learning system (DLS) to predict disease specific survival across 10 cancer types from The Cancer Genome Atlas (TCGA). We used a weakly-supervised approach without pixel-level annotations, and tested three different survival loss functions. The DLS was developed using 9,086 slides from 3,664 cases and evaluated using 3,009 slides from 1,216 cases. In multivariable Cox regression analysis of the combined cohort including all 10 cancers, the DLS was significantly associated with disease specific survival (hazard ratio of 1.58, 95% CI 1.28-1.70, p<0.0001) after adjusting for cancer type, stage, age, and sex. In a per-cancer adjusted subanalysis, the DLS remained a significant predictor of survival in 5 of 10 cancer types. Compared to a baseline model including stage, age, and sex, the c-index of the model demonstrated an absolute 3.7% improvement (95% CI 1.0-6.5) in the combined cohort. Additionally, our models stratified patients within individual cancer stages, particularly stage II (p = 0.025) and stage III (p<0.001). By developing and evaluating prognostic models across multiple cancer types, this work represents one of the most comprehensive studies exploring the direct prediction of clinical outcomes using deep learning and histopathology images. Our analysis demonstrates the potential for this approach to provide significant prognostic information in multiple cancer types, and even within specific pathologic stages. However, given the relatively small number of cases and observed clinical events for a deep learning task of this type, we observed wide confidence intervals for model performance, thus highlighting that future work will benefit from larger datasets assembled for the purposes for survival modeling.