RESUMO
The application of deep learning to spatial transcriptomics (ST) can reveal relationships between gene expression and tissue architecture. Prior work has demonstrated that inferring gene expression from tissue histomorphology can discern these spatial molecular markers to enable population scale studies, reducing the fiscal barriers associated with large-scale spatial profiling. However, while most improvements in algorithmic performance have focused on improving model architectures, little is known about how the quality of tissue preparation and imaging can affect deep learning model training for spatial inference from morphology and its potential for widespread clinical adoption. Prior studies for ST inference from histology typically utilize manually stained frozen sections with imaging on non-clinical grade scanners. Training such models on ST cohorts is also costly. We hypothesize that adopting tissue processing and imaging practices that mirror standards for clinical implementation (permanent sections, automated tissue staining, and clinical grade scanning) can significantly improve model performance. An enhanced specimen processing and imaging protocol was developed for deep learning-based ST inference from morphology. This protocol featured the Visium CytAssist assay to permit automated hematoxylin and eosin staining (e.g. Leica Bond), 40×-resolution imaging, and joining of multiple patients' tissue sections per capture area prior to ST profiling. Using a cohort of 13 pathologic T Stage-III stage colorectal cancer patients, we compared the performance of models trained on slide prepared using enhanced versus traditional (i.e. manual staining and low-resolution imaging) protocols. Leveraging Inceptionv3 neural networks, we predicted gene expression across serial, histologically-matched tissue sections using whole slide images (WSI) from both protocols. The data Shapley was used to quantify and compare marginal performance gains on a patient-by-patient basis attributed to using the enhanced protocol versus the actual costs of spatial profiling. Findings indicate that training and validating on WSI acquired through the enhanced protocol as opposed to the traditional method resulted in improved performance at lower fiscal cost. In the realm of ST, the enhancement of deep learning architectures frequently captures the spotlight; however, the significance of specimen processing and imaging is often understated. This research, informed through a game-theoretic lens, underscores the substantial impact that specimen preparation/imaging can have on spatial transcriptomic inference from morphology. It is essential to integrate such optimized processing protocols to facilitate the identification of prognostic markers at a larger scale.
Assuntos
Aprendizado Profundo , Transcriptoma , Humanos , Perfilação da Expressão Gênica/métodos , Processamento de Imagem Assistida por Computador/métodos , Algoritmos , Neoplasias Colorretais/genética , Neoplasias Colorretais/patologia , Neoplasias Colorretais/diagnóstico por imagemRESUMO
Aim: This study addresses the challenge of predicting the response of head and neck squamous cell carcinoma (HNSCC) patients to immunotherapy.Methods: Using DNA methylation cytometry, we analyzed the immune profiles of six HNSCC patients who showed a positive response to immunotherapy over a year without disease progression.Results: There was an initial increase in CD8 T memory cells and natural killer cells during the first four cycles of immunotherapy, which then returned to baseline levels after a year. Baseline CD8 T cell levels were lower in HNSCC immunotherapy responders but became similar to those in healthy subjects after immunotherapy.Conclusion: These findings suggest that monitoring fluctuations in immune profiles could potentially identify biomarkers for immunotherapy response in HNSCC patients.
[Box: see text].
Assuntos
Metilação de DNA , Neoplasias de Cabeça e Pescoço , Inibidores de Checkpoint Imunológico , Carcinoma de Células Escamosas de Cabeça e Pescoço , Humanos , Inibidores de Checkpoint Imunológico/uso terapêutico , Inibidores de Checkpoint Imunológico/farmacologia , Carcinoma de Células Escamosas de Cabeça e Pescoço/tratamento farmacológico , Carcinoma de Células Escamosas de Cabeça e Pescoço/imunologia , Carcinoma de Células Escamosas de Cabeça e Pescoço/genética , Carcinoma de Células Escamosas de Cabeça e Pescoço/terapia , Masculino , Pessoa de Meia-Idade , Feminino , Neoplasias de Cabeça e Pescoço/tratamento farmacológico , Neoplasias de Cabeça e Pescoço/imunologia , Neoplasias de Cabeça e Pescoço/genética , Idoso , Linfócitos T CD8-Positivos/imunologia , Linfócitos T CD8-Positivos/metabolismo , Linfócitos T CD8-Positivos/efeitos dos fármacos , Células Matadoras Naturais/imunologia , Células Matadoras Naturais/metabolismo , Sobreviventes de CâncerRESUMO
BACKGROUND: High-grade serous carcinoma (HGSC) gene expression subtypes are associated with differential survival. We characterized HGSC gene expression in Black individuals and considered whether gene expression differences by self-identified race may contribute to poorer HGSC survival among Black versus White individuals. METHODS: We included newly generated RNA sequencing data from Black and White individuals and array-based genotyping data from four existing studies of White and Japanese individuals. We used K-means clustering, a method with no predefined number of clusters or dataset-specific features, to assign subtypes. Cluster- and dataset-specific gene expression patterns were summarized by moderated t-scores. We compared cluster-specific gene expression patterns across datasets by calculating the correlation between the summarized vectors of moderated t-scores. After mapping to The Cancer Genome Atlas-derived HGSC subtypes, we used Cox proportional hazards models to estimate subtype-specific survival by dataset. RESULTS: Cluster-specific gene expression was similar across gene expression platforms and racial groups. Comparing the Black population with the White and Japanese populations, the immunoreactive subtype was more common (39% vs. 23%-28%) and the differentiated subtype was less common (7% vs. 22%-31%). Patterns of subtype-specific survival were similar between the Black and White populations with RNA sequencing data; compared with mesenchymal cases, the risk of death was similar for proliferative and differentiated cases and suggestively lower for immunoreactive cases [Black population HR = 0.79 (0.55, 1.13); White population HR = 0.86 (0.62, 1.19)]. CONCLUSIONS: Although the prevalence of HGSC subtypes varied by race, subtype-specific survival was similar. IMPACT: HGSC subtypes can be consistently assigned across platforms and self-identified racial groups.
Assuntos
Cistadenocarcinoma Seroso , Neoplasias Ovarianas , Humanos , Feminino , Neoplasias Ovarianas/genética , Neoplasias Ovarianas/etnologia , Neoplasias Ovarianas/patologia , Neoplasias Ovarianas/mortalidade , Cistadenocarcinoma Seroso/genética , Cistadenocarcinoma Seroso/patologia , Cistadenocarcinoma Seroso/etnologia , Cistadenocarcinoma Seroso/mortalidade , Pessoa de Meia-Idade , População Branca/genética , População Branca/estatística & dados numéricos , Gradação de Tumores , Idoso , Negro ou Afro-Americano/genética , Negro ou Afro-Americano/estatística & dados numéricosRESUMO
Although intratumoral heterogeneity has been established in pediatric central nervous system tumors, epigenomic alterations at the cell type level have largely remained unresolved. To identify cell type-specific alterations to cytosine modifications in pediatric central nervous system tumors, we utilize a multi-omic approach that integrated bulk DNA cytosine modification data (methylation and hydroxymethylation) with both bulk and single-cell RNA-sequencing data. We demonstrate a large reduction in the scope of significantly differentially modified cytosines in tumors when accounting for tumor cell type composition. In the progenitor-like cell types of tumors, we identify a preponderance differential Cytosine-phosphate-Guanine site hydroxymethylation rather than methylation. Genes with differential hydroxymethylation, like histone deacetylase 4 and insulin-like growth factor 1 receptor, are associated with cell type-specific changes in gene expression in tumors. Our results highlight the importance of epigenomic alterations in the progenitor-like cell types and its role in cell type-specific transcriptional regulation in pediatric central nervous system tumors.
Assuntos
Neoplasias do Sistema Nervoso Central , Metilação de DNA , Epigênese Genética , Regulação Neoplásica da Expressão Gênica , Humanos , Neoplasias do Sistema Nervoso Central/genética , Neoplasias do Sistema Nervoso Central/metabolismo , Neoplasias do Sistema Nervoso Central/patologia , Criança , Histona Desacetilases/metabolismo , Histona Desacetilases/genética , Epigenômica/métodos , Proteínas Repressoras/metabolismo , Proteínas Repressoras/genética , Análise de Célula Única , Transcrição Gênica , Citosina/metabolismoRESUMO
Background: Bladder cancer and therapy responses hinge on immune profiles in the tumor microenvironment (TME) and blood, yet studies linking tumor-infiltrating immune cells to peripheral immune profiles are limited. Methods: DNA methylation cytometry quantified TME and matched peripheral blood immune cell proportions. With tumor immune profile data as the input, subjects were grouped by immune infiltration status and consensus clustering. Results: Immune hot and cold groups had different immune compositions in the TME but not in circulating blood. Two clusters of patients identified with consensus clustering had different immune compositions not only in the TME but also in blood. Conclusion: Detailed immune profiling via methylation cytometry reveals the significance of understanding tumor and systemic immune relationships in cancer patients.
Bladder cancer and treatment outcomes depend on the immune profiles in the tumor and blood. Our study, using DNA methylation cytometry, measured immune cell proportions in both areas. Patients were grouped based on immune status and consensus clustering. Results showed distinct immune compositions in the tumor, but not in blood, for hot and cold groups. Consensus clustering revealed two patient clusters with differing immune compositions in both tumor and blood. This detailed immune profiling highlights the importance of understanding the complex interplay between tumor and systemic immunity in bladder cancer patients.
Assuntos
Microambiente Tumoral , Neoplasias da Bexiga Urinária , Humanos , Neoplasias da Bexiga Urinária/genética , Análise por Conglomerados , Metilação de DNA , Processamento de Proteína Pós-Traducional , PrognósticoRESUMO
BACKGROUND: Among men, prostate cancer (PCa) is the second most common cancer and the second leading cause of cancer death. Etiologic factors associated with both prostate carcinogenesis and somatic alterations in tumors are incompletely understood. While genetic variants associated with PCa have been identified, epigenetic alterations in PCa are relatively understudied. To date, DNA methylation (DNAm) and gene expression (GE) in PCa have been investigated; however, these studies did not correct for cell-type proportions of the tumor microenvironment (TME), which could confound results. METHODS: The data (GSE183040) consisted of DNAm and GE data from both tumor and adjacent non-tumor prostate tissue of 56 patients who underwent radical prostatectomies prior to any treatment. This study builds upon previous studies that examined methylation patterns and GE in PCa patients by using a novel tumor deconvolution approach to identify and correct for cell-type proportions of the TME in its epigenome-wide association study (EWAS) and differential expression analysis (DEA). RESULTS: The inclusion of cell-type proportions in EWASs and DEAs reduced the scope of significant alterations associated with PCa. We identified 2,093 significantly differentially methylated CpGs (DMC), and 51 genes associated with PCa, including PCA3, SPINK1, and AMACR. CONCLUSIONS: This work illustrates the importance of correcting for cell types of the TME when performing EWASs and DEAs on PCa samples, and establishes a more confounding-adverse methodology. We identified a more tumor-cell-specific set of altered genes and epigenetic marks that can be further investigated as potential biomarkers of disease or potential therapeutic targets.
Assuntos
Metilação de DNA , Neoplasias da Próstata , Masculino , Humanos , Epigênese Genética , Microambiente Tumoral/genética , Ilhas de CpG , Neoplasias da Próstata/patologia , Expressão Gênica , Inibidor da Tripsina Pancreática de Kazal/genética , Inibidor da Tripsina Pancreática de Kazal/metabolismoRESUMO
The advent of spatial transcriptomics technologies has heralded a renaissance in research to advance our understanding of the spatial cellular and transcriptional heterogeneity within tissues. Spatial transcriptomics allows investigation of the interplay between cells, molecular pathways, and the surrounding tissue architecture and can help elucidate developmental trajectories, disease pathogenesis, and various niches in the tumor microenvironment. Photoaging is the histological and molecular skin damage resulting from chronic/acute sun exposure and is a major risk factor for skin cancer. Spatial transcriptomics technologies hold promise for improving the reliability of evaluating photoaging and developing new therapeutics. Challenges to current methods include limited focus on dermal elastosis variations and reliance on self-reported measures, which can introduce subjectivity and inconsistency. Spatial transcriptomics offers an opportunity to assess photoaging objectively and reproducibly in studies of carcinogenesis and discern the effectiveness of therapies that intervene in photoaging and preventing cancer. Evaluation of distinct histological architectures using highly-multiplexed spatial technologies can identify specific cell lineages that have been understudied due to their location beyond the depth of UV penetration. However, the cost and interpatient variability using state-of-the-art assays such as the 10x Genomics Spatial Transcriptomics assays limits the scope and scale of large-scale molecular epidemiologic studies. Here, we investigate the inference of spatial transcriptomics information from routine hematoxylin and eosin-stained (H&E) tissue slides. We employed the Visium CytAssist spatial transcriptomics assay to analyze over 18,000 genes at a 50-micron resolution for four patients from a cohort of 261 skin specimens collected adjacent to surgical resection sites for basal cell and squamous cell keratinocyte tumors. The spatial transcriptomics data was co-registered with 40x resolution whole slide imaging (WSI) information. We developed machine learning models that achieved a macro-averaged median AUC and F1 score of 0.80 and 0.61 and Spearman coefficient of 0.60 in inferring transcriptomic profiles across the slides, and accurately captured biological pathways across various tissue architectures.
Assuntos
Envelhecimento da Pele , Humanos , Envelhecimento da Pele/genética , Reprodutibilidade dos Testes , Biologia Computacional , Perfilação da Expressão Gênica , Amarelo de Eosina-(YS) , TranscriptomaRESUMO
Graph-based deep learning has shown great promise in cancer histopathology image analysis by contextualizing complex morphology and structure across whole slide images to make high quality downstream outcome predictions (ex: prognostication). These methods rely on informative representations (i.e., embeddings) of image patches comprising larger slides, which are used as node attributes in slide graphs. Spatial omics data, including spatial transcriptomics, is a novel paradigm offering a wealth of detailed information. Pairing this data with corresponding histological imaging localized at 50-micron resolution, may facilitate the development of algorithms which better appreciate the morphological and molecular underpinnings of carcinogenesis. Here, we explore the utility of leveraging spatial transcriptomics data with a contrastive crossmodal pretraining mechanism to generate deep learning models that can extract molecular and histological information for graph-based learning tasks. Performance on cancer staging, lymph node metastasis prediction, survival prediction, and tissue clustering analyses indicate that the proposed methods bring improvement to graph based deep learning models for histopathological slides compared to leveraging histological information from existing schemes, demonstrating the promise of mining spatial omics data to enhance deep learning for pathology workflows.
Assuntos
Aprendizado Profundo , Neoplasias , Humanos , Biologia Computacional , Neoplasias/genética , Algoritmos , Análise por ConglomeradosRESUMO
DNA methylation has been extensively utilized to study epigenetic patterns across many diseases as well as to deconvolve blood cell type proportions. This study builds upon previous studies examining methylation patterns in paediatric patients with varying stages of Crohn's disease to extend the immune profiling of these patients using a novel deconvolution approach. Compared with control subjects, we observed significantly decreased levels of CD4 memory and naive, CD8 naive, and natural killer cells and elevated neutrophil levels in Crohn's disease. In addition, Crohn's patients had a significantly elevated neutrophil-to-lymphocyte ratio. Using an epigenome-wide association approach and adjusting for potential confounders, including cell type, we observed 397 differentially methylated CpG (DMC) sites associated with Crohn's disease. The top genetic pathway associated with the DMCs was the regulation of arginine metabolic processes which are involved in the regulation of T cells.
Assuntos
Doença de Crohn , Humanos , Criança , Doença de Crohn/genética , Metilação de DNARESUMO
Introduction: High-grade serous carcinoma (HGSC) gene expression subtypes are associated with differential survival. We characterized HGSC gene expression in Black individuals and considered whether gene expression differences by race may contribute to poorer HGSC survival among Black versus non-Hispanic White individuals. Methods: We included newly generated RNA-Seq data from Black and White individuals, and array-based genotyping data from four existing studies of White and Japanese individuals. We assigned subtypes using K-means clustering. Cluster- and dataset-specific gene expression patterns were summarized by moderated t-scores. We compared cluster-specific gene expression patterns across datasets by calculating the correlation between the summarized vectors of moderated t-scores. Following mapping to The Cancer Genome Atlas (TCGA)-derived HGSC subtypes, we used Cox proportional hazards models to estimate subtype-specific survival by dataset. Results: Cluster-specific gene expression was similar across gene expression platforms. Comparing the Black study population to the White and Japanese study populations, the immunoreactive subtype was more common (39% versus 23%-28%) and the differentiated subtype less common (7% versus 22%-31%). Patterns of subtype-specific survival were similar between the Black and White populations with RNA-Seq data; compared to mesenchymal cases, the risk of death was similar for proliferative and differentiated cases and suggestively lower for immunoreactive cases (Black population HR=0.79 [0.55, 1.13], White population HR=0.86 [0.62, 1.19]). Conclusions: A single, platform-agnostic pipeline can be used to assign HGSC gene expression subtypes. While the observed prevalence of HGSC subtypes varied by race, subtype-specific survival was similar.
RESUMO
Background: Spatial transcriptomics involves studying the spatial organization of gene expression within tissues, offering insights into the molecular diversity of tumors. While spatial gene expression is commonly amalgamated from 1-10 cells across 50-micron spots, recent methods have demonstrated the capability to disaggregate this information at subspot resolution by leveraging both expression and histological patterns. However, elucidating such information from histology alone presents a significant challenge but if solved can better permit spatial molecular analysis at cellular resolution for instances where Visium data is not available, reducing study costs. This study explores integrating single-cell histological and transcriptomic data to infer spatial mRNA expression patterns in whole slide images collected from a cohort of stage pT3 colorectal cancer patients. A cell graph neural network algorithm was developed to align histological information extracted from detected cells with single cell RNA patterns through optimal transport methods, facilitating the analysis of cellular groupings and gene relationships. This approach leveraged spot-level expression as an intermediary to co-map histological and transcriptomic information at the single-cell level. Results: Our study demonstrated that single-cell transcriptional heterogeneity within a spot could be predicted from histological markers extracted from cells detected within a spot. Furthermore, our model exhibited proficiency in delineating overarching gene expression patterns across whole-slide images. This approach compared favorably to traditional patch-based computer vision methods as well as other methods which did not incorporate single cell expression during the model fitting procedures. Topological nuances of single-cell expression within a Visium spot were preserved using the developed methodology. Conclusion: This innovative approach augments the resolution of spatial molecular assays utilizing histology as a sole input through synergistic co-mapping of histological and transcriptomic datasets at the single-cell level, anchored by spatial transcriptomics. While initial results are promising, they warrant rigorous validation. This includes collaborating with pathologists for precise spatial identification of distinct cell types and utilizing sophisticated assays, such as Xenium, to attain deeper subcellular insights.
RESUMO
The application of deep learning methods to spatial transcriptomics has shown promise in unraveling the complex relationships between gene expression patterns and tissue architecture as they pertain to various pathological conditions. Deep learning methods that can infer gene expression patterns directly from tissue histomorphology can expand the capability to discern spatial molecular markers within tissue slides. However, current methods utilizing these techniques are plagued by substantial variability in tissue preparation and characteristics, which can hinder the broader adoption of these tools. Furthermore, training deep learning models using spatial transcriptomics on small study cohorts remains a costly endeavor. Necessitating novel tissue preparation processes enhance assay reliability, resolution, and scalability. This study investigated the impact of an enhanced specimen processing workflow for facilitating a deep learning-based spatial transcriptomics assessment. The enhanced workflow leveraged the flexibility of the Visium CytAssist assay to permit automated H&E staining (e.g., Leica Bond) of tissue slides, whole-slide imaging at 40x-resolution, and multiplexing of tissue sections from multiple patients within individual capture areas for spatial transcriptomics profiling. Using a cohort of thirteen pT3 stage colorectal cancer (CRC) patients, we compared the efficacy of deep learning models trained on slide prepared using an enhanced workflow as compared to the traditional workflow which leverages manual tissue staining and standard imaging of tissue slides. Leveraging Inceptionv3 neural networks, we aimed to predict gene expression patterns across matched serial tissue sections, each stemming from a distinct workflow but aligned based on persistent histological structures. Findings indicate that the enhanced workflow considerably outperformed the traditional spatial transcriptomics workflow. Gene expression profiles predicted from enhanced tissue slides also yielded expression patterns more topologically consistent with the ground truth. This led to enhanced statistical precision in pinpointing biomarkers associated with distinct spatial structures. These insights can potentially elevate diagnostic and prognostic biomarker detection by broadening the range of spatial molecular markers linked to metastasis and recurrence. Future endeavors will further explore these findings to enrich our comprehension of various diseases and uncover molecular pathways with greater nuance. Combining deep learning with spatial transcriptomics provides a compelling avenue to enrich our understanding of tumor biology and improve clinical outcomes. For results of the highest fidelity, however, effective specimen processing is crucial, and fostering collaboration between histotechnicians, pathologists, and genomics specialists is essential to herald this new era in spatial transcriptomics-driven cancer research.
RESUMO
BACKGROUND: Seasonal variations in environmental exposures at birth or during gestation are associated with numerous adult traits and health outcomes later in life. Whether DNA methylation (DNAm) plays a role in the molecular mechanisms underlying the associations between birth season and lifelong phenotypes remains unclear. METHODS: We carried out epigenome-wide meta-analyses within the Pregnancy And Childhood Epigenetic Consortium to identify associations of DNAm with birth season, both at differentially methylated probes (DMPs) and regions (DMRs). Associations were examined at two time points: at birth (21 cohorts, N = 9358) and in children aged 1-11 years (12 cohorts, N = 3610). We conducted meta-analyses to assess the impact of latitude on birth season-specific associations at both time points. RESULTS: We identified associations between birth season and DNAm (False Discovery Rate-adjusted p values < 0.05) at two CpGs at birth (winter-born) and four in the childhood (summer-born) analyses when compared to children born in autumn. Furthermore, we identified twenty-six differentially methylated regions (DMR) at birth (winter-born: 8, spring-born: 15, summer-born: 3) and thirty-two in childhood (winter-born: 12, spring and summer: 10 each) meta-analyses with few overlapping DMRs between the birth seasons or the two time points. The DMRs were associated with genes of known functions in tumorigenesis, psychiatric/neurological disorders, inflammation, or immunity, amongst others. Latitude-stratified meta-analyses [higher (≥ 50°N), lower (< 50°N, northern hemisphere only)] revealed differences in associations between birth season and DNAm by birth latitude. DMR analysis implicated genes with previously reported links to schizophrenia (LAX1), skin disorders (PSORS1C, LTB4R), and airway inflammation including asthma (LTB4R), present only at birth in the higher latitudes (≥ 50°N). CONCLUSIONS: In this large epigenome-wide meta-analysis study, we provide evidence for (i) associations between DNAm and season of birth that are unique for the seasons of the year (temporal effect) and (ii) latitude-dependent variations in the seasonal associations (spatial effect). DNAm could play a role in the molecular mechanisms underlying the effect of birth season on adult health outcomes.
Assuntos
Asma , Metilação de DNA , Criança , Pré-Escolar , Humanos , Lactente , Recém-Nascido , Carcinogênese , Inflamação , Estações do AnoRESUMO
BACKGROUND: Immune profiles have been associated with bladder cancer outcomes and may have clinical applications for prognosis. However, associations of detailed immune cell subtypes with patient outcomes remain underexplored and may contribute crucial prognostic information for better managing bladder cancer recurrence and survival. METHODS: Bladder cancer case peripheral blood DNA methylation was measured using the Illumina HumanMethylationEPIC array. Extended cell-type deconvolution quantified 12 immune cell-type proportions, including memory, naïve T and B cells, and granulocyte subtypes. DNA methylation clocks determined biological age. Cox proportional hazards models tested associations of immune cell profiles and age acceleration with bladder cancer outcomes. The partDSA algorithm discriminated 10-year overall survival groups from clinical variables and immune cell profiles, and a semi-supervised recursively partitioned mixture model (SS-RPMM) with DNA methylation data was applied to identify a classifier for 10-year overall survival. RESULTS: Higher CD8T memory cell proportions were associated with better overall survival [HR = 0.95, 95% confidence interval (CI) = 0.93-0.98], while higher neutrophil-to-lymphocyte ratio (HR = 1.36, 95% CI = 1.23-1.50), CD8T naïve (HR = 1.21, 95% CI = 1.04-1.41), neutrophil (HR = 1.04, 95% CI = 1.03-1.06) proportions, and age acceleration (HR = 1.06, 95% CI = 1.03-1.08) were associated with worse overall survival in patient with bladder cancer. partDSA and SS-RPMM classified five groups of subjects with significant differences in overall survival. CONCLUSIONS: We identified associations between immune cell subtypes and age acceleration with bladder cancer outcomes. IMPACT: The findings of this study suggest that bladder cancer outcomes are associated with specific methylation-derived immune cell-type proportions and age acceleration, and these factors could be potential prognostic biomarkers.
Assuntos
Recidiva Local de Neoplasia , Neoplasias da Bexiga Urinária , Humanos , Recidiva Local de Neoplasia/genética , Metilação de DNA , Linfócitos , Modelos de Riscos Proporcionais , PrognósticoRESUMO
Background: Gastric adenocarcinomas are a leading cause of global mortality, associated with chronic infection with Helicobacter pylori. The mechanisms by which infection with H. pylori contributes to carcinogenesis are not well understood. Recent studies from subjects with and without gastric cancer have identified significant DNA methylation alterations in normal gastric mucosa associated with H. pylori infection and gastric cancer risk. Here we further investigated DNA methylation alterations in normal gastric mucosa in gastric cancer cases (n = 42) and control subjects (n = 42) with H. pylori infection data. We assessed tissue cell type composition, DNA methylation alterations within cell populations, epigenetic aging, and repetitive element methylation. Results: In normal gastric mucosa of both gastric cancer cases and control subjects, we observed increased epigenetic age acceleration associated with H. pylori infection. We also observed an increased mitotic tick rate associated with H. pylori infection in both gastric cancer cases and controls. Significant differences in immune cell populations associated with H. pylori infection in normal tissue from cancer cases and controls were identified using DNA methylation cell type deconvolution. We also found natural killer cell-specific methylation alterations in normal mucosa from gastric cancer patients with H. pylori infection. Conclusions: Our findings from normal gastric mucosa provide insight into underlying cellular composition and epigenetic aspects of H. pylori associated gastric cancer etiology.
RESUMO
BACKGROUND: Deep learning models can infer cancer patient prognosis from molecular and anatomic pathology information. Recent studies that leveraged information from complementary multimodal data improved prognostication, further illustrating the potential utility of such methods. However, current approaches: 1) do not comprehensively leverage biological and histomorphological relationships and 2) make use of emerging strategies to "pretrain" models (i.e., train models on a slightly orthogonal dataset/modeling objective) which may aid prognostication by reducing the amount of information required for achieving optimal performance. In addition, model interpretation is crucial for facilitating the clinical adoption of deep learning methods by fostering practitioner understanding and trust in the technology. METHODS: Here, we develop an interpretable multimodal modeling framework that combines DNA methylation, gene expression, and histopathology (i.e., tissue slides) data, and we compare performance of crossmodal pretraining, contrastive learning, and transfer learning versus the standard procedure. RESULTS: Our models outperform the existing state-of-the-art method (average 11.54% C-index increase), and baseline clinically driven models (average 11.7% C-index increase). Model interpretations elucidate consideration of biologically meaningful factors in making prognosis predictions. DISCUSSION: Our results demonstrate that the selection of pretraining strategies is crucial for obtaining highly accurate prognostication models, even more so than devising an innovative model architecture, and further emphasize the all-important role of the tumor microenvironment on disease progression.
RESUMO
Over 150â¯000 Americans are diagnosed with colorectal cancer (CRC) every year, and annually over 50â¯000 individuals will die from CRC, necessitating improvements in screening, prognostication, disease management, and therapeutic options. Tumor metastasis is the primary factor related to the risk of recurrence and mortality. Yet, screening for nodal and distant metastasis is costly, and invasive and incomplete resection may hamper adequate assessment. Signatures of the tumor-immune microenvironment (TIME) at the primary site can provide valuable insights into the aggressiveness of the tumor and the effectiveness of various treatment options. Spatially resolved transcriptomics technologies offer an unprecedented characterization of TIME through high multiplexing, yet their scope is constrained by cost. Meanwhile, it has long been suspected that histological, cytological, and macroarchitectural tissue characteristics correlate well with molecular information (e.g., gene expression). Thus, a method for predicting transcriptomics data through inference of RNA patterns from whole slide images (WSI) is a key step in studying metastasis at scale. In this work, we collected tissue from 4 stage-III (pT3) matched colorectal cancer patients for spatial transcriptomics profiling. The Visium spatial transcriptomics (ST) assay was used to measure transcript abundance for 17â¯943 genes at up to 5000 55-micron (i.e., 1-10 cells) spots per patient sampled in a honeycomb pattern, co-registered with hematoxylin and eosin (H&E) stained WSI. The Visium ST assay can measure expression at these spots through tissue permeabilization of mRNAs, which are captured through spatially (i.e., x-y positional coordinates) barcoded, gene specific oligo probes. WSI subimages were extracted around each co-registered Visium spot and were used to predict the expression at these spots using machine learning models. We prototyped and compared several convolutional, transformer, and graph convolutional neural networks to predict spatial RNA patterns at the Visium spots under the hypothesis that the transformer- and graph-based approaches better capture relevant spatial tissue architecture. We further analyzed the model's ability to recapitulate spatial autocorrelation statistics using SPARK and SpatialDE. Overall, the results indicate that the transformer- and graph-based approaches were unable to outperform the convolutional neural network architecture, though they exhibited optimal performance for relevant disease-associated genes. Initial findings suggest that different neural networks that operate on different scales are relevant for capturing distinct disease pathways (e.g., epithelial to mesenchymal transition). We add further evidence that deep learning models can accurately predict gene expression in whole slide images and comment on understudied factors which may increase its external applicability (e.g., tissue context). Our preliminary work will motivate further investigation of inference for molecular patterns from whole slide images as metastasis predictors and in other applications.
RESUMO
Over 150,000 Americans are diagnosed with colorectal cancer (CRC) every year, and annually >50,000 individuals are estimated to die of CRC, necessitating improvements in screening, prognostication, disease management, and therapeutic options. CRC tumors are removed en bloc with surrounding vasculature and lymphatics. Examination of regional lymph nodes at the time of surgical resection is essential for prognostication. Developing alternative approaches to indirectly assess recurrence risk would have utility in cases where lymph node yield is incomplete or inadequate. Spatially dependent, immune cell-specific (eg, tumor-infiltrating lymphocytes), proteomic, and transcriptomic expression patterns inside and around the tumor-the tumor immune microenvironment-can predict nodal/distant metastasis and probe the coordinated immune response from the primary tumor site. The comprehensive characterization of tumor-infiltrating lymphocytes and other immune infiltrates is possible using highly multiplexed spatial omics technologies, such as the GeoMX Digital Spatial Profiler. In this study, machine learning and differential co-expression analyses helped identify biomarkers from Digital Spatial Profiler-assayed protein expression patterns inside, at the invasive margin, and away from the tumor, associated with extracellular matrix remodeling (eg, granzyme B and fibronectin), immune suppression (eg, forkhead box P3), exhaustion and cytotoxicity (eg, CD8), Programmed death ligand 1-expressing dendritic cells, and neutrophil proliferation, among other concomitant alterations. Further investigation of these biomarkers may reveal independent risk factors of CRC metastasis that can be formulated into low-cost, widely available assays.
Assuntos
Neoplasias do Colo , Neoplasias Colorretais , Humanos , Proteômica , Neoplasias Colorretais/metabolismo , Biomarcadores/metabolismo , Linfonodos , Neoplasias do Colo/patologia , Linfócitos do Interstício Tumoral , Microambiente Tumoral , Biomarcadores Tumorais/metabolismoRESUMO
Human cancers are heterogenous by their cell composition and origination site. Cancer metastasis generates the conundrum of the unknown origin of migrated tumor cells. Tracing tissue of origin and tumor type in primary and metastasized cancer is vital for clinical significance. DNA methylation alterations play a crucial role in carcinogenesis and mark cell fate differentiation, thus can be used to trace tumor tissue of origin. In this study, we employed a novel tumor-type-specific hierarchical model using genome-scale DNA methylation data to develop a multilayer perceptron model, HiTAIC, to trace tissue of origin and tumor type in 27 cancers from 23 tissue sites in data from 7735 tumors with high resolution, accuracy, and specificity. In tracing primary cancer origin, HiTAIC accuracy was 99% in the test set and 93% in the external validation data set. Metastatic cancers were identified with a 96% accuracy in the external data set. HiTAIC is a user-friendly web-based application through https://sites.dartmouth.edu/salaslabhitaic/. In conclusion, we developed HiTAIC, a DNA methylation-based algorithm, to trace tumor tissue of origin in primary and metastasized cancers. The high accuracy and resolution of tumor tracing using HiTAIC holds promise for clinical assistance in identifying cancer of unknown origin.
RESUMO
Although intratumoral heterogeneity has been established in pediatric central nervous system tumors, epigenomic alterations at the cell type level have largely remained unresolved. To identify cell type-specific alterations to cytosine modifications in pediatric central nervous system tumors we utilized a multi-omic approach that integrated bulk DNA cytosine modification data (methylation and hydroxymethylation) with both bulk and single-cell RNA-sequencing data. We demonstrate a large reduction in the scope of significantly differentially modified cytosines in tumors when accounting for tumor cell type composition. In the progenitor-like cell types of tumors, we identified a preponderance differential CpG hydroxymethylation rather than methylation. Genes with differential hydroxymethylation, like HDAC4 and IGF1R, were associated with cell type-specific changes in gene expression in tumors. Our results highlight the importance of epigenomic alterations in the progenitor-like cell types and its role in cell type-specific transcriptional regulation in pediatric CNS tumors.