RESUMEN
AIM: Esophageal Squamous Cell Carcinoma (ESCC) is a histological subtype of esophageal cancer that begins in the squamous cells in the esophagus. In only 19% of the ESCC-diagnosed patients, a five-year survival rate has been seen. This necessitates the identification of high-confidence biomarkers for early diagnosis, prognosis, and potential therapeutic targets for the mitigation of ESCC. METHOD: We performed a meta-analysis of 10 mRNA datasets and identified consistently perturbed genes across the studies. Then, integrated with ESCC ATLAS to segregate 'core' genes to identify consequences of primary gene perturbation events leading to gene-gene interactions and dysregulated molecular signaling pathways. Further, by integrating with toxicogenomics data, inferences were drawn for gene interaction with environmental exposures, trace elements, chemical carcinogens, and drug chemicals. We also deduce the clinical outcomes of candidate genes based on survival analysis using the ESCC related dataset in The Cancer Genome Atlas. RESULT: We identified 237 known and 18 novel perturbed candidate genes. Desmoglein 1 (DSG1) is one such gene that we found significantly downregulated (Fold Change =-1.89, p-value = 8.2e-06) in ESCC across six different datasets. Further, we identified 31 'core' genes (that either harbor genetic variants or are regulated by epigenetic modifications) and found regulating key biological pathways via adjoining genes in gene-gene interaction networks. Functional enrichment analysis showed dysregulated biological processes and pathways including "Extracellular matrix", "Collagen trimmer" and "HPV infection" are significantly overrepresented in our candidate genes. Based on the toxicogenomic inferences from Comparative Toxicogenomics Database we report the key genes that interacted with risk factors such as tobacco smoking, zinc, nitroso benzylmethylamine, and drug chemicals such as cisplatin, Fluorouracil, and Mitomycin in relation to ESCC. We also point to the STC2 gene that shows a high risk for mortality in ESCC patients. CONCLUSION: We identified novel perturbed genes in relation to ESCC and explored their interaction network. DSG1 is one such gene, its association with microbiota and a clinical presentation seen commonly with ESCC hints that it is a good candidate for early diagnostic marker. Besides, in this study we highlight candidate genes and their molecular connections to risk factors, biological pathways, drug chemicals, and the survival probability of ESCC patients.
Asunto(s)
Neoplasias Esofágicas , Carcinoma de Células Escamosas de Esófago , Humanos , Carcinoma de Células Escamosas de Esófago/genética , Carcinoma de Células Escamosas de Esófago/patología , Neoplasias Esofágicas/patología , Desmogleína 1/genética , Desmogleína 1/metabolismo , Regulación hacia Abajo , Perfilación de la Expresión Génica , Biología Computacional , Genómica , Pronóstico , ARN Mensajero/genética , Regulación Neoplásica de la Expresión Génica , Biomarcadores de Tumor/genéticaRESUMEN
BACKGROUND: Esophageal squamous cell carcinoma (ESCC) has a poor prognosis and is one of the deadliest gastrointestinal malignancies. Despite numerous transcriptomics studies to understand its molecular basis, the impact of population-specific differences on this disease remains unexplored. AIMS: This study aimed to investigate the population-specific differences in gene expression patterns among ESCC samples obtained from six distinct global populations, identify differentially expressed genes (DEGs) and their associated pathways, and identify potential biomarkers for ESCC diagnosis and prognosis. In addition, this study deciphers population specific microbial and chemical risk factors in ESCC. METHODS: We compared the gene expression patterns of ESCC samples from six different global populations by analyzing microarray datasets. To identify DEGs, we conducted stringent quality control and employed linear modeling. We cross-compared the resulting DEG lists of each populations along with ESCC ATLAS to identify known and novel DEGs. We performed a survival analysis using The Cancer Genome Atlas Program (TCGA) data to identify potential biomarkers for ESCC diagnosis and prognosis among the novel DEGs. Finally, we performed comparative functional enrichment and toxicogenomic analysis. RESULTS: Here we report 19 genes with distinct expression patterns among populations, indicating population-specific variations in ESCC. Additionally, we discovered 166 novel DEGs, such as ENDOU, SLCO1B3, KCNS3, IFI35, among others. The survival analysis identified three novel genes (CHRM3, CREG2, H2AC6) critical for ESCC survival. Notably, our findings showed that ECM-related gene ontology terms and pathways were significantly enriched among the DEGs in ESCC. We also found population-specific variations in immune response and microbial infection-related pathways which included genes enriched for HPV, Ameobiosis, Leishmaniosis, and Human Cytomegaloviruses. Our toxicogenomic analysis identified tobacco smoking as the primary risk factor and cisplatin as the main drug chemical interacting with the maximum number of DEGs across populations. CONCLUSION: This study provides new insights into population-specific differences in gene expression patterns and their associated pathways in ESCC. Our findings suggest that changes in extracellular matrix (ECM) organization may be crucial to the development and progression of this cancer, and that environmental and genetic factors play important roles in the disease. The novel DEGs identified may serve as potential biomarkers for diagnosis, prognosis and treatment.
RESUMEN
Esophageal cancer (EC) is the eighth most aggressive malignancy and its treatment remains a challenge due to the lack of biomarkers that can facilitate early detection. EC is identified in two major histological forms namely - Adenocarcinoma (EAC) and Squamous cell carcinoma (ESCC), each showing differences in the incidence among populations that are geographically separated. Hence the detection of potential drug target and biomarkers demands a population-centric understanding of the molecular and cellular mechanisms of EC. To provide an adequate impetus to the biomarker discovery for ESCC, which is the most prevalent esophageal cancer worldwide, here we have developed ESCC ATLAS, a manually curated database that integrates genetic, epigenetic, transcriptomic, and proteomic ESCC-related genes from the published literature. It consists of 3475 genes associated to molecular signatures such as, altered transcription (2600), altered translation (560), contain copy number variation/structural variations (233), SNPs (102), altered DNA methylation (82), Histone modifications (16) and miRNA based regulation (261). We provide a user-friendly web interface ( http://www.esccatlas.org , freely accessible for academic, non-profit users) that facilitates the exploration and the analysis of genes among different populations. We anticipate it to be a valuable resource for the population specific investigation and biomarker discovery for ESCC.