RESUMO
INTRODUCTION: The differential expression of miRNAs, a key regulator in many cell signaling pathways, has been studied in various malignancies and may have an important role in cancer progression, including colorectal cancer (CRC). METHOD: The present study used machine learning and gene interaction study tools to explore the prognostic and diagnostic value of miRNAs in CRC. Integrative analysis of 353 CRC samples and normal tissue data was obtained from the TCGA database and further analyzed by R packages to define the deferentially expressed miRNAs (DEMs). Furthermore, machine learning and Kaplan Meier survival analysis helped better specify the significant prognostic value of miRNAs. A combination of online databases was then used to evaluate the interactions between target genes, their molecular pathways, and the correlation between the DEMs. RESULT: The results indicated that miR-19b and miR-20a have a significant prognostic role and are associated with CRC progression. The ROC curve analysis discovered that miR-20a alone and combined with other miRNAs, including hsa-mir-21 and hsa-mir-542, are diagnostic biomarkers in CRC. In addition, 12 genes, including NTRK2, CDC42, EGFR, AGO2, PRKCA, HSP90AA1, TLR4, IGF1, ESR1, SMAD2, SMAD4, and NEDD4L, were found to be the highest score targets for these miRNAs. Pathway analysis identified the two correlated tyrosine kinase and MAPK signaling pathways with the key interaction genes, i.e., EGFR, CDC42, and HSP90AA1. CONCLUSION: To better define the role of these miRNAs, the ceRNA network, including lncRNAs, was also prepared. In conclusion, the combination of R data analysis and machine learning provides a robust approach to resolving complicated interactions between miRNAs and their targets.
RESUMO
Non-alcoholic fatty liver disease (NAFLD) comprises a range of chronic liver diseases that result from the accumulation of excess triglycerides in the liver, and which, in its early phases, is categorized NAFLD, or hepato-steatosis with pure fatty liver. The mortality rate of non-alcoholic steatohepatitis (NASH) is more than NAFLD; therefore, diagnosing the disease in its early stages may decrease liver damage and increase the survival rate. In the current study, we screened the gene expression data of NAFLD patients and control samples from the public dataset GEO to detect DEGs. Then, the correlation betweenbetween the top selected DEGs and clinical data was evaluated. In the present study, two GEO datasets (GSE48452, GSE126848) were downloaded. The dysregulated expressed genes (DEGs) were identified by machine learning methods (Penalize regression models). Then, the shared DEGs between the two training datasets were validated using validation datasets. ROC-curve analysis was used to identify diagnostic markers. R software analyzed the interactions between DEGs, clinical data, and fatty liver. Ten novel genes, including ABCF1, SART3, APC5, NONO, KAT7, ZPR1, RABGAP1, SLC7A8, SPAG9, and KAT6A were found to have a differential expression between NAFLD and healthy individuals. Based on validation results and ROC analysis, NR4A2 and IGFBP1b were identified as diagnostic markers. These key genes may be predictive markers for the development of fatty liver. It is recommended that these key genes are assessed further as possible predictive markers during the development of fatty liver.
Assuntos
Hepatopatia Gordurosa não Alcoólica , Humanos , Hepatopatia Gordurosa não Alcoólica/diagnóstico , Hepatopatia Gordurosa não Alcoólica/genética , Cirrose Hepática/diagnóstico , Biologia Computacional , Aprendizado de Máquina , Proteínas Adaptadoras de Transdução de Sinal , Antígenos de Neoplasias , Proteínas de Ligação a RNA , Transportadores de Cassetes de Ligação de ATP , Histona AcetiltransferasesRESUMO
Pancreatic ductal adenocarcinoma (PDAC) is associated with a very poor prognosis. Therefore, there has been a focus on identifying new biomarkers for its early diagnosis and the prediction of patient survival. Genome-wide RNA and microRNA sequencing, bioinformatics and Machine Learning approaches to identify differentially expressed genes (DEGs), followed by validation in an additional cohort of PDAC patients has been undertaken. To identify DEGs, genome RNA sequencing and clinical data from pancreatic cancer patients were extracted from The Cancer Genome Atlas Database (TCGA). We used Kaplan-Meier analysis of survival curves was used to assess prognostic biomarkers. Ensemble learning, Random Forest (RF), Max Voting, Adaboost, Gradient boosting machines (GBM), and Extreme Gradient Boosting (XGB) techniques were used, and Gradient boosting machines (GBM) were selected with 100% accuracy for analysis. Moreover, protein-protein interaction (PPI), molecular pathways, concomitant expression of DEGs, and correlations between DEGs and clinical data were analyzed. We have evaluated candidate genes, miRNAs, and a combination of these obtained from machine learning algorithms and survival analysis. The results of Machine learning identified 23 genes with negative regulation, five genes with positive regulation, seven microRNAs with negative regulation, and 20 microRNAs with positive regulation in PDAC. Key genes BMF, FRMD4A, ADAP2, PPP1R17, and CACNG3 had the highest coefficient in the advanced stages of the disease. In addition, the survival analysis showed decreased expression of hsa.miR.642a, hsa.mir.363, CD22, BTNL9, and CTSW and overexpression of hsa.miR.153.1, hsa.miR.539, hsa.miR.412 reduced survival rate. CTSW was identified as a novel genetic marker and this was validated using RT-PCR. Machine learning algorithms may be used to Identify key dysregulated genes/miRNAs involved in the disease pathogenesis can be used to detect patients in earlier stages. Our data also demonstrated the prognostic and diagnostic value of CTSW in PDAC.
Assuntos
Carcinoma Ductal Pancreático , MicroRNAs , Neoplasias Pancreáticas , Humanos , Catepsina W/genética , Catepsina W/metabolismo , Regulação para Baixo , MicroRNAs/genética , MicroRNAs/metabolismo , Neoplasias Pancreáticas/diagnóstico , Neoplasias Pancreáticas/genética , Neoplasias Pancreáticas/metabolismo , Carcinoma Ductal Pancreático/diagnóstico , Carcinoma Ductal Pancreático/genética , Carcinoma Ductal Pancreático/patologia , Prognóstico , Biomarcadores , Regulação Neoplásica da Expressão Gênica , Biomarcadores Tumorais/genética , Neoplasias PancreáticasRESUMO
Introduction: Colorectal cancer (CRC) is a common cancer associated with poor outcomes, underscoring a need for the identification of novel prognostic and therapeutic targets to improve outcomes. This study aimed to identify genetic variants and differentially expressed genes (DEGs) using genome-wide DNA and RNA sequencing followed by validation in a large cohort of patients with CRC. Methods: Whole genome and gene expression profiling were used to identify DEGs and genetic alterations in 146 patients with CRC. Gene Ontology, Reactom, GSEA, and Human Disease Ontology were employed to study the biological process and pathways involved in CRC. Survival analysis on dysregulated genes in patients with CRC was conducted using Cox regression and Kaplan-Meier analysis. The STRING database was used to construct a protein-protein interaction (PPI) network. Moreover, candidate genes were subjected to ML-based analysis and the Receiver operating characteristic (ROC) curve. Subsequently, the expression of the identified genes was evaluated by Real-time PCR (RT-PCR) in another cohort of 64 patients with CRC. Gene variants affecting the regulation of candidate gene expressions were further validated followed by Whole Exome Sequencing (WES) in 15 patients with CRC. Results: A total of 3576 DEGs in the early stages of CRC and 2985 DEGs in the advanced stages of CRC were identified. ASPHD1 and ZBTB12 genes were identified as potential prognostic markers. Moreover, the combination of ASPHD and ZBTB12 genes was sensitive, and the two were considered specific markers, with an area under the curve (AUC) of 0.934, 1.00, and 0.986, respectively. The expression levels of these two genes were higher in patients with CRC. Moreover, our data identified two novel genetic variants-the rs925939730 variant in ASPHD1 and the rs1428982750 variant in ZBTB1-as being potentially involved in the regulation of gene expression. Conclusions: Our findings provide a proof of concept for the prognostic values of two novel genes-ASPHD1 and ZBTB12-and their associated variants (rs925939730 and rs1428982750) in CRC, supporting further functional analyses to evaluate the value of emerging biomarkers in colorectal cancer.
RESUMO
Cytochrome P450 (CYP450) enzyme has been shown to be expressed in colorectal cancer (CRC) and its dysregulation is linked to tumor progression and a poor prognosis. Here we investigated the therapeutic potential of targeting CYP450 using lopinavir/ritonavir in CRC. The integrative systems biology method and RNAseq were utilized to investigate the differential levels of genes associated with patients with colorectal cancer. The antiproliferative activity of lopinavir/ritonavir was evaluated in both monolayer and 3-dimensional (3D) models, followed by wound-healing assays. The effectiveness of targeting CYP450 was examined in a mouse model, followed by histopathological analysis, biochemical tests (MDA, SOD, thiol, and CAT), and RT-PCR. The data of dysregulation expressed genes (DEG) revealed 1268 upregulated and 1074 down-regulated genes in CRC. Among the top-score genes and dysregulated pathways, CYPs were detected and associated with poor prognosis of patients with CRC. Inhibition of CYP450 reduced cell proliferation via modulating survivin, Chop, CYP13a, and induction of cell death, as detected by AnnexinV/PI staining. This agent suppressed the migratory behaviors of cells by induction of E-cadherin. Moreover, lopinavir/ritonavir suppressed tumor growth and fibrosis, which correlated with a reduction in SOD/thiol levels and increased MDA levels. Our findings illustrated the therapeutic potential of targeting the CYP450 using lopinavir/ritonavir in colorectal cancer, supporting future investigations on this novel therapeutic approach for the treatment of CRC.
RESUMO
Colorectal cancer (CRC) is the third most common cause of cancer-related deaths. The five-year relative survival rate for CRC is estimated to be approximately 90% for patients diagnosed with early stages and 14% for those diagnosed at an advanced stages of disease, respectively. Hence, the development of accurate prognostic markers is required. Bioinformatics enables the identification of dysregulated pathways and novel biomarkers. RNA expression profiling was performed in CRC patients from the TCGA database using a Machine Learning approach to identify differential expression genes (DEGs). Survival curves were assessed using Kaplan-Meier analysis to identify prognostic biomarkers. Furthermore, the molecular pathways, protein-protein interaction, the co-expression of DEGs, and the correlation between DEGs and clinical data have been evaluated. The diagnostic markers were then determined based on machine learning analysis. The results indicated that key upregulated genes are associated with the RNA processing and heterocycle metabolic process, including C10orf2, NOP2, DKC1, BYSL, RRP12, PUS7, MTHFD1L, and PPAT. Furthermore, the survival analysis identified NOP58, OSBPL3, DNAJC2, and ZMYND19 as prognostic markers. The combineROC curve analysis indicated that the combination of C10orf2 -PPAT- ZMYND19 can be considered as diagnostic markers with sensitivity, specificity, and AUC values of 0.98, 1.00, and 0.99, respectively. Eventually, ZMYND19 gene was validated in CRC patients. In conclusion, novel biomarkers of CRC have been identified that may be a promising strategy for early diagnosis, potential treatment, and better prognosis.
RESUMO
Gastric cancer is the high mortality rate cancers globally, and the current survival rate is 30% even with the use of combination therapies. Recently, mounting evidence indicates the potential role of miRNAs in the diagnosis and assessing the prognosis of cancers. In the state-of-art research in cancer, machine-learning (ML) has gained increasing attention to find clinically useful biomarkers. The present study aimed to identify potential diagnostic and prognostic miRNAs in GC with the application of ML. Using the TCGA database and ML algorithms such as Support Vector Machine (SVM), Random Forest, k-NN, etc., a panel of 29 was obtained. Among the ML algorithms, SVM was chosen (AUC:88.5%, Accuracy:93% in GC). To find common molecular mechanisms of the miRNAs, their common gene targets were predicted using online databases such as miRWalk, miRDB, and Targetscan. Functional and enrichment analyzes were performed using Gene Ontology (GO) and Kyoto Database of Genes and Genomes (KEGG), as well as identification of protein-protein interactions (PPI) using the STRING database. Pathway analysis of the target genes revealed the involvement of several cancer-related pathways including miRNA mediated inhibition of translation, regulation of gene expression by genetic imprinting, and the Wnt signaling pathway. Survival and ROC curve analysis showed that the expression levels of hsa-miR-21, hsa-miR-133a, hsa-miR-146b, and hsa-miR-29c were associated with higher mortality and potentially earlier detection of GC patients. A panel of dysregulated miRNAs that may serve as reliable biomarkers for gastric cancer were identified using machine learning, which represents a powerful tool in biomarker identification.