Your browser doesn't support javascript.
loading
High-Throughput Omics and Statistical Learning Integration for the Discovery and Validation of Novel Diagnostic Signatures in Colorectal Cancer.
Long, Nguyen Phuoc; Park, Seongoh; Anh, Nguyen Hoang; Nghi, Tran Diem; Yoon, Sang Jun; Park, Jeong Hill; Lim, Johan; Kwon, Sung Won.
Afiliação
  • Long NP; College of Pharmacy and Research Institute of Pharmaceutical Sciences, Seoul National University, Seoul 08826, Korea. phuoclong@snu.ac.kr.
  • Park S; Department of Statistics, Seoul National University, Seoul 08826, Korea. inmybrain@snu.ac.kr.
  • Anh NH; College of Pharmacy and Research Institute of Pharmaceutical Sciences, Seoul National University, Seoul 08826, Korea. 2018-23140@snu.ac.kr.
  • Nghi TD; School of Medicine, Vietnam National University, Ho Chi Minh 70000, Vietnam. trandiemnghi@gmail.com.
  • Yoon SJ; College of Pharmacy and Research Institute of Pharmaceutical Sciences, Seoul National University, Seoul 08826, Korea. supercanboy@snu.ac.kr.
  • Park JH; College of Pharmacy and Research Institute of Pharmaceutical Sciences, Seoul National University, Seoul 08826, Korea. hillpark@snu.ac.kr.
  • Lim J; Department of Statistics, Seoul National University, Seoul 08826, Korea. johanlim@snu.ac.kr.
  • Kwon SW; College of Pharmacy and Research Institute of Pharmaceutical Sciences, Seoul National University, Seoul 08826, Korea. swkwon@snu.ac.kr.
Int J Mol Sci ; 20(2)2019 Jan 12.
Article em En | MEDLINE | ID: mdl-30642095
The advancement of bioinformatics and machine learning has facilitated the discovery and validation of omics-based biomarkers. This study employed a novel approach combining multi-platform transcriptomics and cutting-edge algorithms to introduce novel signatures for accurate diagnosis of colorectal cancer (CRC). Different random forests (RF)-based feature selection methods including the area under the curve (AUC)-RF, Boruta, and Vita were used and the diagnostic performance of the proposed biosignatures was benchmarked using RF, logistic regression, naïve Bayes, and k-nearest neighbors models. All models showed satisfactory performance in which RF appeared to be the best. For instance, regarding the RF model, the following were observed: mean accuracy 0.998 (standard deviation (SD) < 0.003), mean specificity 0.999 (SD < 0.003), and mean sensitivity 0.998 (SD < 0.004). Moreover, proposed biomarker signatures were highly associated with multifaceted hallmarks in cancer. Some biomarkers were found to be enriched in epithelial cell signaling in Helicobacter pylori infection and inflammatory processes. The overexpression of TGFBI and S100A2 was associated with poor disease-free survival while the down-regulation of NR5A2, SLC4A4, and CD177 was linked to worse overall survival of the patients. In conclusion, novel transcriptome signatures to improve the diagnostic accuracy in CRC are introduced for further validations in various clinical settings.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Neoplasias Colorretais / Biomarcadores Tumorais / Biologia Computacional / Análise de Sequência com Séries de Oligonucleotídeos / Perfilação da Expressão Gênica Tipo de estudo: Diagnostic_studies / Prognostic_studies Limite: Female / Humans Idioma: En Revista: Int J Mol Sci Ano de publicação: 2019 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Neoplasias Colorretais / Biomarcadores Tumorais / Biologia Computacional / Análise de Sequência com Séries de Oligonucleotídeos / Perfilação da Expressão Gênica Tipo de estudo: Diagnostic_studies / Prognostic_studies Limite: Female / Humans Idioma: En Revista: Int J Mol Sci Ano de publicação: 2019 Tipo de documento: Article