Your browser doesn't support javascript.
loading
TOD-CUP: a gene expression rank-based majority vote algorithm for tissue origin diagnosis of cancers of unknown primary.
Shen, Yifei; Chu, Qinjie; Yin, Xinxin; He, Yinjun; Bai, Panpan; Wang, Yunfei; Fang, Weijia; Timko, Michael P; Fan, Longjiang; Jiang, Weiqin.
Afiliação
  • Shen Y; Department of Medical Oncology, First Affiliated Hospital, Zhejiang University and the Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, USA.
  • Chu Q; Institute of Bioinformatics, Zhejiang University, China.
  • Yin X; Institute of Bioinformatics, Zhejiang University, China.
  • He Y; College of Medicine, Zhejiang University, China.
  • Bai P; Institute of Bioinformatics, Zhejiang University, China.
  • Wang Y; Zhejiang Sheng Ting Biotechnology Co., China.
  • Fang W; Department of Medical Oncology, First Affiliated Hospital, Zhejiang University, China.
  • Timko MP; Department of Biology & Public Health Sciences, University of Virginia, USA.
  • Fan L; Department of Medical Oncology, First Affiliated Hospital, Zhejiang University, China.
  • Jiang W; Department of Medical Oncology, First Affiliated Hospital, Zhejiang University, China.
Brief Bioinform ; 22(2): 2106-2118, 2021 03 22.
Article em En | MEDLINE | ID: mdl-32266390
ABSTRACT
Gene expression profiling holds great potential as a new approach to histological diagnosis and precision medicine of cancers of unknown primary (CUP). Batch effects and different data types greatly decrease the predictive performance of biomarker-based algorithms, and few methods have been widely applied to identify tissue origin of CUP up to now. To address this problem and assist in more precise diagnosis, we have developed a gene expression rank-based majority vote algorithm for tissue origin diagnosis of CUP (TOD-CUP) of most common cancer types. Based on massive tissue-specific RNA-seq data sets (10 553) found in The Cancer Genome Atlas (TCGA), 538 feature genes (biomarkers) were selected based on their gene expression ranks and used to predict tissue types. The top scoring pairs (TSPs) classifier of the tumor type was optimized by the TCGA training samples. To test the prediction accuracy of our TOD-CUP algorithm, we analyzed (1) two microarray data sets (1029 Agilent and 2277 Affymetrix/Illumina chips) and found 91% and 94% prediction accuracy, respectively, (2) RNA-seq data from five cancer types derived from 141 public metastatic cancer tumor samples and achieved 94% accuracy and (3) a total of 25 clinical cancer samples (including 14 metastatic cancer samples) were able to classify 24/25 samples correctly (96.0% accuracy). Taken together, the TOD-CUP algorithm provides a powerful and robust means to accurately identify the tissue origin of 24 cancer types across different data platforms. To make the TOD-CUP algorithm easily accessible for clinical application, we established a Web-based server for tumor tissue origin diagnosis (http//ibi. zju.edu.cn/todcup/).
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Neoplasias Primárias Desconhecidas / Expressão Gênica Tipo de estudo: Diagnostic_studies / Prognostic_studies Limite: Humans Idioma: En Ano de publicação: 2021 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Neoplasias Primárias Desconhecidas / Expressão Gênica Tipo de estudo: Diagnostic_studies / Prognostic_studies Limite: Humans Idioma: En Ano de publicação: 2021 Tipo de documento: Article