RESUMO
BACKGROUND: Survival analysis is a cornerstone of medical research, enabling the assessment of clinical outcomes for disease progression and treatment efficiency. Despite its central importance, no commonly used spreadsheet software can handle survival analysis and there is no web server available for its computation. OBJECTIVE: Here, we introduce a web-based tool capable of performing univariate and multivariate Cox proportional hazards survival analysis using data generated by genomic, transcriptomic, proteomic, or metabolomic studies. METHODS: We implemented different methods to establish cut-off values for the trichotomization or dichotomization of continuous data. The false discovery rate is computed to correct for multiple hypothesis testing. A multivariate analysis option enables comparing omics data with clinical variables. RESULTS: We established a registration-free web-based survival analysis tool capable of performing univariate and multivariate survival analysis using any custom-generated data. CONCLUSIONS: This tool fills a gap and will be an invaluable contribution to basic medical and clinical research.
Assuntos
Pesquisa Biomédica , Proteômica , Humanos , Internet , Software , Análise de SobrevidaRESUMO
PURPOSE: The proper validation of prognostic biomarkers is an important clinical issue in breast cancer research. MicroRNAs (miRNAs) have emerged as a new class of promising breast cancer biomarkers. In the present work, we developed an integrated online bioinformatic tool to validate the prognostic relevance of miRNAs in breast cancer. METHODS: A database was set up by searching the GEO, EGA, TCGA, and PubMed repositories to identify datasets with published miRNA expression and clinical data. Kaplan-Meier survival analysis was performed to validate the prognostic value of a set of 41 previously published survival-associated miRNAs. RESULTS: All together 2178 samples from four independent datasets were integrated into the system including the expression of 1052 distinct human miRNAs. In addition, the web-tool allows for the selection of patients, which can be filtered by receptors status, lymph node involvement, histological grade, and treatments. The complete analysis tool can be accessed online at: www.kmplot.com/mirpower . We used this tool to analyze a large number of deregulated miRNAs associated with breast cancer features and outcome, and confirmed the prognostic value of 26 miRNAs. A significant correlation in three out of four datasets was validated only for miR-29c and miR-101. CONCLUSIONS: In summary, we established an integrated platform capable to mine all available miRNA data to perform a survival analysis for the identification and validation of prognostic miRNA markers in breast cancer.
Assuntos
Neoplasias da Mama/genética , Neoplasias da Mama/mortalidade , Bases de Dados Genéticas , Regulação Neoplásica da Expressão Gênica , MicroRNAs/genética , Software , Navegador , Biomarcadores Tumorais , Biologia Computacional/métodos , Feminino , Humanos , Prognóstico , Reprodutibilidade dos Testes , Interface Usuário-ComputadorRESUMO
The estrogen receptor (ER)α drives growth in two-thirds of all breast cancers. Several targeted therapies, collectively termed endocrine therapy, impinge on estrogen-induced ERα activation to block tumor growth. However, half of ERα-positive breast cancers are tolerant or acquire resistance to endocrine therapy. We demonstrate that genome-wide reprogramming of the chromatin landscape, defined by epigenomic maps for regulatory elements or transcriptional activation and chromatin openness, underlies resistance to endocrine therapy. This annotation reveals endocrine therapy-response specific regulatory networks where NOTCH pathway is overactivated in resistant breast cancer cells, whereas classical ERα signaling is epigenetically disengaged. Blocking NOTCH signaling abrogates growth of resistant breast cancer cells. Its activation state in primary breast tumors is a prognostic factor of resistance in endocrine treated patients. Overall, our work demonstrates that chromatin landscape reprogramming underlies changes in regulatory networks driving endocrine therapy resistance in breast cancer.
Assuntos
Neoplasias da Mama/tratamento farmacológico , Montagem e Desmontagem da Cromatina/fisiologia , Epigênese Genética/fisiologia , Receptor alfa de Estrogênio/metabolismo , Redes Reguladoras de Genes/fisiologia , Transdução de Sinais/fisiologia , Western Blotting , Neoplasias da Mama/metabolismo , Imunoprecipitação da Cromatina , Epigênese Genética/genética , Feminino , Redes Reguladoras de Genes/genética , Humanos , Estimativa de Kaplan-Meier , Células MCF-7 , Análise em Microsséries , Receptores Notch/metabolismo , Transdução de Sinais/genéticaRESUMO
BACKGROUND: Primary systemic treatment for ovarian cancer is surgery, followed by platinum based chemotherapy. Platinum resistant cancers progress/recur in approximately 25% of cases within six months. We aimed to identify clinically useful biomarkers of platinum resistance. METHODS: A database of ovarian cancer transcriptomic datasets including treatment and response information was set up by mining the GEO and TCGA repositories. Receiver operator characteristics (ROC) analysis was performed in R for each gene and these were then ranked using their achieved area under the curve (AUC) values. The most significant candidates were selected and in vitro functionally evaluated in four epithelial ovarian cancer cell lines (SKOV-3-, CAOV-3, ES-2 and OVCAR-3), using gene silencing combined with drug treatment in viability and apoptosis assays. We collected 94 tumor samples and the strongest candidate was validated by IHC and qRT-PCR in these. RESULTS: All together 1,452 eligible patients were identified. Based on the ROC analysis the eight most significant genes were JRK, CNOT8, RTF1, CCT3, NFAT2CIP, MEK1, FUBP1 and CSDE1. Silencing of MEK1, CSDE1, CNOT8 and RTF1, and pharmacological inhibition of MEK1 caused significant sensitization in the cell lines. Of the eight genes, JRK (p = 3.2E-05), MEK1 (p = 0.0078), FUBP1 (p = 0.014) and CNOT8 (p = 0.00022) also correlated to progression free survival. The correlation between the best biomarker candidate MEK1 and survival was validated in two independent cohorts by qRT-PCR (n = 34, HR = 5.8, p = 0.003) and IHC (n = 59, HR = 4.3, p = 0.033). CONCLUSION: We identified MEK1 as a promising prognostic biomarker candidate correlated to response to platinum based chemotherapy in ovarian cancer.
Assuntos
Antineoplásicos/farmacologia , Carboplatina/farmacologia , Resistencia a Medicamentos Antineoplásicos/genética , MAP Quinase Quinase 1/genética , Neoplasias Epiteliais e Glandulares/genética , Neoplasias Ovarianas/genética , Antineoplásicos/uso terapêutico , Apoptose/genética , Biomarcadores , Carboplatina/uso terapêutico , Carcinoma Epitelial do Ovário , Linhagem Celular Tumoral , Sobrevivência Celular/efeitos dos fármacos , Sobrevivência Celular/genética , Estudos de Coortes , Biologia Computacional , Bases de Dados de Ácidos Nucleicos , Conjuntos de Dados como Assunto , Feminino , Inativação Gênica , Humanos , MAP Quinase Quinase 1/antagonistas & inibidores , Neoplasias Epiteliais e Glandulares/tratamento farmacológico , Neoplasias Epiteliais e Glandulares/mortalidade , Neoplasias Ovarianas/tratamento farmacológico , Neoplasias Ovarianas/mortalidade , Prognóstico , Inibidores de Proteínas Quinases/farmacologia , RNA Interferente Pequeno/genéticaRESUMO
To date, three molecular markers (ER, PR, and CYP2D6) have been used in clinical setting to predict the benefit of the anti-estrogen tamoxifen therapy. Our aim was to validate new biomarker candidates predicting response to tamoxifen treatment in breast cancer by evaluating these in a meta-analysis of available transcriptomic datasets with known treatment and follow-up. Biomarker candidates were identified in Pubmed and in the 2007-2012 ASCO and 2011-2012 SABCS abstracts. Breast cancer microarray datasets of endocrine therapy-treated patients were downloaded from GEO and EGA and RNAseq datasets from TCGA. Of the biomarker candidates, only those identified or already validated in a clinical cohort were included. Relapse-free survival (RFS) up to 5 years was used as endpoint in a ROC analysis in the GEO and RNAseq datasets. In the EGA dataset, Kaplan-Meier analysis was performed for overall survival. Statistical significance was set at p < 0.005. The transcriptomic datasets included 665 GEO-based and 1,208 EGA-based patient samples. All together 68 biomarker candidates were identified. Of these, the best performing genes were PGR (AUC = 0.64, p = 2.3E-07), MAPT (AUC = 0.62, p = 7.8E-05), and SLC7A5 (AUC = 0.62, p = 9.2E-05). Further genes significantly correlated to RFS include FOS, TP53, BTG2, HOXB7, DRG1, CXCL10, and TPM4. In the RNAseq dataset, only ERBB2, EDF1, and MAPK1 reached statistical significance. We evaluated tamoxifen-resistance genes in three independent platforms and identified PGR, MAPT, and SLC7A5 as the most promising prognostic biomarkers in tamoxifen treated patients.
Assuntos
Biomarcadores Tumorais/biossíntese , Neoplasias da Mama/tratamento farmacológico , Regulação Neoplásica da Expressão Gênica , Tamoxifeno/administração & dosagem , Antineoplásicos Hormonais/administração & dosagem , Neoplasias da Mama/genética , Neoplasias da Mama/patologia , Intervalo Livre de Doença , Feminino , Humanos , Estimativa de Kaplan-Meier , Transportador 1 de Aminoácidos Neutros Grandes/metabolismo , Recidiva Local de Neoplasia/tratamento farmacológico , Recidiva Local de Neoplasia/genética , Prognóstico , Piridinas/metabolismo , Resultado do Tratamento , Proteínas tau/metabolismoRESUMO
In the last decades, several gene expression-based predictors of clinical behavior were developed for breast cancer. A common feature of these is the use of multiple genes to predict hormone receptor status and the probability of tumor recurrence, survival or response to chemotherapy. We developed an online analysis tool to compute ER and HER2 status, Oncotype DX 21-gene recurrence score and an independent recurrence risk classification using gene expression data obtained by interrogation of Affymetrix microarray profiles. We implemented rigorous quality control algorithms to promptly exclude any biases related to sample processing, hybridization and scanning. After uploading the raw microarray data, the system performs the complete evaluation automatically and provides a report summarizing the results. The system is accessible online at http://www.recurrenceonline.com . We validated the system using data from 2,472 publicly available microarrays. The validation of the prediction of the 21-gene recurrence score was significant in lymph node negative patients (Cox-Mantel, P = 5.6E-16, HR = 0.4, CI = 0.32-0.5). A correct classification was obtained for 88.5% of ER- and 90.5% of ER + tumors (n = 1,894). The prediction of recurrence risk in all patients by using the mean of the independent six strongest genes (P < 1E-16, HR = 2.9, CI = 2.5-3.3), of the four strongest genes in lymph node negative ER positive patients (P < 1E-16, HR = 2.8, CI = 2.2-3.5) and of the three genes in lymph node positive patients (P = 3.2E-9, HR = 2.5, CI = 1.8-3.4) was highly significant. In summary, we integrated available knowledge in one platform to validate currently used predictors and to provide a global tool for the online determination of different prognostic parameters simultaneously using genome-wide microarrays.
Assuntos
Neoplasias da Mama/genética , Recidiva Local de Neoplasia/genética , Receptor ErbB-2/genética , Receptores de Estrogênio/genética , Algoritmos , Área Sob a Curva , Neoplasias da Mama/metabolismo , Neoplasias da Mama/mortalidade , Simulação por Computador , Intervalo Livre de Doença , Feminino , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Estimativa de Kaplan-Meier , Modelos Genéticos , Modelos Estatísticos , Análise de Sequência com Séries de Oligonucleotídeos , Sistemas On-Line , Prognóstico , Modelos de Riscos Proporcionais , Curva ROC , Receptor ErbB-2/metabolismo , Receptores de Estrogênio/metabolismo , Fatores de Risco , Interface Usuário-ComputadorRESUMO
Breast cancer clinical treatment selection is based on the immunohistochemical determination of four protein biomarkers: ESR1, PGR, HER2, and MKI67. Our aim was to correlate immunohistochemical results to proteome-level technologies in measuring the expression of these markers. We also aimed to integrate available proteome-level breast cancer datasets to identify and validate new prognostic biomarker candidates. We searched studies involving breast cancer patient cohorts with published survival and proteomic information. Immunohistochemistry and proteomic technologies were compared using the Mann-Whitney test. Receiver operating characteristics (ROC) curves were generated to validate discriminative power. Cox regression and Kaplan-Meier survival analysis were calculated to assess prognostic power. False Discovery Rate was computed to correct for multiple hypothesis testing. We established a database integrating protein expression data and survival information from four independent cohorts for 1229 breast cancer patients. In all four studies combined, a total of 7342 unique proteins were identified, and 1417 of these were identified in at least three datasets. ESR1, PGR, and HER2 protein expression levels determined by RPPA or LC-MS/MS methods showed a significant correlation with the levels determined by immunohistochemistry (p < 0.0001). PGR and ESR1 levels showed a moderate correlation (correlation coefficient = 0.17, p = 0.0399). An additional panel of candidate proteins, including apoptosis-related proteins (BCL2,), adhesion markers (CDH1, CLDN3, CLDN7) and basal markers (cytokeratins), were validated as prognostic biomarkers. Finally, we expanded our previously established web tool designed to validate survival-associated biomarkers by including the proteomic datasets analyzed in this study ( https://kmplot.com/ ). In summary, large proteomic studies now provide sufficient data enabling the validation and ranking of potential protein biomarkers.
Assuntos
Biomarcadores Tumorais/genética , Neoplasias da Mama/genética , Prognóstico , Proteômica , Idoso , Neoplasias da Mama/epidemiologia , Neoplasias da Mama/patologia , Cromatografia Líquida , Bases de Dados Genéticas , Intervalo Livre de Doença , Receptor alfa de Estrogênio/genética , Feminino , Humanos , Antígeno Ki-67/genética , Pessoa de Meia-Idade , Receptor ErbB-2/genética , Receptores de Progesterona/genética , Espectrometria de Massas em TandemRESUMO
Validating prognostic or predictive candidate genes in appropriately powered breast cancer cohorts are of utmost interest. Our aim was to develop an online tool to draw survival plots, which can be used to assess the relevance of the expression levels of various genes on the clinical outcome both in untreated and treated breast cancer patients. A background database was established using gene expression data and survival information of 1,809 patients downloaded from GEO (Affymetrix HGU133A and HGU133+2 microarrays). The median relapse free survival is 6.43 years, 968/1,231 patients are estrogen-receptor (ER) positive, and 190/1,369 are lymph-node positive. After quality control and normalization only probes present on both Affymetrix platforms were retained (n = 22,277). In order to analyze the prognostic value of a particular gene, the cohorts are divided into two groups according to the median (or upper/lower quartile) expression of the gene. The two groups can be compared in terms of relapse free survival, overall survival, and distant metastasis free survival. A survival curve is displayed, and the hazard ratio with 95% confidence intervals and logrank P value are calculated and displayed. Additionally, three subgroups of patients can be assessed: systematically untreated patients, endocrine-treated ER positive patients, and patients with a distribution of clinical characteristics representative of those seen in general clinical practice in the US. Web address: www.kmplot.com . We used this integrative data analysis tool to confirm the prognostic power of the proliferation-related genes TOP2A and TOP2B, MKI67, CCND2, CCND3, CCNDE2, as well as CDKN1A, and TK2. We also validated the capability of microarrays to determine estrogen receptor status in 1,231 patients. The tool is highly valuable for the preliminary assessment of biomarkers, especially for research groups with limited bioinformatic resources.
Assuntos
Biomarcadores Tumorais/genética , Neoplasias da Mama/genética , Neoplasias da Mama/mortalidade , Perfilação da Expressão Gênica/métodos , Regulação Neoplásica da Expressão Gênica , Marcadores Genéticos , Análise de Sequência com Séries de Oligonucleotídeos , Sistemas On-Line , Adulto , Idoso , Biomarcadores Tumorais/análise , Neoplasias da Mama/química , Neoplasias da Mama/patologia , Neoplasias da Mama/terapia , Gráficos por Computador , Intervalo Livre de Doença , Feminino , Humanos , Internet , Estimativa de Kaplan-Meier , Metástase Linfática , Pessoa de Meia-Idade , Estadiamento de Neoplasias , Valor Preditivo dos Testes , Prognóstico , Receptores de Estrogênio/análise , Reprodutibilidade dos Testes , Fatores de TempoRESUMO
A correction to this article has been published and is linked from the HTML and PDF versions of this paper. The error has not been fixed in the paper.
RESUMO
Multiple studies suggested using different miRNAs as biomarkers for prognosis of hepatocellular carcinoma (HCC). We aimed to assemble a miRNA expression database from independent datasets to enable an independent validation of previously published prognostic biomarkers of HCC. A miRNA expression database was established by searching the TCGA (RNA-seq) and GEO (microarray) repositories to identify miRNA datasets with available expression and clinical data. A PubMed search was performed to identify prognostic miRNAs for HCC. We performed a uni- and multivariate Cox regression analysis to validate the prognostic significance of these miRNAs. The Limma R package was applied to compare the expression of miRNAs between tumor and normal tissues. We uncovered 214 publications containing 223 miRNAs identified as potential prognostic biomarkers for HCC. In the survival analysis, the expression levels of 55 and 84 miRNAs were significantly correlated with overall survival in RNA-seq and gene chip datasets, respectively. The most significant miRNAs were hsa-miR-149, hsa-miR-139, and hsa-miR-3677 in the RNA-seq and hsa-miR-146b-3p, hsa-miR-584, and hsa-miR-31 in the microarray dataset. Of the 223 miRNAs studied, the expression was significantly altered in 102 miRNAs in tumors compared to normal liver tissues. In summary, we set up an integrated miRNA expression database and validated prognostic miRNAs in HCC.
Assuntos
Biomarcadores Tumorais/biossíntese , Carcinoma Hepatocelular/metabolismo , Bases de Dados de Ácidos Nucleicos , Neoplasias Hepáticas/metabolismo , MicroRNAs/biossíntese , RNA Neoplásico/metabolismo , Biomarcadores Tumorais/genética , Carcinoma Hepatocelular/genética , Carcinoma Hepatocelular/mortalidade , Carcinoma Hepatocelular/patologia , Intervalo Livre de Doença , Feminino , Humanos , Neoplasias Hepáticas/genética , Neoplasias Hepáticas/mortalidade , Neoplasias Hepáticas/patologia , Masculino , MicroRNAs/genética , RNA Neoplásico/genética , Taxa de SobrevidaRESUMO
INTRODUCTION: Multiple gene expression based prognostic biomarkers have been repeatedly identified in gastric carcinoma. However, without confirmation in an independent validation study, their clinical utility is limited. Our goal was to establish a robust database enabling the swift validation of previous and future gastric cancer survival biomarker candidates. RESULTS: The entire database incorporates 1,065 gastric carcinoma samples, gene expression data. Out of 29 established markers, higher expression of BECN1 (HR = 0.68, p = 1.5E-05), CASP3 (HR = 0.5, p = 6E-14), COX2 (HR = 0.72, p = 0.0013), CTGF (HR = 0.72, p = 0.00051), CTNNB1 (HR = 0.47, p = 4.3E-15), MET (HR = 0.63, p = 1.3E-05), and SIRT1 (HR = 0.64, p = 2.2E-07) correlated to longer OS. Higher expression of BIRC5 (HR = 1.45, p = 1E-04), CNTN1 (HR = 1.44, p = 3.5E- 05), EGFR (HR = 1.86, p = 8.5E-11), ERCC1 (HR = 1.36, p = 0.0012), HER2 (HR = 1.41, p = 0.00011), MMP2 (HR = 1.78, p = 2.6E-09), PFKB4 (HR = 1.56, p = 3.2E-07), SPHK1 (HR = 1.61, p = 3.1E-06), SP1 (HR = 1.45, p = 1.6E-05), TIMP1 (HR = 1.92, p = 2.2E- 10) and VEGF (HR = 1.53, p = 5.7E-06) were predictive for poor OS. MATERIALS AND METHODS: We integrated samples of three major cancer research centers (Berlin, Bethesda and Melbourne datasets) and publicly available datasets with available follow-up data to form a single integrated database. Subsequently, we performed a literature search for prognostic markers in gastric carcinomas (PubMed, 2012-2015) and re-validated their findings predicting first progression (FP) and overall survival (OS) using uni- and multivariate Cox proportional hazards regression analysis. CONCLUSIONS: The major advantage of our analysis is that we evaluated all genes in the same set of patients thereby making direct comparison of the markers feasible. The best performing genes include BIRC5, CASP3, CTNNB1, TIMP-1, MMP-2, SIRT, and VEGF.
Assuntos
Biomarcadores Tumorais/metabolismo , Neoplasias Gástricas/metabolismo , Transcriptoma , Caspase 3/metabolismo , Feminino , Perfilação da Expressão Gênica , Regulação Neoplásica da Expressão Gênica , Humanos , Proteínas Inibidoras de Apoptose/metabolismo , Masculino , Metaloproteinase 2 da Matriz/metabolismo , Análise de Sequência com Séries de Oligonucleotídeos , Modelos de Riscos Proporcionais , Sirtuína 1/metabolismo , Survivina , Inibidor Tecidual de Metaloproteinase-1/metabolismo , Resultado do Tratamento , Fator A de Crescimento do Endotélio Vascular/metabolismo , beta Catenina/metabolismoRESUMO
Defects in AU-rich elements (ARE)-mediated posttranscriptional control can lead to several abnormal processes that underlie carcinogenesis. Here, we performed a systematic analysis of ARE-mRNA expression across multiple cancer types. First, the ARE database (ARED) was intersected with The Cancer Genome Atlas databases and others. A large set of ARE-mRNAs was over-represented in cancer and, unlike non-ARE-mRNAs, correlated with the reversed balance in the expression of the RNA-binding proteins tristetraprolin (TTP, ZFP36) and HuR (ELAVL1). Serial statistical and functional enrichment clustering identified a cluster of 11 overexpressed ARE-mRNAs (CDC6, KIF11, PRC1, NEK2, NCAPG, CENPA, NUF2, KIF18A, CENPE, PBK, TOP2A) that negatively correlated with TTP/HuR mRNA ratios and was involved in the mitotic cell cycle. This cluster was upregulated in a number of solid cancers. Experimentally, we demonstrated that the ARE-mRNA cluster is upregulated in a number of tumor breast cell lines when compared with noninvasive and normal-like breast cancer cells. RNA-IP demonstrated the association of the ARE-mRNAs with TTP and HuR. Experimental modulation of TTP or HuR expression led to changes in the mitosis ARE-mRNAs. Posttranscriptional reporter assays confirmed the functionality of AREs. Moreover, TTP augmented mitotic cell-cycle arrest as demonstrated by flow cytometry and histone H3 phosphorylation. We found that poor breast cancer patient survival was significantly associated with low TTP/HuR mRNA ratios and correlated with high levels of the mitotic ARE-mRNA signature. These results significantly broaden the role of AREs and their binding proteins in cancer, and demonstrate that TTP induces an antimitotic pathway that is diminished in cancer. Cancer Res; 76(14); 4068-80. ©2016 AACR.
Assuntos
Regulação Neoplásica da Expressão Gênica , Neoplasias/genética , Proteínas de Ligação a RNA/metabolismo , Adenina/análise , Pontos de Checagem do Ciclo Celular , Linhagem Celular Tumoral , Proteína Semelhante a ELAV 1/genética , Humanos , Poliadenilação , RNA Mensageiro/análise , Tristetraprolina/genética , Uridina/análiseRESUMO
In the last decade, optimized treatment for non-small cell lung cancer had lead to improved prognosis, but the overall survival is still very short. To further understand the molecular basis of the disease we have to identify biomarkers related to survival. Here we present the development of an online tool suitable for the real-time meta-analysis of published lung cancer microarray datasets to identify biomarkers related to survival. We searched the caBIG, GEO and TCGA repositories to identify samples with published gene expression data and survival information. Univariate and multivariate Cox regression analysis, Kaplan-Meier survival plot with hazard ratio and logrank P value are calculated and plotted in R. The complete analysis tool can be accessed online at: www.kmplot.com/lung. All together 1,715 samples of ten independent datasets were integrated into the system. As a demonstration, we used the tool to validate 21 previously published survival associated biomarkers. Of these, survival was best predicted by CDK1 (p<1E-16), CD24 (p<1E-16) and CADM1 (p = 7E-12) in adenocarcinomas and by CCNE1 (p = 2.3E-09) and VEGF (p = 3.3E-10) in all NSCLC patients. Additional genes significantly correlated to survival include RAD51, CDKN2A, OPN, EZH2, ANXA3, ADAM28 and ERCC1. In summary, we established an integrated database and an online tool capable of uni- and multivariate analysis for in silico validation of new biomarker candidates in non-small cell lung cancer.
Assuntos
Carcinoma Pulmonar de Células não Pequenas/genética , Neoplasias Pulmonares/genética , Software , Biomarcadores , Humanos , Estimativa de Kaplan-Meier , Análise Multivariada , Modelos de Riscos Proporcionais , Transcriptoma/genéticaRESUMO
The validation of prognostic biomarkers in large independent patient cohorts is a major bottleneck in ovarian cancer research. We implemented an online tool to assess the prognostic value of the expression levels of all microarray-quantified genes in ovarian cancer patients. First, a database was set up using gene expression data and survival information of 1287 ovarian cancer patients downloaded from Gene Expression Omnibus and The Cancer Genome Atlas (Affymetrix HG-U133A, HG-U133A 2.0, and HG-U133 Plus 2.0 microarrays). After quality control and normalization, only probes present on all three Affymetrix platforms were retained (n=22,277). To analyze the prognostic value of the selected gene, we divided the patients into two groups according to various quantile expressions of the gene. These groups were then compared using progression-free survival (n=1090) or overall survival (n=1287). A Kaplan-Meier survival plot was generated and significance was computed. The tool can be accessed online at www.kmplot.com/ovar. We used this integrative data analysis tool to validate the prognostic power of 37 biomarkers identified in the literature. Of these, CA125 (MUC16; P=3.7×10(-5), hazard ratio (HR)=1.4), CDKN1B (P=5.4×10(-5), HR=1.4), KLK6 (P=0.002, HR=0.79), IFNG (P=0.004, HR=0.81), P16 (P=0.02, HR=0.66), and BIRC5 (P=0.00017, HR=0.75) were associated with survival. The combination of several probe sets can further increase prediction efficiency. In summary, we developed a global online biomarker validation platform that mines all available microarray data to assess the prognostic power of 22,277 genes in 1287 ovarian cancer patients. We specifically used this tool to evaluate the effect of 37 previously published biomarkers on ovarian cancer prognosis.