RESUMO
BACKGROUND: The detection of somatic mutations in primary tumors is critical for the understanding of cancer evolution and targeting therapy. Multiple technologies have been developed to enable the detection of such mutations. Next generation sequencing (NGS) is a new platform that is gradually becoming the technology of choice for genotyping cancer samples, owing to its ability to simultaneously interrogate many genomic loci at massively high efficiency and increasingly lower cost. However, multiple barriers still exist for its broader adoption in clinical research practice, such as fragmented workflow and complex bioinformatics analysis and interpretation. METHODS: We performed validation of the QIAGEN GeneReader NGS System using the QIAact Actionable Insights Tumor Panel, focusing on clinically meaningful mutations by using DNA extracted from formalin-fixed paraffin-embedded (FFPE) colorectal tissue with known KRAS mutations. The performance of the GeneReader was evaluated and compared to data generated from alternative technologies (PCR and pyrosequencing) as well as an alternative NGS platform. The results were further confirmed with Sanger sequencing. RESULTS: The data generated from the GeneReader achieved 100% concordance with reference technologies. Furthermore, the GeneReader workflow provides a truly integrated workflow, eliminating artifacts resulting from routine sample preparation; and providing up-to-date interpretation of test results. CONCLUSION: The GeneReader NGS system offers an effective and efficient method to identify somatic (KRAS) cancer mutations.
Assuntos
Análise Mutacional de DNA , Proteínas Proto-Oncogênicas p21(ras)/genética , Neoplasias Colorretais/genética , Fixadores/química , Formaldeído/química , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Mutação , Inclusão em Parafina , Reação em Cadeia da PolimeraseRESUMO
Alternative splicing enhances proteome diversity and modulates cancer-associated proteins. To identify tissue- and tumor-specific alternative splicing, we used the GeneChip Human Exon 1.0 ST Array to measure whole-genome exon expression in 102 normal and cancer tissue samples of different stages from colon, urinary bladder, and prostate. We identified 2069 candidate alternative splicing events between normal tissue samples from colon, bladder, and prostate and selected 15 splicing events for RT-PCR validation, 10 of which were successfully validated by RT-PCR and sequencing. Furthermore 23, 19, and 18 candidate tumor-specific splicing alterations in colon, bladder, and prostate, respectively, were selected for RT-PCR validation on an independent set of 81 normal and tumor tissue samples. In total, seven genes with tumor-specific splice variants were identified (ACTN1, CALD1, COL6A3, LRRFIP2, PIK4CB, TPM1, and VCL). The validated tumor-specific splicing alterations were highly consistent, enabling clear separation of normal and cancer samples and in some cases even of different tumor stages. A subset of the tumor-specific splicing alterations (ACTN1, CALD1, and VCL) was found in all three organs and may represent general cancer-related splicing events. In silico protein predictions suggest that the identified cancer-specific splice variants encode proteins with potentially altered functions, indicating that they may be involved in pathogenesis and hence represent novel therapeutic targets. In conclusion, we identified and validated alternative splicing between normal tissue samples from colon, bladder, and prostate in addition to cancer-specific splicing events in colon, bladder, and prostate cancer that may have diagnostic and prognostic implications.
Assuntos
Adenoma/genética , Processamento Alternativo/genética , Neoplasias do Colo/genética , Éxons , Neoplasias da Próstata/genética , Neoplasias da Bexiga Urinária/genética , Actinina/genética , Actinina/metabolismo , Adenoma/metabolismo , Adenoma/patologia , Proteínas de Ligação a Calmodulina/genética , Proteínas de Ligação a Calmodulina/metabolismo , Neoplasias do Colo/metabolismo , Neoplasias do Colo/patologia , Perfilação da Expressão Gênica , Regulação Neoplásica da Expressão Gênica , Humanos , Masculino , Modelos Moleculares , Estadiamento de Neoplasias , Análise de Sequência com Séries de Oligonucleotídeos , Neoplasias da Próstata/metabolismo , Neoplasias da Próstata/patologia , Neoplasias da Bexiga Urinária/metabolismo , Neoplasias da Bexiga Urinária/patologia , Vinculina/química , Vinculina/genética , Vinculina/metabolismoRESUMO
PURPOSE: Clinically useful molecular markers predicting the clinical course of patients diagnosed with non-muscle-invasive bladder cancer are needed to improve treatment outcome. Here, we validated four previously reported gene expression signatures for molecular diagnosis of disease stage and carcinoma in situ (CIS) and for predicting disease recurrence and progression. EXPERIMENTAL DESIGN: We analyzed tumors from 404 patients diagnosed with bladder cancer in hospitals in Denmark, Sweden, England, Spain, and France using custom microarrays. Molecular classifications were compared with pathologic diagnosis and clinical outcome. RESULTS: Classification of disease stage using a 52-gene classifier was found to be highly significantly correlated with pathologic stage (P < 0.001). Furthermore, the classifier added information regarding disease progression of T(a) or T(1) tumors (P < 0.001). The molecular 88-gene progression classifier was highly significantly correlated with progression-free survival (P < 0.001) and cancer-specific survival (P = 0.001). Multivariate Cox regression analysis showed the progression classifier to be an independently significant variable associated with disease progression after adjustment for age, sex, stage, grade, and treatment (hazard ratio, 2.3; P = 0.007). The diagnosis of CIS using a 68-gene classifier showed a highly significant correlation with histopathologic CIS diagnosis (odds ratio, 5.8; P < 0.001) in multivariate logistic regression analysis. CONCLUSION: This multicenter validation study confirms in an independent series the clinical utility of molecular classifiers to predict the outcome of patients initially diagnosed with non-muscle-invasive bladder cancer. This information may be useful to better guide patient treatment.
Assuntos
Biomarcadores Tumorais/análise , Perfilação da Expressão Gênica , Neoplasias da Bexiga Urinária/genética , Neoplasias da Bexiga Urinária/patologia , Idoso , Biomarcadores Tumorais/genética , Progressão da Doença , Feminino , Expressão Gênica , Humanos , Estimativa de Kaplan-Meier , Masculino , Pessoa de Meia-Idade , Estadiamento de Neoplasias , Prognóstico , Neoplasias da Bexiga Urinária/mortalidadeRESUMO
BACKGROUND: Affymetrix 3' GeneChip microarrays are widely used to profile the expression of thousands of genes simultaneously. They differ from many other microarray types in that GeneChips are hybridised using a single labelled extract and because they contain multiple 'match' and 'mismatch' sequences for each transcript. Most algorithms extract the signal from GeneChip experiments in a sequence of separate steps, including background correction and normalisation, which inhibits the simultaneous use of all available information. They principally provide a point estimate of gene expression and, in contrast to BGX, do not fully integrate the uncertainty arising from potentially heterogeneous responses of the probes. RESULTS: BGX is a new Bioconductor R package that implements an integrated Bayesian approach to the analysis of 3' GeneChip data. The software takes into account additive and multiplicative error, non-specific hybridisation and replicate summarisation in the spirit of the model outlined in 1. It also provides a posterior distribution for the expression of each gene. Moreover, BGX can take into account probe affinity effects from probe sequence information where available. The package employs a novel adaptive Markov chain Monte Carlo (MCMC) algorithm that raises considerably the efficiency with which the posterior distributions are sampled from. Finally, BGX incorporates various ways to analyse the results, such as ranking genes by expression level as well as statistically based methods for estimating the amount of up and down regulated genes between two conditions. CONCLUSION: BGX performs well relative to other widely used methods at estimating expression levels and fold changes. It has the advantage that it provides a statistically sound measure of uncertainty for its estimates. BGX includes various analysis functions to visualise and exploit the rich output that is produced by the Bayesian model.
Assuntos
Algoritmos , Perfilação da Expressão Gênica/métodos , Modelos Genéticos , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Linguagens de Programação , Análise de Sequência de DNA/métodos , Software , Teorema de Bayes , Simulação por Computador , Interpretação Estatística de Dados , Perfilação da Expressão Gênica/instrumentação , Modelos Estatísticos , Análise de Sequência com Séries de Oligonucleotídeos/instrumentação , Integração de SistemasRESUMO
BACKGROUND: Studies of differential expression that use Affymetrix GeneChip arrays are often carried out with a limited number of replicates. Reasons for this include financial considerations and limits on the available amount of RNA for sample preparation. In addition, failed hybridizations are not uncommon leading to a further reduction in the number of replicates available for analysis. Most existing methods for studying differential expression rely on the availability of replicates and the demand for alternative methods that require few or no replicates is high. RESULTS: We describe a statistical procedure for performing differential expression analysis without replicates. The procedure relies on a Bayesian integrated approach (BGX) to the analysis of Affymetrix GeneChips. The BGX method estimates a posterior distribution of expression for each gene and condition, from a simultaneous consideration of the available probe intensities representing the gene in a condition. Importantly, posterior distributions of expression are obtained regardless of the number of replicates available. We exploit these posterior distributions to create ranked gene lists that take into account the estimated expression difference as well as its associated uncertainty. We estimate the proportion of non-differentially expressed genes empirically, allowing an informed choice of cut-off for the ranked gene list, adapting an approach proposed by Efron. We assess the performance of the method, and compare it to those of other methods, on publicly available spike-in data sets, as well as in a proper biological setting. CONCLUSION: The method presented is a powerful tool for extracting information on differential expression from GeneChip expression studies with limited or no replicates.
Assuntos
Regulação da Expressão Gênica , Técnicas Genéticas , Análise de Sequência com Séries de Oligonucleotídeos , Algoritmos , Animais , Teorema de Bayes , Perfilação da Expressão Gênica , Humanos , Camundongos , Modelos Estatísticos , Hibridização de Ácido Nucleico , RNA/química , Curva ROCRESUMO
Formalin-fixed, paraffin-embedded (FFPE) tissues are an invaluable resource for clinical research. However, nucleic acids extracted from FFPE tissues are fragmented and chemically modified making them challenging to use in molecular studies. We analysed 23 fresh-frozen (FF), 35 FFPE and 38 paired FF/FFPE specimens, representing six different human tissue types (bladder, prostate and colon carcinoma; liver and colon normal tissue; reactive tonsil) in order to examine the potential use of FFPE samples in next-generation sequencing (NGS) based retrospective and prospective clinical studies. Two methods for DNA and three methods for RNA extraction from FFPE tissues were compared and were found to affect nucleic acid quantity and quality. DNA and RNA from selected FFPE and paired FF/FFPE specimens were used for exome and transcriptome analysis. Preparations of DNA Exome-Seq libraries was more challenging (29.5% success) than that of RNA-Seq libraries, presumably because of modifications to FFPE tissue-derived DNA. Libraries could still be prepared from RNA isolated from two-decade old FFPE tissues. Data were analysed using the CLC Bio Genomics Workbench and revealed systematic differences between FF and FFPE tissue-derived nucleic acid libraries. In spite of this, pairwise analysis of DNA Exome-Seq data showed concordance for 70-80% of variants in FF and FFPE samples stored for fewer than three years. RNA-Seq data showed high correlation of expression profiles in FF/FFPE pairs (Pearson Correlations of 0.90 +/- 0.05), irrespective of storage time (up to 244 months) and tissue type. A common set of 1,494 genes was identified with expression profiles that were significantly different between paired FF and FFPE samples irrespective of tissue type. Our results are promising and suggest that NGS can be used to study FFPE specimens in both prospective and retrospective archive-based studies in which FF specimens are not available.
Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/métodos , Neoplasias/genética , Neoplasias/patologia , Inclusão em Parafina , Análise de Sequência de DNA/métodos , Análise de Sequência de RNA/métodos , Fixação de Tecidos , Criopreservação , DNA/genética , DNA/isolamento & purificação , Exoma/genética , Formaldeído/farmacologia , Perfilação da Expressão Gênica , Humanos , Proteínas Proto-Oncogênicas/genética , Proteínas Proto-Oncogênicas B-raf/genética , Proteínas Proto-Oncogênicas p21(ras) , RNA/genética , RNA/isolamento & purificação , Proteínas ras/genéticaRESUMO
BACKGROUND: Mechanisms underlying the malignant development in bladder cancer are still not well understood. Lipolysis stimulated lipoprotein receptor (LSR) has previously been found to be upregulated by P53. Furthermore, we have previously found LSR to be differentially expressed in bladder cancer. Here we investigated the role of LSR in bladder cancer. METHODS: A time course siRNA knock down experiment was performed to investigate the functional role of LSR in SW780 bladder cancer cells. Since LSR was previously shown to be regulated by P53, siRNA against TP53 was included in the experimental setup. We used Affymetrix GeneChips for measuring gene expression changes and we used Ingenuity Pathway Analysis to investigate the relationship among differentially expressed genes upon siRNA knockdown. RESULTS: By Ingenuity Pathway analysis of the microarray data from the different timepoints we identified six gene networks containing genes mainly related to the functional categories "cancer", "cell death", and "cellular movement". We determined that genes annotated to the functional category "cellular movement" including "invasion" and "cell motility" were highly significantly overrepresented. A matrigel assay showed that 24 h after transfection the invasion capacity was significantly increased 3-fold (p < 0.02) in LSR-siRNA transfected cells, and 2.7-fold (p < 0.02) in TP53-siRNA transfected cells compared to controls. After 48 h the motility capacity was significantly increased 3.5-fold (p < 0.004) in LSR-siRNA transfected cells, and 4.7-fold (p < 0.002) in TP53-siRNA transfected cells compared to controls. CONCLUSION: We conclude that LSR may impair bladder cancer cells from gaining invasive properties.
RESUMO
We present Bayesian hierarchical models for the analysis of Affymetrix GeneChip data. The approach we take differs from other available approaches in two fundamental aspects. Firstly, we aim to integrate all processing steps of the raw data in a common statistically coherent framework, allowing all components and thus associated errors to be considered simultaneously. Secondly, inference is based on the full posterior distribution of gene expression indices and derived quantities, such as fold changes or ranks, rather than on single point estimates. Measures of uncertainty on these quantities are thus available. The models presented represent the first building block for integrated Bayesian Analysis of Affymetrix GeneChip data: the models take into account additive as well as multiplicative error, gene expression levels are estimated using perfect match and a fraction of mismatch probes and are modeled on the log scale. Background correction is incorporated by modeling true signal and cross-hybridization explicitly, and a need for further normalization is considerably reduced by allowing for array-specific distributions of nonspecific hybridization. When replicate arrays are available for a condition, posterior distributions of condition-specific gene expression indices are estimated directly, by a simultaneous consideration of replicate probe sets, avoiding averaging over estimates obtained from individual replicate arrays. The performance of the Bayesian model is compared to that of standard available point estimate methods on subsets of the well known GeneLogic and Affymetrix spike-in data. The Bayesian model is found to perform well and the integrated procedure presented appears to hold considerable promise for further development.