Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 62
Filtrar
1.
Science ; 385(6713): eadk9217, 2024 09 06.
Artigo em Inglês | MEDLINE | ID: mdl-39236169

RESUMO

To identify cancer-associated gene regulatory changes, we generated single-cell chromatin accessibility landscapes across eight tumor types as part of The Cancer Genome Atlas. Tumor chromatin accessibility is strongly influenced by copy number alterations that can be used to identify subclones, yet underlying cis-regulatory landscapes retain cancer type-specific features. Using organ-matched healthy tissues, we identified the "nearest healthy" cell types in diverse cancers, demonstrating that the chromatin signature of basal-like-subtype breast cancer is most similar to secretory-type luminal epithelial cells. Neural network models trained to learn regulatory programs in cancer revealed enrichment of model-prioritized somatic noncoding mutations near cancer-associated genes, suggesting that dispersed, nonrecurrent, noncoding mutations in cancer are functional. Overall, these data and interpretable gene regulatory models for cancer and healthy tissue provide a framework for understanding cancer-specific gene regulation.


Assuntos
Cromatina , Regulação Neoplásica da Expressão Gênica , Neoplasias , Análise de Célula Única , Humanos , Cromatina/metabolismo , Cromatina/genética , Neoplasias/genética , Redes Neurais de Computação , Mutação , Variações do Número de Cópias de DNA , Neoplasias da Mama/genética , Neoplasias da Mama/patologia
2.
Genome Biol ; 25(1): 220, 2024 Aug 14.
Artigo em Inglês | MEDLINE | ID: mdl-39143494

RESUMO

Inferring gene regulatory networks from single-cell RNA-sequencing trajectories has been an active area of research yet methods are still needed to identify regulators governing cell transitions. We developed DREAMIT (Dynamic Regulation of Expression Across Modules in Inferred Trajectories) to annotate transcription-factor activity along single-cell trajectory branches, using ensembles of relations to target genes. Using a benchmark representing several different tissues, as well as external validation with ATAC-Seq and Perturb-Seq data on hematopoietic cells, the method was found to have higher tissue-specific sensitivity and specificity over competing approaches.


Assuntos
Análise de Célula Única , Fatores de Transcrição , Análise de Célula Única/métodos , Fatores de Transcrição/metabolismo , Fatores de Transcrição/genética , Humanos , Redes Reguladoras de Genes , Análise de Sequência de RNA
3.
Cell Rep Methods ; 4(6): 100799, 2024 Jun 17.
Artigo em Inglês | MEDLINE | ID: mdl-38889686

RESUMO

The cellular components of tumors and their microenvironment play pivotal roles in tumor progression, patient survival, and the response to cancer treatments. Unveiling a comprehensive cellular profile within bulk tumors via single-cell RNA sequencing (scRNA-seq) data is crucial, as it unveils intrinsic tumor cellular traits that elude identification through conventional cancer subtyping methods. Our contribution, scBeacon, is a tool that derives cell-type signatures by integrating and clustering multiple scRNA-seq datasets to extract signatures for deconvolving unrelated tumor datasets on bulk samples. Through the employment of scBeacon on the The Cancer Genome Atlas (TCGA) cohort, we find cellular and molecular attributes within specific tumor categories, many with patient outcome relevance. We developed a tumor cell-type map to visually depict the relationships among TCGA samples based on the cell-type inferences.


Assuntos
Neoplasias , Análise de Célula Única , Microambiente Tumoral , Humanos , Microambiente Tumoral/genética , Análise de Célula Única/métodos , Neoplasias/genética , Neoplasias/patologia , Análise de Sequência de RNA , Perfilação da Expressão Gênica , Regulação Neoplásica da Expressão Gênica , Análise por Conglomerados
5.
Cell Syst ; 12(8): 827-838.e5, 2021 08 18.
Artigo em Inglês | MEDLINE | ID: mdl-34146471

RESUMO

The accurate identification and quantitation of RNA isoforms present in the cancer transcriptome is key for analyses ranging from the inference of the impacts of somatic variants to pathway analysis to biomarker development and subtype discovery. The ICGC-TCGA DREAM Somatic Mutation Calling in RNA (SMC-RNA) challenge was a crowd-sourced effort to benchmark methods for RNA isoform quantification and fusion detection from bulk cancer RNA sequencing (RNA-seq) data. It concluded in 2018 with a comparison of 77 fusion detection entries and 65 isoform quantification entries on 51 synthetic tumors and 32 cell lines with spiked-in fusion constructs. We report the entries used to build this benchmark, the leaderboard results, and the experimental features associated with the accurate prediction of RNA species. This challenge required submissions to be in the form of containerized workflows, meaning each of the entries described is easily reusable through CWL and Docker containers at https://github.com/SMC-RNA-challenge. A record of this paper's transparent peer review process is included in the supplemental information.


Assuntos
Neoplasias , Humanos , Neoplasias/genética , Isoformas de Proteínas/genética , RNA/genética , RNA-Seq , Análise de Sequência de RNA
6.
PLoS Comput Biol ; 17(4): e1008878, 2021 04.
Artigo em Inglês | MEDLINE | ID: mdl-33861732

RESUMO

Advancements in sequencing have led to the proliferation of multi-omic profiles of human cells under different conditions and perturbations. In addition, many databases have amassed information about pathways and gene "signatures"-patterns of gene expression associated with specific cellular and phenotypic contexts. An important current challenge in systems biology is to leverage such knowledge about gene coordination to maximize the predictive power and generalization of models applied to high-throughput datasets. However, few such integrative approaches exist that also provide interpretable results quantifying the importance of individual genes and pathways to model accuracy. We introduce AKLIMATE, a first kernel-based stacked learner that seamlessly incorporates multi-omics feature data with prior information in the form of pathways for either regression or classification tasks. AKLIMATE uses a novel multiple-kernel learning framework where individual kernels capture the prediction propensities recorded in random forests, each built from a specific pathway gene set that integrates all omics data for its member genes. AKLIMATE has comparable or improved performance relative to state-of-the-art methods on diverse phenotype learning tasks, including predicting microsatellite instability in endometrial and colorectal cancer, survival in breast cancer, and cell line response to gene knockdowns. We show how AKLIMATE is able to connect feature data across data platforms through their common pathways to identify examples of several known and novel contributors of cancer and synthetic lethality.


Assuntos
Genômica , Aprendizado de Máquina , Neoplasias/classificação , Neoplasias/genética , Linhagem Celular Tumoral , Técnicas de Silenciamento de Genes , Humanos , Fenótipo , RNA Interferente Pequeno/genética , Análise de Sobrevida
7.
Prostate Cancer Prostatic Dis ; 24(1): 81-87, 2021 03.
Artigo em Inglês | MEDLINE | ID: mdl-32286548

RESUMO

BACKGROUND: Metastatic disease burden out of proportion to serum PSA has been used as a marker of aggressive phenotype prostate cancer but is not well defined as a distinct subgroup. We sought to prospectively characterize the molecular features and clinical outcomes of Low PSA Secretors. METHODS: Eligible metastatic castration resistant prostate cancer (mCRPC) patients without prior small cell histology underwent metastatic tumor biopsy with molecular characterization. Low PSA secretion was defined as serum PSA < 2, 5, or 10 ng/mL plus >5 metastases with radiographic progression at study entry. Clinical and molecular features were compared between low PSA vs. normal secretors in a post-hoc fashion. RESULTS: 183 patients were enrolled, including 15 (8%) identified as Low PSA Secretors using optimal PSA cut point of 5 ng/mL. Biopsies from Low PSA Secretors demonstrated higher t-SCNC and RB1 loss and lower AR transcriptional signature scores compared with normal secretors. Genomic loss of RB1 and/or TP53 was more common in Low PSA Secretors (80% vs. 41%). Overall survival (OS) was shorter in Low PSA Secretors (median OS = 26.7 vs. 46.0 months, hazard ratio = 2.465 (95% CI: 0.982-6.183). Progression-free survival (PFS) on post-biopsy treatment with AR-targeted therapy was shorter than with chemotherapy (median PFS 6.2 vs. 4.1 months). CONCLUSIONS: Low PSA secretion in relation to metastatic tumor burden may be a readily available clinical selection tool for de-differentiated mCRPC with molecular features consistent with t-SCNC. Prospective validation is warranted.


Assuntos
Adenocarcinoma/sangue , Estadiamento de Neoplasias , Neoplasias de Próstata Resistentes à Castração/sangue , Proteínas de Ligação a Retinoblastoma/genética , Proteína Supressora de Tumor p53/genética , Ubiquitina-Proteína Ligases/genética , Adenocarcinoma/genética , Adenocarcinoma/secundário , Idoso , Idoso de 80 Anos ou mais , Biomarcadores Tumorais/sangue , Biópsia , DNA de Neoplasias/genética , Intervalo Livre de Doença , Feminino , Seguimentos , Genômica , Humanos , Masculino , Pessoa de Meia-Idade , Metástase Neoplásica , Antígeno Prostático Específico/sangue , Neoplasias de Próstata Resistentes à Castração/genética , Neoplasias de Próstata Resistentes à Castração/patologia , Proteínas de Ligação a Retinoblastoma/metabolismo , Estudos Retrospectivos , Proteína Supressora de Tumor p53/metabolismo , Ubiquitina-Proteína Ligases/metabolismo
8.
Urol Oncol ; 38(12): 931.e9-931.e16, 2020 12.
Artigo em Inglês | MEDLINE | ID: mdl-32624423

RESUMO

OBJECTIVES: The net oncogenic effect of ß2-adrenergic receptor ADRB2, whose downstream elements induce neuroendocrine differentiation and whose expression is regulated by EZH2, is unclear. ADRB2 expression and associated clinical outcomes in metastatic castration-resistant prostate cancer (mCRPC) are unknown. METHODS AND MATERIALS: This was a retrospective analysis of a multi-center, prospectively enrolled cohort of mCRPC patients. Metastatic biopsies were obtained at progression, and specimens underwent laser capture microdissection and RNA-seq. ADRB2 expression was stratified by histology and clustering based on unsupervised hierarchical transcriptome analysis and correlated with EZH2 expression; an external dataset was used for validation. The association between ADRB2 expression and overall survival (OS) was assessed by log-rank test and a multivariable Cox proportional hazard model. RESULTS: One hundred and twenty-seven patients with progressive mCRPC had sufficient metastatic tumor for RNA-seq. ADRB2 expression was lowest in the small cell-enriched transcriptional cluster (P < 0.01) and correlated inversely with EZH2 expression (r = -0.28, P < 0.01). These findings were validated in an external cohort enriched for neuroendocrine differentiation. Patients with tumors harboring low ADRB2 expression (lowest quartile) had a shorter median OS than those with higher (9.5 vs. 20.5 months, P = 0.02). In multivariable analysis, low ADRB2 expression was associated with a trend toward shorter OS (HR for death = 1.54, 95%CI 0.98-2.44). Conversely, higher expression of upstream transcriptional regulator EZH2 was associated with shortened OS (HR for death = 3.01, 95%CI 1.12-8.09). CONCLUSIONS: Low ADRB2 expression is associated with neuroendocrine differentiation and is associated with shortened survival. EZH2 is a potential therapeutic target for preventing neuroendocrine transdifferentiation and improving outcomes in mCRPC. Further studies of agents targeting ß-adrenergic signaling are warranted.


Assuntos
Carcinoma Neuroendócrino/genética , Carcinoma de Células Pequenas/genética , Regulação Neoplásica da Expressão Gênica , Neoplasias de Próstata Resistentes à Castração/genética , Idoso , Idoso de 80 Anos ou mais , Carcinoma Neuroendócrino/mortalidade , Carcinoma de Células Pequenas/mortalidade , Regulação para Baixo , Humanos , Masculino , Pessoa de Meia-Idade , Neoplasias de Próstata Resistentes à Castração/mortalidade , Receptores Adrenérgicos beta 2 , Estudos Retrospectivos , Taxa de Sobrevida
9.
Clin Cancer Res ; 26(17): 4616-4624, 2020 09 01.
Artigo em Inglês | MEDLINE | ID: mdl-32727885

RESUMO

PURPOSE: The purpose of this study was to measure genomic changes that emerge with enzalutamide treatment using analyses of whole-genome sequencing and RNA sequencing. EXPERIMENTAL DESIGN: One hundred and one tumors from men with metastatic castration-resistant prostate cancer (mCRPC) who had not been treated with enzalutamide (n = 64) or who had enzalutamide-resistant mCRPC (n = 37) underwent whole genome sequencing. Ninety-nine of these tumors also underwent RNA sequencing. We analyzed the genomes and transcriptomes of these mCRPC tumors. RESULTS: Copy number loss was more common than gain in enzalutamide-resistant tumors. Specially, we identified 124 protein-coding genes that were more commonly lost in enzalutamide-resistant samples. These 124 genes included eight putative tumor suppressors located at nine distinct genomic regions. We demonstrated that focal deletion of the 17q22 locus that includes RNF43 and SRSF1 was not present in any patient with enzalutamide-naïve mCRPC but was present in 16% (6/37) of patients with enzalutamide-resistant mCRPC. 17q22 loss was associated with lower RNF43 and SRSF1 expression and poor overall survival from time of biopsy [median overall survival of 19.3 months in 17q22 intact vs. 8.9 months in 17q22 loss, HR, 3.44 95% confidence interval (CI), 1.338-8.867, log-rank P = 0.006]. Finally, 17q22 loss was linked with activation of several targetable factors, including CDK1/2, Akt, and PLK1, demonstrating the potential therapeutic relevance of 17q22 loss in mCRPC. CONCLUSIONS: Copy number loss is common in enzalutamide-resistant tumors. Focal deletion of chromosome 17q22 defines a previously unappreciated molecular subset of enzalutamide-resistant mCRPC associated with poor clinical outcome.


Assuntos
Benzamidas/farmacologia , Biomarcadores Tumorais/genética , Cromossomos Humanos Par 17/genética , Resistencia a Medicamentos Antineoplásicos/genética , Nitrilas/farmacologia , Feniltioidantoína/farmacologia , Neoplasias de Próstata Resistentes à Castração/genética , Benzamidas/uso terapêutico , Biópsia , Variações do Número de Cópias de DNA , Intervalo Livre de Doença , Humanos , Masculino , Nitrilas/uso terapêutico , Feniltioidantoína/uso terapêutico , Próstata/patologia , Neoplasias de Próstata Resistentes à Castração/tratamento farmacológico , Neoplasias de Próstata Resistentes à Castração/mortalidade , Neoplasias de Próstata Resistentes à Castração/patologia , RNA-Seq , Análise de Sobrevida
10.
Proc Natl Acad Sci U S A ; 117(22): 12315-12323, 2020 06 02.
Artigo em Inglês | MEDLINE | ID: mdl-32424106

RESUMO

The androgen receptor (AR) antagonist enzalutamide is one of the principal treatments for men with castration-resistant prostate cancer (CRPC). However, not all patients respond, and resistance mechanisms are largely unknown. We hypothesized that genomic and transcriptional features from metastatic CRPC biopsies prior to treatment would be predictive of de novo treatment resistance. To this end, we conducted a phase II trial of enzalutamide treatment (160 mg/d) in 36 men with metastatic CRPC. Thirty-four patients were evaluable for the primary end point of a prostate-specific antigen (PSA)50 response (PSA decline ≥50% at 12 wk vs. baseline). Nine patients were classified as nonresponders (PSA decline <50%), and 25 patients were classified as responders (PSA decline ≥50%). Failure to achieve a PSA50 was associated with shorter progression-free survival, time on treatment, and overall survival, demonstrating PSA50's utility. Targeted DNA-sequencing was performed on 26 of 36 biopsies, and RNA-sequencing was performed on 25 of 36 biopsies that contained sufficient material. Using computational methods, we measured AR transcriptional function and performed gene set enrichment analysis (GSEA) to identify pathways whose activity state correlated with de novo resistance. TP53 gene alterations were more common in nonresponders, although this did not reach statistical significance (P = 0.055). AR gene alterations and AR expression were similar between groups. Importantly, however, transcriptional measurements demonstrated that specific gene sets-including those linked to low AR transcriptional activity and a stemness program-were activated in nonresponders. Our results suggest that patients whose tumors harbor this program should be considered for clinical trials testing rational agents to overcome de novo enzalutamide resistance.


Assuntos
Antineoplásicos/administração & dosagem , Resistencia a Medicamentos Antineoplásicos , Feniltioidantoína/análogos & derivados , Neoplasias de Próstata Resistentes à Castração/genética , Receptores Androgênicos/administração & dosagem , Receptores Androgênicos/genética , Idoso , Idoso de 80 Anos ou mais , Benzamidas , Perfilação da Expressão Gênica , Humanos , Masculino , Pessoa de Meia-Idade , Nitrilas , Feniltioidantoína/administração & dosagem , Antígeno Prostático Específico/metabolismo , Neoplasias de Próstata Resistentes à Castração/tratamento farmacológico , Neoplasias de Próstata Resistentes à Castração/metabolismo , Receptores Androgênicos/metabolismo
11.
JCO Clin Cancer Inform ; 4: 147-159, 2020 02.
Artigo em Inglês | MEDLINE | ID: mdl-32097025

RESUMO

PURPOSE: The analysis of cancer biology data involves extremely heterogeneous data sets, including information from RNA sequencing, genome-wide copy number, DNA methylation data reporting on epigenetic regulation, somatic mutations from whole-exome or whole-genome analyses, pathology estimates from imaging sections or subtyping, drug response or other treatment outcomes, and various other clinical and phenotypic measurements. Bringing these different resources into a common framework, with a data model that allows for complex relationships as well as dense vectors of features, will unlock integrated data set analysis. METHODS: We introduce the BioMedical Evidence Graph (BMEG), a graph database and query engine for discovery and analysis of cancer biology. The BMEG is unique from other biologic data graphs in that sample-level molecular and clinical information is connected to reference knowledge bases. It combines gene expression and mutation data with drug-response experiments, pathway information databases, and literature-derived associations. RESULTS: The construction of the BMEG has resulted in a graph containing > 41 million vertices and 57 million edges. The BMEG system provides a graph query-based application programming interface to enable analysis, with client code available for Python, Javascript, and R, and a server online at bmeg.io. Using this system, we have demonstrated several forms of cross-data set analysis to show the utility of the system. CONCLUSION: The BMEG is an evolving resource dedicated to enabling integrative analysis. We have demonstrated queries on the system that illustrate mutation significance analysis, drug-response machine learning, patient-level knowledge-base queries, and pathway level analysis. We have compared the resulting graph to other available integrated graph systems and demonstrated the former is unique in the scale of the graph and the type of data it makes available.


Assuntos
Antineoplásicos/uso terapêutico , Biomarcadores Tumorais/genética , Biologia Computacional/métodos , Regulação Neoplásica da Expressão Gênica/efeitos dos fármacos , Informática Médica , Neoplasias/diagnóstico , Neoplasias/tratamento farmacológico , Gráficos por Computador , Bases de Dados Factuais , Redes Reguladoras de Genes , Humanos , Neoplasias/genética , Transdução de Sinais
12.
Nat Commun ; 11(1): 729, 2020 02 05.
Artigo em Inglês | MEDLINE | ID: mdl-32024854

RESUMO

The catalog of cancer driver mutations in protein-coding genes has greatly expanded in the past decade. However, non-coding cancer driver mutations are less well-characterized and only a handful of recurrent non-coding mutations, most notably TERT promoter mutations, have been reported. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2658 cancer across 38 tumor types, we perform multi-faceted pathway and network analyses of non-coding mutations across 2583 whole cancer genomes from 27 tumor types compiled by the ICGC/TCGA PCAWG project that was motivated by the success of pathway and network analyses in prioritizing rare mutations in protein-coding genes. While few non-coding genomic elements are recurrently mutated in this cohort, we identify 93 genes harboring non-coding mutations that cluster into several modules of interacting proteins. Among these are promoter mutations associated with reduced mRNA expression in TP53, TLE4, and TCF4. We find that biological processes had variable proportions of coding and non-coding mutations, with chromatin remodeling and proliferation pathways altered primarily by coding mutations, while developmental pathways, including Wnt and Notch, altered by both coding and non-coding mutations. RNA splicing is primarily altered by non-coding mutations in this cohort, and samples containing non-coding mutations in well-known RNA splicing factors exhibit similar gene expression signatures as samples with coding mutations in these genes. These analyses contribute a new repertoire of possible cancer genes and mechanisms that are altered by non-coding mutations and offer insights into additional cancer vulnerabilities that can be investigated for potential therapeutic treatments.


Assuntos
Regulação Neoplásica da Expressão Gênica , Mutação , Neoplasias/genética , Splicing de RNA , Montagem e Desmontagem da Cromatina , Biologia Computacional/métodos , Bases de Dados Genéticas , Genoma Humano , Humanos , Redes e Vias Metabólicas/genética , Neoplasias/metabolismo , Regiões Promotoras Genéticas
13.
Nat Biotechnol ; 38(1): 97-107, 2020 01.
Artigo em Inglês | MEDLINE | ID: mdl-31919445

RESUMO

Tumor DNA sequencing data can be interpreted by computational methods that analyze genomic heterogeneity to infer evolutionary dynamics. A growing number of studies have used these approaches to link cancer evolution with clinical progression and response to therapy. Although the inference of tumor phylogenies is rapidly becoming standard practice in cancer genome analyses, standards for evaluating them are lacking. To address this need, we systematically assess methods for reconstructing tumor subclonality. First, we elucidate the main algorithmic problems in subclonal reconstruction and develop quantitative metrics for evaluating them. Then we simulate realistic tumor genomes that harbor all known clonal and subclonal mutation types and processes. Finally, we benchmark 580 tumor reconstructions, varying tumor read depth, tumor type and somatic variant detection. Our analysis provides a baseline for the establishment of gold-standard methods to analyze tumor heterogeneity.


Assuntos
Algoritmos , Neoplasias/patologia , Células Clonais , Simulação por Computador , Variações do Número de Cópias de DNA/genética , Dosagem de Genes , Genoma , Humanos , Mutação/genética , Neoplasias/genética , Polimorfismo de Nucleotídeo Único/genética , Padrões de Referência
14.
Pac Symp Biocomput ; 25: 343-354, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-31797609

RESUMO

Cancer genome projects have produced multidimensional datasets on thousands of samples. Yet, depending on the tumor type, 5-50% of samples have no known driving event. We introduce a semi-supervised method called Learning UnRealized Events (LURE) that uses a progressive label learning framework and minimum spanning analysis to predict cancer drivers based on their altered samples sharing a gene expression signature with the samples of a known event. We demonstrate the utility of the method on the TCGA Pan-Cancer Atlas dataset for which it produced a high-confidence result relating 59 new connections to 18 known mutation events including alterations in the same gene, family, and pathway. We give examples of predicted drivers involved in TP53, telomere maintenance, and MAPK/RTK signaling pathways. LURE identifies connections between genes with no known prior relationship, some of which may offer clues for targeting specific forms of cancer. Code and Supplemental Material are available on the LURE website: https://sysbiowiki.soe.ucsc.edu/lure.


Assuntos
Biologia Computacional , Neoplasias , Humanos , Mutação , Neoplasias/genética
15.
Pac Symp Biocomput ; 24: 136-147, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-30864317

RESUMO

Cancer is a complex collection of diseases that are to some degree unique to each patient. Precision oncology aims to identify the best drug treatment regime using molecular data on tumor samples. While omics-level data is becoming more widely available for tumor specimens, the datasets upon which computational learning methods can be trained vary in coverage from sample to sample and from data type to data type. Methods that can 'connect the dots' to leverage more of the information provided by these studies could offer major advantages for maximizing predictive potential. We introduce a multi-view machinelearning strategy called PLATYPUS that builds 'views' from multiple data sources that are all used as features for predicting patient outcomes. We show that a learning strategy that finds agreement across the views on unlabeled data increases the performance of the learning methods over any single view. We illustrate the power of the approach by deriving signatures for drug sensitivity in a large cancer cell line database. Code and additional information are available from the PLATYPUS website https://sysbiowiki.soe.ucsc.edu/platypus.


Assuntos
Resistencia a Medicamentos Antineoplásicos , Aprendizado de Máquina , Neoplasias/tratamento farmacológico , Antineoplásicos/uso terapêutico , Linhagem Celular Tumoral , Biologia Computacional/métodos , Bases de Dados Factuais , Resistencia a Medicamentos Antineoplásicos/genética , Humanos , Armazenamento e Recuperação da Informação , Aprendizado de Máquina/estatística & dados numéricos , Neoplasias/genética , Modelagem Computacional Específica para o Paciente , Variantes Farmacogenômicos , Medicina de Precisão , Software , Aprendizado de Máquina Supervisionado/estatística & dados numéricos
17.
Genome Biol ; 19(1): 188, 2018 11 06.
Artigo em Inglês | MEDLINE | ID: mdl-30400818

RESUMO

BACKGROUND: The phenotypes of cancer cells are driven in part by somatic structural variants. Structural variants can initiate tumors, enhance their aggressiveness, and provide unique therapeutic opportunities. Whole-genome sequencing of tumors can allow exhaustive identification of the specific structural variants present in an individual cancer, facilitating both clinical diagnostics and the discovery of novel mutagenic mechanisms. A plethora of somatic structural variant detection algorithms have been created to enable these discoveries; however, there are no systematic benchmarks of them. Rigorous performance evaluation of somatic structural variant detection methods has been challenged by the lack of gold standards, extensive resource requirements, and difficulties arising from the need to share personal genomic information. RESULTS: To facilitate structural variant detection algorithm evaluations, we create a robust simulation framework for somatic structural variants by extending the BAMSurgeon algorithm. We then organize and enable a crowdsourced benchmarking within the ICGC-TCGA DREAM Somatic Mutation Calling Challenge (SMC-DNA). We report here the results of structural variant benchmarking on three different tumors, comprising 204 submissions from 15 teams. In addition to ranking methods, we identify characteristic error profiles of individual algorithms and general trends across them. Surprisingly, we find that ensembles of analysis pipelines do not always outperform the best individual method, indicating a need for new ways to aggregate somatic structural variant detection approaches. CONCLUSIONS: The synthetic tumors and somatic structural variant detection leaderboards remain available as a community benchmarking resource, and BAMSurgeon is available at https://github.com/adamewing/bamsurgeon .


Assuntos
Benchmarking , Simulação por Computador , Crowdsourcing , Variação Genética , Genoma Humano , Genômica/métodos , Neoplasias/genética , Algoritmos , Bases de Dados Genéticas , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Software
19.
Cell ; 174(3): 758-769.e9, 2018 07 26.
Artigo em Inglês | MEDLINE | ID: mdl-30033370

RESUMO

While mutations affecting protein-coding regions have been examined across many cancers, structural variants at the genome-wide level are still poorly defined. Through integrative deep whole-genome and -transcriptome analysis of 101 castration-resistant prostate cancer metastases (109X tumor/38X normal coverage), we identified structural variants altering critical regulators of tumorigenesis and progression not detectable by exome approaches. Notably, we observed amplification of an intergenic enhancer region 624 kb upstream of the androgen receptor (AR) in 81% of patients, correlating with increased AR expression. Tandem duplication hotspots also occur near MYC, in lncRNAs associated with post-translational MYC regulation. Classes of structural variations were linked to distinct DNA repair deficiencies, suggesting their etiology, including associations of CDK12 mutation with tandem duplications, TP53 inactivation with inverted rearrangements and chromothripsis, and BRCA2 inactivation with deletions. Together, these observations provide a comprehensive view of how structural variations affect critical regulators in metastatic prostate cancer.


Assuntos
Variação Estrutural do Genoma/genética , Neoplasias da Próstata/genética , Idoso , Idoso de 80 Anos ou mais , Proteína BRCA2/metabolismo , Quinases Ciclina-Dependentes/metabolismo , Variações do Número de Cópias de DNA , Exoma , Perfilação da Expressão Gênica/métodos , Genômica/métodos , Humanos , Masculino , Pessoa de Meia-Idade , Mutação , Metástase Neoplásica/genética , Proteínas Proto-Oncogênicas c-myc/genética , Proteínas Proto-Oncogênicas c-myc/metabolismo , Receptores Androgênicos/genética , Receptores Androgênicos/metabolismo , Sequências de Repetição em Tandem/genética , Proteína Supressora de Tumor p53/metabolismo , Sequenciamento Completo do Genoma/métodos
20.
J Clin Oncol ; 36(24): 2492-2503, 2018 08 20.
Artigo em Inglês | MEDLINE | ID: mdl-29985747

RESUMO

Purpose The prevalence and features of treatment-emergent small-cell neuroendocrine prostate cancer (t-SCNC) are not well characterized in the era of modern androgen receptor (AR)-targeting therapy. We sought to characterize the clinical and genomic features of t-SCNC in a multi-institutional prospective study. Methods Patients with progressive, metastatic castration-resistant prostate cancer (mCRPC) underwent metastatic tumor biopsy and were followed for survival. Metastatic biopsy specimens underwent independent, blinded pathology review along with RNA/DNA sequencing. Results A total of 202 consecutive patients were enrolled. One hundred forty-eight (73%) had prior disease progression on abiraterone and/or enzalutamide. The biopsy evaluable rate was 79%. The overall incidence of t-SCNC detection was 17%. AR amplification and protein expression were present in 67% and 75%, respectively, of t-SCNC biopsy specimens. t-SCNC was detected at similar proportions in bone, node, and visceral organ biopsy specimens. Genomic alterations in the DNA repair pathway were nearly mutually exclusive with t-SCNC differentiation ( P = .035). Detection of t-SCNC was associated with shortened overall survival among patients with prior AR-targeting therapy for mCRPC (hazard ratio, 2.02; 95% CI, 1.07 to 3.82). Unsupervised hierarchical clustering of the transcriptome identified a small-cell-like cluster that further enriched for adverse survival outcomes (hazard ratio, 3.00; 95% CI, 1.25 to 7.19). A t-SCNC transcriptional signature was developed and validated in multiple external data sets with > 90% accuracy. Multiple transcriptional regulators of t-SCNC were identified, including the pancreatic neuroendocrine marker PDX1. Conclusion t-SCNC is present in nearly one fifth of patients with mCRPC and is associated with shortened survival. The near-mutual exclusivity with DNA repair alterations suggests t-SCNC may be a distinct subset of mCRPC. Transcriptional profiling facilitates the identification of t-SCNC and novel therapeutic targets.


Assuntos
Carcinoma Neuroendócrino/genética , Carcinoma Neuroendócrino/patologia , Neoplasias de Próstata Resistentes à Castração/genética , Neoplasias de Próstata Resistentes à Castração/patologia , Idoso , Idoso de 80 Anos ou mais , Carcinoma Neuroendócrino/epidemiologia , Reparo do DNA/genética , Humanos , Masculino , Pessoa de Meia-Idade , Estudos Prospectivos , Neoplasias de Próstata Resistentes à Castração/epidemiologia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA