RESUMO
Somatic mutation phasing informs our understanding of cancer-related events, like driver mutations. We generated linked-read whole genome sequencing data for 23 samples across disease stages from 14 multiple myeloma (MM) patients and systematically assigned somatic mutations to haplotypes using linked-reads. Here, we report the reconstructed cancer haplotypes and phase blocks from several MM samples and show how phase block length can be extended by integrating samples from the same individual. We also uncover phasing information in genes frequently mutated in MM, including DIS3, HIST1H1E, KRAS, NRAS, and TP53, phasing 79.4% of 20,705 high-confidence somatic mutations. In some cases, this enabled us to interpret clonal evolution models at higher resolution using pairs of phased somatic mutations. For example, our analysis of one patient suggested that two NRAS hotspot mutations occurred on the same haplotype but were independent events in different subclones. Given sufficient tumor purity and data quality, our framework illustrates how haplotype-aware analysis of somatic mutations in cancer can be beneficial for some cancer cases.
RESUMO
The National Cancer Institute's Clinical Proteomic Tumor Analysis Consortium (CPTAC) investigates tumors from a proteogenomic perspective, creating rich multi-omics datasets connecting genomic aberrations to cancer phenotypes. To facilitate pan-cancer investigations, we have generated harmonized genomic, transcriptomic, proteomic, and clinical data for >1000 tumors in 10 cohorts to create a cohesive and powerful dataset for scientific discovery. We outline efforts by the CPTAC pan-cancer working group in data harmonization, data dissemination, and computational resources for aiding biological discoveries. We also discuss challenges for multi-omics data integration and analysis, specifically the unique challenges of working with both nucleotide sequencing and mass spectrometry proteomics data.
Assuntos
Neoplasias , Proteogenômica , Humanos , Proteômica , Genômica , Neoplasias/genética , Perfilação da Expressão GênicaRESUMO
DNA methylation plays a critical role in establishing and maintaining cellular identity. However, it is frequently dysregulated during tumor development and is closely intertwined with other genetic alterations. Here, we leveraged multi-omic profiling of 687 tumors and matched non-involved adjacent tissues from the kidney, brain, pancreas, lung, head and neck, and endometrium to identify aberrant methylation associated with RNA and protein abundance changes and build a Pan-Cancer catalog. We uncovered lineage-specific epigenetic drivers including hypomethylated FGFR2 in endometrial cancer. We showed that hypermethylated STAT5A is associated with pervasive regulon downregulation and immune cell depletion, suggesting that epigenetic regulation of STAT5A expression constitutes a molecular switch for immunosuppression in squamous tumors. We further demonstrated that methylation subtype-enrichment information can explain cell-of-origin, intra-tumor heterogeneity, and tumor phenotypes. Overall, we identified cis-acting DNA methylation events that drive transcriptional and translational changes, shedding light on the tumor's epigenetic landscape and the role of its cell-of-origin.
Assuntos
Metilação de DNA , Neoplasias do Endométrio , Feminino , Humanos , Epigênese Genética , Multiômica , Regulação Neoplásica da Expressão Gênica , Neoplasias do Endométrio/genéticaRESUMO
Pediatric brain and spinal cancers are collectively the leading disease-related cause of death in children; thus, we urgently need curative therapeutic strategies for these tumors. To accelerate such discoveries, the Children's Brain Tumor Network (CBTN) and Pacific Pediatric Neuro-Oncology Consortium (PNOC) created a systematic process for tumor biobanking, model generation, and sequencing with immediate access to harmonized data. We leverage these data to establish OpenPBTA, an open collaborative project with over 40 scalable analysis modules that genomically characterize 1,074 pediatric brain tumors. Transcriptomic classification reveals universal TP53 dysregulation in mismatch repair-deficient hypermutant high-grade gliomas and TP53 loss as a significant marker for poor overall survival in ependymomas and H3 K28-mutant diffuse midline gliomas. Already being actively applied to other pediatric cancers and PNOC molecular tumor board decision-making, OpenPBTA is an invaluable resource to the pediatric oncology community.
RESUMO
Multiple myeloma (MM) is a highly refractory hematologic cancer. Targeted immunotherapy has shown promise in MM but remains hindered by the challenge of identifying specific yet broadly representative tumor markers. We analyzed 53 bone marrow (BM) aspirates from 41 MM patients using an unbiased, high-throughput pipeline for therapeutic target discovery via single-cell transcriptomic profiling, yielding 38 MM marker genes encoding cell-surface proteins and 15 encoding intracellular proteins. Of these, 20 candidate genes were highlighted that are not yet under clinical study, 11 of which were previously uncharacterized as therapeutic targets. The findings were cross-validated using bulk RNA sequencing, flow cytometry, and proteomic mass spectrometry of MM cell lines and patient BM, demonstrating high overall concordance across data types. Independent discovery using bulk RNA sequencing reiterated top candidates, further affirming the ability of single-cell transcriptomics to accurately capture marker expression despite limitations in sample size or sequencing depth. Target dynamics and heterogeneity were further examined using both transcriptomic and immuno-imaging methods. In summary, this study presents a robust and broadly applicable strategy for identifying tumor markers to better inform the development of targeted cancer therapy. SIGNIFICANCE: Single-cell transcriptomic profiling and multiomic cross-validation to uncover therapeutic targets identifies 38 myeloma marker genes, including 11 transcribing surface proteins with previously uncharacterized potential for targeted antitumor therapy.
Assuntos
Mieloma Múltiplo , Humanos , Mieloma Múltiplo/tratamento farmacológico , Mieloma Múltiplo/genética , Multiômica , Proteômica , Biomarcadores Tumorais/genética , Perfilação da Expressão Gênica/métodosRESUMO
Large compendia of gene expression data have proven valuable for the discovery of novel biological relationships. Historically, most available RNA assays were run on microarray, while RNA-seq is now the platform of choice for many new experiments. The data structure and distributions between the platforms differ, making it challenging to combine them directly. Here we perform supervised and unsupervised machine learning evaluations to assess which existing normalization methods are best suited for combining microarray and RNA-seq data. We find that quantile and Training Distribution Matching normalization allow for supervised and unsupervised model training on microarray and RNA-seq data simultaneously. Nonparanormal normalization and z-scores are also appropriate for some applications, including pathway analysis with Pathway-Level Information Extractor (PLIER). We demonstrate that it is possible to perform effective cross-platform normalization using existing methods to combine microarray and RNA-seq data for machine learning applications.
Assuntos
Perfilação da Expressão Gênica , Aprendizado de Máquina , RNA-Seq , Perfilação da Expressão Gênica/métodos , Análise de Sequência de RNA/métodos , Análise em MicrossériesRESUMO
Multiple myeloma (MM) is characterized by the uncontrolled proliferation of plasma cells. Despite recent treatment advances, it is still incurable as disease progression is not fully understood. To investigate MM and its immune environment, we apply single cell RNA and linked-read whole genome sequencing to profile 29 longitudinal samples at different disease stages from 14 patients. Here, we collect 17,267 plasma cells and 57,719 immune cells, discovering patient-specific plasma cell profiles and immune cell expression changes. Patients with the same genetic alterations tend to have both plasma cells and immune cells clustered together. By integrating bulk genomics and single cell mapping, we track plasma cell subpopulations across disease stages and find three patterns: stability (from precancer to diagnosis), and gain or loss (from diagnosis to relapse). In multiple patients, we detect "B cell-featured" plasma cell subpopulations that cluster closely with B cells, implicating their cell of origin. We validate AP-1 complex differential expression (JUN and FOS) in plasma cell subpopulations using CyTOF-based protein assays, and integrated analysis of single-cell RNA and CyTOF data reveals AP-1 downstream targets (IL6 and IL1B) potentially leading to inflammation regulation. Our work represents a longitudinal investigation for tumor and microenvironment during MM progression and paves the way for expanding treatment options.
Assuntos
Linfócitos B/metabolismo , Regulação Neoplásica da Expressão Gênica/genética , Mieloma Múltiplo/genética , Mieloma Múltiplo/imunologia , Recidiva Local de Neoplasia/genética , Microambiente Tumoral/imunologia , Idoso , Linfócitos B/citologia , Linfócitos B/imunologia , Linhagem da Célula , Evolução Clonal/genética , Estudos de Coortes , Progressão da Doença , Feminino , Regulação Neoplásica da Expressão Gênica/imunologia , Haplótipos , Humanos , Interleucina-1beta/sangue , Interleucina-6/sangue , Masculino , Espectrometria de Massas , Pessoa de Meia-Idade , Família Multigênica , Mieloma Múltiplo/sangue , Mieloma Múltiplo/patologia , Mutação , Recidiva Local de Neoplasia/sangue , Recidiva Local de Neoplasia/imunologia , Proteínas Proto-Oncogênicas c-fos/sangue , Proteínas Proto-Oncogênicas c-jun/sangue , RNA-Seq , Transdução de Sinais/genética , Transdução de Sinais/imunologia , Análise de Célula ÚnicaRESUMO
Multiple myeloma is a plasma cell blood cancer with frequent chromosomal translocations leading to gene fusions. To determine the clinical relevance of fusion events, we detect gene fusions from a cohort of 742 patients from the Multiple Myeloma Research Foundation CoMMpass Study. Patients with multiple clinic visits enable us to track tumor and fusion evolution, and cases with matching peripheral blood and bone marrow samples allow us to evaluate the concordance of fusion calls in patients with high tumor burden. We examine the joint upregulation of WHSC1 and FGFR3 in samples with t(4;14)-related fusions, and we illustrate a method for detecting fusions from single cell RNA-seq. We report fusions at MYC and a neighboring gene, PVT1, which are related to MYC translocations and associated with divergent progression-free survival patterns. Finally, we find that 4% of patients may be eligible for targeted fusion therapies, including three with an NTRK1 fusion.
Assuntos
Fusão Gênica/genética , Histona-Lisina N-Metiltransferase/genética , Mieloma Múltiplo/genética , Proteínas Proto-Oncogênicas c-myc/genética , Receptor Tipo 3 de Fator de Crescimento de Fibroblastos/genética , Proteínas Repressoras/genética , Adulto , Idoso , Idoso de 80 Anos ou mais , Variações do Número de Cópias de DNA/genética , Perfilação da Expressão Gênica/métodos , Histona-Lisina N-Metiltransferase/biossíntese , Humanos , Imunoglobulinas/genética , Pessoa de Meia-Idade , Intervalo Livre de Progressão , RNA Longo não Codificante/genética , RNA-Seq/métodos , Receptor Tipo 3 de Fator de Crescimento de Fibroblastos/biossíntese , Receptor trkA/genética , Proteínas Repressoras/biossínteseRESUMO
Genetic alterations in signaling pathways that control cell-cycle progression, apoptosis, and cell growth are common hallmarks of cancer, but the extent, mechanisms, and co-occurrence of alterations in these pathways differ between individual tumors and tumor types. Using mutations, copy-number changes, mRNA expression, gene fusions and DNA methylation in 9,125 tumors profiled by The Cancer Genome Atlas (TCGA), we analyzed the mechanisms and patterns of somatic alterations in ten canonical pathways: cell cycle, Hippo, Myc, Notch, Nrf2, PI-3-Kinase/Akt, RTK-RAS, TGFß signaling, p53 and ß-catenin/Wnt. We charted the detailed landscape of pathway alterations in 33 cancer types, stratified into 64 subtypes, and identified patterns of co-occurrence and mutual exclusivity. Eighty-nine percent of tumors had at least one driver alteration in these pathways, and 57% percent of tumors had at least one alteration potentially targetable by currently available drugs. Thirty percent of tumors had multiple targetable alterations, indicating opportunities for combination therapy.
Assuntos
Bases de Dados Genéticas , Neoplasias/patologia , Transdução de Sinais/genética , Genes Neoplásicos , Humanos , Neoplasias/genética , Fosfatidilinositol 3-Quinases/genética , Fosfatidilinositol 3-Quinases/metabolismo , Fator de Crescimento Transformador beta/genética , Fator de Crescimento Transformador beta/metabolismo , Proteína Supressora de Tumor p53/genética , Proteína Supressora de Tumor p53/metabolismo , Proteínas Wnt/genética , Proteínas Wnt/metabolismoRESUMO
Gene fusions represent an important class of somatic alterations in cancer. We systematically investigated fusions in 9,624 tumors across 33 cancer types using multiple fusion calling tools. We identified a total of 25,664 fusions, with a 63% validation rate. Integration of gene expression, copy number, and fusion annotation data revealed that fusions involving oncogenes tend to exhibit increased expression, whereas fusions involving tumor suppressors have the opposite effect. For fusions involving kinases, we found 1,275 with an intact kinase domain, the proportion of which varied significantly across cancer types. Our study suggests that fusions drive the development of 16.5% of cancer cases and function as the sole driver in more than 1% of them. Finally, we identified druggable fusions involving genes such as TMPRSS2, RET, FGFR3, ALK, and ESR1 in 6.0% of cases, and we predicted immunogenic peptides, suggesting that fusions may provide leads for targeted drug and immune therapy.
Assuntos
Antineoplásicos/farmacologia , Carcinogênese/genética , Neoplasias/genética , Fusão Oncogênica , Antineoplásicos/uso terapêutico , Carcinogênese/efeitos dos fármacos , Carcinogênese/metabolismo , Linhagem Celular Tumoral , Humanos , Terapia de Alvo Molecular/métodos , Neoplasias/tratamento farmacológico , Neoplasias/patologia , Proteínas de Fusão Oncogênica/química , Proteínas de Fusão Oncogênica/genética , Proteínas de Fusão Oncogênica/metabolismoRESUMO
SUMMARY: MIRMMR predicts microsatellite instability status in cancer samples using methylation and mutation information, in contrast to existing methods that rely on observed microsatellites. Additionally, MIRMMR highlights those genetic alterations contributing to microsatellite instability. AVAILABILITY AND IMPLEMENTATION: Source code is freely available at https://github.com/ding-lab/MIRMMR under the MIT license, implemented in R and supported on Unix/OS X operating systems. CONTACT: smfoltz@wustl.edu or lding@wustl.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Metilação de DNA , Genômica/métodos , Instabilidade de Microssatélites , Mutação , Neoplasias/genética , Software , Humanos , Neoplasias/metabolismoRESUMO
CONTEXT: Financial and demographic pressures in US require an understanding of the most efficient distribution of physicians to maximize population-level health benefits. Prior work has assumed a constant negative relationship between physician supply and mortality outcomes throughout the US and has not addressed regional variation. METHODS: In this ecological analysis, geographically weighted regression was used to identify spatially varying relationships between local urologist density and prostate cancer mortality at the county level. Data from 1,492 counties in 30 eastern and southern states from 2006-2010 were analyzed. FINDINGS: The ordinary least squares (OLS) regression found that, on average, increasing urologist density by 1 urologist per 100,000 people resulted in an expected decrease in prostate cancer mortality of -0.499 deaths per 100,000 men (95% CI -0.709 to -0.289, p-value < 0.001), or a 1.5% decrease. Geographic weighted regression demonstrated that the addition of one urologist per 100,000 people in counties in the southern Mississippi River states of Arkansas, Mississippi, and Louisiana, as well as parts of Illinois, Indiana, and Wisconsin is associated with decrease of 0.411 to 0.916 in prostate cancer mortality per 100,000 men (1.6-3.6%). In contrast, the urologist density was not significantly associated with the prostate state mortality in the new England region. CONCLUSIONS: The strength of association between urologist density and prostate cancer mortality varied regionally. Those areas with the highest potential for effects could be targeted for increasing the supply of urologists, as it associated with the largest predicted improvement in prostate cancer mortality.