ABSTRACT
We performed an extensive immunogenomic analysis of more than 10,000 tumors comprising 33 diverse cancer types by utilizing data compiled by TCGA. Across cancer types, we identified six immune subtypes-wound healing, IFN-γ dominant, inflammatory, lymphocyte depleted, immunologically quiet, and TGF-ß dominant-characterized by differences in macrophage or lymphocyte signatures, Th1:Th2 cell ratio, extent of intratumoral heterogeneity, aneuploidy, extent of neoantigen load, overall cell proliferation, expression of immunomodulatory genes, and prognosis. Specific driver mutations correlated with lower (CTNNB1, NRAS, or IDH1) or higher (BRAF, TP53, or CASP8) leukocyte levels across all cancers. Multiple control modalities of the intracellular and extracellular networks (transcription, microRNAs, copy number, and epigenetic processes) were involved in tumor-immune cell interactions, both across and within immune subtypes. Our immunogenomics pipeline to characterize these heterogeneous tumors and the resulting data are intended to serve as a resource for future targeted studies to further advance the field.
Subject(s)
Genomics/methods , Neoplasms , Adolescent , Adult , Aged , Aged, 80 and over , Child , Female , Humans , Interferon-gamma/genetics , Interferon-gamma/immunology , Macrophages/immunology , Male , Middle Aged , Neoplasms/classification , Neoplasms/genetics , Neoplasms/immunology , Prognosis , Th1-Th2 Balance/physiology , Transforming Growth Factor beta/genetics , Transforming Growth Factor beta/immunology , Wound Healing/genetics , Wound Healing/immunology , Young AdultABSTRACT
Mining gene expression profiles has proven valuable for identifying signatures serving as surrogates of cancer phenotypes. However, the similarities of such signatures across different cancer types have not been strong enough to conclude that they represent a universal biological mechanism shared among multiple cancer types. Here we present a computational method for generating signatures using an iterative process that converges to one of several precise attractors defining signatures representing biomolecular events, such as cell transdifferentiation or the presence of an amplicon. By analyzing rich gene expression datasets from different cancer types, we identified several such biomolecular events, some of which are universally present in all tested cancer types in nearly identical form. Although the method is unsupervised, we show that it often leads to attractors with strong phenotypic associations. We present several such multi-cancer attractors, focusing on three that are prominent and sharply defined in all cases: a mesenchymal transition attractor strongly associated with tumor stage, a mitotic chromosomal instability attractor strongly associated with tumor grade, and a lymphocyte-specific attractor.
Subject(s)
Computational Biology/methods , Models, Biological , Neoplasms/genetics , Algorithms , Data Mining , Databases, Genetic , Epithelial-Mesenchymal Transition , Gene Expression Profiling/methods , Genome/genetics , Humans , Kaplan-Meier Estimate , Kinetochores , Mitosis/genetics , Neoplasms/metabolism , Neoplasms/pathology , Oncogenes , Phenotype , PrognosisABSTRACT
Introduction: Fibroblast activation protein (FAP) is predominantly upregulated in various tumor microenvironments and scarcely expressed in normal tissues. Methods: We analyzed FAP across 1216 tissue samples covering 23 tumor types and 70 subtypes. Results: Elevated FAP levels were notable in breast, pancreatic, esophageal, and lung cancers. Using immunohistochemistry and RNAseq, a correlation between FAP gene and protein expression was found. Evaluating FAP's clinical significance, we assessed 29 cohorts from 12 clinical trials, including both mono and combination therapies with the PD-L1 inhibitor atezolizumab and chemotherapy. A trend links higher FAP expression to poorer prognosis, particularly in RCC, across both treatment arms. However, four cohorts showed improved survival with high FAP, while in four others, FAP had no apparent survival impact. Conclusions: Our results emphasize FAP's multifaceted role in therapy response, suggesting its potential as a cancer immunotherapy biomarker.
Subject(s)
Lung Neoplasms , Serine Endopeptidases , Humans , Serine Endopeptidases/genetics , Serine Endopeptidases/metabolism , Immunotherapy , Lung Neoplasms/pathology , Biomarkers, Tumor/genetics , Biomarkers, Tumor/metabolism , Fibroblasts/metabolism , Tumor Microenvironment/geneticsABSTRACT
Subclonal reconstruction algorithms use bulk DNA sequencing data to quantify parameters of tumor evolution, allowing an assessment of how cancers initiate, progress and respond to selective pressures. We launched the ICGC-TCGA (International Cancer Genome Consortium-The Cancer Genome Atlas) DREAM Somatic Mutation Calling Tumor Heterogeneity and Evolution Challenge to benchmark existing subclonal reconstruction algorithms. This 7-year community effort used cloud computing to benchmark 31 subclonal reconstruction algorithms on 51 simulated tumors. Algorithms were scored on seven independent tasks, leading to 12,061 total runs. Algorithm choice influenced performance substantially more than tumor features but purity-adjusted read depth, copy-number state and read mappability were associated with the performance of most algorithms on most tasks. No single algorithm was a top performer for all seven tasks and existing ensemble strategies were unable to outperform the best individual methods, highlighting a key research need. All containerized methods, evaluation code and datasets are available to support further assessment of the determinants of subclonal reconstruction accuracy and development of improved methods to understand tumor evolution.
ABSTRACT
Tau is a microtubule-associated protein (MAPT, tau) implicated in the pathogenesis of tauopathies, a spectrum of neurodegenerative disorders characterized by accumulation of hyperphosphorylated, aggregated tau. Because tau pathology can be distinct across diseases, a pragmatic therapeutic approach may be to intervene at the level of the tau transcript, as it makes no assumptions to mechanisms of tau toxicity. Here we performed a large library screen of locked-nucleic-acid (LNA)-modified antisense oligonucleotides (ASOs), where careful tiling of the MAPT locus resulted in the identification of hot spots for activity in the 3' UTR. Further modifications to the LNA design resulted in the generation of ASO-001933, which selectively and potently reduces tau in primary cultures from hTau mice, monkey, and human neurons. ASO-001933 was well tolerated and produced a robust, long-lasting reduction in tau protein in both mouse and cynomolgus monkey brain. In monkey, tau protein reduction was maintained in brain for 20 weeks post injection and corresponded with tau protein reduction in the cerebrospinal fluid (CSF). Our results demonstrate that LNA-ASOs exhibit excellent drug-like properties and sustained efficacy likely translating to infrequent, intrathecal dosing in patients. These data further support the development of LNA-ASOs against tau for the treatment of tauopathies.
ABSTRACT
Tumor DNA sequencing data can be interpreted by computational methods that analyze genomic heterogeneity to infer evolutionary dynamics. A growing number of studies have used these approaches to link cancer evolution with clinical progression and response to therapy. Although the inference of tumor phylogenies is rapidly becoming standard practice in cancer genome analyses, standards for evaluating them are lacking. To address this need, we systematically assess methods for reconstructing tumor subclonality. First, we elucidate the main algorithmic problems in subclonal reconstruction and develop quantitative metrics for evaluating them. Then we simulate realistic tumor genomes that harbor all known clonal and subclonal mutation types and processes. Finally, we benchmark 580 tumor reconstructions, varying tumor read depth, tumor type and somatic variant detection. Our analysis provides a baseline for the establishment of gold-standard methods to analyze tumor heterogeneity.
Subject(s)
Algorithms , Neoplasms/pathology , Clone Cells , Computer Simulation , DNA Copy Number Variations/genetics , Gene Dosage , Genome , Humans , Mutation/genetics , Neoplasms/genetics , Polymorphism, Single Nucleotide/genetics , Reference StandardsABSTRACT
Similar environmental risk factors have been implicated in different neuropsychiatric disorders (including major psychiatric and neurodegenerative diseases), indicating the existence of common epigenetic mechanisms underlying the pathogenesis shared by different illnesses. To investigate such commonality, we applied an unsupervised computational approach identifying several consensus co-expression and co-methylation signatures from a data cohort of postmortem prefrontal cortex (PFC) samples from individuals with six different neuropsychiatric disorders-schizophrenia, bipolar disorder, major depression, alcoholism, Alzheimer's and Parkinson's-as well as healthy controls. Among our results, we identified a pair of strongly interrelated co-expression and co-methylation (E-M) signatures showing consistent and significant disease association in multiple types of disorders. This E-M signature was enriched for interneuron markers, and we further demonstrated that it is unlikely for this enrichment to be due to varying subpopulation abundance of normal interneurons across samples. Moreover, gene set enrichment analysis revealed overrepresentation of stress-related biological processes in this E-M signature. Our integrative analysis of expression and methylation profiles, therefore, suggests a stress-related epigenetic mechanism in the brain, which could be associated with the pathogenesis of multiple neuropsychiatric diseases.
Subject(s)
Alcoholism/genetics , Alzheimer Disease/genetics , Bipolar Disorder/genetics , DNA Methylation , Depressive Disorder, Major/genetics , Parkinson Disease/genetics , Schizophrenia/genetics , Epigenesis, Genetic , Gene Regulatory Networks , HumansABSTRACT
PURPOSE: Idasanutlin is a selective small-molecule MDM2 antagonist. It activates the tumor suppressor TP53 and is in phase 3 clinical trial for acute myeloid leukemia. Nonclinical studies have shown that glucuronidation is the major metabolizing mechanism for idasanutlin and UGT1A3 is the major metabolizing enzyme. There are reported examples of UGT polymorphisms associated with drug metabolism or response. Thus, the aim of this analysis is to investigate if UGT polymorphism is associated with idasanutlin pharmacokinetics. METHOD: Idasanutlin clearance was derived and normalized from two phase I studies. Its clearance level was compared between patients with different genotypes at 44 non-monomorphic UGT SNPs. Several single-locus and multi-locus association analysis, including haplotype association analysis and pairwise SNP interaction (epistasis) analyses were performed to investigate if there is any association between UGT genotypes and idasanutlin clearance. RESULTS AND CONCLUSION: A total of 69 patients who have both idasanutlin pharmacokinetic data and UGT genotyping data were analyzed for association. The major clearance enzyme for idasanutlin, UGT1A3, has no association with idasanutlin clearance. Further single-locus and multi-locus association analyses also suggest that no significant UGT polymorphism association with idasanutlin clearance can be detected with the current datasets. However, the possibility of association with rare allele(s) of UGT family genes cannot be excluded due to the limited sample size of the current phase I studies.
Subject(s)
Glucuronosyltransferase/genetics , Neoplasms/genetics , Neoplasms/metabolism , Polymorphism, Single Nucleotide , Proto-Oncogene Proteins c-mdm2/antagonists & inhibitors , Pyrrolidines/pharmacokinetics , para-Aminobenzoates/pharmacokinetics , Humans , Neoplasms/drug therapy , Neoplasms/pathology , Prognosis , Pyrrolidines/pharmacology , Tissue Distribution , para-Aminobenzoates/pharmacologyABSTRACT
Exploring linkage disequilibrium (LD) patterns among the single nucleotide polymorphism (SNP) sites can improve the accuracy and cost-effectiveness of genomic association studies, whereby representative (tag) SNPs are identified to sufficiently represent the genomic diversity in populations. There has been considerable amount of effort in developing efficient algorithms to select tag SNPs from the growing large-scale data sets. Methods using the classical pairwise-LD and multi-locus LD measures have been proposed that aim to reduce the computational complexity and to increase the accuracy, respectively. The present work solves the tag SNP selection problem by efficiently balancing the computational complexity and accuracy, and improves the coverage in genomic diversity in a cost-effective manner. The employed algorithm makes use of mutual information to explore the multi-locus association between SNPs and can handle different data types and conditions. Experiments with benchmark HapMap data sets show comparable or better performance against the state-of-the-art algorithms. In particular, as a novel application, the genome-wide SNP tagging is performed in the 1000 Genomes Project data sets, and produced a well-annotated database of tagging variants that capture the common genotype diversity in 2,504 samples from 26 human populations. Compared to conventional methods, the algorithm requires as input only the genotype (or haplotype) sequences, can scale up to genome-wide analyses, and produces accurate solutions with more information-rich output, providing an improved platform for researchers towards the subsequent association studies.
Subject(s)
Algorithms , Chromosome Mapping/methods , Genome-Wide Association Study/methods , Polymorphism, Single Nucleotide , Base Sequence , Cluster Analysis , Databases, Genetic , Epistasis, Genetic , Expressed Sequence Tags , Genetic Association Studies , Haplotypes , Humans , Linkage Disequilibrium , Sequence Homology, Nucleic AcidABSTRACT
BACKGROUND: The winning model of the Sage Bionetworks/DREAM Breast Cancer Prognosis Challenge made use of several molecular features, called attractor metagenes, as well as another metagene defined by the average expression level of the two genes FGD3 and SUSD3. This is a follow-up study toward developing a breast cancer prognostic test derived from and improving upon that model. METHODS: We designed a feature selector facility calculating the prognostic scores of combinations of features, including those that we had used earlier, as well as those used in existing breast cancer biomarker assays, identifying the optimal selection of features for the test. RESULTS: The resulting test, called BCAM (Breast Cancer Attractor Metagenes), is universally applicable to all clinical subtypes and stages of breast cancer and does not make any use of breast cancer molecular subtype or hormonal status information, none of which provided additional prognostic value. BCAM is composed of several molecular features: the breast cancer-specific FGD3-SUSD3 metagene, four attractor metagenes present in multiple cancer types (CIN, MES, LYM, and END), three additional individual genes (CD68, DNAJB9, and CXCL12), tumor size, and the number of positive lymph nodes. CONCLUSIONS: Our analysis leads to the unexpected and remarkable suggestion that ER, PR, and HER2 status, or molecular subtype classification, do not provide additional prognostic value when the values of the FGD3-SUSD3 and attractor metagenes are taken into consideration. IMPACT: Our results suggest that BCAM's prognostic predictions show potential to outperform those resulting from existing breast cancer biomarker assays.
Subject(s)
Breast Neoplasms/genetics , Biomarkers, Tumor , Breast Neoplasms/mortality , Female , Gene Expression Profiling , Humans , Metagenomics , Prognosis , Survival RateABSTRACT
Janus kinase-2 (JAK2) supports breast cancer growth, and clinical trials testing JAK2 inhibitors are under way. In addition to the tumor epithelium, JAK2 is also expressed in other tissues including immune cells; whether the JAK2 mRNA levels in breast tumors correlate with outcomes has not been evaluated. Using a case-control design, JAK2 mRNA was measured in 223 archived breast tumors and associations with distant recurrence were evaluated by logistic regression. The frequency of correct pairwise comparisons of patient rankings based on JAK2 levels versus survival outcomes, the concordance index (CI), was evaluated using data from 2,460 patients in three cohorts. In the case-control study, increased JAK2 was associated with a decreasing risk of recurrence (multivariate P = 0.003, n = 223). Similarly, JAK2 was associated with a protective CI (<0.5) in the public cohorts: NETHERLANDS CI = 0.376, n = 295; METABRIC CI = 0.462, n = 1,981; OSLOVAL CI = 0.452, n = 184. Furthermore, JAK2 was strongly correlated with the favorable prognosis LYM metagene signature for infiltrating T cells (r = 0.5; P < 2 × 10(-16); n = 1,981) and with severe lymphocyte infiltration (P = 0.00003, n = 156). Moreover, the JAK1/2 inhibitor ruxolitinib potently inhibited the anti-CD3-dependent production of IFN-γ, a marker of the differentiation of Th cells along the tumor-inhibitory Th1 pathway. The potential for JAK2 inhibitors to interfere with the antitumor capacities of T cells should be evaluated.
Subject(s)
Breast Neoplasms/genetics , Breast Neoplasms/immunology , Gene Expression , Janus Kinase 2/genetics , Lymphocytes, Tumor-Infiltrating/immunology , Lymphocytes, Tumor-Infiltrating/metabolism , Breast Neoplasms/drug therapy , Breast Neoplasms/mortality , Breast Neoplasms/pathology , Case-Control Studies , Female , Humans , Janus Kinase 2/antagonists & inhibitors , Janus Kinase 2/metabolism , Protein Kinase Inhibitors/pharmacology , Protein Kinase Inhibitors/therapeutic use , RNA, Messenger/genetics , Recurrence , Treatment OutcomeABSTRACT
The accuracy with which cancer phenotypes can be predicted by selecting and combining molecular features is compromised by the large number of potential features available. In an effort to design a robust prognostic model to predict breast cancer survival, we hypothesized that signatures consisting of genes that are coexpressed in multiple cancer types should correspond to molecular events that are prognostic in all cancers, including breast cancer. We previously identified several such signatures--called attractor metagenes--in an analysis of multiple tumor types. We then tested our attractor metagene hypothesis as participants in the Sage Bionetworks-DREAM Breast Cancer Prognosis Challenge. Using a rich training data set that included gene expression and clinical features for breast cancer patients, we developed a prognostic model that was independently validated in a newly generated patient data set. We describe our model, which was based on three attractor metagenes associated with mitotic chromosomal instability, mesenchymal transition, or lymphocyte-based immune recruitment.
Subject(s)
Breast Neoplasms/diagnosis , Breast Neoplasms/genetics , Models, Biological , Breast Neoplasms/pathology , Chromosomal Instability/genetics , Databases, Genetic , Epithelial-Mesenchymal Transition/genetics , Female , Gene Expression Profiling , Gene Expression Regulation, Neoplastic , Genes, Neoplasm/genetics , Humans , Lymphocytes/metabolism , Mitosis/genetics , Prognosis , Survival AnalysisABSTRACT
We herein introduce an automated three-dimensional (3D) locomotion tracking and pose reconstruction system for rodents with superior robustness, rapidity, reliability, resolution, simplicity, and cost. An off-the-shelf composite infrared (IR) range camera was adopted to grab high-resolution depth images (640×480×2048 pixels at 20Hz) in our system for automated behavior analysis. For the inherent 3D structure of the depth images, we developed a compact algorithm to reconstruct the locomotion and body behavior with superior temporal and solid spatial resolution. Since the range camera operates in the IR spectrum, interference from the visible light spectrum did not affect the tracking performance. The accuracy of our system was 98.1±3.2%. We also validated the system, which yielded strong correlation with automated and manual tracking. Meanwhile, the system replicates a detailed dynamic rat model in virtual space, which demonstrates the movements of the extremities of the body and locomotion in detail on varied terrain.