Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 27
Filter
1.
Nucleic Acids Res ; 2024 Oct 26.
Article in English | MEDLINE | ID: mdl-39460617

ABSTRACT

The UCSC Genome Browser (https://genome.ucsc.edu) is a widely utilized web-based tool for visualization and analysis of genomic data, encompassing over 4000 assemblies from diverse organisms. Since its release in 2001, it has become an essential resource for genomics and bioinformatics research. Annotation data available on Genome Browser includes both internally created and maintained tracks as well as custom tracks and track hubs provided by the research community. This last year's updates include over 25 new annotation tracks such as the gnomAD 4.1 track on the human GRCh38/hg38 assembly, the addition of three new public hubs, and significant expansions to the Genome Archive[GenArk) system for interacting with the enormous variety of assemblies. We have also made improvements to our interface, including updates to the browser graphic page, such as a new popup dialog feature that now displays item details without requiring navigation away from the main Genome Browser page. GenePred tracks have been upgraded with right-click options for zooming and precise navigation, along with enhanced mouseOver functions. Additional improvements include a new grouping feature for track hubs and hub description info links. A new tutorial focusing on Clinical Genetics has also been added to the UCSC Genome Browser.

2.
Int J Cancer ; 151(11): 1947-1959, 2022 Dec 01.
Article in English | MEDLINE | ID: mdl-35837755

ABSTRACT

The incidence of esophageal squamous cell carcinoma (ESCC) is disproportionately high in the eastern corridor of Africa and parts of Asia. Emerging research has identified a potential association between poor oral health and ESCC. One possible link between poor oral health and ESCC involves the alteration of the microbiome. We performed an integrated analysis of four independent sequencing efforts of ESCC tumors from patients from high- and low-incidence regions of the world. Using whole genome sequencing (WGS) and RNA sequencing (RNAseq) of ESCC tumors from 61 patients in Tanzania, we identified a community of bacteria, including members of the genera Fusobacterium, Selenomonas, Prevotella, Streptococcus, Porphyromonas, Veillonella and Campylobacter, present at high abundance in ESCC tumors. We then characterized the microbiome of 238 ESCC tumor specimens collected in two additional independent sequencing efforts consisting of patients from other high-ESCC incidence regions (Tanzania, Malawi, Kenya, Iran, China). This analysis revealed similar ESCC-associated bacterial communities in these cancers. Because these genera are traditionally considered members of the oral microbiota, we next explored whether there was a relationship between the synchronous saliva and tumor microbiomes of ESCC patients in Tanzania. Comparative analyses revealed that paired saliva and tumor microbiomes were significantly similar with a specific enrichment of Fusobacterium and Prevotella in the tumor microbiome. Together, these data indicate that cancer-associated oral bacteria are associated with ESCC tumors at the time of diagnosis and support a model in which oral bacteria are present in high abundance in both saliva and tumors of some ESCC patients.


Subject(s)
Esophageal Neoplasms , Esophageal Squamous Cell Carcinoma , Microbiota , Bacteria/genetics , Esophageal Neoplasms/genetics , Humans , Kenya , Microbiota/genetics
3.
Breast Cancer Res ; 22(1): 12, 2020 01 28.
Article in English | MEDLINE | ID: mdl-31992350

ABSTRACT

BACKGROUND: Breast cancer intrinsic molecular subtype (IMS) as classified by the expression-based PAM50 assay is considered a strong prognostic feature, even when controlled for by standard clinicopathological features such as age, grade, and nodal status, yet the molecular testing required to elucidate these subtypes is not routinely performed. Furthermore, when such bulk assays as RNA sequencing are performed, intratumoral heterogeneity that may affect prognosis and therapeutic decision-making can be missed. METHODS: As a more facile and readily available method for determining IMS in breast cancer, we developed a deep learning approach for approximating PAM50 intrinsic subtyping using only whole-slide images of H&E-stained breast biopsy tissue sections. This algorithm was trained on images from 443 tumors that had previously undergone PAM50 subtyping to classify small patches of the images into four major molecular subtypes-Basal-like, HER2-enriched, Luminal A, and Luminal B-as well as Basal vs. non-Basal. The algorithm was subsequently used for subtype classification of a held-out set of 222 tumors. RESULTS: This deep learning image-based classifier correctly subtyped the majority of samples in the held-out set of tumors. However, in many cases, significant heterogeneity was observed in assigned subtypes across patches from within a single whole-slide image. We performed further analysis of heterogeneity, focusing on contrasting Luminal A and Basal-like subtypes because classifications from our deep learning algorithm-similar to PAM50-are associated with significant differences in survival between these two subtypes. Patients with tumors classified as heterogeneous were found to have survival intermediate between Luminal A and Basal patients, as well as more varied levels of hormone receptor expression patterns. CONCLUSIONS: Here, we present a method for minimizing manual work required to identify cancer-rich patches among all multiscale patches in H&E-stained WSIs that can be generalized to any indication. These results suggest that advanced deep machine learning methods that use only routinely collected whole-slide images can approximate RNA-seq-based molecular tests such as PAM50 and, importantly, may increase detection of heterogeneous tumors that may require more detailed subtype analysis.


Subject(s)
Biomarkers, Tumor/genetics , Breast Neoplasms/mortality , Breast Neoplasms/pathology , Deep Learning , Gene Expression Regulation, Neoplastic , Image Processing, Computer-Assisted/methods , Molecular Typing/methods , Breast Neoplasms/classification , Breast Neoplasms/genetics , Female , Humans , Neoplasm Grading , Receptor, ErbB-2/metabolism , Survival Rate
4.
Breast Cancer Res ; 19(1): 44, 2017 03 29.
Article in English | MEDLINE | ID: mdl-28356166

ABSTRACT

BACKGROUND: Breast cancer is a heterogeneous disease at the clinical and molecular level. In this study we integrate classifications extracted from five different molecular levels in order to identify integrated subtypes. METHODS: Tumor tissue from 425 patients with primary breast cancer from the Oslo2 study was cut and blended, and divided into fractions for DNA, RNA and protein isolation and metabolomics, allowing the acquisition of representative and comparable molecular data. Patients were stratified into groups based on their tumor characteristics from five different molecular levels, using various clustering methods. Finally, all previously identified and newly determined subgroups were combined in a multilevel classification using a "cluster-of-clusters" approach with consensus clustering. RESULTS: Based on DNA copy number data, tumors were categorized into three groups according to the complex arm aberration index. mRNA expression profiles divided tumors into five molecular subgroups according to PAM50 subtyping, and clustering based on microRNA expression revealed four subgroups. Reverse-phase protein array data divided tumors into five subgroups. Hierarchical clustering of tumor metabolic profiles revealed three clusters. Combining DNA copy number and mRNA expression classified tumors into seven clusters based on pathway activity levels, and tumors were classified into ten subtypes using integrative clustering. The final consensus clustering that incorporated all aforementioned subtypes revealed six major groups. Five corresponded well with the mRNA subtypes, while a sixth group resulted from a split of the luminal A subtype; these tumors belonged to distinct microRNA clusters. Gain-of-function studies using MCF-7 cells showed that microRNAs differentially expressed between the luminal A clusters were important for cancer cell survival. These microRNAs were used to validate the split in luminal A tumors in four independent breast cancer cohorts. In two cohorts the microRNAs divided tumors into subgroups with significantly different outcomes, and in another a trend was observed. CONCLUSIONS: The six integrated subtypes identified confirm the heterogeneity of breast cancer and show that finer subdivisions of subtypes are evident. Increasing knowledge of the heterogeneity of the luminal A subtype may add pivotal information to guide therapeutic choices, evidently bringing us closer to improved treatment for this largest subgroup of breast cancer.


Subject(s)
Biomarkers, Tumor , Breast Neoplasms/genetics , Breast Neoplasms/metabolism , Cluster Analysis , Breast Neoplasms/epidemiology , Breast Neoplasms/mortality , DNA Copy Number Variations , Female , Gene Expression Profiling , Gene Expression Regulation, Neoplastic , Gene Regulatory Networks , Humans , Metabolic Networks and Pathways , Metabolomics/methods , MicroRNAs/genetics , Norway/epidemiology , Prognosis , RNA, Messenger/genetics
5.
Proc Natl Acad Sci U S A ; 111(44): E4726-35, 2014 Nov 04.
Article in English | MEDLINE | ID: mdl-25339441

ABSTRACT

The acute cellular response to stress generates a subpopulation of reversibly stress-tolerant cells under conditions that are lethal to the majority of the population. Stress tolerance is attributed to heterogeneity of gene expression within the population to ensure survival of a minority. We performed whole transcriptome sequencing analyses of metastatic human breast cancer cells subjected to the chemotherapeutic agent paclitaxel at the single-cell and population levels. Here we show that specific transcriptional programs are enacted within untreated, stressed, and drug-tolerant cell groups while generating high heterogeneity between single cells within and between groups. We further demonstrate that drug-tolerant cells contain specific RNA variants residing in genes involved in microtubule organization and stabilization, as well as cell adhesion and cell surface signaling. In addition, the gene expression profile of drug-tolerant cells is similar to that of untreated cells within a few doublings. Thus, single-cell analyses reveal the dynamics of the stress response in terms of cell-specific RNA variants driving heterogeneity, the survival of a minority population through generation of specific RNA variants, and the efficient reconversion of stress-tolerant cells back to normalcy.


Subject(s)
Antineoplastic Agents, Phytogenic/pharmacology , Breast Neoplasms , Drug Resistance, Neoplasm , Paclitaxel/pharmacology , RNA, Neoplasm , Sequence Analysis, RNA , Transcription, Genetic , Breast Neoplasms/drug therapy , Breast Neoplasms/genetics , Breast Neoplasms/metabolism , Breast Neoplasms/pathology , Cell Line, Tumor , Drug Resistance, Neoplasm/drug effects , Drug Resistance, Neoplasm/genetics , Female , Humans , RNA, Neoplasm/biosynthesis , RNA, Neoplasm/genetics , Transcription, Genetic/drug effects , Transcription, Genetic/genetics
6.
Breast Cancer Res Treat ; 154(1): 23-32, 2015 Nov.
Article in English | MEDLINE | ID: mdl-26456572

ABSTRACT

FOXM1 is a key transcription factor regulating cell cycle progression, DNA damage response, and a host of other hallmark cancer features, but the role of the FOXM1 cistrome in driving estrogen receptor-positive (ER+) versus estrogen receptor-negative (ER-) breast cancer clinical outcomes remains undefined. Chromatin immunoprecipitation sequencing (ChIP-Seq) coupled with RNA sequencing (RNA-Seq) analyses was used to identify FOXM1 target genes in breast cancer cells (MCF-7) where FOXM1 expression was either induced by cell proliferation or repressed by p53 upregulation. The prognostic performance of these FOXM1 target genes was assessed relative to FOXM transcript levels and a 61-gene proliferation score (PS) for their ability to dichotomize a pooled cohort of 683 adjuvant chemotherapy-naĆÆve, node-negative breast cancer cases (447 ER+, 236 ER-). Differences in distant metastasis-free survival (DMFS) between the dichotomized expression groups were determined by Cox proportional hazard modeling. Proliferation-associated FOXM1 upregulation induced a set of 145 differentially bound and expressed genes (direct targets), and these demonstrated minimal overlap with differentially bound and expressed genes following FOXM1 repression by p53 upregulation. This proliferation-associated FOXM1 cistrome was not only better at significantly predicting metastatic outcome of ER+ breast cancers (HR: 2.8 (2.0-3.8), pĀ =Ā 8.13E-10), but was the only parameter trending toward significance in predicting ER- metastatic outcome (HR: 1.6 (0.9-2.9), pĀ =Ā 0.087). Our findings demonstrate that FOXM1 target genes are highly dependent on the cellular context in which FOXM1 expression is modulated, and a newly identified proliferation-associated FOXM1 cistromic signature best predicts breast cancer metastatic outcome.


Subject(s)
Breast Neoplasms/genetics , Forkhead Transcription Factors/genetics , Genes/genetics , Prognosis , Breast Neoplasms/pathology , Cell Proliferation/genetics , Disease-Free Survival , Estrogen Receptor alpha/genetics , Female , Forkhead Box Protein M1 , Forkhead Transcription Factors/biosynthesis , Gene Expression Regulation, Neoplastic , Humans , Kaplan-Meier Estimate , MCF-7 Cells , Mitotic Index , Neoplasm Proteins/biosynthesis
7.
Nucleic Acids Res ; 41(Web Server issue): W218-24, 2013 Jul.
Article in English | MEDLINE | ID: mdl-23748957

ABSTRACT

High-throughput data sets such as genome-wide protein-protein interactions, protein-DNA interactions and gene expression data have been published for several model systems, especially for human cancer samples. The University of California, Santa Cruz (UCSC) Interaction Browser (http://sysbio.soe.ucsc.edu/nets) is an online tool for biologists to view high-throughput data sets simultaneously for the analysis of functional relationships between biological entities. Users can access several public interaction networks and functional genomics data sets through the portal as well as upload their own networks and data sets for analysis. Users can navigate through correlative relationships for focused sets of genes belonging to biological pathways using a standard web browser. Using a new visual modality called the CircleMap, multiple 'omics' data sets can be viewed simultaneously within the context of curated, predicted, directed and undirected regulatory interactions. The Interaction Browser provides an integrative viewing of biological networks based on the consensus of many observations about genes and their products, which may provide new insights about normal and disease processes not obvious from any isolated data set.


Subject(s)
Gene Regulatory Networks , Software , Colorectal Neoplasms/genetics , Computer Graphics , DNA Copy Number Variations , DNA Methylation , Gene Expression , Genomics , Humans , Internet , Mutation , Protein Interaction Mapping
8.
Proc Natl Acad Sci U S A ; 109(8): 2802-7, 2012 Feb 21.
Article in English | MEDLINE | ID: mdl-21908711

ABSTRACT

We use an integrated approach to understand breast cancer heterogeneity by modeling mRNA, copy number alterations, microRNAs, and methylation in a pathway context utilizing the pathway recognition algorithm using data integration on genomic models (PARADIGM). We demonstrate that combining mRNA expression and DNA copy number classified the patients in groups that provide the best predictive value with respect to prognosis and identified key molecular and stromal signatures. A chronic inflammatory signature, which promotes the development and/or progression of various epithelial tumors, is uniformly present in all breast cancers. We further demonstrate that within the adaptive immune lineage, the strongest predictor of good outcome is the acquisition of a gene signature that favors a high T-helper 1 (Th1)/cytotoxic T-lymphocyte response at the expense of Th2-driven humoral immunity. Patients who have breast cancer with a basal HER2-negative molecular profile (PDGM2) are characterized by high expression of protumorigenic Th2/humoral-related genes (24-38%) and a low Th1/Th2 ratio. The luminal molecular subtypes are again differentiated by low or high FOXM1 and ERBB4 signaling. We show that the interleukin signaling profiles observed in invasive cancers are absent or weakly expressed in healthy tissue but already prominent in ductal carcinoma in situ, together with ECM and cell-cell adhesion regulating pathways. The most prominent difference between low and high mammographic density in healthy breast tissue by PARADIGM was that of STAT4 signaling. In conclusion, by means of a pathway-based modeling methodology (PARADIGM) integrating different layers of molecular data from whole-tumor samples, we demonstrate that we can stratify immune signatures that predict patient survival.


Subject(s)
Breast Neoplasms/blood supply , Breast Neoplasms/genetics , Carcinoma, Intraductal, Noninfiltrating/genetics , Gene Expression Profiling , Gene Expression Regulation, Neoplastic , Interleukins/metabolism , Signal Transduction/genetics , Algorithms , Breast Neoplasms/classification , Breast Neoplasms/pathology , Carcinoma, Intraductal, Noninfiltrating/classification , Carcinoma, Intraductal, Noninfiltrating/immunology , Carcinoma, Intraductal, Noninfiltrating/pathology , Databases, Genetic , Female , Genomics , Humans , Lymphocyte Count , Lymphocytes, Tumor-Infiltrating/immunology , Mammography , Neoplasm Invasiveness , Prognosis , Reproducibility of Results , Survival Analysis , Th1 Cells/immunology , Th2 Cells/immunology
9.
Bioinformatics ; 29(13): i62-70, 2013 Jul 01.
Article in English | MEDLINE | ID: mdl-23813010

ABSTRACT

UNLABELLED: High-dimensional '-omics' profiling provides a detailed molecular view of individual cancers; however, understanding the mechanisms by which tumors evade cellular defenses requires deep knowledge of the underlying cellular pathways within each cancer sample. We extended the PARADIGM algorithm (Vaske et al., 2010, Bioinformatics, 26, i237-i245), a pathway analysis method for combining multiple '-omics' data types, to learn the strength and direction of 9139 gene and protein interactions curated from the literature. Using genomic and mRNA expression data from 1936 samples in The Cancer Genome Atlas (TCGA) cohort, we learned interactions that provided support for and relative strength of 7138 (78%) of the curated links. Gene set enrichment found that genes involved in the strongest interactions were significantly enriched for transcriptional regulation, apoptosis, cell cycle regulation and response to tumor cells. Within the TCGA breast cancer cohort, we assessed different interaction strengths between breast cancer subtypes, and found interactions associated with the MYC pathway and the ER alpha network to be among the most differential between basal and luminal A subtypes. PARADIGM with the Naive Bayesian assumption produced gene activity predictions that, when clustered, found groups of patients with better separation in survival than both the original version of PARADIGM and a version without the assumption. We found that this Naive Bayes assumption was valid for the vast majority of co-regulators, indicating that most co-regulators act independently on their shared target. AVAILABILITY: http://paradigm.five3genomics.com. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Algorithms , Gene Expression Regulation, Neoplastic , Bayes Theorem , Breast Neoplasms/genetics , Breast Neoplasms/metabolism , Cluster Analysis , Estrogen Receptor alpha/metabolism , Female , Gene Expression Profiling , Genomics , Humans , Models, Statistical , Neoplasms/genetics , Ovarian Neoplasms/genetics , Ovarian Neoplasms/metabolism , Ovarian Neoplasms/mortality , Survival Analysis
10.
Cancer Discov ; 14(9): 1631-1652, 2024 Sep 04.
Article in English | MEDLINE | ID: mdl-39058036

ABSTRACT

Merkel cell carcinoma (MCC) is an aggressive neuroendocrine skin cancer with a Ć¢ĀˆĀ¼50% response rate to immune checkpoint blockade (ICB) therapy. To identify predictive biomarkers, we integrated bulk and single-cell RNA sequencing (RNA-seq) with spatial transcriptomics from a cohort of 186 samples from 116 patients, including bulk RNA-seq from 14 matched pairs pre- and post-ICB. In nonresponders, tumors show evidence of increased tumor proliferation, neuronal stem cell markers, and IL1. Responders have increased type I/II interferons and preexisting tissue resident (Trm) CD8 or VƎĀ“1 ƎĀ³ĆŽĀ“ T cells that functionally converge with overlapping antigen-specific transcriptional programs and clonal expansion of public T-cell receptors. Spatial transcriptomics demonstrated colocalization of T cells with B and dendritic cells, which supply chemokines and costimulation. Lastly, ICB significantly increased clonal expansion or recruitment of Trm and VƎĀ“1 cells in tumors specifically in responders, underscoring their therapeutic importance. These data identify potential clinically actionable biomarkers and therapeutic targets for MCC. Significance: MCC serves as a model of ICB response. We utilized the largest-to-date, multimodal MCC dataset (n = 116 patients) to uncover unique tumor-intrinsic properties and immune circuits that predict response. We identified CD8 Trm and VƎĀ“1 T cells as clinically actionable mediators of ICB response in major histocompatibility complex-high and -low MCCs, respectively.


Subject(s)
CD8-Positive T-Lymphocytes , Carcinoma, Merkel Cell , Immunotherapy , Skin Neoplasms , Humans , Carcinoma, Merkel Cell/immunology , Carcinoma, Merkel Cell/drug therapy , Carcinoma, Merkel Cell/pathology , Carcinoma, Merkel Cell/therapy , CD8-Positive T-Lymphocytes/immunology , CD8-Positive T-Lymphocytes/metabolism , Skin Neoplasms/immunology , Skin Neoplasms/drug therapy , Skin Neoplasms/pathology , Skin Neoplasms/therapy , Skin Neoplasms/genetics , Immunotherapy/methods , Immune Checkpoint Inhibitors/therapeutic use , Immune Checkpoint Inhibitors/pharmacology
11.
Nucleic Acids Res ; 39(Database issue): D951-9, 2011 Jan.
Article in English | MEDLINE | ID: mdl-21059681

ABSTRACT

The UCSC Cancer Genomics Browser (https://genome-cancer.ucsc.edu) comprises a suite of web-based tools to integrate, visualize and analyze cancer genomics and clinical data. The browser displays whole-genome views of genome-wide experimental measurements for multiple samples alongside their associated clinical information. Multiple data sets can be viewed simultaneously as coordinated 'heatmap tracks' to compare across studies or different data modalities. Users can order, filter, aggregate, classify and display data interactively based on any given feature set including clinical features, annotated biological pathways and user-contributed collections of genes. Integrated standard statistical tools provide dynamic quantitative analysis within all available data sets. The browser hosts a growing body of publicly available cancer genomics data from a variety of cancer types, including data generated from the Cancer Genome Atlas project. Multiple consortiums use the browser on confidential prepublication data enabled by private installations. Many new features have been added, including the hgMicroscope tumor image viewer, hgSignature for real-time genomic signature evaluation on any browser track, and 'PARADIGM' pathway tracks to display integrative pathway activities. The browser is integrated with the UCSC Genome Browser; thus inheriting and integrating the Genome Browser's rich set of human biology and genetics data that enhances the interpretability of the cancer genomics data.


Subject(s)
Databases, Genetic , Genomics , Neoplasms/genetics , DNA Copy Number Variations , Gene Expression , Genome, Human , Humans , Internet , Neoplasms/metabolism , Neoplasms/pathology , Software
12.
Cancer Epidemiol Biomarkers Prev ; 32(10): 1411-1420, 2023 10 02.
Article in English | MEDLINE | ID: mdl-37505926

ABSTRACT

BACKGROUND: Esophageal squamous cell carcinoma (ESCC) comprises 90% of all esophageal cancer cases globally and is the most common histology in low-resource settings. Eastern Africa has a disproportionately high incidence of ESCC. METHODS: We describe the genomic profiles of 61 ESCC cases from Tanzania and compare them to profiles from an existing cohort of ESCC cases from Malawi. We also provide a comparison to ESCC tumors in The Cancer Genome Atlas (TCGA). RESULTS: We observed substantial transcriptional overlap with other squamous histologies via comparison with TCGA PanCan dataset. DNA analysis revealed known mutational patterns, both genome-wide as well as in genes known to be commonly mutated in ESCC. TP53 mutations were the most common somatic mutation in tumors from both Tanzania and Malawi but were detected at lower frequencies than previously reported in ESCC cases from other settings. In a combined analysis, two unique transcriptional clusters were identified: a proliferative/epithelial cluster and an invasive/migrative/mesenchymal cluster. Mutational signature analysis of the Tanzanian cohort revealed common signatures associated with aging and cytidine deaminase activity (APOBEC) and an absence of signature 29, which was previously reported in the Malawi cohort. CONCLUSIONS: This study defines the molecular characteristics of ESCC in Tanzania, and enriches the Eastern African dataset, with findings of overall similarities but also some heterogeneity across two unique sites. IMPACT: Despite a high burden of ESCC in Eastern Africa, investigations into the genomics in this region are nascent. This represents the largest comprehensive genomic analysis ESCC from sub-Saharan Africa to date.


Subject(s)
Carcinoma, Squamous Cell , Esophageal Neoplasms , Esophageal Squamous Cell Carcinoma , Humans , Esophageal Squamous Cell Carcinoma/genetics , Esophageal Neoplasms/epidemiology , Esophageal Neoplasms/genetics , Esophageal Neoplasms/pathology , Carcinoma, Squamous Cell/epidemiology , Carcinoma, Squamous Cell/genetics , Carcinoma, Squamous Cell/pathology , Genomics , Tanzania/epidemiology
13.
Front Genet ; 13: 932763, 2022.
Article in English | MEDLINE | ID: mdl-36147501

ABSTRACT

The clustering of mutations observed in cancer cells is reminiscent of the stress-induced mutagenesis (SIM) response in bacteria. Bacteria deploy SIM when faced with DNA double-strand breaks in the presence of conditions that elicit an SOS response. SIM employs DinB, the evolutionary precursor to human trans-lesion synthesis (TLS) error-prone polymerases, and results in mutations concentrated around DNA double-strand breaks with an abundance that decays with distance. We performed a quantitative study on single nucleotide variant calls for whole-genome sequencing data from 1950 tumors, non-inherited mutations from 129 normal samples, and acquired mutations in 3 cell line models of stress-induced adaptive mutation. We introduce statistical methods to identify mutational clusters, quantify their shapes and tease out the potential mechanism that produced them. Our results show that mutations in both normal and cancer samples are indeed clustered and have shapes indicative of SIM. Clusters in normal samples occur more often in the same genomic location across samples than in cancer suggesting loss of regulation over the mutational process during carcinogenesis. Additionally, the signatures of TLS contribute the most to mutational cluster formation in both patient samples as well as experimental models of SIM. Furthermore, a measure of cluster shape heterogeneity was associated with cancer patient survival with a hazard ratio of 5.744 (Cox Proportional Hazard Regression, 95% CI: 1.824-18.09). Our results support the conclusion that the ancient and evolutionary-conserved adaptive mutation response found in bacteria is a source of genomic instability in cancer. Biological adaptation through SIM might explain the ability of tumors to evolve in the face of strong selective pressures such as treatment and suggests that the conventional 'hit it hard' approaches to therapy could prove themselves counterproductive.

14.
Bioinformatics ; 26(12): i237-45, 2010 Jun 15.
Article in English | MEDLINE | ID: mdl-20529912

ABSTRACT

MOTIVATION: High-throughput data is providing a comprehensive view of the molecular changes in cancer tissues. New technologies allow for the simultaneous genome-wide assay of the state of genome copy number variation, gene expression, DNA methylation and epigenetics of tumor samples and cancer cell lines. Analyses of current data sets find that genetic alterations between patients can differ but often involve common pathways. It is therefore critical to identify relevant pathways involved in cancer progression and detect how they are altered in different patients. RESULTS: We present a novel method for inferring patient-specific genetic activities incorporating curated pathway interactions among genes. A gene is modeled by a factor graph as a set of interconnected variables encoding the expression and known activity of a gene and its products, allowing the incorporation of many types of omic data as evidence. The method predicts the degree to which a pathway's activities (e.g. internal gene states, interactions or high-level 'outputs') are altered in the patient using probabilistic inference. Compared with a competing pathway activity inference approach called SPIA, our method identifies altered activities in cancer-related pathways with fewer false-positives in both a glioblastoma multiform (GBM) and a breast cancer dataset. PARADIGM identified consistent pathway-level activities for subsets of the GBM patients that are overlooked when genes are considered in isolation. Further, grouping GBM patients based on their significant pathway perturbations divides them into clinically-relevant subgroups having significantly different survival outcomes. These findings suggest that therapeutics might be chosen that target genes at critical points in the commonly perturbed pathway(s) of a group of patients. AVAILABILITY: Source code available at http://sbenz.github.com/Paradigm,. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Genomics/methods , Neoplasms/genetics , Software , Breast Neoplasms/genetics , DNA Copy Number Variations , Female , Gene Expression Profiling/methods , Glioblastoma/genetics , Humans
15.
J Immunother Cancer ; 9(6)2021 06.
Article in English | MEDLINE | ID: mdl-34172517

ABSTRACT

BACKGROUND: Therapeutic regimens designed to augment the immunological response of a patient with breast cancer (BC) to tumor tissue are critically informed by tumor mutational burden and the antigenicity of expressed neoepitopes. Herein we describe a neoepitope and cognate neoepitope-reactive T-cell identification and validation program that supports the development of next-generation immunotherapies. METHODS: Using GPS Cancer, NantOmics research, and The Cancer Genome Atlas databases, we developed a novel bioinformatic-based approach which assesses mutational load, neoepitope expression, human leukocyte antigen (HLA)-binding prediction, and in vitro confirmation of T-cell recognition to preferentially identify targetable neoepitopes. This program was validated by application to a BC cell line and confirmed using tumor biopsies from two patients with BC enrolled in the Tumor-Infiltrating Lymphocytes and Genomics (TILGen) study. RESULTS: The antigenicity and HLA-A2 restriction of the BC cell line predicted neoepitopes were determined by reactivity of T cells from HLA-A2-expressing healthy donors. For the TILGen subjects, tumor-infiltrating lymphocytes (TILs) recognized the predicted neoepitopes both as peptides and on retroviral expression in HLA-matched Epstein-Barr virus-lymphoblastoid cell line and BC cell line MCF-7 cells; PCR clonotyping revealed the presence of T cells in the periphery with T-cell receptors for the predicted neoepitopes. These high-avidity immune responses were polyclonal, mutation-specific and restricted to either HLA class I or II. Interestingly, we observed the persistence and expansion of polyclonal T-cell responses following neoadjuvant chemotherapy. CONCLUSIONS: We demonstrate our neoepitope prediction program allows for the successful identification of neoepitopes targeted by TILs in patients with BC, providing a means to identify tumor-specific immunogenic targets for individualized treatment, including vaccines or adoptively transferred cellular therapies.


Subject(s)
Antigens, Neoplasm/genetics , Breast Neoplasms/genetics , Histocompatibility Antigens Class I/immunology , Immunotherapy/methods , Female , Humans
16.
PLoS Comput Biol ; 5(1): e1000274, 2009 Jan.
Article in English | MEDLINE | ID: mdl-19180177

ABSTRACT

Complex phenotypes such as the transformation of a normal population of cells into cancerous tissue result from a series of molecular triggers gone awry. We describe a method that searches for a genetic network consistent with expression changes observed under the knock-down of a set of genes that share a common role in the cell, such as a disease phenotype. The method extends the Nested Effects Model of Markowetz et al. (2005) by using a probabilistic factor graph to search for a network representing interactions among these silenced genes. The method also expands the network by attaching new genes at specific downstream points, providing candidates for subsequent perturbations to further characterize the pathway. We investigated an extension provided by the factor graph approach in which the model distinguishes between inhibitory and stimulatory interactions. We found that the extension yielded significant improvements in recovering the structure of simulated and Saccharomyces cerevisae networks. We applied the approach to discover a signaling network among genes involved in a human colon cancer cell invasiveness pathway. The method predicts several genes with new roles in the invasiveness process. We knocked down two genes identified by our approach and found that both knock-downs produce loss of invasive potential in a colon cancer cell line. Nested effects models may be a powerful tool for inferring regulatory connections and genes that operate in normal and disease-related processes.


Subject(s)
Colonic Neoplasms/genetics , Computational Biology/methods , Gene Expression Regulation, Neoplastic , Gene Expression Regulation , Gene Regulatory Networks/physiology , Gene Silencing , Saccharomyces cerevisiae/genetics , Algorithms , Colonic Neoplasms/pathology , Computer Simulation , Data Interpretation, Statistical , HT29 Cells , Humans , Models, Genetic , Neoplasm Invasiveness , Normal Distribution , Oligonucleotide Array Sequence Analysis , Signal Transduction
17.
Sci Rep ; 10(1): 17597, 2020 10 19.
Article in English | MEDLINE | ID: mdl-33077815

ABSTRACT

Transcriptome profiling can provide information of great value in clinical decision-making, yet RNA from readily available formalin-fixed paraffin-embedded (FFPE) tissue is often too degraded for quality sequencing. To assess the clinical utility of FFPE-derived RNA, we performed ribo-deplete RNA extractions on > 3200 FFPE slide samples; 25 of these had direct FFPE vs. fresh frozen (FF) replicates, 57 were sequenced in 2 different labs, 87 underwent multiple library analyses, and 16 had direct microdissected vs. macrodissected replicates. Poly-A versus ribo-depletion RNA extraction methods were compared using transcriptomes of TCGA cohort and 3116 FFPE samples. Compared to FF, FFPE transcripts coding for nuclear/cytoplasmic proteins involved in DNA packaging, replication, and protein synthesis were detected at lower rates and zinc finger family transcripts were of poorer quality. The greatest difference in extraction methods was in histone transcripts which typically lack poly-A tails. Encouragingly, the overall sequencing success rate was 81%. Exome coverage was highly concordant in direct FFPE and FF replicates, with 98% agreement in coding exon coverage and a median correlation of whole transcriptome profiles of 0.95. We provide strong rationale for clinical use of FFPE-derived RNA based on the robustness, reproducibility, and consistency of whole transcriptome profiling.


Subject(s)
Gene Expression Profiling/methods , High-Throughput Nucleotide Sequencing/methods , Computational Biology , Databases, Factual , Humans , Paraffin Embedding , Reproducibility of Results , Sequence Analysis, RNA , Tissue Fixation/methods
18.
JCI Insight ; 5(11)2020 06 04.
Article in English | MEDLINE | ID: mdl-32493840

ABSTRACT

Next-generation sequencing (NGS) has not revealed all the mechanisms underlying resistance to genomically matched drugs. Here, we performed in 1417 tumors whole-exome tumor (somatic)/normal (germline) NGS and whole-transcriptome sequencing, the latter focusing on a clinically oriented 50-gene panel in order to examine transcriptomic silencing of putative driver alterations. In this large-scale study, approximately 13% of the somatic single nucleotide variants (SNVs) were unexpectedly not expressed as RNA; 23% of patients had ≥1 nonexpressed SNV. SNV-bearing genes consistently transcribed were TP53, PIK3CA, and KRAS; those with lower transcription rates were ALK, CSF1R, ERBB4, FLT3, GNAS, HNF1A, KDR, PDGFRA, RET, and SMO. We also determined the frequency of tumor mutations being germline, rather than somatic, in these and an additional 462 tumors with tumor/normal exomes; 33.8% of germline SNVs within the gene panel were rare (not found after filtering through variant information domains) and at risk of being falsely reported as somatic. Both the frequency of silenced variant transcription and the risk of falsely identifying germline mutations as somatic/tumor related are important phenomena. Therefore, transcriptomics is a critical adjunct to genomics when interrogating patient tumors for actionable alterations, because, without expression of the target aberrations, there will likely be therapeutic resistance.


Subject(s)
Gene Silencing , Germ-Line Mutation , Neoplasms , Polymorphism, Single Nucleotide , Transcriptome , Female , High-Throughput Nucleotide Sequencing , Humans , Male , Middle Aged , Neoplasms/genetics , Neoplasms/metabolism , Neoplasms/therapy
19.
Cell Rep ; 33(13): 108562, 2020 12 29.
Article in English | MEDLINE | ID: mdl-33378680

ABSTRACT

Generating mammalian cells with desired mitochondrial DNA (mtDNA) sequences is enabling for studies of mitochondria, disease modeling, and potential regenerative therapies. MitoPunch, a high-throughput mitochondrial transfer device, produces cells with specific mtDNA-nuclear DNA (nDNA) combinations by transferring isolated mitochondria from mouse or human cells into primary or immortal mtDNA-deficient (ρ0) cells. Stable isolated mitochondrial recipient (SIMR) cells isolated in restrictive media permanently retain donor mtDNA and reacquire respiration. However, SIMR fibroblasts maintain a ρ0-like cell metabolome and transcriptome despite growth in restrictive media. We reprogrammed non-immortal SIMR fibroblasts into induced pluripotent stem cells (iPSCs) with subsequent differentiation into diverse functional cell types, including mesenchymal stem cells (MSCs), adipocytes, osteoblasts, and chondrocytes. Remarkably, after reprogramming and differentiation, SIMR fibroblasts molecularly and phenotypically resemble unmanipulated control fibroblasts carried through the same protocol. Thus, our MitoPunch "pipeline" enables the production of SIMR cells with unique mtDNA-nDNA combinations for additional studies and applications in multiple cell types.


Subject(s)
Cellular Reprogramming , Fibroblasts/metabolism , Gene Transfer Techniques , High-Throughput Screening Assays/methods , Mitochondria/genetics , Mitochondria/metabolism , Mitochondria/transplantation , Animals , Cell Differentiation , Cell Line , DNA, Mitochondrial/metabolism , HEK293 Cells , Humans , Induced Pluripotent Stem Cells/metabolism , Metabolome , Mice , Mice, Inbred C57BL , Transcriptome
20.
iScience ; 20: 119-136, 2019 Oct 25.
Article in English | MEDLINE | ID: mdl-31563852

ABSTRACT

DNA accessibility is a key dynamic feature of chromatin regulation that can potentiate transcriptional events and tumor progression. To gain insight into chromatin state across existing tumor data, we improved neural network models for predicting accessibility from DNA sequence and extended them to incorporate a global set of RNA sequencing gene expression inputs. Our expression-informed model expanded the application domain beyond specific tissue types to tissues not present in training and achieved consistently high accuracy in predicting DNA accessibility at promoter and promoter flank regions. We then leveraged our new tool by analyzing the DNA accessibility landscape of promoters across The Cancer Genome Atlas. We show that in lung adenocarcinoma the accessibility perspective uniquely highlights immune pathways inversely correlated with a more open chromatin state and that accessibility patterns learned from even a single tumor type can discriminate immune inflammation across many cancers, often with direct relation to patient prognosis.

SELECTION OF CITATIONS
SEARCH DETAIL