RESUMO
The urea cycle (UC) is the main pathway by which mammals dispose of waste nitrogen. We find that specific alterations in the expression of most UC enzymes occur in many tumors, leading to a general metabolic hallmark termed "UC dysregulation" (UCD). UCD elicits nitrogen diversion toward carbamoyl-phosphate synthetase2, aspartate transcarbamylase, and dihydrooratase (CAD) activation and enhances pyrimidine synthesis, resulting in detectable changes in nitrogen metabolites in both patient tumors and their bio-fluids. The accompanying excess of pyrimidine versus purine nucleotides results in a genomic signature consisting of transversion mutations at the DNA, RNA, and protein levels. This mutational bias is associated with increased numbers of hydrophobic tumor antigens and a better response to immune checkpoint inhibitors independent of mutational load. Taken together, our findings demonstrate that UCD is a common feature of tumors that profoundly affects carcinogenesis, mutagenesis, and immunotherapy response.
Assuntos
Genômica , Metabolômica , Neoplasias/patologia , Ureia/metabolismo , Sistemas de Transporte de Aminoácidos Básicos/metabolismo , Animais , Aspartato Carbamoiltransferase/genética , Aspartato Carbamoiltransferase/metabolismo , Carbamoil Fosfato Sintase (Glutamina-Hidrolizante)/genética , Carbamoil Fosfato Sintase (Glutamina-Hidrolizante)/metabolismo , Linhagem Celular Tumoral , Di-Hidro-Orotase/genética , Di-Hidro-Orotase/metabolismo , Feminino , Humanos , Camundongos , Camundongos Endogâmicos C57BL , Camundongos SCID , Proteínas de Transporte da Membrana Mitocondrial , Neoplasias/metabolismo , Ornitina Carbamoiltransferase/antagonistas & inibidores , Ornitina Carbamoiltransferase/genética , Ornitina Carbamoiltransferase/metabolismo , Fosforilação/efeitos dos fármacos , Pirimidinas/biossíntese , Pirimidinas/química , Interferência de RNA , RNA Interferente Pequeno/metabolismo , Sirolimo/farmacologia , Serina-Treonina Quinases TOR/antagonistas & inibidores , Serina-Treonina Quinases TOR/metabolismoRESUMO
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) poses an immediate, major threat to public health across the globe. Here we report an in-depth molecular analysis to reconstruct the evolutionary origins of the enhanced pathogenicity of SARS-CoV-2 and other coronaviruses that are severe human pathogens. Using integrated comparative genomics and machine learning techniques, we identify key genomic features that differentiate SARS-CoV-2 and the viruses behind the two previous deadly coronavirus outbreaks, SARS-CoV and Middle East respiratory syndrome coronavirus (MERS-CoV), from less pathogenic coronaviruses. These features include enhancement of the nuclear localization signals in the nucleocapsid protein and distinct inserts in the spike glycoprotein that appear to be associated with high case fatality rate of these coronaviruses as well as the host switch from animals to humans. The identified features could be crucial contributors to coronavirus pathogenicity and possible targets for diagnostics, prognostication, and interventions.
Assuntos
Betacoronavirus/genética , Evolução Molecular , Genoma Viral , Proteínas do Nucleocapsídeo/genética , Glicoproteína da Espícula de Coronavírus/genética , Animais , Betacoronavirus/classificação , Betacoronavirus/patogenicidade , Especificidade de Hospedeiro , Humanos , Aprendizado de Máquina , Coronavírus da Síndrome Respiratória do Oriente Médio/classificação , Coronavírus da Síndrome Respiratória do Oriente Médio/genética , Coronavírus da Síndrome Respiratória do Oriente Médio/patogenicidade , Mutagênese Insercional , Sinais de Localização Nuclear/genética , Proteínas do Nucleocapsídeo/química , Filogenia , SARS-CoV-2 , Homologia de Sequência , Glicoproteína da Espícula de Coronavírus/química , Virulência/genéticaRESUMO
PURPOSE: Preoperative (neoadjuvant) chemoradiotherapy (CRT) and total mesorectal excision is the standard treatment for rectal cancer patients (UICC stage II/III). Up to one-third of patients treated with CRT achieve a pathological complete response (pCR). These patients could be spared from surgery and its associated morbidity and mortality, and assigned to a "watch and wait" strategy. However, reliably identifying pCR based on clinical or imaging parameters remains challenging. EXPERIMENTAL DESIGN: We generated gene-expression profiles of 175 patients with locally advanced rectal cancer enrolled in the CAO/ARO/AIO-94 and -04 trials. One hundred and sixty-one samples were used for building, training and validating a predictor of pCR using a machine learning algorithm. The performance of the classifier was validated in three independent cohorts, comprising 76 patients from (i) the CAO/ARO/AIO-94 and -04 trials (n = 14), (ii) a publicly available dataset (n = 38) and (iii) in 24 prospectively collected samples from the TransValid A trial. RESULTS: A 21-transcript signature yielded the best classification of pCR in 161 patients (Sensitivity: 0.31; AUC: 0.81), when not allowing misclassification of non-complete-responders (False-positive rate = 0). The classifier remained robust when applied to three independent datasets (n = 76). CONCLUSION: The classifier can identify >1/3 of rectal cancer patients with a pCR while never classifying patients with an incomplete response as having pCR. Importantly, we could validate this finding in three independent datasets, including a prospectively collected cohort. Therefore, this classifier could help select rectal cancer patients for a "watch and wait" strategy. TRANSLATIONAL RELEVANCE: Forgoing surgery with its associated side effects could be an option for rectal cancer patients if the prediction of a pathological complete response (pCR) after preoperative chemoradiotherapy would be possible. Based on gene-expression profiles of 161 patients a classifier was developed and validated in three independent datasets (n = 76), identifying over 1/3 of patients with pCR, while never misclassifying a non-complete-responder. Therefore, the classifier can identify patients suited for "watch and wait".
Assuntos
Quimiorradioterapia , Neoplasias Retais , Biópsia , Ensaios Clínicos como Assunto , Humanos , Terapia Neoadjuvante , Neoplasias Retais/genética , Neoplasias Retais/patologia , Neoplasias Retais/terapia , Resultado do TratamentoRESUMO
Recent advances in metagenomic sequencing have enabled discovery of diverse, distinct microbes and viruses. Bacteriophages, the most abundant biological entity on Earth, evolve rapidly, and therefore, detection of unknown bacteriophages in sequence datasets is a challenge. Most of the existing detection methods rely on sequence similarity to known bacteriophage sequences, impeding the identification and characterization of distinct, highly divergent bacteriophage families. Here we present Seeker, a deep-learning tool for alignment-free identification of phage sequences. Seeker allows rapid detection of phages in sequence datasets and differentiation of phage sequences from bacterial ones, even when those phages exhibit little sequence similarity to established phage families. We comprehensively validate Seeker's ability to identify previously unidentified phages, and employ this method to detect unknown phages, some of which are highly divergent from the known phage families. We provide a web portal (seeker.pythonanywhere.com) and a user-friendly Python package (github.com/gussow/seeker) allowing researchers to easily apply Seeker in metagenomic studies, for the detection of diverse unknown bacteriophages.
Assuntos
Bactérias/virologia , Bacteriófagos/genética , DNA Viral/genética , Genoma Viral , Metagenoma , Software , Bactérias/genética , Bacteriófagos/classificação , Evolução Biológica , Aprendizado Profundo , Humanos , Metagenômica/métodos , Filogenia , Análise de Sequência de DNARESUMO
Cancer arises through the accumulation of somatic mutations over time. Understanding the sequence of mutation occurrence during cancer progression can assist early and accurate diagnosis and improve clinical decision-making. Here we employ long short-term memory (LSTM) networks, a class of recurrent neural network, to learn the evolution of a tumor through an ordered sequence of mutations. We demonstrate the capacity of LSTMs to learn complex dynamics of the mutational time series governing tumor progression, allowing accurate prediction of the mutational burden and the occurrence of mutations in the sequence. Using the probabilities learned by the LSTM, we simulate mutational data and show that the simulation results are statistically indistinguishable from the empirical data. We identify passenger mutations that are significantly associated with established cancer drivers in the sequence and demonstrate that the genes carrying these mutations are substantially enriched in interactions with the corresponding driver genes. Breaking the network into modules consisting of driver genes and their interactors, we show that these interactions are associated with poor patient prognosis, thus likely conferring growth advantage for tumor progression. Thus, application of LSTM provides for prediction of numerous additional conditional drivers and reveals hitherto unknown aspects of cancer evolution.
Assuntos
Simulação por Computador , Bases de Dados Genéticas , Evolução Molecular , Modelos Genéticos , Mutação , Neoplasias/genética , Algoritmos , Humanos , Redes Neurais de ComputaçãoRESUMO
Modifier genes are believed to account for the clinical variability observed in many Mendelian disorders, but their identification remains challenging due to the limited availability of genomics data from large patient cohorts. Here, we present GENDULF (GENetic moDULators identiFication), one of the first methods to facilitate prediction of disease modifiers using healthy and diseased tissue gene expression data. GENDULF is designed for monogenic diseases in which the mechanism is loss of function leading to reduced expression of the mutated gene. When applied to cystic fibrosis, GENDULF successfully identifies multiple, previously established disease modifiers, including EHF, SLC6A14, and CLCA1. It is then utilized in spinal muscular atrophy (SMA) and predicts U2AF1 as a modifier whose low expression correlates with higher SMN2 pre-mRNA exon 7 retention. Indeed, knockdown of U2AF1 in SMA patient-derived cells leads to increased full-length SMN2 transcript and SMN protein expression. Taking advantage of the increasing availability of transcriptomic data, GENDULF is a novel addition to existing strategies for prediction of genetic disease modifiers, providing insights into disease pathogenesis and uncovering novel therapeutic targets.
Assuntos
Algoritmos , Mineração de Dados , Doença/genética , Genes Modificadores , Transcriptoma/genética , Estudos de Associação Genética , Ligação Genética , Células HEK293 , Humanos , Reprodutibilidade dos TestesRESUMO
BACKGROUND: A crucial factor in mitigating respiratory viral outbreaks is early determination of the duration of the incubation period and, accordingly, the required quarantine time for potentially exposed individuals. At the time of the COVID-19 pandemic, optimization of quarantine regimes becomes paramount for public health, societal well-being, and global economy. However, biological factors that determine the duration of the virus incubation period remain poorly understood. RESULTS: We demonstrate a strong positive correlation between the length of the incubation period and disease severity for a wide range of human pathogenic viruses. Using a machine learning approach, we develop a predictive model that accurately estimates, solely from several virus genome features, in particular, the number of protein-coding genes and the GC content, the incubation time ranges for diverse human pathogenic RNA viruses including SARS-CoV-2. The predictive approach described here can directly help in establishing the appropriate quarantine durations and thus facilitate controlling future outbreaks. CONCLUSIONS: The length of the incubation period in viral diseases strongly correlates with disease severity, emphasizing the biological and epidemiological importance of the incubation period. Perhaps, surprisingly, incubation times of pathogenic RNA viruses can be accurately predicted solely from generic features of virus genomes. Elucidation of the biological underpinnings of the connections between these features and disease progression can be expected to reveal key aspects of virus pathogenesis.
Assuntos
COVID-19/patologia , COVID-19/virologia , Período de Incubação de Doenças Infecciosas , SARS-CoV-2/genética , Simulação por Computador , Genoma Viral , Humanos , Modelos Biológicos , Mutação , QuarentenaRESUMO
The exponential growth of biomedical data in recent years has urged the application of numerous machine learning techniques to address emerging problems in biology and clinical research. By enabling the automatic feature extraction, selection, and generation of predictive models, these methods can be used to efficiently study complex biological systems. Machine learning techniques are frequently integrated with bioinformatic methods, as well as curated databases and biological networks, to enhance training and validation, identify the best interpretable features, and enable feature and model investigation. Here, we review recently developed methods that incorporate machine learning within the same framework with techniques from molecular evolution, protein structure analysis, systems biology, and disease genomics. We outline the challenges posed for machine learning, and, in particular, deep learning in biomedicine, and suggest unique opportunities for machine learning techniques integrated with established bioinformatics approaches to overcome some of these challenges.
Assuntos
Biologia Computacional/tendências , Bases de Dados Factuais/tendências , Aprendizado de Máquina/tendências , Biologia de Sistemas/tendências , Algoritmos , HumanosRESUMO
Metabolic alterations play an important role in cancer and yet, few metabolic cancer driver genes are known. Here we perform a combined genomic and metabolic modeling analysis searching for metabolic drivers of colorectal cancer. Our analysis predicts FUT9, which catalyzes the biosynthesis of Ley glycolipids, as a driver of advanced-stage colon cancer. Experimental testing reveals FUT9's complex dual role; while its knockdown enhances proliferation and migration in monolayers, it suppresses colon cancer cells expansion in tumorspheres and inhibits tumor development in a mouse xenograft models. These results suggest that FUT9's inhibition may attenuate tumor-initiating cells (TICs) that are known to dominate tumorspheres and early tumor growth, but promote bulk tumor cells. In agreement, we find that FUT9 silencing decreases the expression of the colorectal cancer TIC marker CD44 and the level of the OCT4 transcription factor, which is known to support cancer stemness. Beyond its current application, this work presents a novel genomic and metabolic modeling computational approach that can facilitate the systematic discovery of metabolic driver genes in other types of cancer.
Assuntos
Neoplasias Colorretais/metabolismo , Biologia Computacional/métodos , Fucosiltransferases/metabolismo , Algoritmos , Animais , Carcinogênese/metabolismo , Carcinogênese/patologia , Linhagem Celular Tumoral , Neoplasias Colorretais/genética , Modelos Animais de Doenças , Fucosiltransferases/genética , Técnicas de Silenciamento de Genes , Genes Supressores de Tumor , Genômica , Humanos , Camundongos Endogâmicos NOD , Camundongos SCID , Invasividade Neoplásica , Células-Tronco Neoplásicas/metabolismo , Células-Tronco Neoplásicas/patologiaRESUMO
Altered cellular metabolism is an important characteristic and driver of cancer. Surprisingly, however, we find here that aggregating individual gene expression using canonical metabolic pathways fails to enhance the classification of noncancerous vs. cancerous tissues and the prediction of cancer patient survival. This supports the notion that metabolic alterations in cancer rewire cellular metabolism through unconventional pathways. Here we present MCF (Metabolic classifier and feature generator), which incorporates gene expression measurements into a human metabolic network to infer new cancer-mediated pathway compositions that enhance cancer vs. adjacent noncancerous tissue classification across five different cancer types. MCF outperforms standard classifiers based on individual gene expression and on canonical human curated metabolic pathways. It successfully builds robust classifiers integrating different datasets of the same cancer type. Reassuringly, the MCF pathways identified lead to metabolites known to be associated with the pertaining specific cancer types. Aggregating gene expression through MCF pathways leads to markedly better predictions of breast cancer patients' survival in an independent cohort than using the canonical human metabolic pathways (C-index = 0.69 vs. 0.52, respectively). Notably, the survival predictive power of individual MCF pathways strongly correlates with their power in predicting cancer vs. noncancerous samples. The more predictive composite pathways identified via MCF are hence more likely to capture key metabolic alterations occurring in cancer than the canonical pathways characterizing healthy human metabolism.
RESUMO
S-acyltransferases play integral roles in essential physiological processes including regulation of oncogenic signaling pathways. While discovered over 40 years ago the field still lacks specific S-acylation inhibitors thus the potential benefit of pharmacologically targeting S-acyltransferases for human disease is still unknown. Here we report the identification of an orally bioavailable acyltransferase inhibitor SD-066-4 that inhibits the acyltransferase ZDHHC20. We identified a specific alanine residue that accommodates the methyl group of SD-066-4, thus providing isoform selectivity. SD-066-4 stably reduces EGFR S-acylation in Kras mutant cells and blocks the growth of Kras mutant lung tumors extending overall survival. We find that lung cancer patients harboring deletions in ZDHHC20 or ZDHHC14 concurrent with Kras alterations have a significant survival benefit, underscoring the translational importance of these enzymes.
RESUMO
The activation of innate immunity and associated interferon (IFN) signaling have been implicated in cancer, but the regulators are elusive and a link to tumor suppression undetermined. Here, we found that Parkin, an E3 ubiquitin ligase altered in Parkinson's Disease was epigenetically silenced in cancer and its re-expression by clinically approved demethylating therapy stimulated transcription of a potent IFN response in tumor cells. This pathway required Parkin E3 ubiquitin ligase activity, involved the subcellular trafficking and release of the alarmin High Mobility Group Box 1 (HMGB1) and was associated with inhibition of NFκB gene expression. In turn, Parkin-expressing cells released an IFN secretome that upregulated effector and cytotoxic CD8 T cell markers, lowered the expression of immune inhibitory receptors, TIM3 and LAG3, and stimulated high content of the self-renewal/stem cell factor, TCF1. Parkin-induced CD8 T cells selectively accumulated in the microenvironment and inhibited transgenic and syngeneic tumor growth, in vivo. Therefore, Parkin is an epigenetically regulated activator of innate immunity and dual mode tumor suppressor, inhibiting intrinsic tumor traits of metabolism and cell invasion, while simultaneously reinvigorating CD8 T cell functions in the microenvironment.
RESUMO
Epstein-Barr virus (EBV) is an aetiologic risk factor for the development of multiple sclerosis (MS). However, the role of EBV-infected B cells in the immunopathology of MS is not well understood. Here we characterized spontaneous lymphoblastoid cell lines (SLCLs) isolated from MS patients and healthy controls (HC) ex vivo to study EBV and host gene expression in the context of an individual's endogenous EBV. SLCLs derived from MS patient B cells during active disease had higher EBV lytic gene expression than SLCLs from MS patients with stable disease or HCs. Host gene expression analysis revealed activation of pathways associated with hypercytokinemia and interferon signalling in MS SLCLs and upregulation of forkhead box protein 1 (FOXP1), which contributes to EBV lytic gene expression. We demonstrate that antiviral approaches targeting EBV replication decreased cytokine production and autologous CD4+ T cell responses in this ex vivo model. These data suggest that dysregulation of intrinsic B cell control of EBV gene expression drives a pro-inflammatory, pathogenic B cell phenotype that can be attenuated by suppressing EBV lytic gene expression.
Assuntos
Linfócitos B , Infecções por Vírus Epstein-Barr , Herpesvirus Humano 4 , Esclerose Múltipla , Humanos , Herpesvirus Humano 4/genética , Esclerose Múltipla/virologia , Esclerose Múltipla/imunologia , Esclerose Múltipla/genética , Esclerose Múltipla/metabolismo , Linfócitos B/imunologia , Linfócitos B/metabolismo , Linfócitos B/virologia , Infecções por Vírus Epstein-Barr/virologia , Infecções por Vírus Epstein-Barr/imunologia , Infecções por Vírus Epstein-Barr/genética , Infecções por Vírus Epstein-Barr/complicações , Citocinas/metabolismo , Linfócitos T CD4-Positivos/imunologia , Linfócitos T CD4-Positivos/virologia , Linfócitos T CD4-Positivos/metabolismo , Fatores de Transcrição Forkhead/genética , Fatores de Transcrição Forkhead/metabolismo , Transcriptoma , Replicação Viral , Regulação Viral da Expressão Gênica , Linhagem Celular , Interações Hospedeiro-Patógeno/genética , Interações Hospedeiro-Patógeno/imunologia , Perfilação da Expressão Gênica , Adulto , Feminino , MasculinoRESUMO
Ovarian cancer remains a major health threat with limited treatment options available. It is characterized by immunosuppressive tumor microenvironment (TME) maintained by tumor- associated macrophages (TAMs) hindering anti-tumor responses and immunotherapy efficacy. Here we show that targeting retinoblastoma protein (Rb) by disruption of its LxCxE cleft pocket, causes cell death in TAMs by induction of ER stress, p53 and mitochondria-related cell death pathways. A reduction of pro-tumor Rb high M2-type macrophages from TME in vivo enhanced T cell infiltration and inhibited cancer progression. We demonstrate an increased Rb expression in TAMs in women with ovarian cancer is associated with poorer prognosis. Ex vivo, we show analogous cell death induction by therapeutic Rb targeting in TAMs in post-surgery ascites from ovarian cancer patients. Overall, our data elucidates therapeutic targeting of the Rb LxCxE cleft pocket as a novel promising approach for ovarian cancer treatment through depletion of TAMs and re-shaping TME immune landscape. Statement of significance: Currently, targeting immunosuppressive myeloid cells in ovarian cancer microenvironment is the first priority need to enable successful immunotherapy, but no effective solutions are clinically available. We show that targeting LxCxE cleft pocket of Retinoblastoma protein unexpectedly induces preferential cell death in M2 tumor-associated macrophages. Depletion of immunosuppressive M2 tumor-associated macrophages reshapes tumor microenvironment, enhances anti-tumor T cell responses, and inhibits ovarian cancer. Thus, we identify a novel paradoxical function of Retinoblastoma protein in regulating macrophage viability as well as a promising target to enhance immunotherapy efficacy in ovarian cancer.
RESUMO
Since the rise of next-generation sequencing technologies, the catalogue of mutations in cancer has been continuously expanding. To address the complexity of the cancer-genomic landscape and extract meaningful insights, numerous computational approaches have been developed over the last two decades. In this review, we survey the current leading computational methods to derive intricate mutational patterns in the context of clinical relevance. We begin with mutation signatures, explaining first how mutation signatures were developed and then examining the utility of studies using mutation signatures to correlate environmental effects on the cancer genome. Next, we examine current clinical research that employs mutation signatures and discuss the potential use cases and challenges of mutation signatures in clinical decision-making. We then examine computational studies developing tools to investigate complex patterns of mutations beyond the context of mutational signatures. We survey methods to identify cancer-driver genes, from single-driver studies to pathway and network analyses. In addition, we review methods inferring complex combinations of mutations for clinical tasks and using mutations integrated with multi-omics data to better predict cancer phenotypes. We examine the use of these tools for either discovery or prediction, including prediction of tumor origin, treatment outcomes, prognosis, and cancer typing. We further discuss the main limitations preventing widespread clinical integration of computational tools for the diagnosis and treatment of cancer. We end by proposing solutions to address these challenges using recent advances in machine learning.
RESUMO
Local microbiome shifts are implicated in the development and progression of gastrointestinal cancers, and in particular, esophageal carcinoma (ESCA), which is among the most aggressive malignancies. Short-read RNA sequencing (RNAseq) is currently the leading technology to study gene expression changes in cancer. However, using RNAseq to study microbial gene expression is challenging. Here, we establish a new tool to efficiently detect viral and bacterial expression in human tissues through RNAseq. This approach employs a neural network to predict reads of likely microbial origin, which are targeted for assembly into longer contigs, improving identification of microbial species and genes. This approach is applied to perform a systematic comparison of bacterial expression in ESCA and healthy esophagi. We uncover bacterial genera that are over or underabundant in ESCA vs healthy esophagi both before and after correction for possible covariates, including patient metadata. However, we find that bacterial taxonomies are not significantly associated with clinical outcomes. Strikingly, in contrast, dozens of microbial proteins were significantly associated with poor patient outcomes and in particular, proteins that perform mitochondrial functions and iron-sulfur coordination. We further demonstrate associations between these microbial proteins and dysregulated host pathways in ESCA patients. Overall, these results suggest possible influences of bacteria on the development of ESCA and uncover new prognostic biomarkers based on microbial genes. In addition, this study provides a framework for the analysis of other human malignancies whose development may be driven by pathogens.
RESUMO
Epidemiological studies have demonstrated that Epstein-Barr virus (EBV) is a known etiologic risk factor, and perhaps prerequisite, for the development of MS. EBV establishes life-long latent infection in a subpopulation of memory B cells. Although the role of memory B cells in the pathobiology of MS is well established, studies characterizing EBV-associated mechanisms of B cell inflammation and disease pathogenesis in EBV (+) B cells from MS patients are limited. Accordingly, we analyzed spontaneous lymphoblastoid cell lines (SLCLs) from multiple sclerosis patients and healthy controls to study host-virus interactions in B cells, in the context of an individual's endogenous EBV. We identify differences in EBV gene expression and regulation of both viral and cellular genes in SLCLs. Our data suggest that EBV latency is dysregulated in MS SLCLs with increased lytic gene expression observed in MS patient B cells, especially those generated from samples obtained during "active" disease. Moreover, we show increased inflammatory gene expression and cytokine production in MS patient SLCLs and demonstrate that tenofovir alafenamide, an antiviral that targets EBV replication, decreases EBV viral loads, EBV lytic gene expression, and EBV-mediated inflammation in both SLCLs and in a mixed lymphocyte assay. Collectively, these data suggest that dysregulation of EBV latency in MS drives a pro-inflammatory, pathogenic phenotype in memory B cells and that this response can be attenuated by suppressing EBV lytic activation. This study provides further support for the development of antiviral agents that target EBV-infection for use in MS.
RESUMO
About 15% of human cancer cases are attributed to viral infections. To date, virus expression in tumor tissues has been mostly studied by aligning tumor RNA sequencing reads to databases of known viruses. To allow identification of divergent viruses and rapid characterization of the tumor virome, we develop viRNAtrap, an alignment-free pipeline to identify viral reads and assemble viral contigs. We utilize viRNAtrap, which is based on a deep learning model trained to discriminate viral RNAseq reads, to explore viral expression in cancers and apply it to 14 cancer types from The Cancer Genome Atlas (TCGA). Using viRNAtrap, we uncover expression of unexpected and divergent viruses that have not previously been implicated in cancer and disclose human endogenous viruses whose expression is associated with poor overall survival. The viRNAtrap pipeline provides a way forward to study viral infections associated with different clinical conditions.
Assuntos
Aprendizado Profundo , Neoplasias , Vírus , Humanos , Neoplasias/genética , Vírus/genética , Genoma Viral , Sequenciamento de Nucleotídeos em Larga EscalaRESUMO
TP53 is the most frequently mutated gene in cancer, yet key target genes for p53-mediated tumor suppression remain unidentified. Here, we characterize a rare, African-specific germline variant of TP53 in the DNA-binding domain Tyr107His (Y107H). Nuclear magnetic resonance and crystal structures reveal that Y107H is structurally similar to wild-type p53. Consistent with this, we find that Y107H can suppress tumor colony formation and is impaired for the transactivation of only a small subset of p53 target genes; this includes the epigenetic modifier PADI4, which deiminates arginine to the nonnatural amino acid citrulline. Surprisingly, we show that Y107H mice develop spontaneous cancers and metastases and that Y107H shows impaired tumor suppression in two other models. We show that PADI4 is itself tumor suppressive and that it requires an intact immune system for tumor suppression. We identify a p53-PADI4 gene signature that is predictive of survival and the efficacy of immune-checkpoint inhibitors. SIGNIFICANCE: We analyze the African-centric Y107H hypomorphic variant and show that it confers increased cancer risk; we use Y107H in order to identify PADI4 as a key tumor-suppressive p53 target gene that contributes to an immune modulation signature and that is predictive of cancer survival and the success of immunotherapy. See related commentary by Bhatta and Cooks, p. 1518. This article is highlighted in the In This Issue feature, p. 1501.
Assuntos
Genes p53 , Neoplasias , Proteína Supressora de Tumor p53 , Animais , Humanos , Camundongos , População Africana/genética , Neoplasias/genética , Proteína Supressora de Tumor p53/metabolismoRESUMO
Immune Checkpoint Inhibitor (ICI) therapy has revolutionized treatment for advanced melanoma; however, only a subset of patients benefit from this treatment. Despite considerable efforts, the Tumor Mutation Burden (TMB) is the only FDA-approved biomarker in melanoma. However, the mechanisms underlying TMB association with prolonged ICI survival are not entirely understood and may depend on numerous confounding factors. To identify more interpretable ICI response biomarkers based on tumor mutations, we train classifiers using mutations within distinct biological processes. We evaluate a variety of feature selection and classification methods and identify key mutated biological processes that provide improved predictive capability compared to the TMB. The top mutated processes we identify are leukocyte and T-cell proliferation regulation, which demonstrate stable predictive performance across different data cohorts of melanoma patients treated with ICI. This study provides biologically interpretable genomic predictors of ICI response with substantially improved predictive performance over the TMB.