RESUMO
Transcript alterations often result from somatic changes in cancer genomes1. Various forms of RNA alterations have been described in cancer, including overexpression2, altered splicing3 and gene fusions4; however, it is difficult to attribute these to underlying genomic changes owing to heterogeneity among patients and tumour types, and the relatively small cohorts of patients for whom samples have been analysed by both transcriptome and whole-genome sequencing. Here we present, to our knowledge, the most comprehensive catalogue of cancer-associated gene alterations to date, obtained by characterizing tumour transcriptomes from 1,188 donors of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA)5. Using matched whole-genome sequencing data, we associated several categories of RNA alterations with germline and somatic DNA alterations, and identified probable genetic mechanisms. Somatic copy-number alterations were the major drivers of variations in total gene and allele-specific expression. We identified 649 associations of somatic single-nucleotide variants with gene expression in cis, of which 68.4% involved associations with flanking non-coding regions of the gene. We found 1,900 splicing alterations associated with somatic mutations, including the formation of exons within introns in proximity to Alu elements. In addition, 82% of gene fusions were associated with structural variants, including 75 of a new class, termed 'bridged' fusions, in which a third genomic location bridges two genes. We observed transcriptomic alteration signatures that differ between cancer types and have associations with variations in DNA mutational signatures. This compendium of RNA alterations in the genomic context provides a rich resource for identifying genes and mechanisms that are functionally implicated in cancer.
Assuntos
Regulação Neoplásica da Expressão Gênica , Neoplasias/genética , RNA/genética , Variações do Número de Cópias de DNA , DNA de Neoplasias , Genoma Humano , Genômica , Humanos , TranscriptomaRESUMO
The activation of memory T cells is a very rapid and concerted cellular response that requires coordination between cellular processes in different compartments and on different time scales. In this study, we use ribosome profiling and deep RNA sequencing to define the acute mRNA translation changes in CD8 memory T cells following initial activation events. We find that initial translation enables subsequent events of human and mouse T cell activation and expansion. Briefly, early events in the activation of Ag-experienced CD8 T cells are insensitive to transcriptional blockade with actinomycin D, and instead depend on the translation of pre-existing mRNAs and are blocked by cycloheximide. Ribosome profiling identifies â¼92 mRNAs that are recruited into ribosomes following CD8 T cell stimulation. These mRNAs typically have structured GC and pyrimidine-rich 5' untranslated regions and they encode key regulators of T cell activation and proliferation such as Notch1, Ifngr1, Il2rb, and serine metabolism enzymes Psat1 and Shmt2 (serine hydroxymethyltransferase 2), as well as translation factors eEF1a1 (eukaryotic elongation factor α1) and eEF2 (eukaryotic elongation factor 2). The increased production of receptors of IL-2 and IFN-γ precedes the activation of gene expression and augments cellular signals and T cell activation. Taken together, we identify an early RNA translation program that acts in a feed-forward manner to enable the rapid and dramatic process of CD8 memory T cell expansion and activation.
Assuntos
Glicina Hidroximetiltransferase , Interleucina-2 , Regiões 5' não Traduzidas , Animais , Linfócitos T CD8-Positivos , Cicloeximida/metabolismo , Dactinomicina/metabolismo , Glicina Hidroximetiltransferase/genética , Glicina Hidroximetiltransferase/metabolismo , Humanos , Memória Imunológica , Interleucina-2/metabolismo , Ativação Linfocitária , Células T de Memória , Camundongos , Fator 2 de Elongação de Peptídeos/genética , Fator 2 de Elongação de Peptídeos/metabolismo , Fatores de Alongamento de Peptídeos/genética , Pirimidinas/metabolismo , RNA Mensageiro/genética , Serina/genéticaRESUMO
Science journalism is a critical way for the public to learn about and benefit from scientific findings. Such journalism shapes the public's view of the current state of science and legitimizes experts. Journalists can only cite and quote a limited number of sources, who they may discover in their research, including recommendations by other scientists. Biases in either process may influence who is identified and ultimately included as a source. To examine potential biases in science journalism, we analyzed 22,001 non-research articles published by Nature and compared these with Nature-published research articles with respect to predicted gender and name origin. We extracted cited authors' names and those of quoted speakers. While citations and quotations within a piece do not reflect the entire information-gathering process, they can provide insight into the demographics of visible sources. We then predicted gender and name origin of the cited authors and speakers. We compared articles with a comparator set made up of first and last authors within primary research articles in Nature and a subset of Springer Nature articles in the same time period. In our analysis, we found a skew toward quoting men in Nature science journalism. However, quotation is trending toward equal representation at a faster rate than authorship rates in academic publishing. Gender disparity in Nature quotes was dependent on the article type. We found a significant over-representation of names with predicted Celtic/English origin and under-representation of names with a predicted East Asian origin in both in extracted quotes and journal citations but dampened in citations.
Assuntos
Jornalismo , Humanos , Masculino , Feminino , Ciência , Autoria , Fatores Sexuais , Publicações Periódicas como Assunto/estatística & dados numéricos , Bibliometria , Sexismo/estatística & dados numéricosRESUMO
While single-cell experiments provide deep cellular resolution within a single sample, some single-cell experiments are inherently more challenging than bulk experiments due to dissociation difficulties, cost, or limited tissue availability. This creates a situation where we have deep cellular profiles of one sample or condition, and bulk profiles across multiple samples and conditions. To bridge this gap, we propose BuDDI (BUlk Deconvolution with Domain Invariance). BuDDI utilizes domain adaptation techniques to effectively integrate available corpora of case-control bulk and reference scRNA-seq observations to infer cell-type-specific perturbation effects. BuDDI achieves this by learning independent latent spaces within a single variational autoencoder (VAE) encompassing at least four sources of variability: 1) cell type proportion, 2) perturbation effect, 3) structured experimental variability, and 4) remaining variability. Since each latent space is encouraged to be independent, we simulate perturbation responses by independently composing each latent space to simulate cell-type-specific perturbation responses. We evaluated BuDDI's performance on simulated and real data with experimental designs of increasing complexity. We first validated that BuDDI could learn domain invariant latent spaces on data with matched samples across each source of variability. Then we validated that BuDDI could accurately predict cell-type-specific perturbation response when no single-cell perturbed profiles were used during training; instead, only bulk samples had both perturbed and non-perturbed observations. Finally, we validated BuDDI on predicting sex-specific differences, an experimental design where it is not possible to have matched samples. In each experiment, BuDDI outperformed all other comparative methods and baselines. As more reference atlases are completed, BuDDI provides a path to combine these resources with bulk-profiled treatment or disease signatures to study perturbations, sex differences, or other factors at single-cell resolution.
RESUMO
BACKGROUND: High-grade serous carcinoma (HGSC) gene expression subtypes are associated with differential survival. We characterized HGSC gene expression in Black individuals and considered whether gene expression differences by self-identified race may contribute to poorer HGSC survival among Black versus White individuals. METHODS: We included newly generated RNA sequencing data from Black and White individuals and array-based genotyping data from four existing studies of White and Japanese individuals. We used K-means clustering, a method with no predefined number of clusters or dataset-specific features, to assign subtypes. Cluster- and dataset-specific gene expression patterns were summarized by moderated t-scores. We compared cluster-specific gene expression patterns across datasets by calculating the correlation between the summarized vectors of moderated t-scores. After mapping to The Cancer Genome Atlas-derived HGSC subtypes, we used Cox proportional hazards models to estimate subtype-specific survival by dataset. RESULTS: Cluster-specific gene expression was similar across gene expression platforms and racial groups. Comparing the Black population with the White and Japanese populations, the immunoreactive subtype was more common (39% vs. 23%-28%) and the differentiated subtype was less common (7% vs. 22%-31%). Patterns of subtype-specific survival were similar between the Black and White populations with RNA sequencing data; compared with mesenchymal cases, the risk of death was similar for proliferative and differentiated cases and suggestively lower for immunoreactive cases [Black population HR = 0.79 (0.55, 1.13); White population HR = 0.86 (0.62, 1.19)]. CONCLUSIONS: Although the prevalence of HGSC subtypes varied by race, subtype-specific survival was similar. IMPACT: HGSC subtypes can be consistently assigned across platforms and self-identified racial groups.
Assuntos
Cistadenocarcinoma Seroso , Neoplasias Ovarianas , Humanos , Feminino , Neoplasias Ovarianas/genética , Neoplasias Ovarianas/etnologia , Neoplasias Ovarianas/patologia , Neoplasias Ovarianas/mortalidade , Cistadenocarcinoma Seroso/genética , Cistadenocarcinoma Seroso/patologia , Cistadenocarcinoma Seroso/etnologia , Cistadenocarcinoma Seroso/mortalidade , Pessoa de Meia-Idade , População Branca/genética , População Branca/estatística & dados numéricos , Gradação de Tumores , Idoso , Negro ou Afro-Americano/genética , Negro ou Afro-Americano/estatística & dados numéricosRESUMO
Introduction: High-grade serous carcinoma (HGSC) gene expression subtypes are associated with differential survival. We characterized HGSC gene expression in Black individuals and considered whether gene expression differences by race may contribute to poorer HGSC survival among Black versus non-Hispanic White individuals. Methods: We included newly generated RNA-Seq data from Black and White individuals, and array-based genotyping data from four existing studies of White and Japanese individuals. We assigned subtypes using K-means clustering. Cluster- and dataset-specific gene expression patterns were summarized by moderated t-scores. We compared cluster-specific gene expression patterns across datasets by calculating the correlation between the summarized vectors of moderated t-scores. Following mapping to The Cancer Genome Atlas (TCGA)-derived HGSC subtypes, we used Cox proportional hazards models to estimate subtype-specific survival by dataset. Results: Cluster-specific gene expression was similar across gene expression platforms. Comparing the Black study population to the White and Japanese study populations, the immunoreactive subtype was more common (39% versus 23%-28%) and the differentiated subtype less common (7% versus 22%-31%). Patterns of subtype-specific survival were similar between the Black and White populations with RNA-Seq data; compared to mesenchymal cases, the risk of death was similar for proliferative and differentiated cases and suggestively lower for immunoreactive cases (Black population HR=0.79 [0.55, 1.13], White population HR=0.86 [0.62, 1.19]). Conclusions: A single, platform-agnostic pipeline can be used to assign HGSC gene expression subtypes. While the observed prevalence of HGSC subtypes varied by race, subtype-specific survival was similar.
RESUMO
Mutations in the splicing factor SF3B1 are frequently occurring in various cancers and drive tumor progression through the activation of cryptic splice sites in multiple genes. Recent studies also demonstrate a positive correlation between the expression levels of wild-type SF3B1 and tumor malignancy. Here, we demonstrate that SF3B1 is a hypoxia-inducible factor (HIF)-1 target gene that positively regulates HIF1 pathway activity. By physically interacting with HIF1α, SF3B1 facilitates binding of the HIF1 complex to hypoxia response elements (HREs) to activate target gene expression. To further validate the relevance of this mechanism for tumor progression, we show that a reduction in SF3B1 levels via monoallelic deletion of Sf3b1 impedes tumor formation and progression via impaired HIF signaling in a mouse model for pancreatic cancer. Our work uncovers an essential role of SF3B1 in HIF1 signaling, thereby providing a potential explanation for the link between high SF3B1 expression and aggressiveness of solid tumors.
Assuntos
Neoplasias Pancreáticas , Transdução de Sinais , Animais , Linhagem Celular Tumoral , Hipóxia/metabolismo , Fator 1 Induzível por Hipóxia/metabolismo , Subunidade alfa do Fator 1 Induzível por Hipóxia/genética , Subunidade alfa do Fator 1 Induzível por Hipóxia/metabolismo , Camundongos , Neoplasias Pancreáticas/genética , Fosfoproteínas/genética , Fosfoproteínas/metabolismo , Sítios de Splice de RNA , Fatores de Processamento de RNA/genética , Fatores de Processamento de RNA/metabolismo , Neoplasias PancreáticasRESUMO
Translation initiation is orchestrated by the cap binding and 43S preinitiation complexes (PIC). Eukaryotic initiation factor 1A (EIF1A) is essential for recruitment of the ternary complex and for assembling the 43S PIC. Recurrent EIF1AX mutations in papillary thyroid cancers are mutually exclusive with other drivers, including RAS. EIF1AX mutations are enriched in advanced thyroid cancers, where they display a striking co-occurrence with RAS, which cooperates to induce tumorigenesis in mice and isogenic cell lines. The C-terminal EIF1AX-A113splice mutation is the most prevalent in advanced thyroid cancer. EIF1AX-A113splice variants stabilize the PIC and induce ATF4, a sensor of cellular stress, which is co-opted to suppress EIF2α phosphorylation, enabling a general increase in protein synthesis. RAS stabilizes c-MYC, an effect augmented by EIF1AX-A113splice. ATF4 and c-MYC induce expression of amino acid transporters and enhance sensitivity of mTOR to amino acid supply. These mutually reinforcing events generate therapeutic vulnerabilities to MEK, BRD4, and mTOR kinase inhibitors. SIGNIFICANCE: Mutations of EIF1AX, a component of the translation PIC, co-occur with RAS in advanced thyroid cancers and promote tumorigenesis. EIF1AX-A113splice drives an ATF4-induced dephosphorylation of EIF2α, resulting in increased protein synthesis. ATF4 also cooperates with c-MYC to sensitize mTOR to amino acid supply, thus generating vulnerability to mTOR kinase inhibitors. This article is highlighted in the In This Issue feature, p. 151.
Assuntos
Fator 4 Ativador da Transcrição/metabolismo , Processamento Alternativo , Carcinogênese/patologia , Fator de Iniciação 1 em Eucariotos/genética , Mutação , Neoplasias da Glândula Tireoide/patologia , Proteínas ras/genética , Fator 4 Ativador da Transcrição/genética , Animais , Apoptose , Carcinogênese/efeitos dos fármacos , Carcinogênese/genética , Carcinogênese/metabolismo , Proliferação de Células , Feminino , Regulação Neoplásica da Expressão Gênica , Humanos , Camundongos , Camundongos Endogâmicos NOD , Camundongos SCID , Fosforilação , Biossíntese de Proteínas , Inibidores de Proteínas Quinases/farmacologia , Serina-Treonina Quinases TOR/genética , Serina-Treonina Quinases TOR/metabolismo , Neoplasias da Glândula Tireoide/tratamento farmacológico , Neoplasias da Glândula Tireoide/genética , Neoplasias da Glândula Tireoide/metabolismo , Células Tumorais Cultivadas , Ensaios Antitumorais Modelo de XenoenxertoRESUMO
BACKGROUND AND OBJECTIVES: We describe and illustrate use of DISTING, a novel web application for computing alternative structurally identifiable linear compartmental models that are input-output indistinguishable from a postulated linear compartmental model. Several computer packages are available for analysing the structural identifiability of such models, but DISTING is the first to be made available for assessing indistinguishability. METHODS: The computational algorithms embedded in DISTING are based on advanced versions of established geometric and algebraic properties of linear compartmental models, embedded in a user-friendly graphic model user interface. Novel computational tools greatly speed up the overall procedure. These include algorithms for Jacobian matrix reduction, submatrix rank reduction, and parallelization of candidate rank computations in symbolic matrix analysis. RESULTS: The application of DISTING to three postulated models with respectively two, three and four compartments is given. The 2-compartment example is used to illustrate the indistinguishability problem; the original (unidentifiable) model is found to have two structurally identifiable models that are indistinguishable from it. The 3-compartment example has three structurally identifiable indistinguishable models. It is found from DISTING that the four-compartment example has five structurally identifiable models indistinguishable from the original postulated model. This example shows that care is needed when dealing with models that have two or more compartments which are neither perturbed nor observed, because the numbering of these compartments may be arbitrary. CONCLUSIONS: DISTING is universally and freely available via the Internet. It is easy to use and circumvents tedious and complicated algebraic analysis previously done by hand.
Assuntos
Algoritmos , Gráficos por Computador , Internet , Software , Simulação por Computador , Humanos , Modelos Lineares , Fígado/efeitos dos fármacos , Preparações Farmacêuticas , Sulfobromoftaleína/química , Biologia de SistemasRESUMO
The Bene Israel Jewish community from West India is a unique population whose history before the 18th century remains largely unknown. Bene Israel members consider themselves as descendants of Jews, yet the identity of Jewish ancestors and their arrival time to India are unknown, with speculations on arrival time varying between the 8th century BCE and the 6th century CE. Here, we characterize the genetic history of Bene Israel by collecting and genotyping 18 Bene Israel individuals. Combining with 486 individuals from 41 other Jewish, Indian and Pakistani populations, and additional individuals from worldwide populations, we conducted comprehensive genome-wide analyses based on FST, principal component analysis, ADMIXTURE, identity-by-descent sharing, admixture linkage disequilibrium decay, haplotype sharing and allele sharing autocorrelation decay, as well as contrasted patterns between the X chromosome and the autosomes. The genetics of Bene Israel individuals resemble local Indian populations, while at the same time constituting a clearly separated and unique population in India. They are unique among Indian and Pakistani populations we analyzed in sharing considerable genetic ancestry with other Jewish populations. Putting together the results from all analyses point to Bene Israel being an admixed population with both Jewish and Indian ancestry, with the genetic contribution of each of these ancestral populations being substantial. The admixture took place in the last millennium, about 19-33 generations ago. It involved Middle-Eastern Jews and was sex-biased, with more male Jewish and local female contribution. It was followed by a population bottleneck and high endogamy, which can lead to increased prevalence of recessive diseases in this population. This study provides an example of how genetic analysis advances our knowledge of human history in cases where other disciplines lack the relevant data to do so.