ABSTRACT
BACKGROUND: The plastid genomes of the green algal order Chlamydomonadales tend to expand their non-coding regions, but this phenomenon is poorly understood. Here we shed new light on organellar genome evolution in Chlamydomonadales by studying a previously unknown non-photosynthetic lineage. We established cultures of two new Polytoma-like flagellates, defined their basic characteristics and phylogenetic position, and obtained complete organellar genome sequences and a transcriptome assembly for one of them. RESULTS: We discovered a novel deeply diverged chlamydomonadalean lineage that has no close photosynthetic relatives and represents an independent case of photosynthesis loss. To accommodate these organisms, we establish the new genus Leontynka, with two species (L. pallida and L. elongata) distinguishable through both their morphological and molecular characteristics. Notable features of the colourless plastid of L. pallida deduced from the plastid genome (plastome) sequence and transcriptome assembly include the retention of ATP synthase, thylakoid-associated proteins, the carotenoid biosynthesis pathway, and a plastoquinone-based electron transport chain, the latter two modules having an obvious functional link to the eyespot present in Leontynka. Most strikingly, the ~362 kbp plastome of L. pallida is by far the largest among the non-photosynthetic eukaryotes investigated to date due to an extreme proliferation of sequence repeats. These repeats are also present in coding sequences, with one repeat type found in the exons of 11 out of 34 protein-coding genes, with up to 36 copies per gene, thus affecting the encoded proteins. The mitochondrial genome of L. pallida is likewise exceptionally large, with its >104 kbp surpassed only by the mitogenome of Haematococcus lacustris among all members of Chlamydomonadales hitherto studied. It is also bloated with repeats, though entirely different from those in the L. pallida plastome, which contrasts with the situation in H. lacustris where both the organellar genomes have accumulated related repeats. Furthermore, the L. pallida mitogenome exhibits an extremely high GC content in both coding and non-coding regions and, strikingly, a high number of predicted G-quadruplexes. CONCLUSIONS: With its unprecedented combination of plastid and mitochondrial genome characteristics, Leontynka pushes the frontiers of organellar genome diversity and is an interesting model for studying organellar genome evolution.
Subject(s)
Chlorophyceae , Chlorophyta , Genome, Plastid , Chlorophyta/genetics , Evolution, Molecular , Photosynthesis/genetics , Phylogeny , PlastidsABSTRACT
BACKGROUND: Departures from the standard genetic code in eukaryotic nuclear genomes are known for only a handful of lineages and only a few genetic code variants seem to exist outside the ciliates, the most creative group in this regard. Most frequent code modifications entail reassignment of the UAG and UAA codons, with evidence for at least 13 independent cases of a coordinated change in the meaning of both codons. However, no change affecting each of the two codons separately has been documented, suggesting the existence of underlying evolutionary or mechanistic constraints. RESULTS: Here, we present the discovery of two new variants of the nuclear genetic code, in which UAG is translated as an amino acid while UAA is kept as a termination codon (along with UGA). The first variant occurs in an organism noticed in a (meta)transcriptome from the heteropteran Lygus hesperus and demonstrated to be a novel insect-dwelling member of Rhizaria (specifically Sainouroidea). This first documented case of a rhizarian with a non-canonical genetic code employs UAG to encode leucine and represents an unprecedented change among nuclear codon reassignments. The second code variant was found in the recently described anaerobic flagellate Iotanema spirale (Metamonada: Fornicata). Analyses of transcriptomic data revealed that I. spirale uses UAG to encode glutamine, similarly to the most common variant of a non-canonical code known from several unrelated eukaryotic groups, including hexamitin diplomonads (also a lineage of fornicates). However, in these organisms, UAA also encodes glutamine, whereas it is the primary termination codon in I. spirale. Along with phylogenetic evidence for distant relationship of I. spirale and hexamitins, this indicates two independent genetic code changes in fornicates. CONCLUSIONS: Our study documents, for the first time, that evolutionary changes of the meaning of UAG and UAA codons in nuclear genomes can be decoupled and that the interpretation of the two codons by the cytoplasmic translation apparatus is mechanistically separable. The latter conclusion has interesting implications for possibilities of genetic code engineering in eukaryotes. We also present a newly developed generally applicable phylogeny-informed method for inferring the meaning of reassigned codons.
Subject(s)
Cell Nucleus/genetics , Codon/genetics , Genetic Code , Animals , Ciliophora/genetics , Evolution, Molecular , Glutamine/genetics , Insecta/parasitology , Leucine/genetics , Open Reading Frames/genetics , Phylogeny , Rhizaria/geneticsABSTRACT
BACKGROUND: Evidence suggests that some human endogenous retroviruses and endogenous retrovirus-like repeats (here collectively ERVs) regulate the expression of neighboring genes in normal and disease states; e.g. the human globin locus is regulated by an ERV9 that coordinates long-range gene switching during hematopoiesis and activates also intergenic transcripts. While complex transcription regulation is associated with integration of certain exogenous retroviruses, comparable regulation sustained by ERVs is less understood. FINDINGS: We analyzed ERV transcription using ERV9 consensus sequences and publically available RNA-sequencing, chromatin immunoprecipitation with sequencing (ChIP-seq) and cap analysis gene expression (CAGE) data from ENCODE. We discovered previously undescribed and advanced transcription regulation mechanisms in several human reference cell lines. We show that regulation by ERVs involves long-ranging activations including complex RNA splicing patterns, and transcription of large unannotated regions ranging in size from several hundred kb to around 1 Mb. Moreover, regulation was found to be cooperatively sustained in some loci by multiple ERVs and also non-LTR repeats. CONCLUSION: Our analyses show that endogenous retroviruses sustain advanced transcription regulation in human cell lines, which shows similarities to complex insertional mutagenesis effects exerted by exogenous retroviruses. By exposing previously undescribed regulation effects, this study should prove useful for understanding fundamental transcription mechanisms resulting from evolutionary acquisition of retroviral sequence in the human genome.
Subject(s)
Endogenous Retroviruses/genetics , Gene Expression Regulation , Genetic Loci , Host-Pathogen Interactions , Cell Line , Humans , RNA Splicing , Transcription, GeneticABSTRACT
BACKGROUND: Insertional mutagenesis screens of retrovirus-induced mouse tumors have proven valuable in human cancer research and for understanding adverse effects of retroviral-based gene therapies. In previous studies, the assignment of mouse genes to individual retroviral integration sites has been based on close proximity and expression patterns of annotated genes at target positions in the genome. We here employed next-generation RNA sequencing to map retroviral-mouse chimeric junctions genome-wide, and to identify local patterns of transcription activation in T-lymphomas induced by the murine leukemia gamma-retrovirus SL3-3. Moreover, to determine epigenetic integration preferences underlying long-range gene activation by retroviruses, the colocalization propensity with common epigenetic enhancer markers (H3K4Me1 and H3K27Ac) of 6,117 integrations derived from end-stage tumors of more than 2,000 mice was examined. RESULTS: We detected several novel mechanisms of retroviral insertional mutagenesis: bidirectional activation of mouse transcripts on opposite sides of a provirus including transcription of unannotated mouse sequence; sense/antisense-type activation of genes located on opposite DNA strands; tandem-type activation of distal genes that are positioned adjacently on the same DNA strand; activation of genes that are not the direct integration targets; combination-type insertional mutagenesis, in which enhancer activation, alternative chimeric splicing and retroviral promoter insertion are induced by a single retrovirus. We also show that irrespective of the distance to transcription start sites, the far majority of retroviruses in end-stage tumors colocalize with H3K4Me1 and H3K27Ac-enriched regions in murine lymphoid tissues. CONCLUSIONS: We expose novel retrovirus-induced host transcription activation patterns that reach beyond a single and nearest annotated gene target. Awareness of this previously undescribed layer of complexity may prove important for elucidation of adverse effects in retroviral-based gene therapies. We also show that wild-type gamma-retroviruses are frequently positioned at enhancers, suggesting that integration into regulatory regions is specific and also subject to positive selection for sustaining long-range gene activation in end-stage tumors. Altogether, this study should prove useful for extrapolating adverse outcomes of retroviral vector therapies, and for understanding fundamental cellular regulatory principles and retroviral biology.
Subject(s)
Leukemia Virus, Murine/genetics , Mutagenesis, Insertional/genetics , Retroviridae/genetics , Transcriptional Activation/genetics , Animals , Epigenesis, Genetic , Genetic Therapy/methods , Genetic Vectors/genetics , Mice , Neoplasms/genetics , Promoter Regions, Genetic , Proviruses/genetics , T-Lymphocytes/metabolism , Transcription Initiation Site , Virus Integration/geneticsABSTRACT
This paper has two interrelated aims. First, it attempts to sketch a preliminary map of economic winners and losers to highlight the emerging economic geographies of the coronavirus pandemic. Second, it aims to explore the links between these emerging economic geographies and the processes of 'financialisation', drawing on the concept of 'financial chains'. Regarding the first aim, the paper argues that the pandemic-induced crisis will exacerbate social inequalities and deepen uneven development at multiple geographical scales. Regarding the second aim, the paper argues that the 'financialisation' perspective in general, and the concept of 'financial chains' in particular, provide useful insights into the crisis and its uneven effects, by shedding light on the complex web of flows of value and power relations established/emerging between the prospective winners and losers. It also highlights the prominent role of debt and debt-based financial chains in shaping economic geographies in times of major global crisis.
ABSTRACT
Circular RNAs (circRNAs) have recently gained substantial attention in the cancer research field where most, including the putative oncogene ciRS-7 (CDR1as), have been proposed to function as competitive endogenous RNAs (ceRNAs) by sponging specific microRNAs. Here, we report the first spatially resolved cellular expression patterns of ciRS-7 in colon cancer and show that ciRS-7 is completely absent in the cancer cells, but highly expressed in stromal cells within the tumor microenvironment. Additionally, our data suggest that this generally apply to classical oncogene-driven adenocarcinomas, but not to other cancers, including malignant melanoma. Moreover, we find that correlations between circRNA and mRNA expression, which are commonly interpreted as evidence of a ceRNA function, can be explained by different cancer-to-stromal cell ratios among the studied tumor specimens. Together, these results have wide implications for future circRNA studies and highlight the importance of spatially resolving expression patterns of circRNAs proposed to function as ceRNAs.
Subject(s)
Colonic Neoplasms/genetics , Gene Expression Regulation, Neoplastic , MicroRNAs/metabolism , RNA, Circular/metabolism , RNA, Long Noncoding/metabolism , Tumor Microenvironment/genetics , Aged , Colonic Neoplasms/pathology , Female , Humans , Male , Middle Aged , Oncogenes/genetics , Prospective Studies , RNA, Circular/genetics , RNA, Long Noncoding/genetics , Spatial AnalysisABSTRACT
Several studies have shown that human endogenous retroviruses and endogenous retrovirus-like repeats (here collectively HERVs) impose direct regulation on human genes through enhancer and promoter motifs present in their long terminal repeats (LTRs). Although chimeric transcription in which novel gene isoforms containing retroviral and human sequence are transcribed from viral promoters are commonly associated with disease, regulation by HERVs is beneficial in other settings; for example, in human testis chimeric isoforms of TP63 induced by an ERV9 LTR protect the male germ line upon DNA damage by inducing apoptosis, whereas in the human globin locus the γ- and ß-globin switch during normal hematopoiesis is mediated by complex interactions of an ERV9 LTR and surrounding human sequence. The advent of deep sequencing or next-generation sequencing (NGS) has revolutionized the way researchers solve important scientific questions and develop novel hypotheses in relation to human genome regulation. We recently applied next-generation paired-end RNA-sequencing (RNA-seq) together with chromatin immunoprecipitation with sequencing (ChIP-seq) to examine ERV9 chimeric transcription in human reference cell lines from Encyclopedia of DNA Elements (ENCODE). This led to the discovery of advanced regulation mechanisms by ERV9s and other HERVs across numerous human loci including transcription of large gene-unannotated genomic regions, as well as cooperative regulation by multiple HERVs and non-LTR repeats such as Alu elements. In this article, well-established examples of human gene regulation by HERVs are reviewed followed by a description of paired-end RNA-seq, and its application in identifying chimeric transcription genome-widely. Based on integrative analyses of RNA-seq and ChIP-seq, data we then present novel examples of regulation by ERV9s of tumor suppressor genes CADM2 and SEMA3A, as well as transcription of an unannotated region. Taken together, this article highlights the high suitability of contemporary sequencing methods in future analyses of human biology in relation to evolutionary acquired retroviruses in the human genome.