ABSTRACT
The 5S rRNA genes are among the most conserved nucleotide sequences across all species. Similar to the 5S preservation we observe the occurrence of 5S-related nonautonomous retrotransposons, so-called Cassandras. Cassandras harbor highly conserved 5S rDNA-related sequences within their long terminal repeats, advantageously providing them with the 5S internal promoter. However, the dynamics of Cassandra retrotransposon evolution in the context of 5S rRNA gene sequence information and structural arrangement are still unclear, especially: (1) do we observe repeated or gradual domestication of the highly conserved 5S promoter by Cassandras and (2) do changes in 5S organization such as in the linked 35S-5S rDNA arrangements impact Cassandra evolution? Here, we show evidence for gradual co-evolution of Cassandra sequences with their corresponding 5S rDNAs. To follow the impact of 5S rDNA variability on Cassandra TEs, we investigate the Asteraceae family where highly variable 5S rDNAs, including 5S promoter shifts and both linked and separated 35S-5S rDNA arrangements have been reported. Cassandras within the Asteraceae mirror 5S rDNA promoter mutations of their host genome, likely as an adaptation to the host's specific 5S transcription factors and hence compensating for evolutionary changes in the 5S rDNA sequence. Changes in the 5S rDNA sequence and in Cassandras seem uncorrelated with linked/separated rDNA arrangements. We place all these observations into the context of angiosperm 5S rDNA-Cassandra evolution, discuss Cassandra's origin hypotheses (single or multiple) and Cassandra's possible impact on rDNA and plant genome organization, giving new insights into the interplay of ribosomal genes and transposable elements.
Subject(s)
RNA, Ribosomal, 5S , Retroelements , RNA, Ribosomal, 5S/genetics , Retroelements/genetics , Genes, rRNA , Base Sequence , DNA, Ribosomal/genetics , Genome, Plant , Mutation , Evolution, MolecularABSTRACT
Although both are salient features of genomes, at first glance ribosomal DNAs and transposable elements are genetic elements with not much in common: whereas ribosomal DNAs are mainly viewed as housekeeping genes that uphold all prime genome functions, transposable elements are generally portrayed as selfish and disruptive. These opposing characteristics are also mirrored in other attributes: organization in tandem (ribosomal DNAs) versus organization in a dispersed manner (transposable elements); evolution in a concerted manner (ribosomal DNAs) versus evolution by diversification (transposable elements); and activity that prolongs genomic stability (ribosomal DNAs) versus activity that shortens it (transposable elements). Re-visiting relevant instances in which ribosomal DNA-transposable element interactions have been reported, we note that both repeat types share at least four structural and functional hallmarks: (1) they are repetitive DNAs that shape genomes in evolutionary timescales, (2) they exchange structural motifs and can enter co-evolution processes, (3) they are tightly controlled genomic stress sensors playing key roles in senescence/aging, and (4) they share common epigenetic marks such as DNA methylation and histone modification. Here, we give an overview of the structural, functional, and evolutionary characteristics of both ribosomal DNAs and transposable elements, discuss their roles and interactions, and highlight trends and future directions as we move forward in understanding ribosomal DNA-transposable element associations.
Subject(s)
DNA Transposable Elements , Genomics , DNA, Ribosomal , DNA Methylation , Cytogenetic Analysis , Evolution, MolecularABSTRACT
BACKGROUND: Despite the many cheap and fast ways to generate genomic data, good and exact genome assembly is still a problem, with especially the repeats being vastly underrepresented and often misassembled. As short reads in low coverage are already sufficient to represent the repeat landscape of any given genome, many read cluster algorithms were brought forward that provide repeat identification and classification. But how can trustworthy, reliable and representative repeat consensuses be derived from unassembled genomes? RESULTS: Here, we combine methods from repeat identification and genome assembly to derive these robust consensuses. We test several use cases, such as (1) consensus building from clustered short reads of non-model genomes, (2) from genome-wide amplification setups, and (3) specific repeat-centred questions, such as the linked vs. unlinked arrangement of ribosomal genes. In all our use cases, the derived consensuses are robust and representative. To evaluate overall performance, we compare our high-fidelity repeat consensuses to RepeatExplorer2-derived contigs and check, if they represent real transposable elements as found in long reads. Our results demonstrate that it is possible to generate useful, reliable and trustworthy consensuses from short reads by a combination from read cluster and genome assembly methods in an automatable way. CONCLUSION: We anticipate that our workflow opens the way towards more efficient and less manual repeat characterization and annotation, benefitting all genome studies, but especially those of non-model organisms.
Subject(s)
Algorithms , DNA Transposable Elements , Consensus Sequence , Cluster Analysis , GenomicsABSTRACT
BACKGROUND: Extrachromosomal circular DNAs (eccDNAs) are ring-like DNA structures physically separated from the chromosomes with 100 bp to several megabasepairs in size. Apart from carrying tandemly repeated DNA, eccDNAs may also harbor extra copies of genes or recently activated transposable elements. As eccDNAs occur in all eukaryotes investigated so far and likely play roles in stress, cancer, and aging, they have been prime targets in recent research-with their investigation limited by the scarcity of computational tools. RESULTS: Here, we present the ECCsplorer, a bioinformatics pipeline to detect eccDNAs in any kind of organism or tissue using next-generation sequencing techniques. Following Illumina-sequencing of amplified circular DNA (circSeq), the ECCsplorer enables an easy and automated discovery of eccDNA candidates. The data analysis encompasses two major procedures: first, read mapping to the reference genome allows the detection of informative read distributions including high coverage, discordant mapping, and split reads. Second, reference-free comparison of read clusters from amplified eccDNA against control sample data reveals specifically enriched DNA circles. Both software parts can be run separately or jointly, depending on the individual aim or data availability. To illustrate the wide applicability of our approach, we analyzed semi-artificial and published circSeq data from the model organisms Homo sapiens and Arabidopsis thaliana, and generated circSeq reads from the non-model crop plant Beta vulgaris. We clearly identified eccDNA candidates from all datasets, with and without reference genomes. The ECCsplorer pipeline specifically detected mitochondrial mini-circles and retrotransposon activation, showcasing the ECCsplorer's sensitivity and specificity. CONCLUSION: The ECCsplorer (available online at https://github.com/crimBubble/ECCsplorer ) is a bioinformatics pipeline to detect eccDNAs in any kind of organism or tissue using next-generation sequencing data. The derived eccDNA targets are valuable for a wide range of downstream investigations-from analysis of cancer-related eccDNAs over organelle genomics to identification of active transposable elements.
Subject(s)
DNA, Circular , DNA , Chromosomes , Cytoplasm , DNA/genetics , DNA, Circular/genetics , High-Throughput Nucleotide Sequencing , HumansABSTRACT
Fluorescence in situ hybridization (FISH) using oligonucleotide-based probes is an innovative modification of classic FISH techniques, enabling karyotypic identifications. Here, we exemplarily describe the design and in silico visualization of oligonucleotide-based probes derived from the Cucumis sativus genome. Additionally, the probes are also plotted comparatively to the closely related Cucumis melo genome. The visualization process is realized in R using various libraries for linear or circular plots including RIdeogram, KaryoploteR, and Circlize.
Subject(s)
Cucumis sativus , Genome , In Situ Hybridization, Fluorescence , Oligonucleotide Probes/genetics , Cucumis sativus/genetics , Oligonucleotides/geneticsABSTRACT
Saffron crocus (Crocus sativus) is a male-sterile, triploid flower crop, and source of the spice and colorant saffron. For over three millennia, it was cultivated across the Mediterranean, including ancient Greece, Persia, and other cultures, later spreading all over the world. Despite saffron crocus' early omnipresence, its origin has been the matter of a century-old debate, in terms of area and time as well as parental species contribution. While remnants of the ancient arts, crafts, and texts still provide hints on its origin, modern genetics has the potential to efficiently follow these leads, thus shedding light on new possible lines of descent. In this review, we follow ancient arts and recent genetics to trace the evolutionary origin of saffron crocus. We focus on the place and time of saffron domestication and cultivation, and address its presumed autopolyploid origin involving cytotypes of wild Crocus cartwrightianus. Both ancient arts from Greece, Iran, and Mesopotamia as well as recent cytogenetic and comparative next-generation sequencing approaches point to saffron's emergence and domestication in ancient Greece, showing how both disciplines converge in tracing its origin.