Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 296
Filter
1.
Nature ; 619(7970): 555-562, 2023 Jul.
Article in English | MEDLINE | ID: mdl-37380776

ABSTRACT

Whole-genome synthesis provides a powerful approach for understanding and expanding organism function1-3. To build large genomes rapidly, scalably and in parallel, we need (1) methods for assembling megabases of DNA from shorter precursors and (2) strategies for rapidly and scalably replacing the genomic DNA of organisms with synthetic DNA. Here we develop bacterial artificial chromosome (BAC) stepwise insertion synthesis (BASIS)-a method for megabase-scale assembly of DNA in Escherichia coli episomes. We used BASIS to assemble 1.1 Mb of human DNA containing numerous exons, introns, repetitive sequences, G-quadruplexes, and long and short interspersed nuclear elements (LINEs and SINEs). BASIS provides a powerful platform for building synthetic genomes for diverse organisms. We also developed continuous genome synthesis (CGS)-a method for continuously replacing sequential 100 kb stretches of the E. coli genome with synthetic DNA; CGS minimizes crossovers1,4 between the synthetic DNA and the genome such that the output for each 100 kb replacement provides, without sequencing, the input for the next 100 kb replacement. Using CGS, we synthesized a 0.5 Mb section of the E. coli genome-a key intermediate in its total synthesis1-from five episomes in 10 days. By parallelizing CGS and combining it with rapid oligonucleotide synthesis and episome assembly5,6, along with rapid methods for compiling a single genome from strains bearing distinct synthetic genome sections1,7,8, we anticipate that it will be possible to synthesize entire E. coli genomes from functional designs in less than 2 months.


Subject(s)
Chromosomes, Artificial, Bacterial , DNA , Escherichia coli , Genome, Bacterial , Synthetic Biology , Humans , DNA/genetics , DNA/metabolism , Escherichia coli/genetics , Genome, Bacterial/genetics , Plasmids/genetics , Repetitive Sequences, Nucleic Acid/genetics , Synthetic Biology/methods , Chromosomes, Artificial, Bacterial/genetics , Exons , Introns , G-Quadruplexes , Long Interspersed Nucleotide Elements/genetics , Short Interspersed Nucleotide Elements/genetics , Oligodeoxyribonucleotides/biosynthesis , Oligodeoxyribonucleotides/genetics , Oligodeoxyribonucleotides/metabolism , Time Factors
2.
Annu Rev Genomics Hum Genet ; 22: 199-218, 2021 08 31.
Article in English | MEDLINE | ID: mdl-33792357

ABSTRACT

Short interspersed nuclear elements (SINEs) are nonautonomous retrotransposons that occupy approximately 13% of the human genome. They are transcribed by RNA polymerase III and can be retrotranscribed and inserted back into the genome with the help of other autonomous retroelements. Because they are preferentially located close to or within gene-rich regions, they can regulate gene expression by various mechanisms that act at both the DNA and the RNA levels. In this review, we summarize recent findings on the involvement of SINEs in different types of gene regulation and discuss the potential regulatory functions of SINEs that are in close proximity to genes, Pol III-transcribed SINE RNAs, and embedded SINE sequences within Pol II-transcribed genes in the human genome. These discoveries illustrate how the human genome has exapted some SINEs into functional regulatory elements.


Subject(s)
Genome, Human , Transcription, Genetic , Gene Expression Regulation , Humans , RNA Polymerase III/genetics , Short Interspersed Nucleotide Elements/genetics
3.
Proc Natl Acad Sci U S A ; 117(14): 7905-7916, 2020 04 07.
Article in English | MEDLINE | ID: mdl-32193341

ABSTRACT

Transposable elements (TEs) compose nearly half of mammalian genomes and provide building blocks for cis-regulatory elements. Using high-throughput sequencing, we show that 84 TE subfamilies are overrepresented, and distributed in a lineage-specific fashion in core and boundary domains of CD8+ T cell enhancers. Endogenous retroviruses are most significantly enriched in core domains with accessible chromatin, and bear recognition motifs for immune-related transcription factors. In contrast, short interspersed elements (SINEs) are preferentially overrepresented in nucleosome-containing boundaries. A substantial proportion of these SINEs harbor a high density of the enhancer-specific histone mark H3K4me1 and carry sequences that match enhancer boundary nucleotide composition. Motifs with regulatory features are better preserved within enhancer-enriched TE copies compared to their subfamily equivalents located in gene deserts. TE-rich and TE-poor enhancers associate with both shared and unique gene groups and are enriched in overlapping functions related to lymphocyte and leukocyte biology. The majority of T cell enhancers are shared with other immune lineages and are accessible in common hematopoietic progenitors. A higher proportion of immune tissue-specific enhancers are TE-rich compared to enhancers specific to other tissues, correlating with higher TE occurrence in immune gene-associated genomic regions. Our results suggest that during evolution, TEs abundant in these regions and carrying motifs potentially beneficial for enhancer architecture and immune functions were particularly frequently incorporated by evolving enhancers. Their putative selection and regulatory cooption may have accelerated the evolution of immune regulatory networks.


Subject(s)
DNA Transposable Elements/genetics , Enhancer Elements, Genetic/genetics , Evolution, Molecular , T-Lymphocytes/immunology , Animals , Chromatin/genetics , Chromatin/immunology , DNA Transposable Elements/immunology , Endogenous Retroviruses/genetics , Endogenous Retroviruses/immunology , Enhancer Elements, Genetic/immunology , Gene Regulatory Networks/genetics , Genome, Human/genetics , Genome, Human/immunology , Genomics/methods , High-Throughput Nucleotide Sequencing , Humans , Short Interspersed Nucleotide Elements/genetics , Short Interspersed Nucleotide Elements/immunology
4.
Int J Mol Sci ; 24(19)2023 Sep 27.
Article in English | MEDLINE | ID: mdl-37834047

ABSTRACT

Short Interspersed Elements (SINEs) are common in the genomes of most multicellular organisms. They are transcribed by RNA polymerase III from an internal promoter comprising boxes A and B. As transcripts of certain SINEs from mammalian genomes can be polyadenylated, such transcripts should contain the AATAAA sequence as well as those called ß- and τ-signals. One of the goals of this work was to evaluate how autonomous and independent other SINE parts are ß- and τ-signals. Extended regions outside of ß- and τ-signals were deleted from SINEs B2 and Ves and the derived constructs were used to transfect HeLa cells in order to evaluate the relative levels of their transcripts as well as their polyadenylation efficiency. If the deleted regions affected boxes A and B, the 5'-flanking region of the U6 RNA gene with the external promoter was inserted upstream. Such substitution of the internal promoter in B2 completely restored its transcription. Almost all tested deletions/substitutions did not reduce the polyadenylation capacity of the transcripts, indicating a weak dependence of the function of ß- and τ-signals on the neighboring sequences. A similar analysis of B2 and Ves constructs containing a 55-bp foreign sequence inserted between ß- and τ-signals showed an equal polyadenylation efficiency of their transcripts compared to those of constructs without the insertion. The acquired poly(A)-tails significantly increased the lifetime and thus the cellular level of such transcripts. The data obtained highlight the potential of B2 and Ves SINEs as cassettes for the expression of relatively short sequences for various applications.


Subject(s)
Polyadenylation , RNA Polymerase III , Animals , Humans , Polyadenylation/genetics , RNA Polymerase III/genetics , HeLa Cells , Short Interspersed Nucleotide Elements/genetics , Promoter Regions, Genetic , Mammals/metabolism , RNA, Messenger/genetics , RNA, Messenger/metabolism
5.
Mol Genet Genomics ; 297(1): 1-18, 2022 Jan.
Article in English | MEDLINE | ID: mdl-34999963

ABSTRACT

Genome sequence analysis of Entamoeba species revealed various classes of transposable elements. While E. histolytica and E. dispar are rich in non-long terminal repeat (LTR) retrotransposons, E. invadens contains predominantly DNA transposons. Non-LTR retrotransposons of E. histolytica constitute three families of long interspersed nuclear elements (LINEs), and their short, nonautonomous partners, SINEs. They occupy ~ 11% of the genome. The EhLINE1/EhSINE1 family is the most abundant and best studied. EhLINE1 is 4.8 kb, with two ORFs that encode functions needed for retrotransposition. ORF1 codes for the nucleic acid-binding protein, and ORF2 has domains for reverse transcriptase (RT) and endonuclease (EN). Most copies of EhLINEs lack complete ORFs. ORF1p is expressed constitutively, but ORF2p is not detected. Retrotransposition could be demonstrated upon ectopic over expression of ORF2p, showing that retrotransposition machinery is functional. The newly retrotransposed sequences showed a high degree of recombination. In transcriptomic analysis, RNA-Seq reads were mapped to individual EhLINE1 copies. Although full-length copies were transcribed, no full-length 4.8 kb transcripts were seen. Rather, sense transcripts mapped to ORF1, RT and EN domains. Intriguingly, there was strong antisense transcription almost exclusively from the RT domain. These unique features of EhLINE1 could serve to attenuate retrotransposition in E. histolytica.


Subject(s)
Entamoeba histolytica/genetics , Entamoeba histolytica/physiology , Animals , Chromosome Mapping , Genome, Protozoan/genetics , Genomics , Humans , Long Interspersed Nucleotide Elements/genetics , Open Reading Frames/genetics , Retroelements , Short Interspersed Nucleotide Elements/genetics
6.
Nucleic Acids Res ; 48(16): 9346-9360, 2020 09 18.
Article in English | MEDLINE | ID: mdl-32697302

ABSTRACT

Long non-coding RNAs (lncRNAs) are attracting widespread attention for their emerging regulatory, transcriptional, epigenetic, structural and various other functions. Comprehensive transcriptome analysis has revealed that retrotransposon elements (REs) are transcribed and enriched in lncRNA sequences. However, the functions of lncRNAs and the molecular roles of the embedded REs are largely unknown. The secondary and tertiary structures of lncRNAs and their embedded REs are likely to have essential functional roles, but experimental determination and reliable computational prediction of large RNA structures have been extremely challenging. We report here the nuclear magnetic resonance (NMR)-based secondary structure determination of the 167-nt inverted short interspersed nuclear element (SINE) B2, which is embedded in antisense Uchl1 lncRNA and upregulates the translation of sense Uchl1 mRNAs. By using NMR 'fingerprints' as a sensitive probe in the domain survey, we successfully divided the full-length inverted SINE B2 into minimal units made of two discrete structured domains and one dynamic domain without altering their original structures after careful boundary adjustments. This approach allowed us to identify a structured domain in nucleotides 31-119 of the inverted SINE B2. This approach will be applicable to determining the structures of other regulatory lncRNAs.


Subject(s)
Nucleic Acid Conformation , RNA, Long Noncoding/ultrastructure , Retroelements/genetics , Short Interspersed Nucleotide Elements/genetics , Computational Biology , Humans , Magnetic Resonance Spectroscopy , RNA, Antisense/genetics , RNA, Antisense/ultrastructure , RNA, Long Noncoding/genetics , Transcriptome/genetics , Ubiquitin Thiolesterase/genetics , Ubiquitin Thiolesterase/ultrastructure
7.
Nucleic Acids Res ; 48(5): 2621-2642, 2020 03 18.
Article in English | MEDLINE | ID: mdl-31863590

ABSTRACT

Transposable elements (TEs) comprise a large proportion of long non-coding RNAs (lncRNAs). Here, we employed CRISPR to delete a short interspersed nuclear element (SINE) in Malat1, a cancer-associated lncRNA, to investigate its significance in cellular physiology. We show that Malat1 with a SINE deletion forms diffuse nuclear speckles and is frequently translocated to the cytoplasm. SINE-deleted cells exhibit an activated unfolded protein response and PKR and markedly increased DNA damage and apoptosis caused by dysregulation of TDP-43 localization and formation of cytotoxic inclusions. TDP-43 binds stronger to Malat1 without the SINE and is likely 'hijacked' by cytoplasmic Malat1 to the cytoplasm, resulting in the depletion of nuclear TDP-43 and redistribution of TDP-43 binding to repetitive element transcripts and mRNAs encoding mitotic and nuclear-cytoplasmic regulators. The SINE promotes Malat1 nuclear retention by facilitating Malat1 binding to HNRNPK, a protein that drives RNA nuclear retention, potentially through direct interactions of the SINE with KHDRBS1 and TRA2A, which bind to HNRNPK. Losing these RNA-protein interactions due to the SINE deletion likely creates more available TDP-43 binding sites on Malat1 and subsequent TDP-43 aggregation. These results highlight the significance of lncRNA TEs in TDP-43 proteostasis with potential implications in both cancer and neurodegenerative diseases.


Subject(s)
DNA-Binding Proteins/metabolism , Proteostasis/genetics , RNA, Long Noncoding/genetics , Short Interspersed Nucleotide Elements/genetics , Apoptosis , Cell Line , Cytoplasm/metabolism , DNA Damage , Endoplasmic Reticulum Stress , Enzyme Activation , Gene Dosage , Heterogeneous-Nuclear Ribonucleoprotein K/metabolism , Humans , Mitosis , Models, Biological , Protein Transport , RNA, Messenger/genetics , RNA, Messenger/metabolism , Sequence Deletion/genetics , eIF-2 Kinase
8.
Int J Mol Sci ; 23(4)2022 Feb 17.
Article in English | MEDLINE | ID: mdl-35216353

ABSTRACT

X-linked dystonia-parkinsonism (XDP) is a severe neurodegenerative disorder that manifests as adult-onset dystonia combined with parkinsonism. A SINE-VNTR-Alu (SVA) retrotransposon inserted in an intron of the TAF1 gene reduces its expression and alters splicing in XDP patient-derived cells. As a consequence, increased levels of the TAF1 intron retention transcript TAF1-32i can be found in XDP cells as compared to healthy controls. Here, we investigate the sequence of the deep intronic region included in this transcript and show that it is also present in cells from healthy individuals, albeit in lower amounts than in XDP cells, and that it undergoes degradation by nonsense-mediated mRNA decay. Furthermore, we investigate epigenetic marks (e.g., DNA methylation and histone modifications) present in this intronic region and the spanning sequence. Finally, we show that the SVA evinces regulatory potential, as demonstrated by its ability to repress the TAF1 promoter in vitro. Our results enable a better understanding of the disease mechanisms underlying XDP and transcriptional alterations caused by SVA retrotransposons.


Subject(s)
Dystonic Disorders/genetics , Genetic Diseases, X-Linked/genetics , Parkinsonian Disorders/genetics , Retroelements/genetics , Transcription, Genetic/genetics , Adolescent , Adult , DNA Methylation/genetics , Female , Histone Acetyltransferases/genetics , Humans , Introns/genetics , Male , Middle Aged , Promoter Regions, Genetic/genetics , Short Interspersed Nucleotide Elements/genetics , TATA-Binding Protein Associated Factors/genetics , Transcription Factor TFIID/genetics , Young Adult
9.
BMC Bioinformatics ; 22(1): 42, 2021 Feb 02.
Article in English | MEDLINE | ID: mdl-33530928

ABSTRACT

BACKGROUND: Transposable elements (TEs) constitute a significant part of eukaryotic genomes. Short interspersed nuclear elements (SINEs) are non-autonomous TEs, which are widely represented in mammalian genomes and also found in plants. After insertion in a new position in the genome, TEs quickly accumulate mutations, which complicate their identification and annotation by modern bioinformatics methods. In this study, we searched for highly divergent SINE copies in the genome of rice (Oryza sativa subsp. japonica) using the Highly Divergent Repeat Search Method (HDRSM). RESULTS: The HDRSM considers correlations of neighboring symbols to construct position weight matrix (PWM) for a SINE family, which is then used to perform a search for new copies. In order to evaluate the accuracy of the method and compare it with the RepeatMasker program, we generated a set of SINE copies containing nucleotide substitutions and indels and inserted them into an artificial chromosome for analysis. The HDRSM showed better results both in terms of the number of identified inserted repeats and the accuracy of determining their boundaries. A search for the copies of 39 SINE families in the rice genome produced 14,030 hits; among them, 5704 were not detected by RepeatMasker. CONCLUSIONS: The HDRSM could find divergent SINE copies, correctly determine their boundaries, and offer a high level of statistical significance. We also found that RepeatMasker is able to find relatively short copies of the SINE families with a higher level of similarity, while HDRSM is able to find more diverged copies. To obtain a comprehensive profile of SINE distribution in the genome, combined application of the HDRSM and RepeatMasker is recommended.


Subject(s)
DNA Transposable Elements , Oryza , Short Interspersed Nucleotide Elements , Animals , DNA Transposable Elements/genetics , Evolution, Molecular , Humans , Oryza/genetics , Phylogeny , Position-Specific Scoring Matrices , Short Interspersed Nucleotide Elements/genetics
10.
Plant J ; 101(3): 681-699, 2020 02.
Article in English | MEDLINE | ID: mdl-31610059

ABSTRACT

Repetitive sequences are ubiquitous components of eukaryotic genomes affecting genome size and evolution as well as gene regulation. Among them, short interspersed nuclear elements (SINEs) are non-coding retrotransposons usually shorter than 1000 bp. They contain only few short conserved structural motifs, in particular an internal promoter derived from cellular RNAs and a mostly AT-rich 3' tail, whereas the remaining regions are highly variable. SINEs emerge and vanish during evolution, and often diversify into numerous families and subfamilies that are usually specific for only a limited number of species. In contrast, at the 3' end of multiple plant SINEs we detected the highly conserved 'Angio-domain'. This 37 bp segment defines the Angio-SINE superfamily, which encompasses 24 plant SINE families widely distributed across 13 orders within the plant kingdom. We retrieved 28 433 full-length Angio-SINE copies from genome assemblies of 46 plant species, frequently located in genes. Compensatory mutations in and adjacent to the Angio-domain imply selective restraints maintaining its RNA structure. Angio-SINE families share segmental sequence similarities, indicating a modular evolution with strong Angio-domain preservation. We suggest that the conserved domain contributes to the evolutionary success of Angio-SINEs through either structural interactions between SINE RNA and proteins increasing their transpositional efficiency, or by enhancing their accumulation in genes.


Subject(s)
Embryophyta/genetics , Genome, Plant/genetics , Genomics , Short Interspersed Nucleotide Elements/genetics , Evolution, Molecular , Retroelements/genetics
11.
Plant J ; 103(1): 443-458, 2020 07.
Article in English | MEDLINE | ID: mdl-32056333

ABSTRACT

Short interspersed nuclear elements (SINEs) are small, non-autonomous and heterogeneous retrotransposons that are widespread in plants. To explore the amplification dynamics and evolutionary history of SINE populations in representative deciduous tree species, we analyzed the genomes of the six following Salicaceae species: Populus deltoides, Populus euphratica, Populus tremula, Populus tremuloides, Populus trichocarpa, and Salix purpurea. We identified 11 Salicaceae SINE families (SaliS-I to SaliS-XI), comprising 27 077 full-length copies. Most of these families harbor segmental similarities, providing evidence for SINE emergence by reshuffling or heterodimerization. We observed two SINE groups, differing in phylogenetic distribution pattern, similarity and 3' end structure. These groups probably emerged during the 'salicoid duplication' (~65 million years ago) in the Salix-Populus progenitor and during the separation of the genus Salix (45-65 million years ago), respectively. In contrast to conserved 5' start motifs across species and SINE families, the 3' ends are highly variable in sequence and length. This extraordinary 3'-end variability results from mutations in the poly(A) tail, which were fixed by subsequent amplificational bursts. We show that the dissemination of newly evolved 3' ends is accomplished by a displacement of older motifs, leading to various 3'-end subpopulations within the SaliS families.


Subject(s)
3' Flanking Region/genetics , Salicaceae/genetics , Short Interspersed Nucleotide Elements/genetics , Biological Evolution , Chromosome Mapping , Chromosomes, Plant/genetics , Conserved Sequence/genetics , Genes, Plant/genetics , Genome, Plant/genetics , Phylogeny , Populus/genetics , Salix/genetics
12.
BMC Genomics ; 22(1): 226, 2021 Mar 31.
Article in English | MEDLINE | ID: mdl-33789582

ABSTRACT

BACKGROUND: Short interspersed nuclear elements (SINEs) belong to non-long terminal repeat (non-LTR) retrotransposons, which can mobilize dependent on the help of counterpart long interspersed nuclear elements (LINEs). Although 234 SINEs have been identified so far, only 23 are from insect species (SINEbase: http://sines.eimb.ru/ ). RESULTS: Here, five SINEs were identified from the genome of Plutella xylostella, among which PxSE1, PxSE2 and PxSE3 were tRNA-derived SINEs, PxSE4 and PxSE5 were 5S RNA-derived SINEs. A total of 18 related SINEs were further identified in 13 lepidopteran insects and a baculovirus. The 3'-tail of PxSE5 shares highly identity with that of LINE retrotransposon, PxLINE1. The analysis of relative age distribution profiles revealed that PxSE1 is a relatively young retrotransposon in the genome of P. xylostella and was generated by recent explosive amplification. Integration pattern analysis showed that SINEs in P. xylostella prefer to insert into or accumulate in introns and regions 5 kb downstream of genes. In particular, the PxSE1-like element, SlNPVSE1, in Spodoptera litura nucleopolyhedrovirus II genome is highly identical to SfSE1 in Spodoptera frugiperda, SlittSE1 in Spodoptera littoralis, and SlituSE1 in Spodoptera litura, suggesting the occurrence of horizontal transfer. CONCLUSIONS: Lepidopteran insect genomes harbor a diversity of SINEs. The retrotransposition activity and copy number of these SINEs varies considerably between host lineages and SINE lineages. Host-parasite interactions facilitate the horizontal transfer of SINE between baculovirus and its lepidopteran hosts.


Subject(s)
Baculoviridae , Short Interspersed Nucleotide Elements , Animals , Baculoviridae/genetics , Evolution, Molecular , Insecta , Long Interspersed Nucleotide Elements/genetics , Phylogeny , Short Interspersed Nucleotide Elements/genetics
13.
Genes Dev ; 27(7): 793-804, 2013 Apr 01.
Article in English | MEDLINE | ID: mdl-23558772

ABSTRACT

Staufen1-mediated mRNA decay (SMD) degrades mRNAs that harbor a Staufen1-binding site (SBS) in their 3' untranslated regions (UTRs). Human SBSs can form by intermolecular base-pairing between a 3' UTR Alu element and an Alu element within a long noncoding RNA (lncRNA) called a ½-sbsRNA. Since Alu elements are confined to primates, it was unclear how SMD occurs in rodents. Here we identify mouse mRNA 3' UTRs and lncRNAs that contain a B1, B2, B4, or identifier (ID) element. We show that SMD occurs in mouse cells via mRNA-lncRNA base-pairing of partially complementary elements and that mouse ½-sbsRNA (m½-sbsRNA)-triggered SMD regulates C2C12 cell myogenesis. Our findings define new roles for lncRNAs as well as B and ID short interspersed elements (SINEs) in mice that undoubtedly influence many developmental and homeostatic pathways.


Subject(s)
Muscle Development/genetics , RNA, Long Noncoding/genetics , RNA, Long Noncoding/metabolism , Short Interspersed Nucleotide Elements/genetics , 3' Untranslated Regions/genetics , Animals , Base Pairing , Cell Line , Mice , RNA, Messenger/genetics , RNA, Messenger/metabolism , RNA-Binding Proteins/metabolism , Trans-Activators/metabolism
14.
Int J Mol Sci ; 22(18)2021 Sep 13.
Article in English | MEDLINE | ID: mdl-34576060

ABSTRACT

Short Interspersed Elements (SINEs) are eukaryotic non-autonomous retrotransposons transcribed by RNA polymerase III (pol III). The 3'-terminus of many mammalian SINEs has a polyadenylation signal (AATAAA), pol III transcription terminator, and A-rich tail. The RNAs of such SINEs can be polyadenylated, which is unique for pol III transcripts. Here, B2 (mice and related rodents), Dip (jerboas), and Ves (vespertilionid bats) SINE families were thoroughly studied. They were divided into subfamilies reliably distinguished by relatively long indels. The age of SINE subfamilies can be estimated, which allows us to reconstruct their evolution. The youngest and most active variants of SINE subfamilies were given special attention. The shortest pol III transcription terminators are TCTTT (B2), TATTT (Ves and Dip), and the rarer TTTT. The last nucleotide of the terminator is often not transcribed; accordingly, the truncated terminator of its descendant becomes nonfunctional. The incidence of complete transcription of the TCTTT terminator is twice higher compared to TTTT and thus functional terminators are more likely preserved in daughter SINE copies. Young copies have long poly(A) tails; however, they gradually shorten in host generations. Unexpectedly, the tail shortening below A10 increases the incidence of terminator elongation by Ts thus restoring its efficiency. This process can be critical for the maintenance of SINE activity in the genome.


Subject(s)
Evolution, Molecular , Retroelements/genetics , Short Interspersed Nucleotide Elements/genetics , Transcription Termination, Genetic , Animals , Humans , Mice , Poly A/genetics , Polyadenylation/genetics , RNA/genetics , RNA 3' Polyadenylation Signals/genetics , RNA Polymerase III/genetics , RNA, Messenger/genetics , Transcription, Genetic/genetics
15.
Int J Mol Sci ; 22(12)2021 Jun 12.
Article in English | MEDLINE | ID: mdl-34204806

ABSTRACT

Transposable elements (TEs) are repetitive elements that belong to a variety of functional classes and have an important role in shaping genome evolution. Around 50% of the human genome contains TEs, and they have been termed the "dark matter" of the genome because relatively little is known about their function. While TEs have been shown to participate in aberrant gene regulation and the pathogenesis of diseases, only a few studies have explored the systemic effect of TEs on gene expression. In the present study, we analysed whole genome sequences and blood whole transcriptome data from 570 individuals within the Parkinson's Progressive Markers Initiative (PPMI) cohort to identify expression quantitative trait loci (eQTL) regulating genome-wide gene expression associated with TEs. We identified 2132 reference TEs that were polymorphic for their presence or absence in our study cohort. The presence or absence of the TE element could change the expression of the gene or gene clusters from zero to tens of thousands of copies of RNA. The main finding is that many TEs possess very strong regulatory effects, and they have the potential to modulate large genetic networks with hundreds of target genes over the genome. We illustrate the plethora of regulatory mechanisms using examples of their action at the HLA gene cluster and data showing different TEs' convergence to modulate WFS1 gene expression. In conclusion, the presence or absence of polymorphisms of TEs has an eminent genome-wide regulatory function with large effect size at the level of the whole transcriptome. The role of TEs in explaining, in part, the missing heritability for complex traits is convincing and should be considered.


Subject(s)
Quantitative Trait Loci/genetics , Retroelements/genetics , Transcriptome/genetics , Alu Elements/genetics , Genome, Human , Humans , Minisatellite Repeats/genetics , Parkinson Disease/genetics , Short Interspersed Nucleotide Elements/genetics
16.
Int J Mol Sci ; 22(4)2021 Feb 15.
Article in English | MEDLINE | ID: mdl-33671852

ABSTRACT

SINE-VNTR-Alu (SVA) retrotransposons are a subclass of transposable elements (TEs) that exist only in primate genomes. TE insertions can be co-opted as cis-regulatory elements (CREs); however, the regulatory potential of SVAs has predominantly been demonstrated using bioinformatic approaches and reporter gene assays. The objective of this study was to demonstrate SVA cis-regulatory activity by CRISPR (clustered regularly interspaced short palindromic repeats) deletion and subsequent measurement of direct effects on local gene expression. We identified a region on chromosome 17 that was enriched with human-specific SVAs. Comparative gene expression analysis at this region revealed co-expression of TRPV1 and TRPV3 in multiple human tissues, which was not observed in mouse, highlighting key regulatory differences between the two species. Furthermore, the intergenic region between TRPV1 and TRPV3 coding sequences contained a human specific SVA insertion located upstream of the TRPV3 promoter and downstream of the 3' end of TRPV1, highlighting this SVA as a candidate to study its potential cis-regulatory activity on both genes. Firstly, we generated SVA reporter gene constructs and demonstrated their transcriptional regulatory activity in HEK293 cells. We then devised a dual-targeting CRISPR strategy to facilitate the deletion of this entire SVA sequence and generated edited HEK293 clonal cell lines containing homozygous and heterozygous SVA deletions. In edited homozygous ∆SVA clones, we observed a significant decrease in both TRPV1 and TRPV3 mRNA expression, compared to unedited HEK293. In addition, we also observed an increase in the variability of mRNA expression levels in heterozygous ∆SVA clones. Overall, in edited HEK293 with SVA deletions, we observed a disruption to the co-expression of TRPV1 and TRPV3. Here we provide an example of a human specific SVA with cis-regulatory activity in situ, supporting the role of SVA retrotransposons as contributors to species-specific gene expression.


Subject(s)
Alu Elements/genetics , CRISPR-Cas Systems , Clustered Regularly Interspaced Short Palindromic Repeats/genetics , DNA, Intergenic/genetics , Minisatellite Repeats/genetics , Promoter Regions, Genetic/genetics , Short Interspersed Nucleotide Elements/genetics , TRPV Cation Channels/genetics , Animals , Gene Expression , Gene Expression Regulation , Genes, Reporter , HEK293 Cells , Humans , Mice , Primates/genetics
17.
Int J Mol Sci ; 22(21)2021 Nov 08.
Article in English | MEDLINE | ID: mdl-34769514

ABSTRACT

Tumor-associated cell-free DNAs (cfDNA) play an important role in the promotion of metastases. Previous studies proved the high antimetastatic potential of bovine pancreatic DNase I and identified short interspersed nuclear elements (SINEs) and long interspersed nuclear elements (LINEs)and fragments of oncogenes in cfDNA as the main molecular targets of enzyme in the bloodstream. Here, recombinant human DNase I (commercial name Pulmozyme®), which is used for the treatment of cystic fibrosis in humans, was repurposed for the inhibition of lung metastases in the B16 melanoma model in mice. We found that Pulmozyme® strongly reduced migration and induced apoptosis of B16 cells in vitro and effectively inhibited metastases in lungs and liver in vivo. Pulmozyme® was shown to be two times more effective when administered intranasally (i.n.) than bovine DNase I, but intramuscular (i.m.) administration forced it to exhibit as high an antimetastatic activity as bovine DNase I. Both DNases administered to mice either i.m. or i.n. enhanced the DNase activity of blood serum to the level of healthy animals, significantly decreased cfDNA concentrations, efficiently degraded SINE and LINE repeats and c-Myc fragments in the bloodstream and induced apoptosis and disintegration of neutrophil extracellular traps in metastatic foci; as a result, this manifested as the inhibition of metastases spread. Thus, Pulmozyme®, which is already an approved drug, can be recommended for use in the treatment of lung metastases.


Subject(s)
Cell-Free Nucleic Acids/blood , Deoxyribonuclease I/metabolism , Long Interspersed Nucleotide Elements/genetics , Lung Neoplasms/drug therapy , Melanoma, Experimental/drug therapy , Proto-Oncogene Proteins c-myc/antagonists & inhibitors , Short Interspersed Nucleotide Elements/genetics , Animals , Cell Line, Tumor , Deoxyribonuclease I/pharmacology , Disease Models, Animal , Drug Repositioning , Lung Neoplasms/genetics , Lung Neoplasms/metabolism , Lung Neoplasms/secondary , Male , Melanoma, Experimental/genetics , Melanoma, Experimental/metabolism , Melanoma, Experimental/pathology , Mice , Proto-Oncogene Proteins c-myc/blood , Proto-Oncogene Proteins c-myc/genetics , Recombinant Proteins/pharmacology
18.
J Virol ; 93(6)2019 03 15.
Article in English | MEDLINE | ID: mdl-30626674

ABSTRACT

Horizontal transfer of genetic materials between virus and host has been frequently identified. Three rice planthoppers, Laodelphax striatellus, Nilaparvata lugens, and Sogatella furcifera, are agriculturally important insects because they are destructive rice pests and also the vector of a number of phytopathogenic viruses. In this study, we discovered that a small region (∼300 nucleotides [nt]) of the genome of invertebrate iridescent virus 6 (IIV-6; genus Iridovirus, family Iridoviridae), a giant DNA virus that infects invertebrates but is not known to infect planthoppers, is highly homologous to the sequences present in high copy numbers in these three planthopper genomes. These sequences are related to the short interspersed nuclear elements (SINEs), a class of non-long terminal repeat (LTR) retrotransposons (retroposons), suggesting a horizontal transfer event of a transposable element from the rice planthopper genome to the IIV-6 genome. In addition, a number of planthopper transcripts mapped to these rice planthopper SINE-like sequences (RPSlSs) were identified and appear to be transcriptionally regulated along the different developmental stages of planthoppers. Small RNAs derived from these RPSlSs are predominantly 26 to 28 nt long, which is a typical characteristic of PIWI-interacting RNAs. Phylogenetic analysis suggests that IIV-6 acquires a SINE-like retrotransposon from S. furcifera after the evolutionary divergence of the three rice planthoppers. This study provides further examples of the horizontal transfer of an insect transposon to virus and suggests the association of rice planthoppers with iridoviruses in the past or present.IMPORTANCE This study provides an example of the horizontal transfer event from a rice planthopper genome to an IIV-6 genome. A small region of the IIV-6 genome (∼300 nt) is highly homologous to the sequences presented in high copy numbers of three rice planthopper genomes that are related to the SINEs, a class of retroposons. The expression of these planthopper SINE-like sequences was confirmed, and corresponding Piwi-interacting RNA-like small RNAs were identified and comprehensively characterized. Phylogenetic analysis suggests that the giant invertebrate iridovirus IIV-6 obtains this SINE-related sequence from Sogatella furcifera through a horizontal transfer event in the past. To the best of our knowledge, this is the first report of a horizontal transfer event between a planthopper and a giant DNA virus and also is the first evidence for the eukaryotic origin of genetic material in iridoviruses.


Subject(s)
DNA Viruses/genetics , Insect Viruses/genetics , Insecta/virology , Oryza/virology , Retroelements/genetics , Animals , Biological Evolution , Hemiptera/virology , Phylogeny , Short Interspersed Nucleotide Elements/genetics
19.
Nature ; 511(7511): 606-10, 2014 Jul 31.
Article in English | MEDLINE | ID: mdl-25079557

ABSTRACT

DNA methylation is a crucial element in the epigenetic regulation of mammalian embryonic development. However, its dynamic patterns have not been analysed at the genome scale in human pre-implantation embryos due to technical difficulties and the scarcity of required materials. Here we systematically profile the methylome of human early embryos from the zygotic stage through to post-implantation by reduced representation bisulphite sequencing and whole-genome bisulphite sequencing. We show that the major wave of genome-wide demethylation is complete at the 2-cell stage, contrary to previous observations in mice. Moreover, the demethylation of the paternal genome is much faster than that of the maternal genome, and by the end of the zygotic stage the genome-wide methylation level in male pronuclei is already lower than that in female pronuclei. The inverse correlation between promoter methylation and gene expression gradually strengthens during early embryonic development, reaching its peak at the post-implantation stage. Furthermore, we show that active genes, with the trimethylation of histone H3 at lysine 4 (H3K4me3) mark at the promoter regions in pluripotent human embryonic stem cells, are essentially devoid of DNA methylation in both mature gametes and throughout pre-implantation development. Finally, we also show that long interspersed nuclear elements or short interspersed nuclear elements that are evolutionarily young are demethylated to a milder extent compared to older elements in the same family and have higher abundance of transcripts, indicating that early embryos tend to retain higher residual methylation at the evolutionarily younger and more active transposable elements. Our work provides insights into the critical features of the methylome of human early embryos, as well as its functional relation to the regulation of gene expression and the repression of transposable elements.


Subject(s)
DNA Methylation , Epigenesis, Genetic , Gene Expression Regulation, Developmental , Animals , DNA Transposable Elements/genetics , Embryo, Mammalian , Embryonic Stem Cells/physiology , Female , Gene Expression Profiling , Germ Cells/metabolism , Histones/metabolism , Humans , Long Interspersed Nucleotide Elements/genetics , Male , Mice , Promoter Regions, Genetic/genetics , Short Interspersed Nucleotide Elements/genetics
20.
Nucleic Acids Res ; 46(16): 8299-8310, 2018 09 19.
Article in English | MEDLINE | ID: mdl-29986092

ABSTRACT

Mammalian DNA replication is a highly organized and regulated process. Large, Mb-sized regions are replicated at defined times along S-phase. Replication Timing (RT) is thought to play a role in shaping the mammalian genome by affecting mutation rates. Previous analyses relied on somatic RT profiles. However, only germline mutations are passed on to offspring and affect genomic composition. Therefore, germ cell RT information is necessary to evaluate the influences of RT on the mammalian genome. We adapted the RT mapping technique for limited amounts of cells, and measured RT from two stages in the mouse germline - primordial germ cells (PGCs) and spermatogonial stem cells (SSCs). RT in germline cells exhibited stronger correlations to both mutation rate and recombination hotspots density than those of RT in somatic tissues, emphasizing the importance of using correct tissues-of-origin for RT profiling. Germline RT maps exhibited stronger correlations to additional genetic features including GC-content, transposable elements (SINEs and LINEs), and gene density. GC content stratification and multiple regression analysis revealed independent contributions of RT to SINE, gene, mutation, and recombination hotspot densities. Together, our results establish a central role for RT in shaping multiple levels of mammalian genome composition.


Subject(s)
DNA Replication Timing/genetics , DNA Replication/genetics , Genome/genetics , Germ Cells/metabolism , Stem Cells/metabolism , Animals , Base Composition/genetics , Cell Line, Tumor , Cells, Cultured , DNA Transposable Elements/genetics , Female , Germ Cells/cytology , Germ-Line Mutation , Male , Mammals/genetics , Mice, 129 Strain , Mice, Inbred C57BL , Mice, Inbred DBA , Mice, Inbred NOD , Mice, SCID , Mice, Transgenic , Short Interspersed Nucleotide Elements/genetics , Stem Cells/cytology
SELECTION OF CITATIONS
SEARCH DETAIL