Search | Virtual Health Library

1.

G-quadruplexes in the evolution of hepatitis B virus.

Brázda, Václav; Dobrovolná, Michaela; Bohálová, Natália; Mergny, Jean-Louis.

Nucleic Acids Res ; 51(14): 7198-7204, 2023 08 11.

Article in English | MEDLINE | ID: mdl-37395407

ABSTRACT

Hepatitis B virus (HBV) is one of the most dangerous human pathogenic viruses found in all corners of the world. Recent sequencing of ancient HBV viruses revealed that these viruses have accompanied humanity for several millenia. As G-quadruplexes are considered to be potential therapeutic targets in virology, we examined G-quadruplex-forming sequences (PQS) in modern and ancient HBV genomes. Our analyses showed the presence of PQS in all 232 tested HBV genomes, with a total number of 1258 motifs and an average frequency of 1.69 PQS per kbp. Notably, the PQS with the highest G4Hunter score in the reference genome is the most highly conserved. Interestingly, the density of PQS motifs is lower in ancient HBV genomes than in their modern counterparts (1.5 and 1.9/kb, respectively). This modern frequency of 1.90 is very close to the PQS frequency of the human genome (1.93) using identical parameters. This indicates that the PQS content in HBV increased over time to become closer to the PQS frequency in the human genome. No statistically significant differences were found between PQS densities in HBV lineages found in different continents. These results, which constitute the first paleogenomics analysis of G4 propensity, are in agreement with our hypothesis that, for viruses causing chronic infections, their PQS frequencies tend to converge evolutionarily with those of their hosts, as a kind of 'genetic camouflage' to both hijack host cell transcriptional regulatory systems and to avoid recognition as foreign material.

Subject(s)

G-Quadruplexes , Hepatitis B virus , Humans , Genome, Human , Genomics , Hepatitis B virus/genetics , Paleontology , Biological Evolution

2.

Evolution of Diverse Strategies for Promoter Regulation.

Brázda, Václav; Bartas, Martin; Bowater, Richard P.

Trends Genet ; 37(8): 730-744, 2021 08.

Article in English | MEDLINE | ID: mdl-33931265

ABSTRACT

DNA is fundamentally important for all cellular organisms due to its role as a store of hereditary genetic information. The precise and accurate regulation of gene transcription depends primarily on promoters, which vary significantly within and between genomes. Some promoters are rich in specific types of bases, while others have more varied, complex sequence characteristics. However, it is not only base sequence but also epigenetic modifications and altered DNA structure that regulate promoter activity. Significantly, many promoters across all organisms contain sequences that can form intrastrand hairpins (cruciforms) or four-stranded structures (G-quadruplex or i-motif). In this review we integrate recent studies on promoter regulation that highlight the importance of DNA structure in the evolutionary adaptation of promoter sequences.

Subject(s)

DNA/genetics , Evolution, Molecular , Promoter Regions, Genetic/genetics , Transcription, Genetic/genetics , DNA/ultrastructure , G-Quadruplexes , Nucleic Acid Conformation

3.

Unheeded SARS-CoV-2 proteins? A deep look into negative-sense RNA.

Bartas, Martin; Volná, Adriana; Beaudoin, Christopher A; Poulsen, Ebbe Toftgaard; Cerven, Jirí; Brázda, Václav; Spunda, Vladimír; Blundell, Tom L; Pecinka, Petr.

Brief Bioinform ; 23(3)2022 05 13.

Article in English | MEDLINE | ID: mdl-35229157

ABSTRACT

SARS-CoV-2 is a novel positive-sense single-stranded RNA virus from the Coronaviridae family (genus Betacoronavirus), which has been established as causing the COVID-19 pandemic. The genome of SARS-CoV-2 is one of the largest among known RNA viruses, comprising of at least 26 known protein-coding loci. Studies thus far have outlined the coding capacity of the positive-sense strand of the SARS-CoV-2 genome, which can be used directly for protein translation. However, it has been recently shown that transcribed negative-sense viral RNA intermediates that arise during viral genome replication from positive-sense viruses can also code for proteins. No studies have yet explored the potential for negative-sense SARS-CoV-2 RNA intermediates to contain protein-coding loci. Thus, using sequence and structure-based bioinformatics methodologies, we have investigated the presence and validity of putative negative-sense ORFs (nsORFs) in the SARS-CoV-2 genome. Nine nsORFs were discovered to contain strong eukaryotic translation initiation signals and high codon adaptability scores, and several of the nsORFs were predicted to interact with RNA-binding proteins. Evolutionary conservation analyses indicated that some of the nsORFs are deeply conserved among related coronaviruses. Three-dimensional protein modeling revealed the presence of higher order folding among all putative SARS-CoV-2 nsORFs, and subsequent structural mimicry analyses suggest similarity of the nsORFs to DNA/RNA-binding proteins and proteins involved in immune signaling pathways. Altogether, these results suggest the potential existence of still undescribed SARS-CoV-2 proteins, which may play an important role in the viral lifecycle and COVID-19 pathogenesis.

Subject(s)

COVID-19 , SARS-CoV-2 , COVID-19/genetics , Genome, Viral , Humans , Pandemics , RNA, Viral/chemistry , RNA, Viral/genetics , RNA-Binding Proteins/genetics , SARS-CoV-2/genetics

4.

G-quadruplexes in helminth parasites.

Cantara, Alessio; Luo, Yu; Dobrovolná, Michaela; Bohalova, Natalia; Fojta, Miroslav; Verga, Daniela; Guittat, Lionel; Cucchiarini, Anne; Savrimoutou, Solène; Häberli, Cécile; Guillon, Jean; Keiser, Jennifer; Brázda, Václav; Mergny, Jean Louis.

Nucleic Acids Res ; 50(5): 2719-2735, 2022 03 21.

Article in English | MEDLINE | ID: mdl-35234933

ABSTRACT

Parasitic helminths infecting humans are highly prevalent infecting â¼2 billion people worldwide, causing inflammatory responses, malnutrition and anemia that are the primary cause of morbidity. In addition, helminth infections of cattle have a significant economic impact on livestock production, milk yield and fertility. The etiological agents of helminth infections are mainly Nematodes (roundworms) and Platyhelminths (flatworms). G-quadruplexes (G4) are unusual nucleic acid structures formed by G-rich sequences that can be recognized by specific G4 ligands. Here we used the G4Hunter Web Tool to identify and compare potential G4 sequences (PQS) in the nuclear and mitochondrial genomes of various helminths to identify G4 ligand targets. PQS are nonrandomly distributed in these genomes and often located in the proximity of genes. Unexpectedly, a Nematode, Ascaris lumbricoides, was found to be highly enriched in stable PQS. This species can tolerate high-stability G4 structures, which are not counter selected at all, in stark contrast to most other species. We experimentally confirmed G4 formation for sequences found in four different parasitic helminths. Small molecules able to selectively recognize G4 were found to bind to Schistosoma mansoni G4 motifs. Two of these ligands demonstrated potent activity both against larval and adult stages of this parasite.

Subject(s)

G-Quadruplexes , Nematoda , Parasites/genetics , Platyhelminths , Animals , Cattle , Genome , Helminths/genetics , Humans , Ligands , Nematoda/genetics , Platyhelminths/genetics

5.

Special Issue "Bioinformatics of Unusual DNA and RNA Structures".

Bartas, Martin; Brázda, Václav; Pecinka, Petr.

Int J Mol Sci ; 25(10)2024 May 10.

Article in English | MEDLINE | ID: mdl-38791265

ABSTRACT

Nucleic acids are not only static carriers of genetic information but also play vital roles in controlling cellular lifecycles through their fascinating structural diversity [...].

Subject(s)

Computational Biology , DNA , Nucleic Acid Conformation , RNA , RNA/chemistry , RNA/metabolism , DNA/chemistry , DNA/metabolism , Computational Biology/methods , Humans

6.

DNA Quadruplex Structure with a Unique Cation Dependency.

Gajarsky, Martin; Stadlbauer, Petr; Sponer, Jiri; Cucchiarini, Anne; Dobrovolna, Michaela; Brazda, Vaclav; Mergny, Jean-Louis; Trantirek, Lukas; Lenarcic Zivkovic, Martina.

Angew Chem Int Ed Engl ; 63(7): e202313226, 2024 02 12.

Article in English | MEDLINE | ID: mdl-38143239

ABSTRACT

DNA quadruplex structures provide an additional layer of regulatory control in genome maintenance and gene expression and are widely used in nanotechnology. We report the discovery of an unprecedented tetrastranded structure formed from a native G-rich DNA sequence originating from the telomeric region of Caenorhabditis elegans. The structure is defined by multiple properties that distinguish it from all other known DNA quadruplexes. Most notably, the formation of a stable so-called KNa-quadruplex (KNaQ) requires concurrent coordination of K+ and Na+ ions at two distinct binding sites. This structure provides novel insight into G-rich DNA folding under ionic conditions relevant to eukaryotic cell physiology and the structural evolution of telomeric DNA. It highlights the differences between the structural organization of human and nematode telomeric DNA, which should be considered when using C. elegans as a model in telomere biology, particularly in drug screening applications. Additionally, the absence/presence of KNaQ motifs in the host/parasite introduces an intriguing possibility of exploiting the KNaQ fold as a plausible antiparasitic drug target. The structure's unique shape and ion dependency and the possibility of controlling its folding by using low-molecular-weight ligands can be used for the design or discovery of novel recognition DNA elements and sensors.

Subject(s)

G-Quadruplexes , Animals , Humans , Caenorhabditis elegans/genetics , DNA/chemistry , Base Sequence , Cations , Telomere/genetics

7.

Dual mode of IFI16 binding to supercoiled and linear DNA: A closer insight.

Valková, Natália; Kratochvilová, Libuse; Martinková, Lucia; Brázda, Václav.

Biochem Biophys Res Commun ; 667: 89-94, 2023 07 30.

Article in English | MEDLINE | ID: mdl-37209567

ABSTRACT

IFI16 (Interferon inducible protein 16) is a DNA sensor responsible for innate immune response stimulation and a direct viral restriction by modulating gene expression and replication. Many IFI16-DNA binding properties were described - length-dependent and sequence-independent binding, oligomerization of IFI16 upon recognition, sliding on the DNA, and preference for supercoiled DNA. However, the question of the role of IFI16-DNA binding in distinct IFI16 functions remains unclear. Here we demonstrate two modes of IFI16 binding to DNA using atomic force microscopy and electrophoretic mobility shift assays. In our study, we show that IFI16 can bind to DNA in the form of globular complexes or oligomers depending on DNA topology and molar ratios. The stability of the complexes is different in higher salt concentrations. In addition, we observed no preferential binding with the HIN-A or HIN-B domains to supercoiled DNA, revealing the importance of the whole protein for this specificity. These results provide more profound insight into IFI16-DNA interactions and may be important in answering the question of self- and non-self-DNA binding by the IFI16 protein and potentially could shed light on the role of DNA binding in distinct IFI16 functions.

Subject(s)

DNA, Superhelical , DNA , DNA/metabolism , Phosphoproteins/metabolism , Immunity, Innate

8.

Letter to the Editor: Significant mutation enrichment in inverted repeat sites of new SARS-CoV-2 strains.

Bartas, Martin; Goswami, Pratik; Lexa, Matej; Cerven, Jirí; Volná, Adriana; Fojta, Miroslav; Brázda, Václav; Pecinka, Petr.

Brief Bioinform ; 22(5)2021 09 02.

Article in English | MEDLINE | ID: mdl-33837760

ABSTRACT

In a recently published paper, we have found that SARS-CoV-2 hot-spot mutations are significantly associated with inverted repeat loci and CG dinucleotides. However, fast-spreading strains with new mutations (so-called mink farm mutations, England mutations and Japan mutations) have been recently described. We used the new datasets to check the positioning of mutation sites in genomes of the new SARS-CoV-2 strains. Using an open-access Palindrome analyzer tool, we found mutations in these new strains to be significantly enriched in inverted repeat loci.

Subject(s)

Mutation , SARS-CoV-2/genetics , COVID-19/virology , Genome, Viral , Humans

9.

SARS-CoV-2 hot-spot mutations are significantly enriched within inverted repeats and CpG island loci.

Goswami, Pratik; Bartas, Martin; Lexa, Matej; Bohálová, Natália; Volná, Adriana; Cerven, Jirí; Cervenová, Veronika; Pecinka, Petr; Spunda, Vladimír; Fojta, Miroslav; Brázda, Václav.

Brief Bioinform ; 22(2): 1338-1345, 2021 03 22.

Article in English | MEDLINE | ID: mdl-33341900

ABSTRACT

SARS-CoV-2 is an intensively investigated virus from the order Nidovirales (Coronaviridae family) that causes COVID-19 disease in humans. Through enormous scientific effort, thousands of viral strains have been sequenced to date, thereby creating a strong background for deep bioinformatics studies of the SARS-CoV-2 genome. In this study, we inspected high-frequency mutations of SARS-CoV-2 and carried out systematic analyses of their overlay with inverted repeat (IR) loci and CpG islands. The main conclusion of our study is that SARS-CoV-2 hot-spot mutations are significantly enriched within both IRs and CpG island loci. This points to their role in genomic instability and may predict further mutational drive of the SARS-CoV-2 genome. Moreover, CpG islands are strongly enriched upstream from viral ORFs and thus could play important roles in transcription and the viral life cycle. We hypothesize that hypermethylation of these loci will decrease the transcription of viral ORFs and could therefore limit the progression of the disease.

Subject(s)

COVID-19/virology , CpG Islands , Mutation , SARS-CoV-2/genetics , DNA Methylation , Genome, Viral , Humans , Protein Binding

10.

Inverted repeats in the monkeypox virus genome are hot spots for mutation.

Dobrovolná, Michaela; Brázda, Václav; Warner, Emily F; Bidula, Stefan.

J Med Virol ; 95(1): e28322, 2023 01.

Article in English | MEDLINE | ID: mdl-36400742

ABSTRACT

The current monkeypox virus (MPXV) strain differs from the strain arising in 2018 by 50+ single nucleotide polymorphisms (SNPs) and is mutating much faster than expected. The cytidine deaminase apolipoprotein B messenger RNA editing enzyme, catalytic subunit B (APOBEC3) was hypothesized to be driving this increased mutation. APOBEC has recently been identified to preferentially mutate cruciform DNA secondary structures formed by inverted repeats (IRs). IRs were recently identified as hot spots for mutation in severe acute respiratory syndrome coronavirus 2, and we aimed to identify whether IRs were also hot spots for mutation within MPXV genomes. We found that MPXV genomes were replete with IR sequences. Of the 50+ SNPs identified in the 2022 outbreak strain, 63.9% of these were found to have arisen within IR regions in the 2018 reference strain (MT903344.1). Notably, IR sequences found in the 2018 reference strain were significantly lost over time, with an average of 32.5% of these sequences being conserved in the 2022 MPXV genomes. This evidence was highly indicative that mutations were arising within IRs. This data provides further support to the hypothesis that APOBEC may be driving MPXV mutation and highlights the necessity for greater surveillance of IRs of MPXV genomes to detect new mutations.

Subject(s)

COVID-19 , Monkeypox virus , Humans , Monkeypox virus/genetics , Mutation , SARS-CoV-2

11.

Impacts of Molecular Structure on Nucleic Acid-Protein Interactions.

Bowater, Richard P; Brázda, Václav.

Int J Mol Sci ; 24(1)2022 Dec 26.

Article in English | MEDLINE | ID: mdl-36613851

ABSTRACT

Interactions between nucleic acids and proteins are some of the most important interactions in biology because they are the cornerstones for fundamental biological processes, such as replication, transcription, and recombination [...].

Subject(s)

G-Quadruplexes , Nucleic Acids , DNA/chemistry , Molecular Structure , Nucleic Acid Conformation , Proteins/metabolism

12.

Interaction of Proteins with Inverted Repeats and Cruciform Structures in Nucleic Acids.

Bowater, Richard P; Bohálová, Natália; Brázda, Václav.

Int J Mol Sci ; 23(11)2022 May 31.

Article in English | MEDLINE | ID: mdl-35682854

ABSTRACT

Cruciforms occur when inverted repeat sequences in double-stranded DNA adopt intra-strand hairpins on opposing strands. Biophysical and molecular studies of these structures confirm their characterization as four-way junctions and have demonstrated that several factors influence their stability, including overall chromatin structure and DNA supercoiling. Here, we review our understanding of processes that influence the formation and stability of cruciforms in genomes, covering the range of sequences shown to have biological significance. It is challenging to accurately sequence repetitive DNA sequences, but recent advances in sequencing methods have deepened understanding about the amounts of inverted repeats in genomes from all forms of life. We highlight that, in the majority of genomes, inverted repeats are present in higher numbers than is expected from a random occurrence. It is, therefore, becoming clear that inverted repeats play important roles in regulating many aspects of DNA metabolism, including replication, gene expression, and recombination. Cruciforms are targets for many architectural and regulatory proteins, including topoisomerases, p53, Rif1, and others. Notably, some of these proteins can induce the formation of cruciform structures when they bind to DNA. Inverted repeat sequences also influence the evolution of genomes, and growing evidence highlights their significance in several human diseases, suggesting that the inverted repeat sequences and/or DNA cruciforms could be useful therapeutic targets in some cases.

Subject(s)

Nucleic Acids , DNA/genetics , DNA, Cruciform , Humans , Inverted Repeat Sequences , Nucleic Acid Conformation , Repetitive Sequences, Nucleic Acid/genetics

13.

Searching for New Z-DNA/Z-RNA Binding Proteins Based on Structural Similarity to Experimentally Validated Zα Domain.

Bartas, Martin; Slychko, Kristyna; Brázda, Václav; Cerven, Jirí; Beaudoin, Christopher A; Blundell, Tom L; Pecinka, Petr.

Int J Mol Sci ; 23(2)2022 Jan 11.

Article in English | MEDLINE | ID: mdl-35054954

ABSTRACT

Z-DNA and Z-RNA are functionally important left-handed structures of nucleic acids, which play a significant role in several molecular and biological processes including DNA replication, gene expression regulation and viral nucleic acid sensing. Most proteins that have been proven to interact with Z-DNA/Z-RNA contain the so-called Zα domain, which is structurally well conserved. To date, only eight proteins with Zα domain have been described within a few organisms (including human, mouse, Danio rerio, Trypanosoma brucei and some viruses). Therefore, this paper aimed to search for new Z-DNA/Z-RNA binding proteins in the complete PDB structures database and from the AlphaFold2 protein models. A structure-based similarity search found 14 proteins with highly similar Zα domain structure in experimentally-defined proteins and 185 proteins with a putative Zα domain using the AlphaFold2 models. Structure-based alignment and molecular docking confirmed high functional conservation of amino acids involved in Z-DNA/Z-RNA, suggesting that Z-DNA/Z-RNA recognition may play an important role in a variety of cellular processes.

Subject(s)

DNA, Z-Form/chemistry , DNA-Binding Proteins/chemistry , Models, Molecular , Protein Interaction Domains and Motifs , RNA-Binding Proteins/chemistry , RNA/chemistry , Amino Acid Sequence , Binding Sites , DNA, Z-Form/metabolism , DNA-Binding Proteins/metabolism , Molecular Docking Simulation , Molecular Dynamics Simulation , Nucleic Acid Conformation , Protein Binding , Protein Conformation , RNA/metabolism , RNA-Binding Proteins/metabolism , Structure-Activity Relationship

14.

The Newly Sequenced Genome of Pisum sativum Is Replete with Potential G-Quadruplex-Forming Sequences-Implications for Evolution and Biological Regulation.

Dobrovolná, Michaela; Bohálová, Natália; Peska, Vratislav; Wang, Jiawei; Luo, Yu; Bartas, Martin; Volná, Adriana; Mergny, Jean-Louis; Brázda, Václav.

Int J Mol Sci ; 23(15)2022 Jul 30.

Article in English | MEDLINE | ID: mdl-35955617

ABSTRACT

G-quadruplexes (G4s) have been long considered rare and physiologically unimportant in vitro curiosities, but recent methodological advances have proved their presence and functions in vivo. Moreover, in addition to their functional relevance in bacteria and animals, including humans, their importance has been recently demonstrated in evolutionarily distinct plant species. In this study, we analyzed the genome of Pisum sativum (garden pea, or the so-called green pea), a unique member of the Fabaceae family. Our results showed that this genome contained putative G4 sequences (PQSs). Interestingly, these PQSs were located nonrandomly in the nuclear genome. We also found PQSs in mitochondrial (mt) and chloroplast (cp) DNA, and we experimentally confirmed G4 formation for sequences found in these two organelles. The frequency of PQSs for nuclear DNA was 0.42 PQSs per thousand base pairs (kbp), in the same range as for cpDNA (0.53/kbp), but significantly lower than what was found for mitochondrial DNA (1.58/kbp). In the nuclear genome, PQSs were mainly associated with regulatory regions, including 5'UTRs, and upstream of the rRNA region. In contrast to genomic DNA, PQSs were located around RNA genes in cpDNA and mtDNA. Interestingly, PQSs were also associated with specific transposable elements such as TIR and LTR and around them, pointing to their role in their spreading in nuclear DNA. The nonrandom localization of PQSs uncovered their evolutionary and functional significance in the Pisum sativum genome.

Subject(s)

G-Quadruplexes , 5' Untranslated Regions , Animals , Base Sequence , DNA Transposable Elements/genetics , Genome, Plant , Humans , Pisum sativum/genetics

15.

G-quadruplexes in H1N1 influenza genomes.

Brázda, Václav; Porubiaková, Otília; Cantara, Alessio; Bohálová, Natália; Coufal, Jan; Bartas, Martin; Fojta, Miroslav; Mergny, Jean-Louis.

BMC Genomics ; 22(1): 77, 2021 Jan 23.

Article in English | MEDLINE | ID: mdl-33485319

ABSTRACT

BACKGROUND: Influenza viruses are dangerous pathogens. Seventy-Seven genomes of recently emerged genotype 4 reassortant Eurasian avian-like H1N1 virus (G4-EA-H1N1) are currently available. We investigated the presence and variation of potential G-quadruplex forming sequences (PQS), which can serve as targets for antiviral treatment. RESULTS: PQS were identified in all 77 genomes. The total number of PQS in G4-EA-H1N1 genomes was 571. Interestingly, the number of PQS per genome in individual close relative viruses varied from 4 to 12. PQS were not randomly distributed in the 8 segments of the G4-EA-H1N1 genome, the highest frequency of PQS being found in the NP segment (1.39 per 1000 nt), which is considered a potential target for antiviral therapy. In contrast, no PQS was found in the NS segment. Analyses of variability pointed the importance of some PQS; even if genome variation of influenza virus is extreme, the PQS with the highest G4Hunter score is the most conserved in all tested genomes. G-quadruplex formation in vitro was experimentally confirmed using spectroscopic methods. CONCLUSIONS: The results presented here hint several G-quadruplex-forming sequences in G4-EA-H1N1 genomes, that could provide good therapeutic targets.

Subject(s)

G-Quadruplexes , Influenza A Virus, H1N1 Subtype , Influenza, Human , Genome, Viral , Genotype , Humans , Influenza A Virus, H1N1 Subtype/genetics , Reassortant Viruses/genetics

16.

G4Killer web application: a tool to design G-quadruplex mutations.

Brazda, Vaclav; Kolomaznik, Jan; Mergny, Jean-Louis; Stastny, Jiri.

Bioinformatics ; 36(10): 3246-3247, 2020 05 01.

Article in English | MEDLINE | ID: mdl-31985802

ABSTRACT

MOTIVATION: G-quadruplexes (G4) are important regulatory non-B DNA structures with therapeutic potential. A tool for rational design of mutations leading to decreased propensity for G4 formation should be useful in studying G4 functions. Although tools exist for G4 prediction, no easily accessible tool for the rational design of G4 mutations has been available. RESULTS: We developed a web-based tool termed G4Killer that is based on the G4Hunter algorithm. This new tool is a platform-independent and user-friendly application to design mutations crippling G4 propensity in a parsimonious way (i.e., keeping the primary sequence as close as possible to the original one). The tool is integrated into our DNA analyzer server and allows for generating mutated DNA sequences having the desired lowered G4Hunter score with minimal mutation steps. AVAILABILITY AND IMPLEMENTATION: The G4Killer web tool can be accessed at: http://bioinformatics.ibp.cz. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Subject(s)

G-Quadruplexes , Algorithms , DNA , Mutation , Sequence Analysis, DNA

17.

Structures and stability of simple DNA repeats from bacteria.

Brazda, Vaclav; Fojta, Miroslav; Bowater, Richard P.

Biochem J ; 477(2): 325-339, 2020 01 31.

Article in English | MEDLINE | ID: mdl-31967649

ABSTRACT

DNA is a fundamentally important molecule for all cellular organisms due to its biological role as the store of hereditary, genetic information. On the one hand, genomic DNA is very stable, both in chemical and biological contexts, and this assists its genetic functions. On the other hand, it is also a dynamic molecule, and constant changes in its structure and sequence drive many biological processes, including adaptation and evolution of organisms. DNA genomes contain significant amounts of repetitive sequences, which have divergent functions in the complex processes that involve DNA, including replication, recombination, repair, and transcription. Through their involvement in these processes, repetitive DNA sequences influence the genetic instability and evolution of DNA molecules and they are located non-randomly in all genomes. Mechanisms that influence such genetic instability have been studied in many organisms, including within human genomes where they are linked to various human diseases. Here, we review our understanding of short, simple DNA repeats across a diverse range of bacteria, comparing the prevalence of repetitive DNA sequences in different genomes. We describe the range of DNA structures that have been observed in such repeats, focusing on their propensity to form local, non-B-DNA structures. Finally, we discuss the biological significance of such unusual DNA structures and relate this to studies where the impacts of DNA metabolism on genetic stability are linked to human diseases. Overall, we show that simple DNA repeats in bacteria serve as excellent and tractable experimental models for biochemical studies of their cellular functions and influences.

Subject(s)

Bacteria/genetics , DNA/genetics , Microsatellite Repeats/genetics , Repetitive Sequences, Nucleic Acid/genetics , DNA/ultrastructure , Genome, Bacterial/genetics , Genome, Human/genetics , Genomic Instability/genetics , Humans , Nucleic Acid Conformation

18.

Global analysis of inverted repeat sequences in human gene promoters reveals their non-random distribution and association with specific biological pathways.

Brázda, Václav; Bartas, Martin; Lýsek, Jirí; Coufal, Jan; Fojta, Miroslav.

Genomics ; 112(4): 2772-2777, 2020 07.

Article in English | MEDLINE | ID: mdl-32234431

ABSTRACT

Inverted repeats (IR) play important roles in specific DNA-dependent processes in simple prokaryotes to complex eukaryotes. They are recognized by a variety of proteins including restriction enzymes, helicases and transcription factors. We evaluate the presence and localization of IRs in all validated human promoter sequences within 1000â¯bp upstream and downstream of the transcription start site (TSS). The occurrence of 7â¯bp and longer IRs is located non-randomly in promoter regions, with enrichment within 200â¯bp upstream of the TSS. The highest frequency of IRs is just before TSS for repeats of 8â¯bp or longer. A comparison of promoters divided according to the occurrence of five individual promoter motifs shows unique location patterns of IRs. Principal component analyses and hierarchical clustering of IRs abundance demonstrated that they are depleted and/or not enriched in the promoters of stably expressed genes, but show significant enrichments for specific dynamically regulated biological pathways.

Subject(s)

Inverted Repeat Sequences , Promoter Regions, Genetic , Cluster Analysis , Humans , Principal Component Analysis , Transcription Initiation Site

19.

Divergent distributions of inverted repeats and G-quadruplex forming sequences in Saccharomyces cerevisiae.

Cutová, Michaela; Manta, Jacinta; Porubiaková, Otília; Kaura, Patrik; Stastný, Jirí; Jagelská, Eva B; Goswami, Pratik; Bartas, Martin; Brázda, Václav.

Genomics ; 112(2): 1897-1901, 2020 03.

Article in English | MEDLINE | ID: mdl-31706022

ABSTRACT

The importance of DNA structure in the regulation of basic cellular processes is an emerging field of research. Among local non-B DNA structures, inverted repeat (IR) sequences that form cruciforms and G-rich sequences that form G-quadruplexes (G4) are found in all prokaryotic and eukaryotic organisms and are targets for regulatory proteins. We analyzed IRs and G4 sequences in the genome of the most important biotechnology microorganism, S. cerevisiae. IR and G4-prone sequences are enriched in specific genomic locations and differ markedly between mitochondrial and nuclear DNA. While G4s are overrepresented in telomeres and regions surrounding tRNAs, IRs are most enriched in centromeres, rDNA, replication origins and surrounding tRNAs. Mitochondrial DNA is enriched in both IR and G4-prone sequences relative to the nuclear genome. This extensive analysis of local DNA structures adds to the emerging picture of their importance in genome maintenance, DNA replication and transcription of subsets of genes.

Subject(s)

DNA, Fungal/genetics , G-Quadruplexes , Inverted Repeat Sequences , Centromere/genetics , DNA, Fungal/chemistry , Genome, Fungal , RNA, Ribosomal/genetics , Saccharomyces cerevisiae , Telomere/genetics

20.

R-Loop Tracker: Web Access-Based Tool for R-Loop Detection and Analysis in Genomic DNA Sequences.

Brázda, Václav; Havlík, Jan; Kolomazník, Jan; Trenz, Oldrich; Stastný, Jirí.

Int J Mol Sci ; 22(23)2021 Nov 27.

Article in English | MEDLINE | ID: mdl-34884661

ABSTRACT

R-loops are common non-B nucleic acid structures formed by a three-stranded nucleic acid composed of an RNA-DNA hybrid and a displaced single-stranded DNA (ssDNA) loop. Because the aberrant R-loop formation leads to increased mutagenesis, hyper-recombination, rearrangements, and transcription-replication collisions, it is regarded as important in human diseases. Therefore, its prevalence and distribution in genomes are studied intensively. However, in silico tools for R-loop prediction are limited, and therefore, we have developed the R-loop tracker tool, which was implemented as a part of the DNA Analyser web server. This new tool is focused upon (1) prediction of R-loops in genomic DNA without length and sequence limitations; (2) integration of R-loop tracker results with other tools for nucleic acids analyses, including Genome Browser; (3) internal cross-evaluation of in silico results with experimental data, where available; (4) easy export and correlation analyses with other genome features and markers; and (5) enhanced visualization outputs. Our new R-loop tracker tool is freely accessible on the web pages of DNA Analyser tools, and its implementation on the web-based server allows effective analyses not only for DNA segments but also for full chromosomes and genomes.

Subject(s)

Algorithms , DNA/chemistry , DNA/genetics , Genomic Instability , Genomics/methods , Internet/statistics & numerical data , R-Loop Structures , Humans , Software

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL