Search | VHL Regional Portal

1.

Quercetin Exhibits Preferential Binding Interaction by Selectively Targeting HRAS1 I-Motif DNA-Forming Promoter Sequences.

Bag, Sagar; Ghosal, Souvik; Mukherjee, Moupriya; Pramanik, Goutam; Bhowmik, Sudipta.

Langmuir ; 40(19): 10157-10170, 2024 May 14.

Article in English | MEDLINE | ID: mdl-38700902

ABSTRACT

I-Motif (iM) DNA structures represent among the most significant noncanonical nucleic acid configurations. iM-forming DNA sequences are found in an array of vital genomic locations and are particularly frequent in the promoter islands of various oncogenes. Thus, iM DNA is a crucial candidate for anticancer medicines; therefore, binding interactions between iM DNA and small molecular ligands, such as flavonoids, are critically important. Extensive sets of spectroscopic strategies and thermodynamic analysis were utilized in the present investigation to find out the favorable interaction of quercetin (Que), a dietary flavonoid that has various health-promoting characteristics, including anticancer properties, with noncanonical iM DNA structure. Spectroscopic studies and thermal analysis revealed that Que interacts preferentially with HRAS1 iM DNA compared with VEGF, BCL2 iM, and duplex DNA. Que, therefore, emerged as a suitable natural-product-oriented antagonist for targeting HRAS1 iM DNA. The innovative spectroscopic as well as mechanical features of Que and its specific affinity for HRAS1 iM may be useful for therapeutic applications and provide crucial insights for the design of compounds with remarkable medicinal properties.

Subject(s)

DNA , Promoter Regions, Genetic , Proto-Oncogene Proteins p21(ras) , Quercetin , Quercetin/chemistry , Quercetin/pharmacology , Quercetin/metabolism , DNA/chemistry , DNA/metabolism , Proto-Oncogene Proteins p21(ras)/genetics , Proto-Oncogene Proteins p21(ras)/chemistry , Proto-Oncogene Proteins p21(ras)/antagonists & inhibitors , Proto-Oncogene Proteins p21(ras)/metabolism , Thermodynamics , Humans , Nucleotide Motifs , Binding Sites

2.

Resolving the intricate binding of neomycin B to multiple binding motifs of a neomycin-sensing riboswitch aptamer by native top-down mass spectrometry and NMR spectroscopy.

Heel, Sarah Viola; Juen, Fabian; Bartosik, Karolina; Micura, Ronald; Kreutz, Christoph; Breuker, Kathrin.

Nucleic Acids Res ; 52(8): 4691-4701, 2024 May 08.

Article in English | MEDLINE | ID: mdl-38567725

ABSTRACT

Understanding small molecule binding to RNA can be complicated by an intricate interplay between binding stoichiometry, multiple binding motifs, different occupancies of different binding motifs, and changes in the structure of the RNA under study. Here, we use native top-down mass spectrometry (MS) and nuclear magnetic resonance (NMR) spectroscopy to experimentally resolve these factors and gain a better understanding of the interactions between neomycin B and the 40 nt aptamer domain of a neomycin-sensing riboswitch engineered in yeast. Data from collisionally activated dissociation of the 1:1, 1:2 and 1:3 RNA-neomycin B complexes identified a third binding motif C of the riboswitch in addition to the two motifs A and B found in our previous study, and provided occupancies of the different binding motifs for each complex stoichiometry. Binding of a fourth neomycin B molecule was unspecific according to both MS and NMR data. Intriguingly, all major changes in the aptamer structure can be induced by the binding of the first neomycin B molecule regardless of whether it binds to motif A or B as evidenced by stoichiometry-resolved MS data together with titration data from 1H NMR spectroscopy in the imino proton region. Specific binding of the second and third neomycin B molecules further stabilizes the riboswitch aptamer, thereby allowing for a gradual response to increasing concentrations of neomycin B, which likely leads to a fine-tuning of the cellular regulatory mechanism.

Subject(s)

Aptamers, Nucleotide , Framycetin , Nucleic Acid Conformation , Riboswitch , Aptamers, Nucleotide/chemistry , Aptamers, Nucleotide/metabolism , Aptamers, Nucleotide/genetics , Framycetin/chemistry , Framycetin/metabolism , Binding Sites , Magnetic Resonance Spectroscopy/methods , Neomycin/chemistry , Mass Spectrometry/methods , Nucleotide Motifs , Nuclear Magnetic Resonance, Biomolecular

3.

Discovering DNA shape motifs with multiple DNA shape features: generalization, methods, and validation.

Chen, Nanjun; Yu, Jixiang; Liu, Zhe; Meng, Lingkuan; Li, Xiangtao; Wong, Ka-Chun.

Nucleic Acids Res ; 52(8): 4137-4150, 2024 May 08.

Article in English | MEDLINE | ID: mdl-38572749

ABSTRACT

DNA motifs are crucial patterns in gene regulation. DNA-binding proteins (DBPs), including transcription factors, can bind to specific DNA motifs to regulate gene expression and other cellular activities. Past studies suggest that DNA shape features could be subtly involved in DNA-DBP interactions. Therefore, the shape motif annotations based on intrinsic DNA topology can deepen the understanding of DNA-DBP binding. Nevertheless, high-throughput tools for DNA shape motif discovery that incorporate multiple features altogether remain insufficient. To address it, we propose a series of methods to discover non-redundant DNA shape motifs with the generalization to multiple motifs in multiple shape features. Specifically, an existing Gibbs sampling method is generalized to multiple DNA motif discovery with multiple shape features. Meanwhile, an expectation-maximization (EM) method and a hybrid method coupling EM with Gibbs sampling are proposed and developed with promising performance, convergence capability, and efficiency. The discovered DNA shape motif instances reveal insights into low-signal ChIP-seq peak summits, complementing the existing sequence motif discovery works. Additionally, our modelling captures the potential interplays across multiple DNA shape features. We provide a valuable platform of tools for DNA shape motif discovery. An R package is built for open accessibility and long-lasting impact: https://zenodo.org/doi/10.5281/zenodo.10558980.

Subject(s)

DNA , Nucleotide Motifs , DNA/chemistry , DNA/genetics , DNA/metabolism , DNA-Binding Proteins/metabolism , DNA-Binding Proteins/chemistry , DNA-Binding Proteins/genetics , Algorithms , Nucleic Acid Conformation , Chromatin Immunoprecipitation Sequencing/methods , Binding Sites , Transcription Factors/metabolism , Transcription Factors/genetics , Transcription Factors/chemistry , Humans , Protein Binding

4.

Why Does the E1219V Mutation Expand T-Rich PAM Recognition in Cas9 from Streptococcus pyogenes?

Bhattacharya, Shreya; Satpati, Priyadarshi.

J Chem Inf Model ; 64(8): 3237-3247, 2024 Apr 22.

Article in English | MEDLINE | ID: mdl-38600752

ABSTRACT

Popular RNA-guided DNA endonuclease Cas9 from Streptococcus pyogenes (SpCas9) recognizes the canonical 5'-NGG-3' protospacer adjacent motif (PAM) and triggers double-stranded DNA cleavage activity. Mutations in SpCas9 were demonstrated to expand the PAM readability and hold promise for therapeutic and genome editing applications. However, the energetics of the PAM recognition and its relation to the atomic structure remain unknown. Using the X-ray structure (precatalytic SpCas9:sgRNA:dsDNA) as a template, we calculated the change in the PAM binding affinity in response to SpCas9 mutations using computer simulations. The E1219V mutation in SpCas9 fine-tunes the water accessibility in the PAM binding pocket and promotes new interactions in the SpCas9:noncanonical T-rich PAM, thus weakening the PAM stringency. The nucleotide-specific interaction of two arginine residues (i.e., R1333 and R1335 of SpCas9) ensured stringent 5'-NGG-3' PAM recognition. R1335A substitution (SpCas9R1335A) completely disrupts the direct interaction between SpCas9 and PAM sequences (canonical or noncanonical), accounting for the loss of editing activity. Interestingly, the double mutant (SpCas9R1335A,E1219V) boosts DNA binding affinity by favoring protein:PAM electrostatic contact in a desolvated pocket. The underlying thermodynamics explain the varied DNA cleavage activity of SpCas9 variants. A direct link between the energetics, structures, and activity is highlighted, which can aid in the rational design of improved SpCas9-based genome editing tools.

Subject(s)

CRISPR-Associated Protein 9 , Mutation , Streptococcus pyogenes , Streptococcus pyogenes/enzymology , CRISPR-Associated Protein 9/metabolism , CRISPR-Associated Protein 9/chemistry , CRISPR-Associated Protein 9/genetics , Molecular Dynamics Simulation , Nucleotide Motifs , DNA/metabolism , DNA/chemistry , Protein Conformation , Models, Molecular , Thermodynamics , Protein Binding

5.

Uncovering uncharacterized binding of transcription factors from ATAC-seq footprinting data.

Schultheis, Hendrik; Bentsen, Mette; Heger, Vanessa; Looso, Mario.

Sci Rep ; 14(1): 9275, 2024 04 23.

Article in English | MEDLINE | ID: mdl-38654130

ABSTRACT

Transcription factors (TFs) are crucial epigenetic regulators, which enable cells to dynamically adjust gene expression in response to environmental signals. Computational procedures like digital genomic footprinting on chromatin accessibility assays such as ATACseq can be used to identify bound TFs in a genome-wide scale. This method utilizes short regions of low accessibility signals due to steric hindrance of DNA bound proteins, called footprints (FPs), which are combined with motif databases for TF identification. However, while over 1600 TFs have been described in the human genome, only ~ 700 of these have a known binding motif. Thus, a substantial number of FPs without overlap to a known DNA motif are normally discarded from FP analysis. In addition, the FP method is restricted to organisms with a substantial number of known TF motifs. Here we present DENIS (DE Novo motIf diScovery), a framework to generate and systematically investigate the potential of de novo TF motif discovery from FPs. DENIS includes functionality (1) to isolate FPs without binding motifs, (2) to perform de novo motif generation and (3) to characterize novel motifs. Here, we show that the framework rediscovers artificially removed TF motifs, quantifies de novo motif usage during an early embryonic development example dataset, and is able to analyze and uncover TF activity in organisms lacking canonical motifs. The latter task is exemplified by an investigation of a scATAC-seq dataset in zebrafish which covers different cell types during hematopoiesis.

Subject(s)

Chromatin Immunoprecipitation Sequencing , Nucleotide Motifs , Transcription Factors , Zebrafish , Transcription Factors/metabolism , Transcription Factors/genetics , Animals , Zebrafish/genetics , Zebrafish/metabolism , Chromatin Immunoprecipitation Sequencing/methods , Humans , Binding Sites , Protein Binding , DNA Footprinting/methods , Computational Biology/methods , Chromatin/metabolism , Chromatin/genetics

6.

BIOMAPP::CHIP: large-scale motif analysis.

Garbelini, Jader M Caldonazzo; Sanches, Danilo S; Pozo, Aurora T Ramirez.

BMC Bioinformatics ; 25(1): 128, 2024 Mar 26.

Article in English | MEDLINE | ID: mdl-38528492

ABSTRACT

BACKGROUND: Discovery biological motifs plays a fundamental role in understanding regulatory mechanisms. Computationally, they can be efficiently represented as kmers, making the counting of these elements a critical aspect for ensuring not only the accuracy but also the efficiency of the analytical process. This is particularly useful in scenarios involving large data volumes, such as those generated by the ChIP-seq protocol. Against this backdrop, we introduce BIOMAPP::CHIP, a tool specifically designed to optimize the discovery of biological motifs in large data volumes. RESULTS: We conducted a comprehensive set of comparative tests with state-of-the-art algorithms. Our analyses revealed that BIOMAPP::CHIP outperforms existing approaches in various metrics, excelling both in terms of performance and accuracy. The tests demonstrated a higher detection rate of significant motifs and also greater agility in the execution of the algorithm. Furthermore, the SMT component played a vital role in the system's efficiency, proving to be both agile and accurate in kmer counting, which in turn improved the overall efficacy of our tool. CONCLUSION: BIOMAPP::CHIP represent real advancements in the discovery of biological motifs, particularly in large data volume scenarios, offering a relevant alternative for the analysis of ChIP-seq data and have the potential to boost future research in the field. This software can be found at the following address: (https://github.com/jadermcg/biomapp-chip).

Subject(s)

Algorithms , Software , Sequence Analysis, DNA/methods , Chromatin Immunoprecipitation/methods , Binding Sites , Nucleotide Motifs

7.

Potentiometric titrations to study ligand interactions with DNA i-motifs.

Boissieras, Joseph; Granzhan, Anton.

Methods Enzymol ; 695: 233-254, 2024.

Article in English | MEDLINE | ID: mdl-38521587

ABSTRACT

i-Motifs are non-canonical secondary structures of DNA formed by mutual intercalation of hemi-protonated cytosine-cytosine base pairs, most typically in slightly acidic conditions (pH<7.0). These structures are well-studied in vitro and have recently been suggested to exist in cells. Despite nearly a decade of active research, the quest for small-molecule ligands that could selectively bind to and stabilize i-motifs continues, and no reference, bona fide i-motif ligand is currently available. This is, at least in part, due to the lack of robust methods to assess the interaction of ligands with i-motifs, since many techniques well-established for studies of other secondary structures (such as CD-, UV-, and FRET-melting) may generate artifacts when applied to i-motifs. Here, we describe an implementation of automated, potentiometric (pH) titrations as a robust isothermal method to assess the impact of ligands or cosolutes on thermodynamic stability of i-motifs. This approach is validated through the use of a cosolute previously known to stabilize i-motifs (PEG2000) and three small-molecule ligands that are able to stabilize, destabilize, or have no effect on the stability of i-motifs, respectively.

Subject(s)

Cytosine , DNA , Ligands , Nucleotide Motifs , Base Pairing , DNA/chemistry , Cytosine/chemistry

8.

DNA methylation signatures of early-life adversity are exposure-dependent in wild baboons.

Anderson, Jordan A; Lin, Dana; Lea, Amanda J; Johnston, Rachel A; Voyles, Tawni; Akinyi, Mercy Y; Archie, Elizabeth A; Alberts, Susan C; Tung, Jenny.

Proc Natl Acad Sci U S A ; 121(11): e2309469121, 2024 Mar 12.

Article in English | MEDLINE | ID: mdl-38442181

ABSTRACT

The early-life environment can profoundly shape the trajectory of an animal's life, even years or decades later. One mechanism proposed to contribute to these early-life effects is DNA methylation. However, the frequency and functional importance of DNA methylation in shaping early-life effects on adult outcomes is poorly understood, especially in natural populations. Here, we integrate prospectively collected data on fitness-associated variation in the early environment with DNA methylation estimates at 477,270 CpG sites in 256 wild baboons. We find highly heterogeneous relationships between the early-life environment and DNA methylation in adulthood: aspects of the environment linked to resource limitation (e.g., low-quality habitat, early-life drought) are associated with many more CpG sites than other types of environmental stressors (e.g., low maternal social status). Sites associated with early resource limitation are enriched in gene bodies and putative enhancers, suggesting they are functionally relevant. Indeed, by deploying a baboon-specific, massively parallel reporter assay, we show that a subset of windows containing these sites are capable of regulatory activity, and that, for 88% of early drought-associated sites in these regulatory windows, enhancer activity is DNA methylation-dependent. Together, our results support the idea that DNA methylation patterns contain a persistent signature of the early-life environment. However, they also indicate that not all environmental exposures leave an equivalent mark and suggest that socioenvironmental variation at the time of sampling is more likely to be functionally important. Thus, multiple mechanisms must converge to explain early-life effects on fitness-related traits.

Subject(s)

Adverse Childhood Experiences , DNA Methylation , Animals , Nucleotide Motifs , Biological Assay , Papio/genetics

9.

Structural and dynamical aspect of DNA motif sequence specific binding of AP-1 transcription factor.

Patra, Piya; Gao, Yi Qin.

J Chem Phys ; 160(11)2024 Mar 21.

Article in English | MEDLINE | ID: mdl-38506297

ABSTRACT

Activator protein-1 (AP-1) comprises one of the largest and most evolutionary conserved families of ubiquitous eukaryotic transcription factors that act as a pioneer factor. Diversity in DNA binding interaction of AP-1 through a conserved basic-zipper (bZIP) domain directs in-depth understanding of how AP-1 achieves its DNA binding selectivity and consequently gene regulation specificity. Here, we address the structural and dynamical aspects of the DNA target recognition process of AP-1 using microsecond-long atomistic simulations based on the structure of the human AP-1 FosB/JunD bZIP-DNA complex. Our results show the unique role of DNA shape features in selective base specific interactions, characteristic ion population, and solvation properties of DNA grooves to form the motif sequence specific AP-1-DNA complex. The TpG step at the two terminals of the AP-1 site plays an important role in the structural adjustment of DNA by modifying the helical twist in the AP-1 bound state. We addressed the role of intrinsic motion of the bZIP domain in terms of opening and closing gripper motions of DNA binding helices, in target site recognition and binding of AP-1 factors. Our observations suggest that binding to the cognate motif in DNA is mainly accompanied with the precise adjustment of closing gripper motion of DNA binding helices of the bZIP domain.

Subject(s)

DNA , Transcription Factor AP-1 , Humans , Transcription Factor AP-1/metabolism , Nucleotide Motifs , DNA/chemistry , Binding Sites , Protein Binding

10.

Dynamic control of DNA condensation.

Agarwal, Siddharth; Osmanovic, Dino; Dizani, Mahdi; Klocke, Melissa A; Franco, Elisa.

Nat Commun ; 15(1): 1915, 2024 Mar 01.

Article in English | MEDLINE | ID: mdl-38429336

ABSTRACT

Artificial biomolecular condensates are emerging as a versatile approach to organize molecular targets and reactions without the need for lipid membranes. Here we ask whether the temporal response of artificial condensates can be controlled via designed chemical reactions. We address this general question by considering a model problem in which a phase separating component participates in reactions that dynamically activate or deactivate its ability to self-attract. Through a theoretical model we illustrate the transient and equilibrium effects of reactions, linking condensate response and reaction parameters. We experimentally realize our model problem using star-shaped DNA motifs known as nanostars to generate condensates, and we take advantage of strand invasion and displacement reactions to kinetically control the capacity of nanostars to interact. We demonstrate reversible dissolution and growth of DNA condensates in the presence of specific DNA inputs, and we characterize the role of toehold domains, nanostar size, and nanostar valency. Our results will support the development of artificial biomolecular condensates that can adapt to environmental changes with prescribed temporal dynamics.

Subject(s)

Biomolecular Condensates , DNA Packaging , DNA Replication , Gene Conversion , Nucleotide Motifs

11.

Prediction of DNA i-motifs via machine learning.

Yang, Bibo; Guneri, Dilek; Yu, Haopeng; Wright, Elisé P; Chen, Wenqian; Waller, Zoë A E; Ding, Yiliang.

Nucleic Acids Res ; 52(5): 2188-2197, 2024 Mar 21.

Article in English | MEDLINE | ID: mdl-38364855

ABSTRACT

i-Motifs (iMs), are secondary structures formed in cytosine-rich DNA sequences and are involved in multiple functions in the genome. Although putative iM forming sequences are widely distributed in the human genome, the folding status and strength of putative iMs vary dramatically. Much previous research on iM has focused on assessing the iM folding properties using biophysical experiments. However, there are no dedicated computational tools for predicting the folding status and strength of iM structures. Here, we introduce a machine learning pipeline, iM-Seeker, to predict both folding status and structural stability of DNA iMs. The programme iM-Seeker incorporates a Balanced Random Forest classifier trained on genome-wide iMab antibody-based CUT&Tag sequencing data to predict the folding status and an Extreme Gradient Boosting regressor to estimate the folding strength according to both literature biophysical data and our in-house biophysical experiments. iM-Seeker predicts DNA iM folding status with a classification accuracy of 81% and estimates the folding strength with coefficient of determination (R2) of 0.642 on the test set. Model interpretation confirms that the nucleotide composition of the C-rich sequence significantly affects iM stability, with a positive correlation with sequences containing cytosine and thymine and a negative correlation with guanine and adenine.

Subject(s)

DNA , Machine Learning , Nucleotide Motifs , Humans , Base Sequence , Cytosine/chemistry , DNA/chemistry , DNA/genetics

12.

DeepLocRNA: an interpretable deep learning model for predicting RNA subcellular localization with domain-specific transfer-learning.

Wang, Jun; Horlacher, Marc; Cheng, Lixin; Winther, Ole.

Bioinformatics ; 40(2)2024 02 01.

Article in English | MEDLINE | ID: mdl-38317052

ABSTRACT

MOTIVATION: Accurate prediction of RNA subcellular localization plays an important role in understanding cellular processes and functions. Although post-transcriptional processes are governed by trans-acting RNA binding proteins (RBPs) through interaction with cis-regulatory RNA motifs, current methods do not incorporate RBP-binding information. RESULTS: In this article, we propose DeepLocRNA, an interpretable deep-learning model that leverages a pre-trained multi-task RBP-binding prediction model to predict the subcellular localization of RNA molecules via fine-tuning. We constructed DeepLocRNA using a comprehensive dataset with variant RNA types and evaluated it on the held-out dataset. Our model achieved state-of-the-art performance in predicting RNA subcellular localization in mRNA and miRNA. It has also demonstrated great generalization capabilities, performing well on both human and mouse RNA. Additionally, a motif analysis was performed to enhance the interpretability of the model, highlighting signal factors that contributed to the predictions. The proposed model provides general and powerful prediction abilities for different RNA types and species, offering valuable insights into the localization patterns of RNA molecules and contributing to our understanding of cellular processes at the molecular level. A user-friendly web server is available at: https://biolib.com/KU/DeepLocRNA/.

Subject(s)

Deep Learning , Animals , Humans , Mice , RNA/metabolism , RNA, Messenger/genetics , RNA, Messenger/metabolism , Nucleotide Motifs , RNA-Binding Proteins/metabolism , Computational Biology/methods

13.

An RNA Motif That Enables Optozyme Control and Light-Dependent Gene Expression in Bacteria and Mammalian Cells.

Pietruschka, Georg; Ranzani, Américo T; Weber, Anna; Patwari, Tejal; Pilsl, Sebastian; Renzl, Christian; Otte, David M; Pyka, Daniel; Möglich, Andreas; Mayer, Günter.

Adv Sci (Weinh) ; 11(12): e2304519, 2024 Mar.

Article in English | MEDLINE | ID: mdl-38227373

ABSTRACT

The regulation of gene expression by light enables the versatile, spatiotemporal manipulation of biological function in bacterial and mammalian cells. Optoribogenetics extends this principle by molecular RNA devices acting on the RNA level whose functions are controlled by the photoinduced interaction of a light-oxygen-voltage photoreceptor with cognate RNA aptamers. Here light-responsive ribozymes, denoted optozymes, which undergo light-dependent self-cleavage and thereby control gene expression are described. This approach transcends existing aptamer-ribozyme chimera strategies that predominantly rely on aptamers binding to small molecules. The optozyme method thus stands to enable the graded, non-invasive, and spatiotemporally resolved control of gene expression. Optozymes are found efficient in bacteria and mammalian cells and usher in hitherto inaccessible optoribogenetic modalities with broad applicability in synthetic and systems biology.

Subject(s)

RNA, Catalytic , RNA , Animals , Nucleotide Motifs , RNA/genetics , RNA, Catalytic/chemistry , RNA, Catalytic/genetics , RNA, Catalytic/metabolism , Bacteria/metabolism , Gene Expression , Mammals/metabolism

14.

Concurrent prediction of RNA secondary structures with pseudoknots and local 3D motifs in an integer programming framework.

Loyer, Gabriel; Reinharz, Vladimir.

Bioinformatics ; 40(2)2024 02 01.

Article in English | MEDLINE | ID: mdl-38230755

ABSTRACT

MOTIVATION: The prediction of RNA structure canonical base pairs from a single sequence, especially pseudoknotted ones, remains challenging in a thermodynamic models that approximates the energy of the local 3D motifs joining canonical stems. It has become more and more apparent in recent years that the structural motifs in the loops, composed of noncanonical interactions, are essential for the final shape of the molecule enabling its multiple functions. Our capacity to predict accurate 3D structures is also limited when it comes to the organization of the large intricate network of interactions that form inside those loops. RESULTS: We previously developed the integer programming framework RNA Motifs over Integer Programming (RNAMoIP) to reconcile RNA secondary structure and local 3D motif information available in databases. We further develop our model to now simultaneously predict the canonical base pairs (with pseudoknots) from base pair probability matrices with or without alignment. We benchmarked our new method over the all nonredundant RNAs below 150 nucleotides. We show that the joined prediction of canonical base pairs structure and local conserved motifs (i) improves the ratio of well-predicted interactions in the secondary structure, (ii) predicts well canonical and Wobble pairs at the location where motifs are inserted, (iii) is greatly improved with evolutionary information, and (iv) noncanonical motifs at kink-turn locations. AVAILABILITY AND IMPLEMENTATION: The source code of the framework is available at https://gitlab.info.uqam.ca/cbe/RNAMoIP and an interactive web server at https://rnamoip.cbe.uqam.ca/.

Subject(s)

Algorithms , RNA , RNA/chemistry , Nucleic Acid Conformation , Software , Nucleotide Motifs

15.

PERFUMES: pipeline to extract RNA functional motifs and exposed structures.

Chol, Arnaud; Sarrazin-Gendron, Roman; Lécuyer, Éric; Blanchette, Mathieu; Waldispühl, Jérôme.

Bioinformatics ; 40(2)2024 02 01.

Article in English | MEDLINE | ID: mdl-38291894

ABSTRACT

MOTIVATION: Up to 75% of the human genome encodes RNAs. The function of many non-coding RNAs relies on their ability to fold into 3D structures. Specifically, nucleotides inside secondary structure loops form non-canonical base pairs that help stabilize complex local 3D structures. These RNA 3D motifs can promote specific interactions with other molecules or serve as catalytic sites. RESULTS: We introduce PERFUMES, a computational pipeline to identify 3D motifs that can be associated with observable features. Given a set of RNA sequences with associated binary experimental measurements, PERFUMES searches for RNA 3D motifs using BayesPairing2 and extracts those that are over-represented in the set of positive sequences. It also conducts a thermodynamics analysis of the structural context that can support the interpretation of the predictions. We illustrate PERFUMES' usage on the SNRPA protein binding site, for which the tool retrieved both previously known binder motifs and new ones. AVAILABILITY AND IMPLEMENTATION: PERFUMES is an open-source Python package (https://jwgitlab.cs.mcgill.ca/arnaud_chol/perfumes).

Subject(s)

Perfume , Humans , Nucleic Acid Conformation , Nucleotide Motifs , Base Pairing , RNA/chemistry

16.

Less-is-more: selecting transcription factor binding regions informative for motif inference.

Xu, Jinrui; Gao, Jiahao; Ni, Pengyu; Gerstein, Mark.

Nucleic Acids Res ; 52(4): e20, 2024 Feb 28.

Article in English | MEDLINE | ID: mdl-38214231

ABSTRACT

Numerous statistical methods have emerged for inferring DNA motifs for transcription factors (TFs) from genomic regions. However, the process of selecting informative regions for motif inference remains understudied. Current approaches select regions with strong ChIP-seq signal for a given TF, assuming that such strong signal primarily results from specific interactions between the TF and its motif. Additionally, these selection approaches do not account for non-target motifs, i.e. motifs of other TFs; they presume the occurrence of these non-target motifs infrequent compared to that of the target motif, and thus assume these have minimal interference with the identification of the target. Leveraging extensive ChIP-seq datasets, we introduced the concept of TF signal 'crowdedness', referred to as C-score, for each genomic region. The C-score helps in highlighting TF signals arising from non-specific interactions. Moreover, by considering the C-score (and adjusting for the length of genomic regions), we can effectively mitigate interference of non-target motifs. Using these tools, we find that in many instances, strong ChIP-seq signal stems mainly from non-specific interactions, and the occurrence of non-target motifs significantly impacts the accurate inference of the target motif. Prioritizing genomic regions with reduced crowdedness and short length markedly improves motif inference. This 'less-is-more' effect suggests that ChIP-seq region selection warrants more attention.

Subject(s)

Genomics , Nucleotide Motifs , Transcription Factors , Binding Sites , Chromatin Immunoprecipitation , Protein Binding , Transcription Factors/genetics , Transcription Factors/metabolism

17.

Specific Circular RNA Signature of Endothelial Cells: Potential Implications in Vascular Pathophysiology.

Diallo, Leïla Halidou; Mariette, Jérôme; Laugero, Nathalie; Touriol, Christian; Morfoisse, Florent; Prats, Anne-Catherine; Garmy-Susini, Barbara; Lacazette, Eric.

Int J Mol Sci ; 25(1)2024 Jan 04.

Article in English | MEDLINE | ID: mdl-38203852

ABSTRACT

Circular RNAs (circRNAs) are a recently characterized family of gene transcripts forming a covalently closed loop of single-stranded RNA. The extent of their potential for fine-tuning gene expression is still being discovered. Several studies have implicated certain circular RNAs in pathophysiological processes within vascular endothelial cells and cancer cells independently. However, to date, no comparative study of circular RNA expression in different types of endothelial cells has been performed and analysed through the lens of their central role in vascular physiology and pathology. In this work, we analysed publicly available and original RNA sequencing datasets from arterial, veinous, and lymphatic endothelial cells to identify common and distinct circRNA expression profiles. We identified 4713 distinct circRNAs in the compared endothelial cell types, 95% of which originated from exons. Interestingly, the results show that the expression profile of circular RNAs is much more specific to each cell type than linear RNAs, and therefore appears to be more suitable for distinguishing between them. As a result, we have discovered a specific circRNA signature for each given endothelial cell type. Furthermore, we identified a specific endothelial cell circRNA signature that is composed four circRNAs: circCARD6, circPLXNA2, circCASC15 and circEPHB4. These circular RNAs are produced by genes that are related to endothelial cell migration pathways and cancer progression. More detailed studies of their functions could lead to a better understanding of the mechanisms involved in physiological and pathological (lymph)angiogenesis and might open new ways to tackle tumour spread through the vascular system.

Subject(s)

Endothelial Cells , RNA, Circular , RNA, Circular/genetics , Nucleotide Motifs , RNA/genetics , Cell Movement

18.

A new small molecule DoNA binding to CAG repeat RNA.

Chen, Qingwen; Yamada, Takeshi; Miyagawa, Koichi; Murata, Asako; Shoji, Mitsuo; Nakatani, Kazuhiko.

Bioorg Med Chem ; 98: 117580, 2024 Jan 15.

Article in English | MEDLINE | ID: mdl-38194737

ABSTRACT

We here report a new molecule DoNA binding to a CAG repeat RNA. DoNA is a dimer of the NA molecule that we previously reported. NA binds with high affinity to a CAG repeat DNA but not significantly to a CAG repeat RNA. Binding analyses using SPR and CSI-TOF MS indicated a significant increase in the affinity of DoNA to a single stranded CAG repeat RNA compared to NA. Systematic investigation of the RNA motifs bound by DoNA using hairpin RNA models revealed that DoNA binds to the CAG units at overhang and terminal positions, and notably, it binds to the structurally flexible internal and hairpin loop region.

Subject(s)

RNA , Trinucleotide Repeats , RNA/chemistry , DNA/chemistry , Nucleotide Motifs

19.

MethMotif.Org 2024: a database integrating context-specific transcription factor-binding motifs with DNA methylation patterns.

Dyer, Matthew; Lin, Quy Xiao Xuan; Shapoval, Sofiia; Thieffry, Denis; Benoukraf, Touati.

Nucleic Acids Res ; 52(D1): D222-D228, 2024 Jan 05.

Article in English | MEDLINE | ID: mdl-37850642

ABSTRACT

MethMotif (https://methmotif.org) is a publicly available database that provides a comprehensive repository of transcription factor (TF)-binding profiles, enriched with DNA methylation patterns. In this release, we have enhanced the platform, expanding our initial collection to over 700 position weight matrices (PWM), all of which include DNA methylation profiles. One of the key advancements in this release is the segregation of TF-binding motifs based on their cofactors and DNA methylation status. We have previously demonstrated that gene ontology (GO) enriched terms associated with TF target genes may differ based on their association with alternative cofactors and DNA methylation status. MethMotif provides precomputed GO annotations for each human TF of interest, as well as for TF-co-TF complexes, enabling a comprehensive analysis of TF functions in the context of their co-factors. Additionally, MethMotif has been updated to encompass data for two new species, Mus musculus and Arabidopsis thaliana, widening its applicability to a broader community. MethMotif stands out as the first and only TF-binding motifs database to incorporate context-specific PWM coupled with epigenetic information, thereby enlightening context-specific TF functions. This enhancement allows the community to explore and gain deeper insights into the regulatory mechanisms governing transcriptional processes.

Subject(s)

DNA Methylation , Databases, Genetic , Transcription Factors , Animals , Humans , Mice , Binding Sites , Molecular Sequence Annotation , Nucleotide Motifs , Protein Binding , Transcription Factors/metabolism

20.

TeloBase: a community-curated database of telomere sequences across the tree of life.

Lycka, Martin; Bubeník, Michal; Závodník, Michal; Peska, Vratislav; Fajkus, Petr; Demko, Martin; Fajkus, Jirí; Fojtová, Miloslava.

Nucleic Acids Res ; 52(D1): D311-D321, 2024 Jan 05.

Article in English | MEDLINE | ID: mdl-37602392

ABSTRACT

Discoveries over the recent decade have demonstrated the unexpected diversity of telomere DNA motifs in nature. However, currently available resources, 'Telomerase database' and 'Plant rDNA database', contain just fragments of all relevant literature published over decades of telomere research as they have a different primary focus and limited updates. To fill this gap, we gathered data about telomere DNA sequences from a thorough literature screen as well as by analysing publicly available NGS data, and we created TeloBase (http://cfb.ceitec.muni.cz/telobase/) as a comprehensive database of information about telomere motif diversity. TeloBase is supplemented by internal taxonomy utilizing popular on-line taxonomic resources that enables in-house data filtration and graphical visualisation of telomere DNA evolutionary dynamics in the form of heat tree plots. TeloBase avoids overreliance on administrators for future data updates by having a simple form and community-curation system for application and approval, respectively, of new telomere sequences by users, which should ensure timeliness of the database and topicality. To demonstrate TeloBase utility, we examined telomere motif diversity in species from the fungal genus Aspergillus, and discovered (TTTATTAGGG)n sequence as a putative telomere motif in the plant family Chrysobalanaceae. This was bioinformatically confirmed by analysing template regions of identified telomerase RNAs.

Subject(s)

Databases, Genetic , Telomerase , Nucleotide Motifs , Plants/genetics , Telomerase/genetics , Telomere/genetics , Telomere/metabolism

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL