Search | VHL Search Portal

1.

Wiedemann, Jakub; Kaczor, Jacek; Milostan, Maciej; Zok, Tomasz; Blazewicz, Jacek; Szachniuk, Marta; Antczak, Maciej.

Bioinformatics ; 38(17): 4200-4205, 2022 09 02.

Article in English | MEDLINE | ID: mdl-35809063

ABSTRACT

MOTIVATION: Knowledge of the 3D structure of RNA supports discovering its functions and is crucial for designing drugs and modern therapeutic solutions. Thus, much attention is devoted to experimental determination and computational prediction targeting the global fold of RNA and its local substructures. The latter include multi-branched loops-functionally significant elements that highly affect the spatial shape of the entire molecule. Unfortunately, their computational modeling constitutes a weak point of structural bioinformatics. A remedy for this is in collecting these motifs and analyzing their features. RESULTS: RNAloops is a self-updating database that stores multi-branched loops identified in the PDB-deposited RNA structures. A description of each loop includes angular data-planar and Euler angles computed between pairs of adjacent helices to allow studying their mutual arrangement in space. The system enables search and analysis of multiloops, presents their structure details numerically and visually, and computes data statistics. AVAILABILITY AND IMPLEMENTATION: RNAloops is freely accessible at https://rnaloops.cs.put.poznan.pl. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Subject(s)

RNA , Software , RNA/chemistry , Nucleic Acid Conformation , Sequence Analysis, RNA , Databases, Factual

2.

Genome-scale de novo assembly using ALGA.

Swat, Sylwester; Laskowski, Artur; Badura, Jan; Frohmberg, Wojciech; Wojciechowski, Pawel; Swiercz, Aleksandra; Kasprzak, Marta; Blazewicz, Jacek.

Bioinformatics ; 37(12): 1644-1651, 2021 Jul 19.

Article in English | MEDLINE | ID: mdl-33471088

ABSTRACT

MOTIVATION: There are very few methods for de novo genome assembly based on the overlap graph approach. It is considered as giving more exact results than the so-called de Bruijn graph approach but in much greater time and of much higher memory usage. It is not uncommon that assembly methods involving the overlap graph model are not able to successfully compute greater datasets, mainly due to memory limitation of a computer. This was the reason for developing in last decades mainly de Bruijn-based assembly methods, fast and fairly accurate. However, the latter methods can fail for longer or more repetitive genomes, as they decompose reads to shorter fragments and lose a part of information. An efficient assembler for processing big datasets and using the overlap graph model is still looked out. RESULTS: We propose a new genome-scale de novo assembler based on the overlap graph approach, designed for short-read sequencing data. The method, ALGA, incorporates several new ideas resulting in more exact contigs produced in short time. Among these ideas, we have creation of a sparse but quite informative graph, reduction of the graph including a procedure referring to the problem of minimum spanning tree of a local subgraph, and graph traversal connected with simultaneous analysis of contigs stored so far. What is rare in genome assembly, the algorithm is almost parameter-free, with only one optional parameter to be set by a user. ALGA was compared with nine state-of-the-art assemblers in tests on genome-scale sequencing data obtained from real experiments on six organisms, differing in size, coverage, GC content and repetition rate. ALGA produced best results in the sense of overall quality of genome reconstruction, understood as a good balance between genome coverage, accuracy and length of resulting sequences. The algorithm is one of tools involved in processing data in currently realized national project Genomic Map of Poland. AVAILABILITY AND IMPLEMENTATION: ALGA is available at http://alga.put.poznan.pl. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

3.

Virxicon: a lexicon of viral sequences.

Kudla, Mateusz; Gutowska, Kaja; Synak, Jaroslaw; Weber, Mirko; Bohnsack, Katrin Sophie; Lukasiak, Piotr; Villmann, Thomas; Blazewicz, Jacek; Szachniuk, Marta.

Bioinformatics ; 36(22-23): 5507-5513, 2021 Apr 01.

Article in English | MEDLINE | ID: mdl-33367605

ABSTRACT

MOTIVATION: Viruses are the most abundant biological entities and constitute a large reservoir of genetic diversity. In recent years, knowledge about them has increased significantly as a result of dynamic development in life sciences and rapid technological progress. This knowledge is scattered across various data repositories, making a comprehensive analysis of viral data difficult. RESULTS: In response to the need for gathering a comprehensive knowledge of viruses and viral sequences, we developed Virxicon, a lexicon of all experimentally acquired sequences for RNA and DNA viruses. The ability to quickly obtain data for entire viral groups, searching sequences by levels of taxonomic hierarchy-according to the Baltimore classification and ICTV taxonomy-and tracking the distribution of viral data and its growth over time are unique features of our database compared to the other tools. AVAILABILITYAND IMPLEMENTATION: Virxicon is a publicly available resource, updated weekly. It has an intuitive web interface and can be freely accessed at http://virxicon.cs.put.poznan.pl/.

4.

RNA World Modeling: A Comparison of Two Complementary Approaches.

Synak, Jaroslaw; Rybarczyk, Agnieszka; Blazewicz, Jacek.

Entropy (Basel) ; 24(4)2022 Apr 11.

Article in English | MEDLINE | ID: mdl-35455198

ABSTRACT

The origin of life remains one of the major scientific questions in modern biology. Among many hypotheses aiming to explain how life on Earth started, RNA world is probably the most extensively studied. It assumes that, in the very beginning, RNA molecules served as both enzymes and as genetic information carriers. However, even if this is true, there are many questions that still need to be answered-for example, whether the population of such molecules could achieve stability and retain genetic information for many generations, which is necessary in order for evolution to start. In this paper, we try to answer this question based on the parasite-replicase model (RP model), which divides RNA molecules into enzymes (RNA replicases) capable of catalyzing replication and parasites that do not possess replicase activity but can be replicated by RNA replicases. We describe the aforementioned system using partial differential equations and, based on the analysis of the simulation, surmise general rules governing its evolution. We also compare this approach with one where the RP system is modeled and implemented using a multi-agent modeling technique. We show that approaching the description and analysis of the RP system from different perspectives (microscopic represented by MAS and macroscopic depicted by PDE) provides consistent results. Therefore, applying MAS does not lead to erroneous results and allows us to study more complex situations where many cases are concerned, which would not be possible through the PDE model.

5.

RNAvista: a webserver to assess RNA secondary structures with non-canonical base pairs.

Antczak, Maciej; Zablocki, Marcin; Zok, Tomasz; Rybarczyk, Agnieszka; Blazewicz, Jacek; Szachniuk, Marta.

Bioinformatics ; 35(1): 152-155, 2019 01 01.

Article in English | MEDLINE | ID: mdl-29985979

ABSTRACT

Motivation: In the study of 3D RNA structure, information about non-canonical interactions between nucleobases is increasingly important. Specialized databases support investigation of this issue based on experimental data, and several programs can annotate non-canonical base pairs in the RNA 3D structure. However, predicting the extended RNA secondary structure which describes both canonical and non-canonical interactions remains difficult. Results: Here, we present RNAvista that allows predicting an extended RNA secondary structure from sequence or from the list enumerating canonical base pairs only. RNAvista is implemented as a publicly available webserver with user-friendly interface. It runs on all major web browsers. Availability and implementation: http://rnavista.cs.put.poznan.pl.

Subject(s)

Base Pairing , Nucleic Acid Conformation , RNA/chemistry , Software , Computational Biology

6.

PUM1 and PUM2 exhibit different modes of regulation for SIAH1 that involve cooperativity with NANOS paralogues.

Sajek, Marcin; Janecki, Damian Mikolaj; Smialek, Maciej Jerzy; Ginter-Matuszewska, Barbara; Spik, Anna; Oczkowski, Slawomir; Ilaslan, Erkut; Kusz-Zamelczyk, Kamila; Kotecki, Maciej; Blazewicz, Jacek; Jaruzelska, Jadwiga.

Cell Mol Life Sci ; 76(1): 147-161, 2019 Jan.

Article in English | MEDLINE | ID: mdl-30269240

ABSTRACT

Pumilio (PUM) proteins are RNA-binding proteins that posttranscriptionally regulate gene expression in many organisms. Their PUF domain recognizes specific PUM-binding elements (PBE) in the 3' untranslated region of target mRNAs while engaging protein cofactors such as NANOS that repress the expression of target mRNAs through the recruitment of effector complexes. Although the general process whereby PUM recognizes individual mRNAs has been studied extensively, the particulars of the mechanism underlying PUM-NANOS cooperation in mRNA regulation and the functional overlap among PUM and NANOS paralogues in mammals have not been elucidated. Here, using the novel PUM1 and PUM2 mRNA target SIAH1 as a model, we show mechanistic differences between PUM1 and PUM2 and between NANOS1, 2, and 3 paralogues in the regulation of SIAH1. Specifically, unlike PUM2, PUM1 exhibited PBE-independent repression of SIAH1 3'UTR-dependent luciferase expression. Concordantly, the PUF domains of PUM1 and PUM2 showed different EMSA complex formation patterns with SIAH1 3'UTRs. Importantly, we show direct binding of NANOS3, but not NANOS2, to SIAH1 3'UTR, which did not require PBEs or the PUF domain. To the best of our knowledge, this is the first report, showing that an NANOS protein directly binds RNA. Finally, using NANOS1 and NANOS3 constructs carrying mutations identified in infertile patients, we show that these mutations disrupt repression of the SIAH1-luciferase reporter and that the central region in NANOS1 appears to contribute to the regulation of SIAH1. Our findings highlight the mechanistic versatility of the PUM/NANOS machinery in mammalian posttranscriptional regulation.

Subject(s)

Gene Expression Regulation , Nuclear Proteins/genetics , RNA-Binding Proteins/metabolism , Ubiquitin-Protein Ligases/genetics , 3' Untranslated Regions , Animals , Drosophila melanogaster , HEK293 Cells , Humans , Mutation , Nuclear Proteins/metabolism , Protein Binding , RNA, Messenger/genetics , RNA, Messenger/metabolism , RNA-Binding Proteins/genetics , Transcription, Genetic , Ubiquitin-Protein Ligases/metabolism

7.

RNApdbee 2.0: multifunctional tool for RNA structure annotation.

Zok, Tomasz; Antczak, Maciej; Zurkowski, Michal; Popenda, Mariusz; Blazewicz, Jacek; Adamiak, Ryszard W; Szachniuk, Marta.

Nucleic Acids Res ; 46(W1): W30-W35, 2018 07 02.

Article in English | MEDLINE | ID: mdl-29718468

ABSTRACT

In the field of RNA structural biology and bioinformatics, an access to correctly annotated RNA structure is of crucial importance, especially in the secondary and 3D structure predictions. RNApdbee webserver, introduced in 2014, primarily aimed to address the problem of RNA secondary structure extraction from the PDB files. Its new version, RNApdbee 2.0, is a highly advanced multifunctional tool for RNA structure annotation, revealing the relationship between RNA secondary and 3D structure given in the PDB or PDBx/mmCIF format. The upgraded version incorporates new algorithms for recognition and classification of high-ordered pseudoknots in large RNA structures. It allows analysis of isolated base pairs impact on RNA structure. It can visualize RNA secondary structures-including that of quadruplexes-with depiction of non-canonical interactions. It also annotates motifs to ease identification of stems, loops and single-stranded fragments in the input RNA structure. RNApdbee 2.0 is implemented as a publicly available webserver with an intuitive interface and can be freely accessed at http://rnapdbee.cs.put.poznan.pl/.

Subject(s)

Computational Biology , Internet , RNA/genetics , Software , Algorithms , Base Pairing/genetics , Databases, Nucleic Acid , Nucleic Acid Conformation , Protein Structure, Secondary/genetics , RNA/chemistry

8.

Detecting life signatures with RNA sequence similarity measures.

Wasik, Szymon; Szostak, Natalia; Kudla, Mateusz; Wachowiak, Michal; Krawiec, Krzysztof; Blazewicz, Jacek.

J Theor Biol ; 463: 110-120, 2019 02 21.

Article in English | MEDLINE | ID: mdl-30562502

ABSTRACT

The RNA World is currently the most plausible hypothesis for explaining the origins of life on Earth. The supporting body of evidence is growing and it comes from multiple areas, including astrobiology, chemistry, biology, mathematics, and, in particular, from computer simulations. Such methods frequently assume the existence of a hypothetical species on Earth, around three billion years ago, with a base sequence probably dissimilar from any in known genomes. However, it is often hard to verify whether or not a hypothetical sequence has the characteristics of biological sequences, and is thus likely to be functional. The primary objective of the presented research was to verify the possibility of building a computational 'life probe' for determining whether a given genetic sequence is biological, and assessing the sensitivity of such probes to the signatures of life present in known biological sequences. We have proposed decision algorithms based on the normalized compression distance (NCD) and Levenshtein distance (LD). We have validated the proposed method in the context of the RNA World hypothesis using short genetic sequences shorter than the error threshold value (i.e., 100 nucleotides). We have demonstrated that both measures can be successfully used to construct life probes that are significantly better than a random decision procedure, while varying from each other when it comes to detailed characteristics. We also observed that fragments of sequences related to replication have better discriminatory power than sequences having other molecular functions. In a broader context, this shows that the signatures of life in short RNA samples can be effectively detected using relatively simple means.

Subject(s)

Origin of Life , RNA/genetics , Algorithms , Base Sequence , Computer Simulation , RNA/physiology , Reproduction/genetics

9.

NPM1 alternative transcripts are upregulated in acute myeloid and lymphoblastic leukemia and their expression level affects patient outcome.

Handschuh, Luiza; Wojciechowski, Pawel; Kazmierczak, Maciej; Marcinkowska-Swojak, Malgorzata; Luczak, Magdalena; Lewandowski, Krzysztof; Komarnicki, Mieczyslaw; Blazewicz, Jacek; Figlerowicz, Marek; Kozlowski, Piotr.

J Transl Med ; 16(1): 232, 2018 08 20.

Article in English | MEDLINE | ID: mdl-30126426

ABSTRACT

BACKGROUND: Expression of the NPM1 gene, encoding nucleophosmin, is upregulated in cancers. Although more than ten NPM1 transcripts are known, the reports were usually limited to one predominant transcript. In leukemia, the NPM1 expression has not been widely studied so far. In acute myeloid leukemia (AML), the mutational status of the gene seems to play a pivotal role in carcinogenesis. Therefore, the aim of the study was to quantify alternative NPM1 transcripts in two types of acute leukemia, AML and ALL (acute lymphoblastic leukemia). METHODS: Using droplet digital PCR, we analyzed the levels of three protein-coding NPM1 transcripts in 66 samples collected from AML and ALL patients and 16 control samples. Using RNA-seq, we detected 8 additional NPM1 transcripts, including non-coding splice variants with retained introns. For data analysis, Welch two sample t-test, Pearson's correlation and Kaplan-Meier analysis were applied. RESULTS: The levels of the particular NPM1 transcripts were significantly different but highly correlated with each other in both leukemia and control samples. Transcript NPM1.1, encoding the longest protein (294 aa), had the highest level of accumulation and was one of the most abundant transcripts in the cell. Comparing to NPM1.1, the levels of the NPM1.2 and NPM1.3 transcripts, encoding a 265-aa and 259-aa proteins, were 30 and 3 times lower, respectively. All three NPM1 transcripts were proportionally upregulated in both types of leukemia compared to control samples. In AML, the levels of NPM1 transcripts decreased in complete remission and increased again with relapse of the disease. Low levels of NPM1.1 and NPM1.3 were associated with better prognosis. The contribution of non-coding transcripts to the total level of NPM1 gene seemed to be marginal, except for one short 5-end transcript accumulated at high levels in AML and control cells. Aberrant proportions of particular NPM1 splice variants could be linked to abnormal expression of genes encoding alternative splicing factors. CONCLUSIONS: The levels of the studied NPM1 transcripts were different but highly correlated with each other. Their upregulation in AML and ALL, decrease after therapy and association with patient outcome suggests the involvement of elevated NPM1 expression in the acute leukemia pathogenesis.

Subject(s)

Gene Expression Regulation, Leukemic , Leukemia, Myeloid, Acute/metabolism , Nuclear Proteins/metabolism , Precursor Cell Lymphoblastic Leukemia-Lymphoma/metabolism , Adult , Aged , DNA Mutational Analysis , Disease-Free Survival , Follow-Up Studies , Gene Expression Profiling , Humans , Introns , Kaplan-Meier Estimate , Leukemia, Myeloid, Acute/mortality , Leukemia, Myeloid, Acute/therapy , Middle Aged , Nucleophosmin , Polymerase Chain Reaction , Precursor Cell Lymphoblastic Leukemia-Lymphoma/mortality , Precursor Cell Lymphoblastic Leukemia-Lymphoma/therapy , Prognosis , Sequence Analysis, RNA , Treatment Outcome , Up-Regulation , Young Adult

10.

RNAssess--a web server for quality assessment of RNA 3D structures.

Lukasiak, Piotr; Antczak, Maciej; Ratajczak, Tomasz; Szachniuk, Marta; Popenda, Mariusz; Adamiak, Ryszard W; Blazewicz, Jacek.

Nucleic Acids Res ; 43(W1): W502-6, 2015 Jul 01.

Article in English | MEDLINE | ID: mdl-26068469

ABSTRACT

Nowadays, various methodologies can be applied to model RNA 3D structure. Thus, the plausible quality assessment of 3D models has a fundamental impact on the progress of structural bioinformatics. Here, we present RNAssess server, a novel tool dedicated to visual evaluation of RNA 3D models in the context of the known reference structure for a wide range of accuracy levels (from atomic to the whole molecule perspective). The proposed server is based on the concept of local neighborhood, defined as a set of atoms observed within a sphere localized around a central atom of a particular residue. A distinctive feature of our server is the ability to perform simultaneous visual analysis of the model-reference structure coherence. RNAssess supports the quality assessment through delivering both static and interactive visualizations that allows an easy identification of native-like models and/or chosen structural regions of the analyzed molecule. A combination of results provided by RNAssess allows us to rank analyzed models. RNAssess offers new route to a fast and efficient 3D model evaluation suitable for the RNA-Puzzles challenge. The proposed automated tool is implemented as a free and open to all users web server with an user-friendly interface and can be accessed at: http://rnassess.cs.put.poznan.pl/.

Subject(s)

Models, Molecular , RNA/chemistry , Software , Internet , Nucleic Acid Conformation , Sequence Analysis, RNA

11.

Structural alignment of protein descriptors - a combinatorial model.

Antczak, Maciej; Kasprzak, Marta; Lukasiak, Piotr; Blazewicz, Jacek.

BMC Bioinformatics ; 17: 383, 2016 Sep 17.

Article in English | MEDLINE | ID: mdl-27639380

ABSTRACT

BACKGROUND: Structural alignment of proteins is one of the most challenging problems in molecular biology. The tertiary structure of a protein strictly correlates with its function and computationally predicted structures are nowadays a main premise for understanding the latter. However, computationally derived 3D models often exhibit deviations from the native structure. A way to confirm a model is a comparison with other structures. The structural alignment of a pair of proteins can be defined with the use of a concept of protein descriptors. The protein descriptors are local substructures of protein molecules, which allow us to divide the original problem into a set of subproblems and, consequently, to propose a more efficient algorithmic solution. In the literature, one can find many applications of the descriptors concept that prove its usefulness for insight into protein 3D structures, but the proposed approaches are presented rather from the biological perspective than from the computational or algorithmic point of view. Efficient algorithms for identification and structural comparison of descriptors can become crucial components of methods for structural quality assessment as well as tertiary structure prediction. RESULTS: In this paper, we propose a new combinatorial model and new polynomial-time algorithms for the structural alignment of descriptors. The model is based on the maximum-size assignment problem, which we define here and prove that it can be solved in polynomial time. We demonstrate suitability of this approach by comparison with an exact backtracking algorithm. Besides a simplification coming from the combinatorial modeling, both on the conceptual and complexity level, we gain with this approach high quality of obtained results, in terms of 3D alignment accuracy and processing efficiency. CONCLUSIONS: All the proposed algorithms were developed and integrated in a computationally efficient tool descs-standalone, which allows the user to identify and structurally compare descriptors of biological molecules, such as proteins and RNAs. Both PDB (Protein Data Bank) and mmCIF (macromolecular Crystallographic Information File) formats are supported. The proposed tool is available as an open source project stored on GitHub ( https://github.com/mantczak/descs-standalone ).

Subject(s)

Proteins/chemistry , Sequence Alignment/methods , Algorithms , Amino Acid Sequence , Databases, Protein , Models, Molecular , Time Factors

12.

RNApdbee--a webserver to derive secondary structures from pdb files of knotted and unknotted RNAs.

Antczak, Maciej; Zok, Tomasz; Popenda, Mariusz; Lukasiak, Piotr; Adamiak, Ryszard W; Blazewicz, Jacek; Szachniuk, Marta.

Nucleic Acids Res ; 42(Web Server issue): W368-72, 2014 Jul.

Article in English | MEDLINE | ID: mdl-24771339

ABSTRACT

In RNA structural biology and bioinformatics an access to correct RNA secondary structure and its proper representation is of crucial importance. This is true especially in the field of secondary and 3D RNA structure prediction. Here, we introduce RNApdbee-a new tool that allows to extract RNA secondary structure from the pdb file, and presents it in both textual and graphical form. RNApdbee supports processing of knotted and unknotted structures of large RNAs, also within protein complexes. The method works not only for first but also for high order pseudoknots, and gives an information about canonical and non-canonical base pairs. A combination of these features is unique among existing applications for RNA structure analysis. Additionally, a function of converting between the text notations, i.e. BPSEQ, CT and extended dot-bracket, is provided. In order to facilitate a more comprehensive study, the webserver integrates the functionality of RNAView, MC-Annotate and 3DNA/DSSR, being the most common tools used for automated identification and classification of RNA base pairs. RNApdbee is implemented as a publicly available webserver with an intuitive interface and can be freely accessed at http://rnapdbee.cs.put.poznan.pl/.

Subject(s)

RNA/chemistry , Software , Base Pairing , Internet , Nucleic Acid Conformation

13.

New in silico approach to assessing RNA secondary structures with non-canonical base pairs.

Rybarczyk, Agnieszka; Szostak, Natalia; Antczak, Maciej; Zok, Tomasz; Popenda, Mariusz; Adamiak, Ryszard; Blazewicz, Jacek; Szachniuk, Marta.

BMC Bioinformatics ; 16: 276, 2015 Sep 02.

Article in English | MEDLINE | ID: mdl-26329823

ABSTRACT

BACKGROUND: The function of RNA is strongly dependent on its structure, so an appropriate recognition of this structure, on every level of organization, is of great importance. One particular concern is the assessment of base-base interactions, described as the secondary structure, the knowledge of which greatly facilitates an interpretation of RNA function and allows for structure analysis on the tertiary level. The RNA secondary structure can be predicted from a sequence using in silico methods often adjusted with experimental data, or assessed from 3D structure atom coordinates. Computational approaches typically consider only canonical, Watson-Crick and wobble base pairs. Handling of non-canonical interactions, important for a full description of RNA structure, is still very difficult. RESULTS: We introduce our novel approach to assessing an extended RNA secondary structure, which characterizes both canonical and non-canonical base pairs, along with their type classification. It is based on predicting the RNA 3D structure from a user-provided sequence or a secondary structure that only describes canonical base pairs, and then deriving the extended secondary structure from atom coordinates. In our example implementation, this was achieved by integrating the functionality of two fully automated, high fidelity methods in a computational pipeline: RNAComposer for the 3D RNA structure prediction and RNApdbee for base-pair annotation. CONCLUSIONS: The presented methodology ties together existing applications for RNA 3D structure prediction and base-pair annotation. The example performance, applying RNAComposer and RNApdbee, reveals better accuracy in non-canonical base pair assessment than the compared methods that directly predict RNA secondary structure.

Subject(s)

Base Pairing/genetics , Computer Simulation/trends , RNA/genetics , Protein Structure, Secondary , RNA/chemistry

14.

RNAlyzer--novel approach for quality analysis of RNA structural models.

Lukasiak, Piotr; Antczak, Maciej; Ratajczak, Tomasz; Bujnicki, Janusz M; Szachniuk, Marta; Adamiak, Ryszard W; Popenda, Mariusz; Blazewicz, Jacek.

Nucleic Acids Res ; 41(12): 5978-90, 2013 Jul.

Article in English | MEDLINE | ID: mdl-23620294

ABSTRACT

The continuously increasing amount of RNA sequence and experimentally determined 3D structure data drives the development of computational methods supporting exploration of these data. Contemporary functional analysis of RNA molecules, such as ribozymes or riboswitches, covers various issues, among which tertiary structure modeling becomes more and more important. A growing number of tools to model and predict RNA structure calls for an evaluation of these tools and the quality of outcomes their produce. Thus, the development of reliable methods designed to meet this need is relevant in the context of RNA tertiary structure analysis and can highly influence the quality and usefulness of RNA tertiary structure prediction in the nearest future. Here, we present RNAlyzer-a computational method for comparison of RNA 3D models with the reference structure and for discrimination between the correct and incorrect models. Our approach is based on the idea of local neighborhood, defined as a set of atoms included in the sphere centered around a user-defined atom. A unique feature of the RNAlyzer is the simultaneous visualization of the model-reference structure distance at different levels of detail, from the individual residues to the entire molecules.

Subject(s)

Models, Molecular , RNA/chemistry , Software , Computational Biology/methods , Nucleic Acid Conformation

15.

Sorting signal targeting mRNA into hepatic extracellular vesicles.

Szostak, Natalia; Royo, Felix; Rybarczyk, Agnieszka; Szachniuk, Marta; Blazewicz, Jacek; del Sol, Antonio; Falcon-Perez, Juan M.

RNA Biol ; 11(7): 836-44, 2014.

Article in English | MEDLINE | ID: mdl-24921245

ABSTRACT

Intercellular communication mediated by extracellular vesicles has proved to play an important role in normal and pathological scenarios. However not too much information about the sorting mechanisms involved in loading the vesicles is available. Recently, our group has characterized the mRNA content of vesicles released by hepatic cellular systems, showing that a set of transcripts was particularly enriched in the vesicles in comparison with their intracellular abundance. In the current work, based on in silico bioinformatics tools, we have mapped a novel sequence of 12 nucleotides C[TA]G[GC][AGT]G[CT]C[AT]GG[GA], which is significantly enriched in the set of mRNAs that accumulate in extracellular vesicles. By including a 3'-UTR containing this sequence in a luciferase mRNA reporter, we have shown that in a hepatic cellular system this reporter mRNA was incorporated into extracellular vesicles. This study identifies a sorting signal in mRNAs that is involved in their enrichment in EVs, within a hepatic non-tumoral cellular model.

Subject(s)

Computational Biology/methods , Liver/metabolism , Nucleotide Motifs , RNA Transport , RNA, Messenger/chemistry , 3' Untranslated Regions , Cell Communication , Cell Line , Gene Expression Profiling , Humans , Liver/cytology , Nucleic Acid Conformation , RNA, Messenger/metabolism , Transport Vesicles

16.

Automated 3D structure composition for large RNAs.

Popenda, Mariusz; Szachniuk, Marta; Antczak, Maciej; Purzycka, Katarzyna J; Lukasiak, Piotr; Bartol, Natalia; Blazewicz, Jacek; Adamiak, Ryszard W.

Nucleic Acids Res ; 40(14): e112, 2012 Aug.

Article in English | MEDLINE | ID: mdl-22539264

ABSTRACT

Understanding the numerous functions that RNAs play in living cells depends critically on knowledge of their three-dimensional structure. Due to the difficulties in experimentally assessing structures of large RNAs, there is currently great demand for new high-resolution structure prediction methods. We present the novel method for the fully automated prediction of RNA 3D structures from a user-defined secondary structure. The concept is founded on the machine translation system. The translation engine operates on the RNA FRABASE database tailored to the dictionary relating the RNA secondary structure and tertiary structure elements. The translation algorithm is very fast. Initial 3D structure is composed in a range of seconds on a single processor. The method assures the prediction of large RNA 3D structures of high quality. Our approach needs neither structural templates nor RNA sequence alignment, required for comparative methods. This enables the building of unresolved yet native and artificial RNA structures. The method is implemented in a publicly available, user-friendly server RNAComposer. It works in an interactive mode and a batch mode. The batch mode is designed for large-scale modelling and accepts atomic distance restraints. Presently, the server is set to build RNA structures of up to 500 residues.

Subject(s)

Algorithms , Models, Molecular , RNA/chemistry , Databases, Nucleic Acid , Internet , Nucleic Acid Conformation , RNA, Ribosomal, 5S/chemistry , Retroelements , Software

17.

In silico and in vitro analysis of the impact of single substitutions within EXO-motifs on Hsa-MiR-1246 intercellular transfer in breast cancer cell.

Rybarczyk, Agnieszka; Lehmann, Tomasz; Iwanczyk-Skalska, Ewa; Juzwa, Wojciech; Plawski, Andrzej; Kopciuch, Kamil; Blazewicz, Jacek; Jagodzinski, Pawel P.

J Appl Genet ; 64(1): 105-124, 2023 Feb.

Article in English | MEDLINE | ID: mdl-36394782

ABSTRACT

MiR-1246 has recently gained much attention and many studies have shown its oncogenic role in colorectal, breast, lung, and ovarian cancers. However, miR-1246 processing, stability, and mechanisms directing miR-1246 into neighbor cells remain still unclear. In this study, we aimed to determine the role of single-nucleotide substitutions within short exosome sorting motifs - so-called EXO-motifs: GGAG and GCAG present in miR-1246 sequence on its intracellular stability and extracellular transfer. We applied in silico methods such as 2D and 3D structure analysis and modeling of protein interactions. We also performed in vitro validation through the transfection of fluorescently labeled miRNA to MDA-MB-231 cells, which we analyzed by flow cytometry and fluorescent microscopy. Our results suggest that nucleotides alterations that disturbed miR-1246 EXO-motifs were able to modulate miRNA-1246 stability and its transfer level to the neighboring cells, suggesting that the molecular mechanism of RNA stability and intercellular transfer can be closely related.

Subject(s)

Breast Neoplasms , MicroRNAs , Humans , Female , Breast Neoplasms/genetics , MicroRNAs/genetics

18.

GeVaDSs - decision support system for novel Genetic Vaccine development process.

Blazewicz, Jacek; Borowski, Marcin; Chaara, Wahiba; Kedziora, Pawel; Klatzmann, David; Lukasiak, Piotr; Six, Adrien; Wojciechowski, Pawel.

BMC Bioinformatics ; 13: 91, 2012 May 10.

Article in English | MEDLINE | ID: mdl-22574945

ABSTRACT

BACKGROUND: The lack of a uniform way for qualitative and quantitative evaluation of vaccine candidates under development led us to set up a standardized scheme for vaccine efficacy and safety evaluation. We developed and implemented molecular and immunology methods, and designed support tools for immunization data storage and analyses. Such collection can create a unique opportunity for immunologists to analyse data delivered from their laboratories. RESULTS: We designed and implemented GeVaDSs (Genetic Vaccine Decision Support system) an interactive system for efficient storage, integration, retrieval and representation of data. Moreover, GeVaDSs allows for relevant association and interpretation of data, and thus for knowledge-based generation of testable hypotheses of vaccine responses. CONCLUSIONS: GeVaDSs has been tested by several laboratories in Europe, and proved its usefulness in vaccine analysis. Case study of its application is presented in the additional files. The system is available at: http://gevads.cs.put.poznan.pl/preview/(login: viewer, password: password).

Subject(s)

Decision Support Techniques , Drug Design , Drug Evaluation/statistics & numerical data , Information Storage and Retrieval/methods , Software , Vaccines, DNA/immunology , B-Lymphocytes/immunology , Europe , Humans , T-Lymphocytes/immunology , Vaccination/statistics & numerical data

19.

Hypercycle.

Szostak, Natalia; Wasik, Szymon; Blazewicz, Jacek.

PLoS Comput Biol ; 12(4): e1004853, 2016 Apr.

Article in English | MEDLINE | ID: mdl-27054759

Subject(s)

Macromolecular Substances/chemistry , Macromolecular Substances/metabolism , Models, Biological , Catalysis , Computational Biology , Evolution, Molecular , Feedback, Physiological , Models, Molecular , Nucleic Acids/chemistry , Nucleic Acids/genetics , Nucleic Acids/metabolism , Origin of Life , Peptides/chemistry , Peptides/genetics , Peptides/metabolism , Protein Biosynthesis , RNA, Catalytic/chemistry , RNA, Catalytic/genetics , RNA, Catalytic/metabolism

20.

Learning vector quantization as an interpretable classifier for the detection of SARS-CoV-2 types based on their RNA sequences.

Kaden, Marika; Bohnsack, Katrin Sophie; Weber, Mirko; Kudla, Mateusz; Gutowska, Kaja; Blazewicz, Jacek; Villmann, Thomas.

Neural Comput Appl ; 34(1): 67-78, 2022.

Article in English | MEDLINE | ID: mdl-33935376

ABSTRACT

We present an approach to discriminate SARS-CoV-2 virus types based on their RNA sequence descriptions avoiding a sequence alignment. For that purpose, sequences are preprocessed by feature extraction and the resulting feature vectors are analyzed by prototype-based classification to remain interpretable. In particular, we propose to use variants of learning vector quantization (LVQ) based on dissimilarity measures for RNA sequence data. The respective matrix LVQ provides additional knowledge about the classification decisions like discriminant feature correlations and, additionally, can be equipped with easy to realize reject options for uncertain data. Those options provide self-controlled evidence, i.e., the model refuses to make a classification decision if the model evidence for the presented data is not sufficient. This model is first trained using a GISAID dataset with given virus types detected according to the molecular differences in coronavirus populations by phylogenetic tree clustering. In a second step, we apply the trained model to another but unlabeled SARS-CoV-2 virus dataset. For these data, we can either assign a virus type to the sequences or reject atypical samples. Those rejected sequences allow to speculate about new virus types with respect to nucleotide base mutations in the viral sequences. Moreover, this rejection analysis improves model robustness. Last but not least, the presented approach has lower computational complexity compared to methods based on (multiple) sequence alignment. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s00521-021-06018-2.

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

Subject(s)

ABSTRACT

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL