Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 34
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
Methods Mol Biol ; 2726: 347-376, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38780738

RESUMEN

Structural changes in RNAs are an important contributor to controlling gene expression not only at the posttranscriptional stage but also during transcription. A subclass of riboswitches and RNA thermometers located in the 5' region of the primary transcript regulates the downstream functional unit - usually an ORF - through premature termination of transcription. Not only such elements occur naturally, but they are also attractive devices in synthetic biology. The possibility to design such riboswitches or RNA thermometers is thus of considerable practical interest. Since these functional RNA elements act already during transcription, it is important to model and understand the dynamics of folding and, in particular, the formation of intermediate structures concurrently with transcription. Cotranscriptional folding simulations are therefore an important step to verify the functionality of design constructs before conducting expensive and labor-intensive wet lab experiments. For RNAs, full-fledged molecular dynamics simulations are far beyond practical reach because of both the size of the molecules and the timescales of interest. Even at the simplified level of secondary structures, further approximations are necessary. The BarMap approach is based on representing the secondary structure landscape for each individual transcription step by a coarse-grained representation that only retains a small set of low-energy local minima and the energy barriers between them. The folding dynamics between two transcriptional elongation steps is modeled as a Markov process on this representation. Maps between pairs of consecutive coarse-grained landscapes make it possible to follow the folding process as it changes in response to transcription elongation. In its original implementation, the BarMap software provides a general framework to investigate RNA folding dynamics on temporally changing landscapes. It is, however, difficult to use in particular for specific scenarios such as cotranscriptional folding. To overcome this limitation, we developed the user-friendly BarMap-QA pipeline described in detail in this contribution. It is illustrated here by an elaborate example that emphasizes the careful monitoring of several quality measures. Using an iterative workflow, a reliable and complete kinetics simulation of a synthetic, transcription-regulating riboswitch is obtained using minimal computational resources. All programs and scripts used in this contribution are free software and available for download as a source distribution for Linux® or as a platform-independent Docker® image including support for Apple macOS® and Microsoft Windows®.


Asunto(s)
Simulación de Dinámica Molecular , Conformación de Ácido Nucleico , Pliegue del ARN , Transcripción Genética , Riboswitch/genética , ARN/química , ARN/genética , Programas Informáticos
2.
NAR Genom Bioinform ; 5(3): lqad072, 2023 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-37608800

RESUMEN

The in silico prediction of non-coding and protein-coding genetic loci has received considerable attention in comparative genomics aiming in particular at the identification of properties of nucleotide sequences that are informative of their biological role in the cell. We present here a software framework for the alignment-based training, evaluation and application of machine learning models with user-defined parameters. Instead of focusing on the one-size-fits-all approach of pervasive in silico annotation pipelines, we offer a framework for the structured generation and evaluation of models based on arbitrary features and input data, focusing on stable and explainable results. Furthermore, we showcase the usage of our software package in a full-genome screen of Drosophila melanogaster and evaluate our results against the well-known but much less flexible program RNAz.

3.
Methods Mol Biol ; 2518: 179-202, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35666446

RESUMEN

Riboswitches are an attractive target for the directed design of RNA-based regulators by in silico prediction. These noncoding RNA elements consist of an aptamer platform for the highly selective ligand recognition and an expression platform which controls gene activity typically at the level of transcription or translation. In previous work, we could successfully apply RNA folding prediction to implement a new riboswitch mechanism regulating processing of a tRNA by RNase P. In this contribution, we present detailed information about our pipeline consisting of in silico design combined with the biochemical analysis for the verification of the implemented mechanism. Furthermore, we discuss the applicability of the presented biochemical in vivo and in vitro methods for the characterization of other artificial riboswitches.


Asunto(s)
Riboswitch , Conformación de Ácido Nucleico , ARN/química , Pliegue del ARN , ARN de Transferencia/química , ARN de Transferencia/genética , Ribonucleasa P/genética , Riboswitch/genética
4.
RNA ; 28(4): 551-567, 2022 04.
Artículo en Inglés | MEDLINE | ID: mdl-35022261

RESUMEN

Removal of the 5'-leader region is an essential step in the maturation of tRNA molecules in all domains of life. This reaction is catalyzed by various RNase P activities, ranging from ribonucleoproteins with ribozyme activity to protein-only forms. In Escherichia coli, the efficiency of RNase P-mediated cleavage can be controlled by computationally designed riboswitch elements in a ligand-dependent way, where the 5'-leader sequence of a tRNA precursor is either sequestered in a hairpin structure or presented as a single-stranded region accessible for maturation. In the presented work, the regulatory potential of such artificial constructs is tested on different forms of eukaryotic RNase P enzymes-two protein-only RNase P enzymes (PRORP1 and PRORP2) from Arabidopsis thaliana and the ribonucleoprotein of Homo sapiens The PRORP enzymes were analyzed in vitro as well as in vivo in a bacterial RNase P complementation system. We also tested in HEK293T cells whether the riboswitches remain functional with human nuclear RNase P. While the regulatory principle of the synthetic riboswitches applies for all tested RNase P enzymes, the results also show differences in the substrate requirements of the individual enzyme versions. Hence, such designed RNase P riboswitches represent a novel tool to investigate the impact of the structural composition of the 5'-leader on substrate recognition by different types of RNase P enzymes.


Asunto(s)
Ribonucleasa P , Riboswitch , Eucariontes/genética , Células HEK293 , Humanos , Precursores del ARN/metabolismo , ARN de Transferencia/química , ARN de Transferencia/genética , Ribonucleasa P/metabolismo , Riboswitch/genética
5.
Noncoding RNA ; 7(4)2021 Dec 13.
Artículo en Inglés | MEDLINE | ID: mdl-34940758

RESUMEN

Long non-coding RNAs (lncRNAs) are widely recognized as important regulators of gene expression. Their molecular functions range from miRNA sponging to chromatin-associated mechanisms, leading to effects in disease progression and establishing them as diagnostic and therapeutic targets. Still, only a few representatives of this diverse class of RNAs are well studied, while the vast majority is poorly described beyond the existence of their transcripts. In this review we survey common in silico approaches for lncRNA annotation. We focus on the well-established sets of features used for classification and discuss their specific advantages and weaknesses. While the available tools perform very well for the task of distinguishing coding sequence from other RNAs, we find that current methods are not well suited to distinguish lncRNAs or parts thereof from other non-protein-coding input sequences. We conclude that the distinction of lncRNAs from intronic sequences and untranslated regions of coding mRNAs remains a pressing research gap.

6.
Nucleic Acids Res ; 49(3): 1784-1800, 2021 02 22.
Artículo en Inglés | MEDLINE | ID: mdl-33469651

RESUMEN

We describe a synthetic riboswitch element that implements a regulatory principle which directly addresses an essential tRNA maturation step. Constructed using a rational in silico design approach, this riboswitch regulates RNase P-catalyzed tRNA 5'-processing by either sequestering or exposing the single-stranded 5'-leader region of the tRNA precursor in response to a ligand. A single base pair in the 5'-leader defines the regulatory potential of the riboswitch both in vitro and in vivo. Our data provide proof for prior postulates on the importance of the structure of the leader region for tRNA maturation. We demonstrate that computational predictions of ligand-dependent structural rearrangements can address individual maturation steps of stable non-coding RNAs, thus making them amenable as promising target for regulatory devices that can be used as functional building blocks in synthetic biology.


Asunto(s)
Procesamiento Postranscripcional del ARN , ARN de Transferencia/metabolismo , Ribonucleasa P/metabolismo , Riboswitch , Simulación por Computador , Escherichia coli/genética , Ligandos , ARN de Transferencia/química , Ribonucleasa P/química , Termodinámica
7.
RNA Biol ; 18(4): 457-467, 2021 04.
Artículo en Inglés | MEDLINE | ID: mdl-32882151

RESUMEN

Gene regulation in prokaryotes often depends on RNA elements such as riboswitches or RNA thermometers located in the 5' untranslated region of mRNA. Rearrangements of the RNA structure in response, e.g., to the binding of small molecules or ions control translational initiation or premature termination of transcription and thus mRNA expression. Such structural responses are amenable to computational modelling, making it possible to rationally design synthetic riboswitches for a given aptamer. Starting from an artificial aptamer, we construct the first synthetic transcriptional riboswitches that respond to the antibiotic neomycin. We show that the switching behaviour in vivo critically depends not only on the sequence of the riboswitch itself, but also on its sequence context. We therefore developed in silico methods to predict the impact of the context, making it possible to adapt the design and to rescue non-functional riboswitches. We furthermore analyse the influence of 5' hairpins with varying stability on neomycin riboswitch activity. Our data highlight the limitations of a simple plug-and-play approach in the design of complex genetic circuits and demonstrate that detailed computational models significantly simplify, improve, and automate the design of transcriptional circuits. Our design software is available under a free licence on GitHub (https://github.com/xileF1337/riboswitch_design).


Asunto(s)
Clonación Molecular/métodos , Biología Computacional/métodos , Neomicina/química , Riboswitch/genética , Aptámeros de Nucleótidos/química , Aptámeros de Nucleótidos/genética , Simulación por Computador , Regulación Bacteriana de la Expresión Génica , Genes Reporteros/genética , Neomicina/farmacología , Conformación de Ácido Nucleico , ARN Bacteriano/análisis , ARN Bacteriano/química , ARN Bacteriano/genética , Programas Informáticos , Biología Sintética
8.
Methods ; 161: 54-63, 2019 05 15.
Artículo en Inglés | MEDLINE | ID: mdl-31059832

RESUMEN

Artificial RNA molecules with novel functionality have many applications in synthetic biology, pharmacy and white biotechnology. The de novo design of such devices using computational methods and prediction tools is a resource-efficient alternative to experimental screening and selection pipelines. In this review, we describe methods common to many such computational approaches, thoroughly dissect these methods and highlight open questions for the individual steps. Initially, it is essential to investigate the biological target system, the regulatory mechanism that will be exploited, as well as the desired components in order to define design objectives. Subsequent computational design is needed to combine the selected components and to obtain novel functionality. This process can usually be split into constrained sequence sampling, the formulation of an optimization problem and an in silico analysis to narrow down the number of candidates with respect to secondary goals. Finally, experimental analysis is important to check whether the defined design objectives are indeed met in the target environment and detailed characterization experiments should be performed to improve the mechanistic models and detect missing design requirements.


Asunto(s)
Biología Computacional/métodos , ARN/análisis , ARN/genética , Análisis de Secuencia de ARN/métodos , Animales , Biología Computacional/tendencias , Humanos , ARN no Traducido/análisis , ARN no Traducido/genética , Análisis de Secuencia de ARN/tendencias , Biología Sintética/métodos , Biología Sintética/tendencias
9.
ACS Synth Biol ; 7(5): 1269-1278, 2018 05 18.
Artículo en Inglés | MEDLINE | ID: mdl-29617125

RESUMEN

Riboregulators are short RNA sequences that, upon binding to a ligand, change their secondary structure and influence the expression rate of a downstream gene. They constitute an attractive alternative to transcription factors for building synthetic gene regulatory networks because they can be engineered de novo. However, riboregulators are generally designed in silico and tested in vivo, which provides little quantitative information about their performances, thus hindering the improvement of design algorithms. Here we show that a cell-free transcription-translation (TX-TL) system provides valuable information about the performances of in silico designed riboregulators. We first propose a simple model that provides a quantitative definition of the dynamic range of a riboregulator. We further characterize two types of translational riboregulators composed of a cis-repressed (cr) and a trans-activating (ta) strand. At the DNA level we demonstrate that high concentrations of taDNA poisoned the activator until total shut off, in agreement with our model, and that relative dynamic ranges of riboregulators determined in vitro are in agreement with published in vivo data. At the RNA level, we show that this approach provides a fast and simple way to measure dissociation constants of functional riboregulators, in contrast to standard mobility-shift assays. Our method opens the route for using cell-free TX-TL systems for the quantitative characterization of functional riboregulators in order to improve their design in silico.


Asunto(s)
Regulación de la Expresión Génica , Técnicas Genéticas , ARN/química , ARN/genética , Regiones no Traducidas 5' , Sistema Libre de Células , ADN/química , ADN/genética , ARN Polimerasas Dirigidas por ADN/genética , Fluorescencia , Modelos Genéticos , Conformación de Ácido Nucleico , Biosíntesis de Proteínas , ARN de Transferencia , Reacción en Cadena en Tiempo Real de la Polimerasa , Transcripción Genética , Proteínas Virales/genética
10.
Methods ; 143: 90-101, 2018 07 01.
Artículo en Inglés | MEDLINE | ID: mdl-29660485

RESUMEN

This contribution sketches a work flow to design an RNA switch that is able to adapt two structural conformations in a ligand-dependent way. A well characterized RNA aptamer, i.e., knowing its Kd and adaptive structural features, is an essential ingredient of the described design process. We exemplify the principles using the well-known theophylline aptamer throughout this work. The aptamer in its ligand-binding competent structure represents one structural conformation of the switch while an alternative fold that disrupts the binding-competent structure forms the other conformation. To keep it simple we do not incorporate any regulatory mechanism to control transcription or translation. We elucidate a commonly used design process by explicitly dissecting and explaining the necessary steps in detail. We developed a novel objective function which specifies the mechanistics of this simple, ligand-triggered riboswitch and describe an extensive in silico analysis pipeline to evaluate important kinetic properties of the designed sequences. This protocol and the developed software can be easily extended or adapted to fit novel design scenarios and thus can serve as a template for future needs.


Asunto(s)
Aptámeros de Nucleótidos/síntesis química , Biología Computacional/métodos , Conformación de Ácido Nucleico , Riboswitch/genética , Aptámeros de Nucleótidos/genética , Biología Computacional/instrumentación , Cinética , Ligandos , Pliegue del ARN , Programas Informáticos
11.
Sensors (Basel) ; 17(9)2017 Aug 30.
Artículo en Inglés | MEDLINE | ID: mdl-28867802

RESUMEN

RNA aptamers readily recognize small organic molecules, polypeptides, as well as other nucleic acids in a highly specific manner. Many such aptamers have evolved as parts of regulatory systems in nature. Experimental selection techniques such as SELEX have been very successful in finding artificial aptamers for a wide variety of natural and synthetic ligands. Changes in structure and/or stability of aptamers upon ligand binding can propagate through larger RNA constructs and cause specific structural changes at distal positions. In turn, these may affect transcription, translation, splicing, or binding events. The RNA secondary structure model realistically describes both thermodynamic and kinetic aspects of RNA structure formation and refolding at a single, consistent level of modelling. Thus, this framework allows studying the function of natural riboswitches in silico. Moreover, it enables rationally designing artificial switches, combining essentially arbitrary sensors with a broad choice of read-out systems. Eventually, this approach sets the stage for constructing versatile biosensors.


Asunto(s)
Riboswitch , Aptámeros de Nucleótidos , Técnicas Biosensibles , Cinética , Ligandos
12.
Bioinformatics ; 33(18): 2850-2858, 2017 Sep 15.
Artículo en Inglés | MEDLINE | ID: mdl-28449031

RESUMEN

MOTIVATION: Realizing the value of synthetic biology in biotechnology and medicine requires the design of molecules with specialized functions. Due to its close structure to function relationship, and the availability of good structure prediction methods and energy models, RNA is perfectly suited to be synthetically engineered with predefined properties. However, currently available RNA design tools cannot be easily adapted to accommodate new design specifications. Furthermore, complicated sampling and optimization methods are often developed to suit a specific RNA design goal, adding to their inflexibility. RESULTS: We developed a C ++ library implementing a graph coloring approach to stochastically sample sequences compatible with structural and sequence constraints from the typically very large solution space. The approach allows to specify and explore the solution space in a well defined way. Our library also guarantees uniform sampling, which makes optimization runs performant by not only avoiding re-evaluation of already found solutions, but also by raising the probability of finding better solutions for long optimization runs. We show that our software can be combined with any other software package to allow diverse RNA design applications. Scripting interfaces allow the easy adaption of existing code to accommodate new scenarios, making the whole design process very flexible. We implemented example design approaches written in Python to demonstrate these advantages. AVAILABILITY AND IMPLEMENTATION: RNAblueprint , Python implementations and benchmark datasets are available at github: https://github.com/ViennaRNA . CONTACT: s.hammer@univie.ac.at, ivo@tbi.univie.ac.at or sven@tbi.univie.ac.at. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Modelos Moleculares , ARN/química , Programas Informáticos , Biología Sintética/métodos , Conformación de Ácido Nucleico
13.
Nucleic Acids Res ; 45(7): 4108-4119, 2017 04 20.
Artículo en Inglés | MEDLINE | ID: mdl-27994029

RESUMEN

Riboswitches have gained attention as tools for synthetic biology, since they enable researchers to reprogram cells to sense and respond to exogenous molecules. In vitro evolutionary approaches produced numerous RNA aptamers that bind such small ligands, but their conversion into functional riboswitches remains difficult. We previously developed a computational approach for the design of synthetic theophylline riboswitches based on secondary structure prediction. These riboswitches have been constructed to regulate ligand-dependent transcription termination in Escherichia coli. Here, we test the usability of this design strategy by applying the approach to tetracycline and streptomycin aptamers. The resulting tetracycline riboswitches exhibit robust regulatory properties in vivo. Tandem fusions of these riboswitches with theophylline riboswitches represent logic gates responding to two different input signals. In contrast, the conversion of the streptomycin aptamer into functional riboswitches appears to be difficult. Investigations of the underlying aptamer secondary structure revealed differences between in silico prediction and structure probing. We conclude that only aptamers adopting the minimal free energy (MFE) structure are suitable targets for construction of synthetic riboswitches with design approaches based on equilibrium thermodynamics of RNA structures. Further improvements in the design strategy are required to implement aptamer structures not corresponding to the calculated MFE state.


Asunto(s)
Regulación de la Expresión Génica , Riboswitch , Terminación de la Transcripción Genética , Aptámeros de Nucleótidos/química , Biología Computacional/métodos , Simulación por Computador , Escherichia coli/genética , Genes Reporteros , Conformación de Ácido Nucleico , Riboswitch/efectos de los fármacos , Estreptomicina/farmacología , Tetraciclina/farmacología
14.
RNA Biol ; 12(2): 221-31, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-25826571

RESUMEN

Riboswitches are RNA-based regulators of gene expression composed of a ligand-sensing aptamer domain followed by an overlapping expression platform. The regulation occurs at either the level of transcription (by formation of terminator or antiterminator structures) or translation (by presentation or sequestering of the ribosomal binding site). Due to a modular composition, these elements can be manipulated by combining different aptamers and expression platforms and therefore represent useful tools to regulate gene expression in synthetic biology. Using computationally designed theophylline-dependent riboswitches we show that 2 parameters, terminator hairpin stability and folding traps, have a major impact on the functionality of the designed constructs. These have to be considered very carefully during design phase. Furthermore, a combination of several copies of individual riboswitches leads to a much improved activation ratio between induced and uninduced gene activity and to a linear dose-dependent increase in reporter gene expression. Such serial arrangements of synthetic riboswitches closely resemble their natural counterparts and may form the basis for simple quantitative read out systems for the detection of specific target molecules in the cell.


Asunto(s)
Diseño de Fármacos , Riboswitch , Transcripción Genética , Escherichia coli/genética , Escherichia coli/metabolismo , Genes Reporteros , Proteínas Fluorescentes Verdes/genética , Proteínas Fluorescentes Verdes/metabolismo , Datos de Secuencia Molecular , Conformación de Ácido Nucleico , Relación Estructura-Actividad , Biología Sintética , Teofilina/química , Termodinámica , beta-Galactosidasa/genética , beta-Galactosidasa/metabolismo
15.
Methods Enzymol ; 550: 1-22, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-25605378

RESUMEN

In this chapter, we review both computational and experimental aspects of de novo RNA sequence design. We give an overview of currently available design software and their limitations, and discuss the necessary setup to experimentally validate proper function in vitro and in vivo. We focus on transcription-regulating riboswitches, a task that has just recently lead to first successful designs of such RNA elements.


Asunto(s)
Riboswitch/fisiología , Secuencia de Bases/genética , Secuencia de Bases/fisiología , ARN/química , ARN/genética , Pliegue del ARN , Riboswitch/genética
16.
RNA Biol ; 11(5): 470-83, 2014.
Artículo en Inglés | MEDLINE | ID: mdl-24755880

RESUMEN

The genome of most prokaryotes gives rise to surprisingly complex transcriptomes, comprising not only protein-coding mRNAs, often organized as operons, but also harbors dozens or even hundreds of highly structured small regulatory RNAs and unexpectedly large levels of anti-sense transcripts. Comprehensive surveys of prokaryotic transcriptomes and the need to characterize also their non-coding components is heavily dependent on computational methods and workflows, many of which have been developed or at least adapted specifically for the use with bacterial and archaeal data. This review provides an overview on the state-of-the-art of RNA bioinformatics focusing on applications to prokaryotes.


Asunto(s)
Biología Computacional , Células Procariotas/metabolismo , ARN/genética , Emparejamiento Base , Biología Computacional/métodos , Perfilación de la Expresión Génica , Genómica , Conformación de Ácido Nucleico , ARN/química , Transcriptoma
17.
BMC Bioinformatics ; 15: 89, 2014 Mar 27.
Artículo en Inglés | MEDLINE | ID: mdl-24674136

RESUMEN

BACKGROUND: Differential RNA sequencing (dRNA-seq) is a high-throughput screening technique designed to examine the architecture of bacterial operons in general and the precise position of transcription start sites (TSS) in particular. Hitherto, dRNA-seq data were analyzed by visualizing the sequencing reads mapped to the reference genome and manually annotating reliable positions. This is very labor intensive and, due to the subjectivity, biased. RESULTS: Here, we present TSSAR, a tool for automated de novo TSS annotation from dRNA-seq data that respects the statistics of dRNA-seq libraries. TSSAR uses the premise that the number of sequencing reads starting at a certain genomic position within a transcriptional active region follows a Poisson distribution with a parameter that depends on the local strength of expression. The differences of two dRNA-seq library counts thus follow a Skellam distribution. This provides a statistical basis to identify significantly enriched primary transcripts.We assessed the performance by analyzing a publicly available dRNA-seq data set using TSSAR and two simple approaches that utilize user-defined score cutoffs. We evaluated the power of reproducing the manual TSS annotation. Furthermore, the same data set was used to reproduce 74 experimentally validated TSS in H. pylori from reliable techniques such as RACE or primer extension. Both analyses showed that TSSAR outperforms the static cutoff-dependent approaches. CONCLUSIONS: Having an automated and efficient tool for analyzing dRNA-seq data facilitates the use of the dRNA-seq technique and promotes its application to more sophisticated analysis. For instance, monitoring the plasticity and dynamics of the transcriptomal architecture triggered by different stimuli and growth conditions becomes possible.The main asset of a novel tool for dRNA-seq analysis that reaches out to a broad user community is usability. As such, we provide TSSAR both as intuitive RESTful Web service ( http://rna.tbi.univie.ac.at/TSSAR) together with a set of post-processing and analysis tools, as well as a stand-alone version for use in high-throughput dRNA-seq data analysis pipelines.


Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Análisis de Secuencia de ARN/métodos , Secuencia de Bases , Bases de Datos de Ácidos Nucleicos , Genoma , Genómica/métodos , Helicobacter pylori/genética , Humanos , Programas Informáticos , Stenotrophomonas maltophilia/genética
18.
J Proteomics ; 86: 27-42, 2013 Jun 28.
Artículo en Inglés | MEDLINE | ID: mdl-23665149

RESUMEN

Correct annotation of protein coding genes is the basis of conventional data analysis in proteomic studies. Nevertheless, most protein sequence databases almost exclusively rely on gene finding software and inevitably also miss protein annotations or possess errors. Proteogenomics tries to overcome these issues by matching MS data directly against a genome sequence database. Here we report an in-depth proteogenomics study of Helicobacter pylori strain 26695. MS data was searched against a combined database of the NCBI annotations and a six-frame translation of the genome. Database searches with Mascot and X! Tandem revealed 1115 proteins identified by at least two peptides with a peptide false discovery rate below 1%. This represents 71% of the predicted proteome. So far this is the most extensive proteome study of Helicobacter pylori. Our proteogenomic approach unambiguously identified four previously missed annotations and furthermore allowed us to correct sequences of six annotated proteins. Since secreted proteins are often involved in pathogenic processes we further investigated signal peptidase cleavage sites. By applying a database search that accommodates the identification of semi-specific cleaved peptides, 63 previously unknown signal peptides were detected. The motif LXA showed to be the predominant recognition sequence for signal peptidases. BIOLOGICAL SIGNIFICANCE: The results of MS-based proteomic studies highly rely on correct annotation of protein coding genes which is the basis of conventional data analysis. However, the annotation of protein coding sequences in genomic data is usually based on gene finding software. These tools are limited in their prediction accuracy such as the problematic determination of exact gene boundaries. Thus, protein databases own partly erroneous or incomplete sequences. Additionally, some protein sequences might also be missing in the databases. Proteogenomics, a combination of proteomic and genomic data analyses, is well suited to detect previously not annotated proteins and to correct erroneous sequences. For this purpose, the existing database of the investigated species is typically supplemented with a six-frame translation of the genome. Here, we studied the proteome of the major human pathogen Helicobacter pylori that is responsible for many gastric diseases such as duodenal ulcers and gastric cancer. Our in-depth proteomic study highly reliably identified 1115 proteins (FDR<0.01%) by at least two peptides (FDR<1%) which represent 71% of the predicted proteome deposited at NCBI. The proteogenomic data analysis of our data set resulted in the unambiguous identification of four previously missed annotations, the correction of six annotated proteins as well as the detection of 63 previously unknown signal peptides. We have annotated proteins of particular biological interest like the ferrous iron transport protein A, the coiled-coil-rich protein HP0058 and the lipopolysaccharide biosynthesis protein HP0619. For instance, the protein HP0619 could be a drug target for the inhibition of the LPS synthesis pathway. Furthermore it has been proven that the motif "LXA" is the predominant recognition sequence for the signal peptidase I of H. pylori. Signal peptidases are essential enzymes for the viability of bacterial cells and are involved in pathogenesis. Therefore signal peptidases could be novel targets for antibiotics. The inclusion of the corrected and new annotated proteins as well as the information of signal peptide cleavage sites will help in the study of biological pathways involved in pathogenesis or drug response of H. pylori.


Asunto(s)
Bases de Datos de Proteínas , Genómica/métodos , Helicobacter pylori/genética , Proteínas de la Membrana/metabolismo , Señales de Clasificación de Proteína/genética , Proteómica/métodos , Serina Endopeptidasas/metabolismo , Mutación del Sistema de Lectura , Espectrometría de Masas , Anotación de Secuencia Molecular , Señales de Clasificación de Proteína/fisiología
19.
RNA Biol ; 10(7): 1204-10, 2013 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-23702463

RESUMEN

Prokaryotic transcripts constitute almost always uninterrupted intervals when mapped back to the genome. Split reads, i.e., RNA-seq reads consisting of parts that only map to discontiguous loci, are thus disregarded in most analysis pipelines. There are, however, some well-known exceptions, in particular, tRNA splicing and circularized small RNAs in Archaea as well as self-splicing introns. Here, we reanalyze a series of published RNA-seq data sets, screening them specifically for non-contiguously mapping reads. We recover most of the known cases together with several novel archaeal ncRNAs associated with circularized products. In Eubacteria, only a handful of interesting candidates were obtained beyond a few previously described group I and group II introns. Most of the atypically mapping reads do not appear to correspond to well-defined, specifically processed products. Whether this diffuse background is, at least in part, an incidental by-product of prokaryotic RNA processing or whether it consists entirely of technical artifacts of reverse transcription or amplification remains unknown.


Asunto(s)
Biología Computacional/métodos , Células Procariotas/metabolismo , ARN/química , Análisis de Secuencia de ARN , Transcriptoma , Archaea/genética , Bacterias/genética , Genómica/métodos , Anotación de Secuencia Molecular , ARN/genética
20.
Nucleic Acids Res ; 41(4): 2541-51, 2013 Feb 01.
Artículo en Inglés | MEDLINE | ID: mdl-23275562

RESUMEN

Riboswitches are regulatory RNA elements typically located in the 5'-untranslated region of certain mRNAs and control gene expression at the level of transcription or translation. These elements consist of a sensor and an adjacent actuator domain. The sensor usually is an aptamer that specifically interacts with a ligand. The actuator contains an intrinsic terminator or a ribosomal binding site for transcriptional or translational regulation, respectively. Ligand binding leads to structural rearrangements of the riboswitch and to presentation or masking of these regulatory elements. Based on this modular organization, riboswitches are an ideal target for constructing synthetic regulatory systems for gene expression. Although riboswitches for translational control have been designed successfully, attempts to construct synthetic elements regulating transcription have failed so far. Here, we present an in silico pipeline for the rational design of synthetic riboswitches that regulate gene expression at the transcriptional level. Using the well-characterized theophylline aptamer as sensor, we designed the actuator part as RNA sequences that can fold into functional intrinsic terminator structures. In the biochemical characterization, several of the designed constructs show ligand-dependent control of gene expression in Escherichia coli, demonstrating that it is possible to engineer riboswitches not only for translational but also for transcriptional regulation.


Asunto(s)
Regulación de la Expresión Génica , Riboswitch , Regiones Terminadoras Genéticas , Terminación de la Transcripción Genética , Aptámeros de Nucleótidos/química , ARN/química , Riboswitch/efectos de los fármacos , Teofilina/farmacología , Terminación de la Transcripción Genética/efectos de los fármacos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...