Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 53
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
Methods Mol Biol ; 2726: 125-141, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38780730

RESUMEN

Analysis of the folding space of RNA generally suffers from its exponential size. With classified Dynamic Programming algorithms, it is possible to alleviate this burden and to analyse the folding space of RNA in great depth. Key to classified DP is that the search space is partitioned into classes based on an on-the-fly computed feature. A class-wise evaluation is then used to compute class-wide properties, such as the lowest free energy structure for each class, or aggregate properties, such as the class' probability. In this paper we describe the well-known shape and hishape abstraction of RNA structures, their power to help better understand RNA function and related methods that are based on these abstractions.


Asunto(s)
Algoritmos , Biología Computacional , Conformación de Ácido Nucleico , Pliegue del ARN , ARN , ARN/química , ARN/genética , Biología Computacional/métodos , Programas Informáticos , Termodinámica
2.
Bioinformatics ; 39(11)2023 11 01.
Artículo en Inglés | MEDLINE | ID: mdl-37944046

RESUMEN

SUMMARY: RNA molecules play crucial roles in various biological processes. They mediate their function mainly by interacting with other RNAs or proteins. At present, information about these interactions is distributed over different resources, often providing the data in simple tab-delimited formats that differ between the databases. There is no standardized data format that can capture the nature of all these different interactions in detail. AVAILABILITY AND IMPLEMENTATION: Here, we propose the RNA interaction format (RIF) for the detailed representation of RNA-RNA and RNA-Protein interactions and provide reference implementations in C/C++, Python, and JavaScript. RIF is released under licence GNU General Public License version 3 (GNU GPLv3) and is available on https://github.com/RNABioInfo/rna-interaction-format.


Asunto(s)
ARN , Programas Informáticos , Bases de Datos Factuales , Proteínas
3.
Front Mol Biosci ; 8: 640440, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-34055875

RESUMEN

The function and mode of action of small regulatory RNAs is currently still understudied in archaea. In the halophilic archaeon Haloferax volcanii, a plethora of sRNAs have been identified; however, in-depth functional analysis is missing for most of them. We selected a small RNA (s479) from Haloferax volcanii for detailed characterization. The sRNA gene is encoded between a CRISPR RNA locus and the Cas protein gene cluster, and the s479 deletion strain is viable and was characterized in detail. Transcriptome studies of wild-type Haloferax cells and the deletion mutant revealed upregulation of six genes in the deletion strain, showing that this sRNA has a clearly defined function. Three of the six upregulated genes encode potential zinc transporter proteins (ZnuA1, ZnuB1, and ZnuC1) suggesting the involvement of s479 in the regulation of zinc transport. Upregulation of these genes in the deletion strain was confirmed by northern blot and proteome analyses. Furthermore, electrophoretic mobility shift assays demonstrate a direct interaction of s479 with the target znuC1 mRNA. Proteome comparison of wild-type and deletion strains further expanded the regulon of s479 deeply rooting this sRNA within the metabolism of H. volcanii especially the regulation of transporter abundance. Interestingly, s479 is not only encoded next to CRISPR-cas genes, but the mature s479 contains a crRNA-like 5' handle, and experiments with Cas protein deletion strains indicate maturation by Cas6 and interaction with Cas proteins. Together, this might suggest that the CRISPR-Cas system is involved in s479 function.

4.
Nucleic Acids Res ; 49(10): 5493-5501, 2021 06 04.
Artículo en Inglés | MEDLINE | ID: mdl-34019662

RESUMEN

RNA-RNA inter- and intramolecular interactions are fundamental for numerous biological processes. While there are reasonable approaches to map RNA secondary structures genome-wide, understanding how different RNAs interact to carry out their regulatory functions requires mapping of intermolecular base pairs. Recently, different strategies to detect RNA-RNA duplexes in living cells, so called direct duplex detection (DDD) methods, have been developed. Common to all is the Psoralen-mediated in vivo RNA crosslinking followed by RNA Proximity Ligation to join the two interacting RNA strands. Sequencing of the RNA via classical RNA-seq and subsequent specialised bioinformatic analyses the result in the prediction of inter- and intramolecular RNA-RNA interactions. Existing approaches adapt standard RNA-seq analysis pipelines, but often neglect inherent features of RNA-RNA interactions that are useful for filtering and statistical assessment. Here we present RNAnue, a general pipeline for the inference of RNA-RNA interactions from DDD experiments that takes into account hybridisation potential and statistical significance to improve prediction accuracy. We applied RNAnue to data from different DDD studies and compared our results to those of the original methods. This showed that RNAnue performs better in terms of quantity and quality of predictions.


Asunto(s)
Biología Computacional/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , ARN , Análisis de Secuencia de ARN/métodos , Emparejamiento Base , Línea Celular , Análisis de Datos , Humanos , Hibridación de Ácido Nucleico , ARN/química , ARN/metabolismo
5.
Genes (Basel) ; 12(3)2021 02 26.
Artículo en Inglés | MEDLINE | ID: mdl-33652983

RESUMEN

Studying the folding kinetics of an RNA can provide insight into its function and is thus a valuable method for RNA analyses. Computational approaches to the simulation of folding kinetics suffer from the exponentially large folding space that needs to be evaluated. Here, we present a new approach that combines structure abstraction with evolutionary conservation to restrict the analysis to common parts of folding spaces of related RNAs. The resulting algorithm can recapitulate the folding kinetics known for single RNAs and is able to analyse even long RNAs in reasonable time. Our program RNAliHiKinetics is the first algorithm for the simulation of consensus folding kinetics and addresses a long-standing problem in a new and unique way.


Asunto(s)
Algoritmos , Simulación por Computador , Modelos Químicos , Conformación de Ácido Nucleico , ARN/química , Cinética , ARN/genética
6.
Bioinformatics ; 36(15): 4357-4359, 2020 08 01.
Artículo en Inglés | MEDLINE | ID: mdl-32492127

RESUMEN

MOTIVATION: The correct prediction of bacterial sRNA homologs is a prerequisite for many downstream analyses based on comparative genomics, but it is frequently challenging due to the short length and distinct heterogeneity of such homologs. GLobal Automatic Small RNA Search go (GLASSgo) is an efficient tool for the prediction of sRNA homologs from a single input query. To make the algorithm available to a broader community, we offer a Docker container along with a free-access web service. For non-computer scientists, the web service provides a user-friendly interface. However, capabilities were lacking so far for batch processing, version control and direct interaction with compatible software applications as a workflow management system can provide. RESULTS: Here, we present GLASSgo 1.5.2, an updated version that is fully incorporated into the workflow management system Galaxy. The improved version contains a new feature for extracting the upstream regions, allowing the search for conserved promoter elements. Additionally, it supports the use of accession numbers instead of the outdated GI numbers, which widens the applicability of the tool. AVAILABILITY AND IMPLEMENTATION: GLASSgo is available at https://github.com/lotts/GLASSgo/ under the MIT license and is accompanied by instruction and application data. Furthermore, it can be installed into any Galaxy instance using the Galaxy ToolShed.


Asunto(s)
Biología Computacional , Programas Informáticos , Algoritmos , Genómica , Flujo de Trabajo
7.
Microb Biotechnol ; 13(4): 1145-1161, 2020 07.
Artículo en Inglés | MEDLINE | ID: mdl-32267616

RESUMEN

Pseudomonas putida is recognized as a very promising strain for industrial application due to its high redox capacity and frequently observed tolerance towards organic solvents. In this research, we studied the metabolic and transcriptional response of P. putida KT2440 exposed to large-scale heterogeneous mixing conditions in the form of repeated glucose shortage. Cellular responses were mimicked in an experimental setup comprising a stirred tank reactor and a connected plug flow reactor. We deciphered that a stringent response-like transcriptional regulation programme is frequently induced, which seems to be linked to the intracellular pool of 3-hydroxyalkanoates (3-HA) that are known to serve as precursors for polyhydroxyalkanoates (PHA). To be precise, P. putida is endowed with a survival strategy likely to access cellular PHA, amino acids and glycogen in few seconds under glucose starvation to obtain ATP from respiration, thereby replenishing the reduced ATP levels and the adenylate energy charge. Notably, cells only need 0.4% of glucose uptake to build those 3-HA-based energy buffers. Concomitantly, genes that are related to amino acid catabolism and ß-oxidation are upregulated during the transient absence of glucose. Furthermore, we provide a detailed list of transcriptional short- and long-term responses that increase the cellular maintenance by about 17% under the industrial-like conditions tested.


Asunto(s)
Polihidroxialcanoatos , Pseudomonas putida , Pseudomonas putida/genética
8.
Methods ; 172: 3-11, 2020 02 01.
Artículo en Inglés | MEDLINE | ID: mdl-31326596

RESUMEN

Clustered regularly interspaced short palindromic repeats (CRISPR) and their associated proteins (Cas) are essential genetic elements in many archaeal and bacterial genomes, playing a key role in a prokaryote adaptive immune system against invasive foreign elements. In recent years, the CRISPR-Cas system has also been engineered to facilitate target gene editing in eukaryotic genomes. Bioinformatics played an essential role in the detection and analysis of CRISPR systems and here we review the bioinformatics-based efforts that pushed the field of CRISPR-Cas research further. We discuss the bioinformatics tools that have been published over the last few years and, finally, present the most popular tools for the design of CRISPR-Cas9 guides.


Asunto(s)
Sistemas CRISPR-Cas/genética , Biología Computacional/métodos , Edición Génica , Algoritmos , Biología Computacional/tendencias , ARN Guía de Kinetoplastida/genética
9.
F1000Res ; 72018.
Artículo en Inglés | MEDLINE | ID: mdl-30519453

RESUMEN

Tight regulation of cellular processes is key to the development of complex organisms but also vital for simpler ones. During evolution, different regulatory systems have emerged, among them RNA-based regulation that is carried out mainly by intramolecular and intermolecular RNA-RNA interactions. However, methods for the transcriptome-wide detection of these interactions were long unavailable. Recently, three publications described high-throughput methods to directly detect RNA duplexes in living cells. This promises to enable in-depth studies of RNA-based regulation and will narrow the gaps in our understanding of RNA structure and function. In this review, we highlight the benefits of these methods and their commonalities and differences and, in particular, point to methodological shortcomings that hamper their wider application. We conclude by presenting ideas for how to overcome these problems and commenting on the prospects we see in this area of research.


Asunto(s)
ARN/química , Transcriptoma , Animales , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , ARN/genética , ARN/fisiología
10.
Front Genet ; 9: 124, 2018.
Artículo en Inglés | MEDLINE | ID: mdl-29719549

RESUMEN

Bacterial small RNAs (sRNAs) are important post-transcriptional regulators of gene expression. The functional and evolutionary characterization of sRNAs requires the identification of homologs, which is frequently challenging due to their heterogeneity, short length and partly, little sequence conservation. We developed the GLobal Automatic Small RNA Search go (GLASSgo) algorithm to identify sRNA homologs in complex genomic databases starting from a single sequence. GLASSgo combines an iterative BLAST strategy with pairwise identity filtering and a graph-based clustering method that utilizes RNA secondary structure information. We tested the specificity, sensitivity and runtime of GLASSgo, BLAST and the combination RNAlien/cmsearch in a typical use case scenario on 40 bacterial sRNA families. The sensitivity of the tested methods was similar, while the specificity of GLASSgo and RNAlien/cmsearch was significantly higher than that of BLAST. GLASSgo was on average ∼87 times faster than RNAlien/cmsearch, and only ∼7.5 times slower than BLAST, which shows that GLASSgo optimizes the trade-off between speed and accuracy in the task of finding sRNA homologs. GLASSgo is fully automated, whereas BLAST often recovers only parts of homologs and RNAlien/cmsearch requires extensive additional bioinformatic work to get a comprehensive set of homologs. GLASSgo is available as an easy-to-use web server to find homologous sRNAs in large databases.

11.
Mar Genomics ; 33: 21-25, 2017 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-28237778

RESUMEN

De-novo assembly of a metagenomic dataset obtained from a decaying cyanobacterial Trichodesmium bloom from the New Caledonian lagoon resulted in a complete giant phage genome of 257,908bp, obtained independently with multiple assembly tools. Noteworthy, gammaproteobacteria were an abundant fraction in the sequenced samples. Mapping of the raw reads with 99% accuracy to the giant phage genome resulted in an average coverage of 262X. The closest sequenced relatives, albeit still distant, are the Pseudomonas phages PaBG from Lake Baikal and Lu11 isolated from a soil sample from the Philippines. The phage reported here might belong to the same family within the Myoviridae as PaBG and Lu11 and would thus be its first marine member, indicating a more widespread occurrence of this group. We named this phage NCTB (New Caledonia Trichodesmium Bloom) after its origin.


Asunto(s)
ADN Viral/genética , Genoma Viral , Myoviridae/genética , Trichodesmium/virología , Eutrofización , Gammaproteobacteria/virología , Metagenómica , Myoviridae/aislamiento & purificación , Nueva Caledonia
12.
Plant Physiol ; 172(4): 2314-2326, 2016 12.
Artículo en Inglés | MEDLINE | ID: mdl-27729470

RESUMEN

Phytoene synthase (PSY) catalyzes the highly regulated, frequently rate-limiting synthesis of the first biosynthetically formed carotene. While PSY constitutes a small gene family in most plant taxa, the Brassicaceae, including Arabidopsis (Arabidopsis thaliana), predominantly possess a single PSY gene. This monogenic situation is compensated by the differential expression of two alternative splice variants (ASV), which differ in length and in the exon/intron retention of their 5'UTRs. ASV1 contains a long 5'UTR (untranslated region) and is involved in developmentally regulated carotenoid formation, such as during deetiolation. ASV2 contains a short 5'UTR and is preferentially induced when an immediate increase in the carotenoid pathway flux is required, such as under salt stress or upon sudden light intensity changes. We show that the long 5'UTR of ASV1 is capable of attenuating the translational activity in response to high carotenoid pathway fluxes. This function resides in a defined 5'UTR stretch with two predicted interconvertible RNA conformations, as known from riboswitches, which might act as a flux sensor. The translation-inhibitory structure is absent from the short 5'UTR of ASV2 allowing to bypass translational inhibition under conditions requiring rapidly increased pathway fluxes. The mechanism is not found in the rice (Oryza sativa) PSY1 5'UTR, consistent with the prevalence of transcriptional control mechanisms in taxa with multiple PSY genes. The translational control mechanism identified is interpreted in terms of flux adjustments needed in response to retrograde signals stemming from intermediates of the plastid-localized carotenoid biosynthesis pathway.


Asunto(s)
Regiones no Traducidas 5'/genética , Empalme Alternativo/genética , Proteínas de Arabidopsis/genética , Arabidopsis/enzimología , Arabidopsis/genética , Carotenoides/biosíntesis , Complejos Multienzimáticos/genética , Biosíntesis de Proteínas/genética , Proteínas de Arabidopsis/metabolismo , Carotenoides/genética , Carotenoides/metabolismo , Biología Computacional , Regulación de la Expresión Génica de las Plantas , Genes Reporteros , Glucuronidasa/metabolismo , Modelos Biológicos , Complejos Multienzimáticos/metabolismo , Hojas de la Planta/enzimología , Hojas de la Planta/genética
13.
Bioinformatics ; 32(22): 3525-3527, 2016 11 15.
Artículo en Inglés | MEDLINE | ID: mdl-27412091

RESUMEN

MOTIVATION: We developed VisualGraphX, a web-based, interactive visualization tool for large-scale graphs. Current graph visualization tools that follow the rich-internet paradigm lack an interactive and scalable visualization of graph-based data. VisualGraphX aims to provide a universal graph visualization tool that empowers the users to efficiently explore the data for themselves at a large scale. It is available as a visualization plugin for the Galaxy platform, such that VisualGraphX can be integrated into custom analysis pipelines. AVAILABILITY AND IMPLEMENTATION: VisualGraphX has been released as a visualization plugin for the Galaxy platform under AFL 3.0 and is available with instructions and application data at http://gitlab.com/comptrans/VisualGraphX/ CONTACT: bjoern.voss@ibvt.uni-stuttgart.de.


Asunto(s)
Procesamiento de Imagen Asistido por Computador , Internet
14.
Sci Rep ; 5: 9560, 2015 Apr 22.
Artículo en Inglés | MEDLINE | ID: mdl-25902393

RESUMEN

In all studied organisms, a substantial portion of the transcriptome consists of non-coding RNAs that frequently execute regulatory functions. Here, we have compared the primary transcriptomes of the cyanobacteria Synechocystis sp. PCC 6714 and PCC 6803 under 10 different conditions. These strains share 2854 protein-coding genes and a 16S rRNA identity of 99.4%, indicating their close relatedness. Conserved major transcriptional start sites (TSSs) give rise to non-coding transcripts within the sigB gene, from the 5'UTRs of cmpA and isiA, and 168 loci in antisense orientation. Distinct differences include single nucleotide polymorphisms rendering promoters inactive in one of the strains, e.g., for cmpR and for the asRNA PsbA2R. Based on the genome-wide mapped location, regulation and classification of TSSs, non-coding transcripts were identified as the most dynamic component of the transcriptome. We identified a class of mRNAs that originate by read-through from an sRNA that accumulates as a discrete and abundant transcript while also serving as the 5'UTR. Such an sRNA/mRNA structure, which we name 'actuaton', represents another way for bacteria to remodel their transcriptional network. Our findings support the hypothesis that variations in the non-coding transcriptome constitute a major evolutionary element of inter-strain divergence and capability for physiological adaptation.


Asunto(s)
Adaptación Fisiológica/genética , Synechocystis/genética , Transcriptoma , Regiones no Traducidas 5' , Proteínas Bacterianas/química , Proteínas Bacterianas/genética , Proteínas Bacterianas/metabolismo , Secuencia de Bases , Datos de Secuencia Molecular , Conformación de Ácido Nucleico , Polimorfismo de Nucleótido Simple , Regiones Promotoras Genéticas , ARN sin Sentido/química , ARN sin Sentido/metabolismo , ARN Mensajero/química , ARN Mensajero/metabolismo , Alineación de Secuencia , Factores de Transcripción/química , Factores de Transcripción/genética , Factores de Transcripción/metabolismo , Sitio de Iniciación de la Transcripción
15.
Front Genet ; 6: 43, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-25750651

RESUMEN

The visualization of massive datasets, such as those resulting from comparative metatranscriptome analyses or the analysis of microbial population structures using ribosomal RNA sequences, is a challenging task. We developed a new method called CoVennTree (Comparative weighted Venn Tree) that simultaneously compares up to three multifarious datasets by aggregating and propagating information from the bottom to the top level and produces a graphical output in Cytoscape. With the introduction of weighted Venn structures, the contents and relationships of various datasets can be correlated and simultaneously aggregated without losing information. We demonstrate the suitability of this approach using a dataset of 16S rDNA sequences obtained from microbial populations at three different depths of the Gulf of Aqaba in the Red Sea. CoVennTree has been integrated into the Galaxy ToolShed and can be directly downloaded and integrated into the user instance.

16.
Genome Announc ; 2(4)2014 Jul 31.
Artículo en Inglés | MEDLINE | ID: mdl-25081267

RESUMEN

Synechocystis sp. strain PCC 6714 is a unicellular cyanobacterium closely related to the popular model organism Synechocystis sp. strain PCC 6803. A combination of PacBio SMRT and Illumina GAIIx data results in a highly accurate finished genome sequence that provides a reliable resource for further comparative analyses.

17.
BMC Genomics ; 15: 698, 2014 Aug 20.
Artículo en Inglés | MEDLINE | ID: mdl-25142710

RESUMEN

BACKGROUND: Marine diatoms constitute a major component of eukaryotic phytoplankton and stand at the crossroads of several evolutionary lineages. These microalgae possess peculiar genomic features and novel combinations of genes acquired from bacterial, animal and plant ancestors. Furthermore, they display both DNA methylation and gene silencing activities. Yet, the biogenesis and regulatory function of small RNAs (sRNAs) remain ill defined in diatoms. RESULTS: Here we report the first comprehensive characterization of the sRNA landscape and its correlation with genomic and epigenomic information in Phaeodactylum tricornutum. The majority of sRNAs is 25 to 30 nt-long and maps to repetitive and silenced Transposable Elements marked by DNA methylation. A subset of this population also targets DNA methylated protein-coding genes, suggesting that gene body methylation might be sRNA-driven in diatoms. Remarkably, 25-30 nt sRNAs display a well-defined and unprecedented 180 nt-long periodic distribution at several highly methylated regions that awaits characterization. While canonical miRNAs are not detectable, other 21-25 nt sRNAs of unknown origin are highly expressed. Besides, non-coding RNAs with well-described function, namely tRNAs and U2 snRNA, constitute a major source of 21-25 nt sRNAs and likely play important roles under stressful environmental conditions. CONCLUSIONS: P. tricornutum has evolved diversified sRNA pathways, likely implicated in the regulation of largely still uncharacterized genetic and epigenetic processes. These results uncover an unexpected complexity of diatom sRNA population and previously unappreciated features, providing new insights into the diversification of sRNA-based processes in eukaryotes.


Asunto(s)
Diatomeas/genética , Diatomeas/metabolismo , ARN Pequeño no Traducido/genética , ARN Pequeño no Traducido/metabolismo , Mapeo Cromosómico , Biología Computacional , Metilación de ADN , Elementos Transponibles de ADN , Regulación de la Expresión Génica , Genoma , Estudio de Asociación del Genoma Completo , Genómica , Anotación de Secuencia Molecular , Conformación de Ácido Nucleico , ARN Nuclear Pequeño/genética , ARN Nuclear Pequeño/metabolismo , ARN Pequeño no Traducido/química
18.
DNA Res ; 21(5): 527-39, 2014 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-24935866

RESUMEN

RNA-seq and especially differential RNA-seq-type transcriptomic analyses (dRNA-seq) are powerful analytical tools, as they not only provide insights into gene expression changes but also provide detailed information about all promoters active at a given moment, effectively giving a deep insight into the transcriptional landscape. Synechocystis sp. PCC 6803 (Synechocystis 6803) is a unicellular model cyanobacterium that is widely used in research fields from ecology, photophysiology to systems biology, modelling and biotechnology. Here, we analysed the response of the Synechocystis 6803 primary transcriptome to different, environmentally relevant stimuli. We established genome-wide maps of the transcriptional start sites active under 10 different conditions relevant for photosynthetic growth and identified 4,091 transcriptional units, which provide information about operons, 5' and 3' untranslated regions (UTRs). Based on a unique expression factor, we describe regulons and relevant promoter sequences at single-nucleotide resolution. Finally, we report several sRNAs with an intriguing expression pattern and therefore likely function, specific for carbon depletion (CsiR1), nitrogen depletion (NsiR4), phosphate depletion (PsiR1), iron stress (IsaR1) or photosynthesis (PsrR1). This dataset is accompanied by comprehensive information providing extensive visualization and data access to allow an easy-to-use approach for the design of experiments, the incorporation into modelling studies of the regulatory system and for comparative analyses.


Asunto(s)
Synechocystis/genética , Transcriptoma , Regiones no Traducidas 3' , Regiones no Traducidas 5' , Regulación Bacteriana de la Expresión Génica , Genoma Bacteriano , Operón , Fotosíntesis/genética , ARN Bacteriano/genética , ARN Bacteriano/metabolismo , Synechocystis/metabolismo , Sitio de Iniciación de la Transcripción
19.
BMC Bioinformatics ; 15: 122, 2014 Apr 29.
Artículo en Inglés | MEDLINE | ID: mdl-24780064

RESUMEN

BACKGROUND: RNA-seq and its variant differential RNA-seq (dRNA-seq) are today routine methods for transcriptome analysis in bacteria. While expression profiling and transcriptional start site prediction are standard tasks today, the problem of identifying transcriptional units in a genome-wide fashion is still not solved for prokaryotic systems. RESULTS: We present RNAseg, an algorithm for the prediction of transcriptional units based on dRNA-seq data. A key feature of the algorithm is that, based on the data, it distinguishes between transcribed and un-transcribed genomic segments. Furthermore, the program provides many different predictions in a single run, which can be used to infer the significance of transcriptional units in a consensus procedure. We show the performance of our method based on a well-studied dRNA-seq data set for Helicobacter pylori. CONCLUSIONS: With our algorithm it is possible to identify operons and 5'- and 3'-UTRs in an automated fashion. This alleviates the need for labour intensive manual inspection and enables large-scale studies in the area of comparative transcriptomics.


Asunto(s)
Algoritmos , Perfilación de la Expresión Génica/métodos , Análisis de Secuencia de ARN/métodos , Genómica , Helicobacter pylori/genética , Operón , Sitio de Iniciación de la Transcripción , Regiones no Traducidas
20.
ISME J ; 8(10): 2056-68, 2014 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-24739626

RESUMEN

Prochlorococcus is a genus of abundant and ecologically important marine cyanobacteria. Here, we present a comprehensive comparison of the structure and composition of the transcriptomes of two Prochlorococcus strains, which, despite their similarities, have adapted their gene pool to specific environmental constraints. We present genome-wide maps of transcriptional start sites (TSS) for both organisms, which are representatives of the two most diverse clades within the two major ecotypes adapted to high- and low-light conditions, respectively. Our data suggest antisense transcription for three-quarters of all genes, which is substantially more than that observed in other bacteria. We discovered hundreds of TSS within genes, most notably within 16 of the 29 prochlorosin genes, in strain MIT9313. A direct comparison revealed very little conservation in the location of TSS and the nature of non-coding transcripts between both strains. We detected extremely short 5' untranslated regions with a median length of only 27 and 29 nt for MED4 and MIT9313, respectively, and for 8% of all protein-coding genes the median distance to the start codon is only 10 nt or even shorter. These findings and the absence of an obvious Shine-Dalgarno motif suggest that leaderless translation and ribosomal protein S1-dependent translation constitute alternative mechanisms for translation initiation in Prochlorococcus. We conclude that genome-wide antisense transcription is a major component of the transcriptional output from these relatively small genomes and that a hitherto unrecognized high degree of complexity and variability of gene expression exists in their transcriptional architecture.


Asunto(s)
Prochlorococcus/genética , Transcriptoma , Regiones no Traducidas 5' , Perfilación de la Expresión Génica , Regulación Bacteriana de la Expresión Génica , Luz , Fotosíntesis/genética , Prochlorococcus/metabolismo , ARN sin Sentido/química , ARN no Traducido/análisis , Sitio de Iniciación de la Transcripción
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...