Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 40
Filtrar
1.
PLoS Genet ; 20(3): e1011188, 2024 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-38442125

RESUMO

Bacteria have evolved complex transcriptional regulatory networks, as well as many diverse regulatory strategies at the RNA level, to enable more efficient use of metabolic resources and a rapid response to changing conditions. However, most RNA-based regulatory mechanisms are not well conserved across different bacterial species despite controlling genes important for virulence or essential biosynthetic processes. Here, we characterize the activity of, and assess the fitness benefit conferred by, twelve cis-acting regulatory RNAs (including several riboswitches and a T-box), in the opportunistic pathogen Streptococcus pneumoniae TIGR4. By evaluating native locus mutants of each regulator that result in constitutively active or repressed expression, we establish that growth defects in planktonic culture are associated with constitutive repression of gene expression, while constitutive activation of gene expression is rarely deleterious. In contrast, in mouse nasal carriage and pneumonia models, strains with either constitutively active and repressed gene expression are significantly less fit than matched control strains. Furthermore, two RNA-regulated pathways, FMN synthesis/transport and pyrimidine synthesis/transport display exceptional sensitivity to mis-regulation or constitutive gene repression in both planktonic culture and in vivo environments. Thus, despite lack of obvious phenotypes associated with constitutive gene expression in vitro, the fitness benefit conferred on bacteria via fine-tuned metabolic regulation through cis-acting regulatory RNAs is substantial in vivo, and therefore easily sufficient to drive the evolution and maintenance of diverse RNA regulatory mechanisms.


Assuntos
RNA , Streptococcus pneumoniae , Animais , Camundongos , Streptococcus pneumoniae/genética , RNA/metabolismo , Virulência/genética , Fenótipo , Regulação Bacteriana da Expressão Gênica , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo
2.
Brief Bioinform ; 24(3)2023 05 19.
Artigo em Inglês | MEDLINE | ID: mdl-36951499

RESUMO

Riboswitches are conserved structural ribonucleic acid (RNA) sensors that are mainly found to regulate a large number of genes/operons in bacteria. Presently, >50 bacterial riboswitch classes have been discovered, but only the thiamine pyrophosphate riboswitch class is detected in a few eukaryotes like fungi, plants and algae. One of the most important challenges in riboswitch research is to discover existing riboswitch classes in eukaryotes and to understand the evolution of bacterial riboswitches. However, traditional search methods for riboswitch detection have failed to detect eukaryotic riboswitches besides just one class and any distant structural homologs of riboswitches. We developed a novel approach based on inverse RNA folding that attempts to find sequences that match the shape of the target structure with minimal sequence conservation based on key nucleotides that interact directly with the ligand. Then, to support our matched candidates, we expanded the results into a covariance model representing similar sequences preserving the structure. Our method transforms a structure-based search into a sequence-based search that considers the conservation of secondary structure shape and ligand-binding residues. This method enables us to identify a potential structural candidate in fungi that could be the distant homolog of bacterial purine riboswitches. Further, phylogenomic analysis and evolutionary distribution of this structural candidate indicate that the most likely point of origin of this structural candidate in these organisms is associated with the loss of traditional purine riboswitches. The computational approach could be applicable to other domains and problems in RNA research.


Assuntos
Riboswitch , Riboswitch/genética , Dobramento de RNA , RNA , Ligantes , Bactérias/genética , Fungos/genética , Purinas , RNA Bacteriano/genética , Conformação de Ácido Nucleico
3.
J Mol Evol ; 89(3): 165-171, 2021 04.
Artigo em Inglês | MEDLINE | ID: mdl-33216148

RESUMO

Over twenty years ago Galtier and Lobry published a manuscript entitled "Relationships between Genomic G + C Content, RNA Secondary Structure, and Optimal Growth Temperature" in the Journal of Molecular Evolution that showcased the lack of a relationship between genomic G + C content and optimal growth temperature (OGT) in a set of about 200 prokaryotes. Galtier and Lobry also assessed the relationship between RNA secondary structures (rRNA stems, tRNAs) and OGT, and in this case a clear relationship emerged. Increasing structured RNA G + C content (particularly in regions that are double-stranded) correlates with increased OGT. Both of these fundamental relationships have withstood test of many additional sequences and spawned a variety of different applications that include prediction of OGT from rRNA sequence and computational ncRNA identification approaches. In this work, I present the motivation behind Galtier and Lobry's original paper and the larger questions addressed by the work, how these questions have evolved over the last two decades, and the impact of Galtier and Lobry's manuscript in fields beyond these questions.


Assuntos
Genoma , RNA , Genômica , Conformação de Ácido Nucleico , Células Procarióticas , RNA/genética , Temperatura
4.
J Mol Evol ; 88(3): 227, 2020 04.
Artigo em Inglês | MEDLINE | ID: mdl-31993689

RESUMO

The tRNA is a critical component in all modern translation systems as well as an important intermediate in models of early protein coding systems. In the following works, proponents for each of the major hypotheses for tRNA origin and evolution engage in discussion of the merits for each model.


Assuntos
Evolução Molecular , Modelos Genéticos , RNA de Transferência/genética
5.
RNA ; 24(9): 1133-1143, 2018 09.
Artigo em Inglês | MEDLINE | ID: mdl-29925569

RESUMO

In many bacteria, ribosomal proteins autogenously repress their own expression by interacting with RNA structures typically located in the 5'-UTRs of their mRNA transcripts. This regulation is necessary to maintain a balance between ribosomal proteins and rRNA to ensure proper ribosome production. Despite advances in noncoding RNA discovery and validation of RNA-protein regulatory interactions, the selective pressures that govern the formation and maintenance of such RNA cis-regulators in the context of an organism remain largely undetermined. To examine the impact disruptions to this regulation have on bacterial fitness, we introduced point mutations that abolish ribosomal protein binding and regulation into the RNA structure that controls expression of ribosomal proteins L20 and L35 within the Bacillus subtilis genome. Our studies indicate that removing this regulation results in reduced log phase growth, improper rRNA maturation, and the accumulation of a kinetically trapped or misassembled ribosomal particle at low temperatures, suggesting defects in ribosome synthesis. Such work emphasizes the important role regulatory RNAs play in the stoichiometric production of ribosomal components for proper ribosome composition and overall organism viability and reinforces the potential of targeting ribosomal protein production and ribosome assembly with novel antimicrobials.


Assuntos
Bacillus subtilis/crescimento & desenvolvimento , Proteínas de Bactérias/metabolismo , Mutação Puntual , RNA Mensageiro/metabolismo , Proteínas Ribossômicas/metabolismo , Bacillus subtilis/genética , Bacillus subtilis/metabolismo , Proteínas de Bactérias/química , Proteínas de Bactérias/genética , Aptidão Genética , Viabilidade Microbiana , Conformação de Ácido Nucleico , Ligação Proteica , RNA Bacteriano/metabolismo , RNA Mensageiro/química , RNA Mensageiro/genética , RNA Ribossômico/metabolismo , Proteínas Ribossômicas/química , Proteínas Ribossômicas/genética
6.
PLoS Pathog ; 14(12): e1007461, 2018 12.
Artigo em Inglês | MEDLINE | ID: mdl-30517198

RESUMO

Efficient and highly organized regulation of transcription is fundamental to an organism's ability to survive, proliferate, and quickly respond to its environment. Therefore, precise mapping of transcriptional units and understanding their regulation is crucial to determining how pathogenic bacteria cause disease and how they may be inhibited. In this study, we map the transcriptional landscape of the bacterial pathogen Streptococcus pneumoniae TIGR4 by applying a combination of high-throughput RNA-sequencing techniques. We successfully map 1864 high confidence transcription termination sites (TTSs), 790 high confidence transcription start sites (TSSs) (742 primary, and 48 secondary), and 1360 low confidence TSSs (74 secondary and 1286 primary) to yield a total of 2150 TSSs. Furthermore, our study reveals a complex transcriptome wherein environment-respondent alternate transcriptional units are observed within operons stemming from internal TSSs and TTSs. Additionally, we identify many putative cis-regulatory RNA elements and riboswitches within 5'-untranslated regions (5'-UTR). By integrating TSSs and TTSs with independently collected RNA-Seq datasets from a variety of conditions, we establish the response of these regulators to changes in growth conditions and validate several of them. Furthermore, to demonstrate the importance of ribo-regulation by 5'-UTR elements for in vivo virulence, we show that the pyrR regulatory element is essential for survival, successful colonization and infection in mice suggesting that such RNA elements are potential drug targets. Importantly, we show that our approach of combining high-throughput sequencing with in vivo experiments can reconstruct a global understanding of regulation, but also pave the way for discovery of compounds that target (ribo-)regulators to mitigate virulence and antibiotic resistance.


Assuntos
Streptococcus pneumoniae/genética , Streptococcus pneumoniae/patogenicidade , Virulência/genética , Animais , Genes Bacterianos/genética , Sequenciamento de Nucleotídeos em Larga Escala , Camundongos , Óperon/genética , Transcrição Gênica
7.
Biochem Soc Trans ; 48(5): 1941-1951, 2020 10 30.
Artigo em Inglês | MEDLINE | ID: mdl-32869842

RESUMO

Structured cis-regulatory RNAs have evolved across all domains of life, highlighting the utility and plasticity of RNA as a regulatory molecule. Homologous RNA sequences and structures often have similar functions, but homology may also be deceiving. The challenges that derive from trying to assign function to structure and vice versa are not trivial. Bacterial riboswitches, viral and eukaryotic IRESes, CITEs, and 3' UTR elements employ an array of mechanisms to exert their effects. Bioinformatic searches coupled with biochemical and functional validation have elucidated some shared and many unique ways cis-regulators are employed in mRNA transcripts. As cis-regulatory RNAs are resolved in greater detail, it is increasingly apparent that shared homology can mask the full spectrum of mRNA cis-regulator functional diversity. Furthermore, similar functions may be obscured by lack of obvious sequence similarity. Thus looking beyond homology is crucial for furthering our understanding of RNA-based regulation.


Assuntos
Bactérias/genética , Biologia Computacional , Conformação de Ácido Nucleico , RNA Bacteriano/genética , Riboswitch/genética , Regiões 3' não Traduzidas , Sítios de Ligação , Evolução Molecular , Perfilação da Expressão Gênica , Glicina/genética , Humanos , Sítios Internos de Entrada Ribossomal , Ligantes , Funções Verossimilhança , Filogenia , RNA/metabolismo
8.
PLoS Comput Biol ; 15(12): e1007564, 2019 12.
Artigo em Inglês | MEDLINE | ID: mdl-31860665

RESUMO

In comparison to protein coding sequences, the impact of mutation and natural selection on the sequence and function of non-coding (ncRNA) genes is not well understood. Many ncRNA genes are narrowly distributed to only a few organisms, and appear to be rapidly evolving. Compared to protein coding sequences, there are many challenges associated with assessment of ncRNAs that are not well addressed by conventional phylogenetic approaches, including: short sequence length, lack of primary sequence conservation, and the importance of secondary structure for biological function. Riboswitches are structured ncRNAs that directly interact with small molecules to regulate gene expression in bacteria. They typically consist of a ligand-binding domain (aptamer) whose folding changes drive changes in gene expression. The glycine riboswitch is among the most well-studied due to the widespread occurrence of a tandem aptamer arrangement (tandem), wherein two homologous aptamers interact with glycine and each other to regulate gene expression. However, a significant proportion of glycine riboswitches are comprised of single aptamers (singleton). Here we use graph clustering to circumvent the limitations of traditional phylogenetic analysis when studying the relationship between the tandem and singleton glycine aptamers. Graph clustering enables a broader range of pairwise comparison measures to be used to assess aptamer similarity. Using this approach, we show that one aptamer of the tandem glycine riboswitch pair is typically much more highly conserved, and that which aptamer is conserved depends on the regulated gene. Furthermore, our analysis also reveals that singleton aptamers are more similar to either the first or second tandem aptamer, again based on the regulated gene. Taken together, our findings suggest that tandem glycine riboswitches degrade into functional singletons, with the regulated gene(s) dictating which glycine-binding aptamer is conserved.


Assuntos
Aptâmeros de Nucleotídeos/química , Aptâmeros de Nucleotídeos/genética , Glicina/química , Riboswitch/genética , Bacillaceae/classificação , Bacillaceae/genética , Biologia Computacional , Evolução Molecular , Genoma Bacteriano , Modelos Genéticos , Modelos Moleculares , Conformação de Ácido Nucleico , Filogenia , Vibrionaceae/classificação , Vibrionaceae/genética
10.
Nucleic Acids Res ; 44(19): 9331-9341, 2016 Nov 02.
Artigo em Inglês | MEDLINE | ID: mdl-27580716

RESUMO

There are several natural examples of distinct RNA structures that interact with the same ligand to regulate the expression of homologous genes in different organisms. One essential question regarding this phenomenon is whether such RNA regulators are the result of convergent or divergent evolution. Are the RNAs derived from some common ancestor and diverged to the point where we cannot identify the similarity, or have multiple solutions to the same biological problem arisen independently? A key variable in assessing these alternatives is how frequently such regulators arise within sequence space. Ribosomal protein S15 is autogenously regulated via an RNA regulator in many bacterial species; four apparently distinct regulators have been functionally validated in different bacterial phyla. Here, we explore how frequently such regulators arise within a partially randomized sequence population. We find many RNAs that interact specifically with ribosomal protein S15 from Geobacillus kaustophilus with biologically relevant dissociation constants. Furthermore, of the six sequences we characterize, four show regulatory activity in an Escherichia coli reporter assay. Subsequent footprinting and mutagenesis analysis indicates that protein binding proximal to regulatory features such as the Shine-Dalgarno sequence is sufficient to enable regulation, suggesting that regulation in response to S15 is relatively easily acquired.


Assuntos
RNA/genética , RNA/metabolismo , Proteínas Ribossômicas/metabolismo , Aptâmeros de Nucleotídeos , Sequência de Bases , Sítios de Ligação , Regulação da Expressão Gênica , Mutação , Conformação de Ácido Nucleico , Óperon , Ligação Proteica , RNA/química , RNA Bacteriano/química , RNA Bacteriano/genética , RNA Bacteriano/metabolismo , RNA Mensageiro/química , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , RNA Ribossômico/química , RNA Ribossômico/genética , RNA Ribossômico/metabolismo , Sequências Reguladoras de Ácido Ribonucleico
11.
PLoS Genet ; 11(12): e1005720, 2015 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-26675164

RESUMO

RNA-protein interactions are critical in many biological processes, yet how such interactions affect the evolution of both partners is still unknown. RNA and protein structures are impacted very differently by mechanisms of genomic change. While most protein families are identifiable at the nucleotide level across large phylogenetic distances, RNA families display far less nucleotide similarity and are often only shared by closely related bacterial species. Ribosomal protein S15 has two RNA binding functions. First, it is a ribosomal protein responsible for organizing the rRNA during ribosome assembly. Second, in many bacterial species S15 also interacts with a structured portion of its own transcript to negatively regulate gene expression. While the first interaction is conserved in most bacteria, the second is not. Four distinct mRNA structures interact with S15 to enable regulation, each of which appears to be independently derived in different groups of bacteria. With the goal of understanding how protein-binding specificity may influence the evolution of such RNA regulatory structures, we examine whether examples of these mRNA structures are able to interact with, and regulate in response to, S15 homologs from organisms containing distinct mRNA structures. We find that despite their shared RNA binding function in the rRNA, S15 homologs have distinct RNA recognition profiles. We present a model to explain the specificity patterns observed, and support this model by with further mutagenesis. After analyzing the patterns of conservation for the S15 protein coding sequences, we also identified amino acid changes that alter the binding specificity of an S15 homolog. In this work we demonstrate that homologous RNA-binding proteins have different specificity profiles, and minor changes to amino acid sequences, or to RNA structural motifs, can have large impacts on RNA-protein recognition.


Assuntos
Filogenia , RNA Mensageiro/genética , Proteínas de Ligação a RNA/genética , Proteínas Ribossômicas/genética , Sequência de Bases , Sítios de Ligação , Escherichia coli , Regulação da Expressão Gênica , Conformação de Ácido Nucleico , RNA Mensageiro/química , RNA Ribossômico/química , RNA Ribossômico/genética , Proteínas de Ligação a RNA/química , Proteínas Ribossômicas/química , Ribossomos/genética
12.
BMC Bioinformatics ; 18(1): 298, 2017 Jun 06.
Artigo em Inglês | MEDLINE | ID: mdl-28587636

RESUMO

BACKGROUND: Proteins recognize many different aspects of RNA ranging from single stranded regions to discrete secondary or tertiary structures. High-throughput sequencing (HTS) of in vitro selected populations offers a large scale method to study RNA-proteins interactions. However, most existing analysis methods require that the binding motifs are enriched in the population relative to earlier rounds, and that motifs are found in a loop or single stranded region of the potential RNA secondary structure. Such methods do not generalize to all RNA-protein interaction as some RNA binding proteins specifically recognize more complex structures such as double stranded RNA. RESULTS: In this study, we use HT-SELEX derived populations to study the landscape of RNAs that interact with Geobacillus kaustophilus ribosomal protein S15. Our data show high sequence and structure diversity and proved intractable to existing methods. Conventional programs identified some sequence motifs, but these are found in less than 5-10% of the total sequence pool. Therefore, we developed a novel framework to analyze HT-SELEX data. Our process accounts for both sequence and structure components by abstracting the overall secondary structure into smaller substructures composed of a single base-pair stack, which allows us to leverage existing approaches already used in k-mer analysis to identify enriched motifs. By focusing on secondary structure motifs composed of specific two base-pair stacks, we identified significantly enriched or depleted structure motifs relative to earlier rounds. CONCLUSIONS: Discrete substructures are likely to be important to RNA-protein interactions, but they are difficult to elucidate. Substructures can help make highly diverse sequence data more tractable. The structure motifs provide limited accuracy in predicting enrichment suggesting that G. kaustophilus S15 can either recognize many different secondary structure motifs or some aspects of the interaction are not captured by the analysis. This highlights the importance of considering secondary and tertiary structure elements and their role in RNA-protein interactions.


Assuntos
Algoritmos , Proteínas Ribossômicas/metabolismo , Pareamento de Bases , Sequência de Bases , Análise por Conglomerados , Geobacillus/genética , Sequenciamento de Nucleotídeos em Larga Escala , Modelos Logísticos , Conformação de Ácido Nucleico , Ligação Proteica , Proteínas de Ligação a RNA/metabolismo , Proteínas Ribossômicas/química , Proteínas Ribossômicas/genética , Análise de Sequência de RNA
13.
RNA ; 21(12): 2039-46, 2015 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-26447183

RESUMO

More than half of the ribosomal protein operons in Escherichia coli are regulated by structures within the mRNA transcripts that interact with specific ribosomal proteins to inhibit further protein expression. This regulation is accomplished using a variety of mechanisms and the RNA structures responsible for regulation are often not conserved across bacterial phyla. A widely conserved mRNA structure preceding the ribosomal protein operon containing rpsF and rpsR (encoding S6 and S18) was recently identified through comparative genomics. Examples of this RNA from both E. coli and Bacillus subtilis were shown to interact in vitro with an S6:S18 complex. In this work, we demonstrate that in E. coli, this RNA structure regulates gene expression in response to the S6:S18 complex. ß-galactosidase activity from a lacZ reporter translationally fused to the 5' UTR and first nine codons of E. coli rpsF is reduced fourfold by overexpression of a genomic fragment encoding both S6 and S18 but not by overexpression of either protein individually. Mutations to the mRNA structure, as well as to the RNA-binding site of S18 and the S6-S18 interaction surfaces of S6 and S18, are sufficient to derepress ß-galactosidase activity, indicating that the S6:S18 complex is the biologically active effector. Measurement of transcript levels shows that although reporter levels do not change upon protein overexpression, levels of the native transcript are reduced fourfold, suggesting that the mRNA regulator prevents translation and this effect is amplified on the native transcript by other mechanisms.


Assuntos
Proteínas de Escherichia coli/genética , Escherichia coli/genética , Biossíntese de Proteínas , Proteínas Ribossômicas/genética , Escherichia coli/metabolismo , Proteínas de Escherichia coli/biossíntese , Regulação Bacteriana da Expressão Gênica , RNA Bacteriano/genética , RNA Bacteriano/metabolismo , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Sequências Reguladoras de Ácido Ribonucleico , Proteínas Ribossômicas/biossíntese
14.
RNA ; 20(2): 168-76, 2014 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-24310371

RESUMO

Approximately half the transcripts encoding ribosomal proteins in Escherichia coli include a structured RNA motif that interacts with a specific ribosomal protein to inhibit gene expression, thus allowing stoichiometric production of ribosome components. However, many of these RNA structures are not widely distributed across bacterial phyla. It is increasingly common for RNA motifs associated with ribosomal protein genes to be identified using comparative genomic methods, yet these are rarely experimentally validated. In this work, we characterize one such motif that precedes operons containing rpsF and rpsR, which encode ribosomal proteins S6 and S18. This RNA structure is widely distributed across many phyla of bacteria despite differences within the downstream operon, and examples are present in both E. coli and Bacillus subtilis. We demonstrate a direct interaction between an example of the RNA from B. subtilis and an S6:S18 complex using in vitro binding assays, verify our predicted secondary structure, and identify a putative protein-binding site. The proposed binding site bears a strong resemblance to the S18 binding site within the 16S rRNA, suggesting molecular mimicry. This interaction is a valuable addition to the canon of ribosomal protein mRNA interactions. This work shows how experimental verification translates computational results into concrete knowledge of biological systems.


Assuntos
Bacillus subtilis/genética , Proteínas de Bactérias/genética , RNA Bacteriano/genética , RNA Ribossômico/genética , Proteína S6 Ribossômica/genética , Regiões 5' não Traduzidas , Bacillus subtilis/metabolismo , Proteínas de Bactérias/química , Pareamento de Bases , Sequência de Bases , Sítios de Ligação , Sequências Repetidas Invertidas , Dados de Sequência Molecular , Óperon , Ligação Proteica , RNA Bacteriano/metabolismo , Proteína S6 Ribossômica/química , Proteínas Ribossômicas/química
15.
Nucleic Acids Res ; 42(18): 11752-62, 2014 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-25209235

RESUMO

Nanotechnology and synthetic biology currently constitute one of the most innovative, interdisciplinary fields of research, poised to radically transform society in the 21st century. This paper concerns the synthetic design of ribonucleic acid molecules, using our recent algorithm, RNAiFold, which can determine all RNA sequences whose minimum free energy secondary structure is a user-specified target structure. Using RNAiFold, we design ten cis-cleaving hammerhead ribozymes, all of which are shown to be functional by a cleavage assay. We additionally use RNAiFold to design a functional cis-cleaving hammerhead as a modular unit of a synthetic larger RNA. Analysis of kinetics on this small set of hammerheads suggests that cleavage rate of computationally designed ribozymes may be correlated with positional entropy, ensemble defect, structural flexibility/rigidity and related measures. Artificial ribozymes have been designed in the past either manually or by SELEX (Systematic Evolution of Ligands by Exponential Enrichment); however, this appears to be the first purely computational design and experimental validation of novel functional ribozymes. RNAiFold is available at http://bioinformatics.bc.edu/clotelab/RNAiFold/.


Assuntos
RNA Catalítico/química , Algoritmos , Sequência de Bases , Biologia Computacional/métodos , Sequência Consenso , Clivagem do RNA , Dobramento de RNA , RNA Catalítico/metabolismo , Biologia Sintética/métodos
16.
BMC Genomics ; 16: 35, 2015 Feb 05.
Artigo em Inglês | MEDLINE | ID: mdl-25649229

RESUMO

BACKGROUND: Structured RNAs have many biological functions ranging from catalysis of chemical reactions to gene regulation. Yet, many homologous structured RNAs display most of their conservation at the secondary or tertiary structure level. As a result, strategies for structured RNA discovery rely heavily on identification of sequences sharing a common stable secondary structure. However, correctly distinguishing structured RNAs from surrounding genomic sequence remains challenging, especially during de novo discovery. RNA also has a long history as a computational model for evolution due to the direct link between genotype (sequence) and phenotype (structure). From these studies it is clear that evolved RNA structures, like protein structures, can be considered robust to point mutations. In this context, an RNA sequence is considered robust if its neutrality (extent to which single mutant neighbors maintain the same secondary structure) is greater than that expected for an artificial sequence with the same minimum free energy structure. RESULTS: In this work, we bring concepts from evolutionary biology to bear on the structured RNA de novo discovery process. We hypothesize that alignments corresponding to structured RNAs should consist of neutral sequences. We evaluate several measures of neutrality for their ability to distinguish between alignments of structured RNA sequences drawn from Rfam and various decoy alignments. We also introduce a new measure of RNA structural neutrality, the structure ensemble neutrality (SEN). SEN seeks to increase the biological relevance of existing neutrality measures in two ways. First, it uses information from an alignment of homologous sequences to identify a conserved biologically relevant structure for comparison. Second, it only counts base-pairs of the original structure that are absent in the comparison structure and does not penalize the formation of additional base-pairs. CONCLUSION: We find that several measures of neutrality are effective at separating structured RNAs from decoy sequences, including both shuffled alignments and flanking genomic sequence. Furthermore, as an independent feature classifier to identify structured RNAs, SEN yields comparable performance to current approaches that consider a variety of features including stability and sequence identity. Finally, SEN outperforms other measures of neutrality at detecting mutational robustness in bacterial regulatory RNA structures.


Assuntos
Modelos Teóricos , Conformação de Ácido Nucleico , RNA Bacteriano/genética , RNA/genética , Bactérias/genética , Sequência de Bases , Mutação , RNA/química , RNA/classificação , RNA Bacteriano/química , Alinhamento de Sequência , Análise de Sequência de RNA , Homologia de Sequência do Ácido Nucleico , Relação Estrutura-Atividade
17.
Nature ; 462(7273): 656-9, 2009 Dec 03.
Artigo em Inglês | MEDLINE | ID: mdl-19956260

RESUMO

Estimates of the total number of bacterial species indicate that existing DNA sequence databases carry only a tiny fraction of the total amount of DNA sequence space represented by this division of life. Indeed, environmental DNA samples have been shown to encode many previously unknown classes of proteins and RNAs. Bioinformatics searches of genomic DNA from bacteria commonly identify new noncoding RNAs (ncRNAs) such as riboswitches. In rare instances, RNAs that exhibit more extensive sequence and structural conservation across a wide range of bacteria are encountered. Given that large structured RNAs are known to carry out complex biochemical functions such as protein synthesis and RNA processing reactions, identifying more RNAs of great size and intricate structure is likely to reveal additional biochemical functions that can be achieved by RNA. We applied an updated computational pipeline to discover ncRNAs that rival the known large ribozymes in size and structural complexity or that are among the most abundant RNAs in bacteria that encode them. These RNAs would have been difficult or impossible to detect without examining environmental DNA sequences, indicating that numerous RNAs with extraordinary size, structural complexity, or other exceptional characteristics remain to be discovered in unexplored sequence space.


Assuntos
Bactérias/genética , Genoma Bacteriano/genética , Genômica , Conformação de Ácido Nucleico , RNA Bacteriano/química , RNA Bacteriano/genética , RNA não Traduzido/genética , Bactérias/metabolismo , Sequência de Bases , Regulação Bacteriana da Expressão Gênica , Dados de Sequência Molecular , RNA não Traduzido/química
18.
Nucleic Acids Res ; 41(6): 3491-503, 2013 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-23396277

RESUMO

In Escherichia coli, 12 distinct RNA structures within the transcripts encoding ribosomal proteins interact with specific ribosomal proteins to allow autogenous regulation of expression from large multi-gene operons, thus coordinating ribosomal protein biosynthesis across multiple operons. However, these RNA structures are typically not represented in the RNA Families Database or annotated in genomic sequences databases, and their phylogenetic distribution is largely unknown. To investigate the extent to which these RNA structures are conserved across eubacterial phyla, we created multiple sequence alignments representing 10 of these messenger RNA (mRNA) structures in E. coli. We find that while three RNA structures are widely distributed across many phyla of bacteria, seven of the RNAs are narrowly distributed to a few orders of Gammaproteobacteria. To experimentally validate our computational predictions, we biochemically confirmed dual L1-binding sites identified in many Firmicute species. This work reveals that RNA-based regulation of ribosomal protein biosynthesis is used in nearly all eubacterial phyla, but the specific RNA structures that regulate ribosomal protein biosynthesis in E. coli are narrowly distributed. These results highlight the limits of our knowledge regarding ribosomal protein biosynthesis regulation outside of E. coli, and the potential for alternative RNA structures responsible for regulating ribosomal proteins in other eubacteria.


Assuntos
Proteínas de Escherichia coli/biossíntese , Escherichia coli/genética , Gammaproteobacteria/genética , RNA Bacteriano/química , RNA Mensageiro/química , Proteínas Ribossômicas/biossíntese , Sítios de Ligação , Proteínas de Escherichia coli/genética , Proteínas de Escherichia coli/metabolismo , Geobacillus/genética , Conformação de Ácido Nucleico , Filogenia , RNA Bacteriano/classificação , RNA Bacteriano/metabolismo , RNA Mensageiro/classificação , RNA Mensageiro/metabolismo , Proteína Ribossômica L10 , Proteínas Ribossômicas/genética , Proteínas Ribossômicas/metabolismo , Alinhamento de Sequência
19.
BMC Genomics ; 15: 657, 2014 Aug 07.
Artigo em Inglês | MEDLINE | ID: mdl-25104606

RESUMO

BACKGROUND: Autogenous cis-regulators of ribosomal protein synthesis play a critical role in maintaining the stoichiometry of ribosome components. Structured portions within an mRNA transcript typically interact with specific ribosomal proteins to prevent expression of the entire operon, thus balancing levels of ribosomal proteins across transcriptional units. Three distinct RNA structures from different bacterial phyla have demonstrated interactions with S15 to regulate gene expression; however, these RNAs are distributed across a small fraction of bacterial diversity. RESULTS: We used comparative genomics in combination with analysis of existing transcriptomic data to identify three novel putative RNA structures associated with the S15 coding region in microbial genomes. These structures are completely distinct from those previously published and encompass potential regulatory regions including ribosome-binding sites. To validate the biological relevance of our findings, we demonstrate that an example of the Alphaproteobacterial RNA from Rhizobium radiobacter specifically interacts with S15 in vitro, and allows in vivo regulation of gene expression in an E. coli reporter system. In addition, structural probing and nuclease protection assays confirm the predicted secondary structure and indicate nucleotides required for protein interaction. CONCLUSIONS: This work illustrates the importance of integrating comparative genomic and transcriptomic approaches during de novo ncRNA identification and reveals a diversity of distinct natural RNA regulators that support analogous biological functions. Furthermore, this work indicates that many additional uncharacterized RNA regulators likely exist within bacterial genomes and that the plasticity of RNA structure allows unique, and likely independently derived, solutions to the same biological problem.


Assuntos
Agrobacterium tumefaciens/genética , Agrobacterium tumefaciens/metabolismo , Proteínas de Bactérias/metabolismo , Genômica , RNA Bacteriano/genética , RNA Bacteriano/metabolismo , Proteínas Ribossômicas/metabolismo , Sequência de Bases , Sítios de Ligação , Mutação , Especificidade por Substrato
20.
Bioinformatics ; 29(3): 381-3, 2013 Feb 01.
Artigo em Inglês | MEDLINE | ID: mdl-23172864

RESUMO

MOTIVATION: High-throughput biological research requires simultaneous visualization as well as analysis of genomic data, e.g. read alignments, variant calls and genomic annotations. Traditionally, such integrative analysis required desktop applications operating on locally stored data. Many current terabyte-size datasets generated by large public consortia projects, however, are already only feasibly stored at specialist genome analysis centers. As even small laboratories can afford very large datasets, local storage and analysis are becoming increasingly limiting, and it is likely that most such datasets will soon be stored remotely, e.g. in the cloud. These developments will require web-based tools that enable users to access, analyze and view vast remotely stored data with a level of sophistication and interactivity that approximates desktop applications. As rapidly dropping cost enables researchers to collect data intended to answer questions in very specialized contexts, developers must also provide software libraries that empower users to implement customized data analyses and data views for their particular application. Such specialized, yet lightweight, applications would empower scientists to better answer specific biological questions than possible with general-purpose genome browsers currently available. RESULTS: Using recent advances in core web technologies (HTML5), we developed Scribl, a flexible genomic visualization library specifically targeting coordinate-based data such as genomic features, DNA sequence and genetic variants. Scribl simplifies the development of sophisticated web-based graphical tools that approach the dynamism and interactivity of desktop applications. AVAILABILITY AND IMPLEMENTATION: Software is freely available online at http://chmille4.github.com/Scribl/ and is implemented in JavaScript with all modern browsers supported.


Assuntos
Gráficos por Computador , Genômica/métodos , Software , Cromossomos Humanos , Humanos , Internet
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa