ABSTRACT
In this issue of Cell, Pelechano et al. report that sequencing of mRNA decay intermediates shows surprisingly tight coupling of a major decay pathway to the movement of the last translating ribosome, revealing stress- and starvation-dependent modulation of translation elongation.
Subject(s)
Protein Biosynthesis , RNA Stability , Ribosomes/metabolism , Saccharomyces cerevisiae/metabolismABSTRACT
Heat causes protein misfolding and aggregation and, in eukaryotic cells, triggers aggregation of proteins and RNA into stress granules. We have carried out extensive proteomic studies to quantify heat-triggered aggregation and subsequent disaggregation in budding yeast, identifying >170 endogenous proteins aggregating within minutes of heat shock in multiple subcellular compartments. We demonstrate that these aggregated proteins are not misfolded and destined for degradation. Stable-isotope labeling reveals that even severely aggregated endogenous proteins are disaggregated without degradation during recovery from shock, contrasting with the rapid degradation observed for many exogenous thermolabile proteins. Although aggregation likely inactivates many cellular proteins, in the case of a heterotrimeric aminoacyl-tRNA synthetase complex, the aggregated proteins remain active with unaltered fidelity. We propose that most heat-induced aggregation of mature proteins reflects the operation of an adaptive, autoregulatory process of functionally significant aggregate assembly and disassembly that aids cellular adaptation to thermal stress.
Subject(s)
Heat-Shock Response , Saccharomyces cerevisiae/cytology , Saccharomyces cerevisiae/physiology , Cycloheximide/pharmacology , Cytoplasmic Granules/metabolism , Protein Aggregates , Protein Biosynthesis/drug effects , Protein Synthesis Inhibitors/pharmacology , Saccharomyces cerevisiae Proteins/chemistry , Saccharomyces cerevisiae Proteins/metabolismABSTRACT
Ssd1, a conserved fungal RNA-binding protein, is important in stress responses, cell division and virulence. Ssd1 is closely related to Dis3L2 of the RNase II family of nucleases, but lacks catalytic activity and likely suppresses translation of bound mRNAs. Previous studies identified RNA motifs enriched in Ssd1-associated transcripts, yet the sequence requirements for Ssd1 binding are not defined. Here, we identify precise binding sites of Ssd1 on RNA using in vivo cross-linking and cDNA analysis. These sites are enriched in 5' untranslated regions of a subset of mRNAs encoding cell wall proteins. We identified a conserved bipartite motif that binds Ssd1 with high affinity in vitro. Active RNase II enzymes have a characteristic, internal RNA binding path; the Ssd1 crystal structure at 1.9 Å resolution shows that remnants of regulatory sequences block this path. Instead, RNA binding activity has relocated to a conserved patch on the surface of the protein. Structure-guided mutations of this surface prevent Ssd1 from binding RNA in vitro and phenocopy Ssd1 deletion in vivo. These studies provide a new framework for understanding the function of a pleiotropic post-transcriptional regulator of gene expression and give insights into the evolution of regulatory and binding elements in the RNase II family.
Subject(s)
Exoribonucleases , Saccharomyces cerevisiae Proteins , Saccharomyces cerevisiae , 5' Untranslated Regions , Exoribonucleases/metabolism , RNA, Messenger/genetics , RNA, Messenger/metabolism , Saccharomyces cerevisiae/metabolism , Saccharomyces cerevisiae Proteins/genetics , Saccharomyces cerevisiae Proteins/metabolismABSTRACT
MOTIVATION: Ribosome profiling, or Ribo-seq, is the state-of-the-art method for quantifying protein synthesis in living cells. Computational analysis of Ribo-seq data remains challenging due to the complexity of the procedure, as well as variations introduced for specific organisms or specialized analyses. RESULTS: We present riboviz 2, an updated riboviz package, for the comprehensive transcript-centric analysis and visualization of Ribo-seq data. riboviz 2 includes an analysis workflow built on the Nextflow workflow management system for end-to-end processing of Ribo-seq data. riboviz 2 has been extensively tested on diverse species and library preparation strategies, including multiplexed samples. riboviz 2 is flexible and uses open, documented file formats, allowing users to integrate new analyses with the pipeline. AVAILABILITY AND IMPLEMENTATION: riboviz 2 is freely available at github.com/riboviz/riboviz.
Subject(s)
Ribosome Profiling , Ribosomes , Ribosomes/genetics , Ribosomes/metabolism , Workflow , RNA, Messenger/metabolism , Data Analysis , Sequence Analysis, RNA/methodsABSTRACT
Over the last few decades, the nature of life sciences research has changed enormously, generating a need for a workforce with a variety of computational skills such as those required to store, manage, and analyse the large biological datasets produced by next-generation sequencing. Those with such expertise are increasingly in demand for employment in both research and industry. Despite this, bioinformatics education has failed to keep pace with advances in research. At secondary school level, computing is often taught in isolation from other sciences, and its importance in biological research is not fully realised, leaving pupils unprepared for the computational component of Higher Education and, subsequently, research in the life sciences. The 4273pi Bioinformatics at School project (https://4273pi.org) aims to address this issue by designing and delivering curriculum-linked, hands-on bioinformatics workshops for secondary school biology pupils, with an emphasis on equitable access. So far, we have reached over 180 schools across Scotland through visits or teacher events, and our open education resources are used internationally. Here, we describe our project, our aims and motivations, and the practical lessons we have learned from implementing a successful bioinformatics education project over the last 5 years.
Subject(s)
Computational Biology , Adolescent , Career Choice , Computational Biology/education , Computational Biology/organization & administration , Female , Humans , Male , Schools , Scotland , StudentsABSTRACT
The RNase II family of 3'-5' exoribonucleases is present in all domains of life, and eukaryotic family members Dis3 and Dis3L2 play essential roles in RNA degradation. Ascomycete yeasts contain both Dis3 and inactive RNase II-like "pseudonucleases." The latter function as RNA-binding proteins that affect cell growth, cytokinesis, and fungal pathogenicity. However, the evolutionary origins of these pseudonucleases are unknown: What sequence of events led to their novel function, and when did these events occur? Here, we show how RNase II pseudonuclease homologs, including Saccharomyces cerevisiae Ssd1, are descended from active Dis3L2 enzymes. During fungal evolution, active site mutations in Dis3L2 homologs have arisen at least four times, in some cases following gene duplication. In contrast, N-terminal cold-shock domains and regulatory features are conserved across diverse dikarya and mucoromycota, suggesting that the nonnuclease function requires these regions. In the basidiomycete pathogenic yeast Cryptococcus neoformans, the single Ssd1/Dis3L2 homolog is required for cytokinesis from polyploid "titan" growth stages. This phenotype of C. neoformans Ssd1/Dis3L2 deletion is consistent with those of inactive fungal pseudonucleases, yet the protein retains an active site sequence signature. We propose that a nuclease-independent function for Dis3L2 arose in an ancestral hyphae-forming fungus. This second function has been conserved across hundreds of millions of years, whereas the RNase activity was lost repeatedly in independent lineages.
Subject(s)
Ascomycota/genetics , Evolution, Molecular , Exoribonucleases/genetics , Multigene Family , Ascomycota/enzymology , Catalytic Domain/genetics , Cryptococcus neoformans/physiology , Cytokinesis , Phylogeny , Saccharomyces cerevisiae Proteins/geneticsABSTRACT
Workflow management systems represent, manage, and execute multistep computational analyses and offer many benefits to bioinformaticians. They provide a common language for describing analysis workflows, contributing to reproducibility and to building libraries of reusable components. They can support both incremental build and re-entrancy-the ability to selectively re-execute parts of a workflow in the presence of additional inputs or changes in configuration and to resume execution from where a workflow previously stopped. Many workflow management systems enhance portability by supporting the use of containers, high-performance computing (HPC) systems, and clouds. Most importantly, workflow management systems allow bioinformaticians to delegate how their workflows are run to the workflow management system and its developers. This frees the bioinformaticians to focus on what these workflows should do, on their data analyses, and on their science. RiboViz is a package to extract biological insight from ribosome profiling data to help advance understanding of protein synthesis. At the heart of RiboViz is an analysis workflow, implemented in a Python script. To conform to best practices for scientific computing which recommend the use of build tools to automate workflows and to reuse code instead of rewriting it, the authors reimplemented this workflow within a workflow management system. To select a workflow management system, a rapid survey of available systems was undertaken, and candidates were shortlisted: Snakemake, cwltool, Toil, and Nextflow. Each candidate was evaluated by quickly prototyping a subset of the RiboViz workflow, and Nextflow was chosen. The selection process took 10 person-days, a small cost for the assurance that Nextflow satisfied the authors' requirements. The use of prototyping can offer a low-cost way of making a more informed selection of software to use within projects, rather than relying solely upon reviews and recommendations by others.
Subject(s)
Computational Biology/education , Computing Methodologies , User-Computer Interface , Workflow , Algorithms , Data Analysis , Genomics , Language , Programming Languages , Reproducibility of Results , Ribosomes/physiology , SoftwareABSTRACT
Eukaryotic protein synthesis generally initiates at a start codon defined by an AUG and its surrounding Kozak sequence context, but the quantitative importance of this context in different species is unclear. We tested this concept in two pathogenic Cryptococcus yeast species by genome-wide mapping of translation and of mRNA 5' and 3' ends. We observed thousands of AUG-initiated upstream open reading frames (uORFs) that are a major contributor to translation repression. uORF use depends on the Kozak sequence context of its start codon, and uORFs with strong contexts promote nonsense-mediated mRNA decay. Transcript leaders in Cryptococcus and other fungi are substantially longer and more AUG-dense than in Saccharomyces. Numerous Cryptococcus mRNAs encode predicted dual-localized proteins, including many aminoacyl-tRNA synthetases, in which a leaky AUG start codon is followed by a strong Kozak context in-frame AUG, separated by mitochondrial-targeting sequence. Analysis of other fungal species shows that such dual-localization is also predicted to be common in the ascomycete mould, Neurospora crassa. Kozak-controlled regulation is correlated with insertions in translational initiation factors in fidelity-determining regions that contact the initiator tRNA. Thus, start codon context is a signal that quantitatively programs both the expression and the structures of proteins in diverse fungi.
Subject(s)
Codon, Initiator/chemistry , Cryptococcus/genetics , Gene Expression Regulation, Fungal , Genome, Fungal , Peptide Chain Initiation, Translational , Amino Acyl-tRNA Synthetases/genetics , Amino Acyl-tRNA Synthetases/metabolism , Candida albicans/genetics , Candida albicans/metabolism , Chromosome Mapping , Codon, Initiator/metabolism , Cryptococcus/metabolism , Neurospora crassa/genetics , Neurospora crassa/metabolism , Open Reading Frames , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae/metabolism , Schizosaccharomyces/genetics , Schizosaccharomyces/metabolism , Species SpecificityABSTRACT
RNA splicing, an essential part of eukaryotic pre-messenger RNA processing, can be simultaneous with transcription by RNA polymerase II. Here, we compare and review independent next-generation sequencing methods that jointly quantify transcription and splicing in budding yeast. For many yeast transcripts, splicing is fast, taking place within seconds of intron transcription, while polymerase is within a few dozens of nucleotides of the 3' splice site. Ribosomal protein transcripts are spliced particularly fast and cotranscriptionally. However, some transcripts are spliced inefficiently or mainly post-transcriptionally. Intron-mediated regulation of some genes is likely to be cotranscriptional. We suggest that intermediates of the splicing reaction, missing from current data sets, may hold key information about splicing kinetics.
Subject(s)
RNA Splicing , Saccharomycetales/genetics , Transcription, Genetic , High-Throughput Nucleotide Sequencing , Introns , Kinetics , RNA Polymerase II/metabolism , RNA, Ribosomal/biosynthesis , RNA, Ribosomal/metabolism , Saccharomycetales/metabolism , Sequence Analysis, RNA , Transcription Elongation, GeneticABSTRACT
[This corrects the article DOI: 10.1371/journal.pbio.1002015.].
ABSTRACT
BACKGROUND: Using high-throughput sequencing to monitor translation in vivo, ribosome profiling can provide critical insights into the dynamics and regulation of protein synthesis in a cell. Since its introduction in 2009, this technique has played a key role in driving biological discovery, and yet it requires a rigorous computational toolkit for widespread adoption. DESCRIPTION: We have developed a database and a browser-based visualization tool, riboviz, that enables exploration and analysis of riboseq datasets. In implementation, riboviz consists of a comprehensive and flexible computational pipeline that allows the user to analyze private, unpublished datasets, along with a web application for comparison with published yeast datasets. Source code and detailed documentation are freely available from https://github.com/shahpr/RiboViz . The web-application is live at www.riboviz.org. CONCLUSIONS: riboviz provides a comprehensive database and analysis and visualization tool to enable comparative analyses of ribosome-profiling datasets. This toolkit will enable both the community of systems biologists who study genome-wide ribosome profiling data and also research groups focused on individual genes to identify patterns of transcriptional and translational regulation across different organisms and conditions.
Subject(s)
Databases, Genetic , Internet , Ribosomes/metabolism , High-Throughput Nucleotide Sequencing , User-Computer InterfaceABSTRACT
Natural selection favors efficient expression of encoded proteins, but the causes, mechanisms, and fitness consequences of evolved coding changes remain an area of aggressive inquiry. We report a large-scale reversal in the relative translational accuracy of codons across 12 fly species in the Drosophila/Sophophora genus. Because the reversal involves pairs of codons that are read by the same genomically encoded tRNAs, we hypothesize, and show by direct measurement, that a tRNA anticodon modification from guanosine to queuosine has coevolved with these genomic changes. Queuosine modification is present in most organisms but its function remains unclear. Modification levels vary across developmental stages in D. melanogaster, and, consistent with a causal effect, genes maximally expressed at each stage display selection for codons that are most accurate given stage-specific queuosine modification levels. In a kinetic model, the known increased affinity of queuosine-modified tRNA for ribosomes increases the accuracy of cognate codons while reducing the accuracy of near-cognate codons. Levels of queuosine modification in D. melanogaster reflect bioavailability of the precursor queuine, which eukaryotes scavenge from the tRNAs of bacteria and absorb in the gut. These results reveal a strikingly direct mechanism by which recoding of entire genomes results from changes in utilization of a nutrient.
Subject(s)
Drosophila/genetics , Genome, Insect , Nucleoside Q/metabolism , Open Reading Frames/genetics , Protein Biosynthesis , RNA, Transfer/metabolism , Animals , Anticodon/genetics , Base Sequence , Codon , Drosophila melanogaster/genetics , Kinetics , Models, Genetic , Molecular Sequence Data , Nucleoside Q/chemistry , Phylogeny , Selection, GeneticABSTRACT
Few biological systems permit rigorous testing of how changes in DNA sequence give rise to adaptive phenotypes. In this study, we sought a simplified experimental system with a detailed understanding of the genotype-to-phenotype relationship that could be altered by environmental perturbations. We focused on plasmid fitness, i.e., the ability of plasmids to be stably maintained in a bacterial population, which is dictated by the plasmid's replication and segregation machinery. Although plasmid replication depends on host proteins, the type II plasmid partitioning (Par) machinery is entirely plasmid encoded and relies solely on three components: parC, a centromere-like DNA sequence, ParR, a DNA-binding protein that interacts with parC, and ParM, which forms actin-like filaments that push two plasmids away from each other at cell division. Interactions between the Par operons of two related plasmids can cause incompatibility and the reduced transmission of one or both plasmids. We have identified segregation-dependent plasmid incompatibility between the highly divergent Par operons of plasmids pB171 and pCP301. Genetic and biochemical studies revealed that the incompatibility is due to the functional promiscuity of the DNA-binding protein ParRpB171, which interacts with both parC DNA sequences to direct plasmid segregation, indicating that the lack of DNA binding specificity is detrimental to plasmid fitness in this environment. This study therefore successfully utilized plasmid segregation to dissect the molecular interactions between genotype, phenotype, and fitness.
Subject(s)
Bacterial Proteins/metabolism , DNA, Bacterial/metabolism , DNA-Binding Proteins/metabolism , Escherichia coli/genetics , Escherichia coli/metabolism , Genomic Instability , Plasmids , Genotype , PhenotypeABSTRACT
A key goal in molecular evolution is to extract mechanistic insights from signatures of selection. A case study is codon usage, where despite many recent advances and hypotheses, two longstanding problems remain: the relative contribution of selection and mutation in determining codon frequencies and the relative contribution of translational speed and accuracy to selection. The relevant targets of selection--the rate of translation and of mistranslation of a codon per unit time in the cell--can only be related to mechanistic properties of the translational apparatus if the number of transcripts per cell is known, requiring use of gene expression measurements. Perhaps surprisingly, different gene-expression data sets yield markedly different estimates of selection. We show that this is largely due to measurement noise, notably due to differences between studies rather than instrument error or biological variability. We develop an analytical framework that explicitly models noise in expression in the context of the population-genetic model. Estimates of mutation and selection strength in budding yeast produced by this method are robust to the expression data set used and are substantially higher than estimates using a noise-blind approach. We introduce per-gene selection estimates that correlate well with previous scoring systems, such as the codon adaptation index, while now carrying an evolutionary interpretation. On average, selection for codon usage in budding yeast is weak, yet our estimates show that genes range from virtually unselected to average per-codon selection coefficients above the inverse population size. Our analytical framework may be generally useful for distinguishing biological signals from measurement noise in other applications that depend upon measurements of gene expression.
Subject(s)
Codon , Gene Expression , Models, Genetic , Selection, Genetic , Evolution, Molecular , Mutation , Protein Biosynthesis , Ribosomes/genetics , Ribosomes/metabolism , Saccharomycetales/genetics , Saccharomycetales/metabolismABSTRACT
Cryptococcus is a fungal pathogen whose virulence relies on proliferation in and dissemination to host sites, and on synthesis of a defensive yet metabolically costly polysaccharide capsule. Regulatory pathways required for Cryptococcus virulence include a GATA-like transcription factor, Gat201, that regulates Cryptococcal virulence in both capsule-dependent and capsule-independent ways. Here we show that Gat201 is part of a negative regulatory pathway that limits fungal survival. RNA-seq analysis found strong induction of GAT201 expression within minutes of transfer to host-like media at alkaline pH. Microscopy, growth curves, and colony forming units to test viability show that in host-like media at alkaline pH wild-type Cryptococcus neoformans yeast cells produce capsule but do not bud or maintain viability, while gat201Δ cells make buds and maintain viability, yet fail to produce capsule. GAT201 is required for transcriptional upregulation of a specific set of genes in host-like media, the majority of which are direct Gat201 targets. Evolutionary analysis shows that Gat201 is conserved within pathogenic fungi but lost in model yeasts. This work identifies the Gat201 pathway as controlling a trade-off between proliferation, which we showed is repressed by GAT201, and production of defensive capsule. The assays established here will allow characterisation of the mechanisms of action of the Gat201 pathway. Together, our findings urge improved understanding of the regulation of proliferation as a driver of fungal pathogenesis.
ABSTRACT
Stress-induced condensation of mRNA and proteins into stress granules is conserved across eukaryotes, yet the function, formation mechanisms, and relation to well-studied conserved transcriptional responses remain largely unresolved. Stress-induced exposure of ribosome-free mRNA following translational shutoff is thought to cause condensation by allowing new multivalent RNA-dependent interactions, with RNA length and associated interaction capacity driving increased condensation. Here we show that, in striking contrast, virtually all mRNA species condense in response to multiple unrelated stresses in budding yeast, length plays a minor role, and instead, stress-induced transcripts are preferentially excluded from condensates, enabling their selective translation. Using both endogenous genes and reporter constructs, we show that translation initiation blockade, rather than resulting ribosome-free RNA, causes condensation. These translation initiation-inhibited condensates (TIICs) are biochemically detectable even when stress granules, defined as microscopically visible foci, are absent or blocked. TIICs occur in unstressed yeast cells, and, during stress, grow before the appearance of visible stress granules. Stress-induced transcripts are excluded from TIICs primarily due to the timing of their expression, rather than their sequence features. Together, our results reveal a simple system by which cells redirect translational activity to newly synthesized transcripts during stress, with broad implications for cellular regulation in changing conditions.
ABSTRACT
Exons are regions of DNA that are transcribed to RNA and retained after introns are spliced out. However, the term "exon" is often misused as synonymous to "protein coding," including in some literature and textbook definitions. In contrast, only a fraction of exonic sequences are protein coding (<30% in humans). Both exons and introns are also present in untranslated regions (UTRs) and non-coding RNAs. Misuse of the term exon is problematic, for example, "whole-exome sequencing" technology targets <25% of the human exome, primarily regions that are protein coding. Here, we argue for the importance of the original definition of an exon for making functional distinctions in genetics and genomics. Further, we recommend the use of clearer language referring to coding exonic regions and non-coding exonic regions. We propose the use of coding exome sequencing, or CES, to more appropriately describe sequencing approaches that target primarily protein-coding regions rather than all transcribed regions.
ABSTRACT
Ribosome profiling quantifies translation genome-wide by sequencing ribosome-protected fragments, or footprints. Its single-codon resolution allows identification of translation regulation, such as ribosome stalls or pauses, on individual genes. However, enzyme preferences during library preparation lead to pervasive sequence artifacts that obscure translation dynamics. Widespread over- and under-representation of ribosome footprints can dominate local footprint densities and skew estimates of elongation rates by up to five fold. To address these biases and uncover true patterns of translation, we present choros, a computational method that models ribosome footprint distributions to provide bias-corrected footprint counts. choros uses negative binomial regression to accurately estimate two sets of parameters: (i) biological contributions from codon-specific translation elongation rates; and (ii) technical contributions from nuclease digestion and ligation efficiencies. We use these parameter estimates to generate bias correction factors that eliminate sequence artifacts. Applying choros to multiple ribosome profiling datasets, we are able to accurately quantify and attenuate ligation biases to provide more faithful measurements of ribosome distribution. We show that a pattern interpreted as pervasive ribosome pausing near the beginning of coding regions is likely to arise from technical biases. Incorporating choros into standard analysis pipelines will improve biological discovery from measurements of translation.
ABSTRACT
Pathogenic fungi hide from their hosts by camouflage, obscuring immunogenic cell wall components such as beta-glucan with innocuous coverings such as mannoproteins and alpha-glucan that are less readily recognised by the host. Attempts to understand how such processes are regulated have met with varying success. Typically studies focus on understanding the transcriptional response of fungi to either their reservoir environment or the host. However, such approaches do not fully address this research question, due to the layers of post-transcriptional and post-translational regulation that occur within a cell. Although in animals the impact of post-transcriptional and post-translational regulation has been well characterised, our knowledge of these processes in the fungal kingdom is more limited. Mutations in RNA-binding proteins, like Ssd1 and Candida albicans Slr1, affect cell wall composition and fungal virulence indicating that post-transcriptional regulation plays a key role in these processes. Here, we review the current state of knowledge of fungal post-transcriptional regulation, and link this to potential mechanisms of immune evasion by drawing on studies from model yeast and plant pathogenic fungi. We highlight several RNA-binding proteins that regulate cell wall synthesis and could be involved in local translation of cell wall components. Expanding our knowledge on post-transcriptional regulation in human fungal pathogens is essential to fully comprehend fungal virulence strategies and for the design of novel antifungal therapies.
ABSTRACT
An emerging principle of cell biology is the regulated conversion of macromolecules between soluble and condensed states. To screen for such regulation of the cyanobacterial proteome, we use quantitative mass spectrometry to identify proteins that change solubility during the day-night cycle. We find a set of night-insoluble proteins that includes many enzymes in essential metabolic pathways. Using time-lapse microscopy and isotope labeling, we show that these proteins reversibly transition between punctate structures at night and a soluble state during the day without substantial degradation. We find that the cyanobacterial circadian clock regulates the kinetics of puncta formation during the night and that the appearance of puncta indicates the metabolic status of the cell. Reversible condensation of specific enzymes is thus a regulated response to the day-night cycle and may reflect a general bacterial strategy used in fluctuating growth conditions.