RESUMEN
The assessment of transcriptome-wide ribosome binding to mRNAs is useful for studying the dynamic regulation of protein synthesis. Two methods frequently applied in eukaryotic cells that operate at different levels of resolution are polysome profiling, which reveals the distribution of ribosome loads across the transcriptome, and ribosome footprinting (also termed ribosome profiling or Ribo-Seq), which when combined with appropriate data on mRNA expression can reveal ribosome densities on individual transcripts. In this study we develop methods for relating the information content of these two methods to one another, by reconstructing theoretical polysome profiles from ribosome footprinting data. Our results validate both approaches as experimental tools. Although we show that both methods can yield highly consistent data, some published ribosome footprinting datasets give rise to reconstructed polysome profiles with non-physiological features. We trace these aberrant features to inconsistencies in RNA and Ribo-Seq data when compared to datasets yielding physiological polysome profiles, thereby demonstrating that modelled polysomes are useful for assessing global dataset properties such as its quality in a simple, visual approach. Aside from using polysome profile reconstructions on published datasets, we propose that this also provides a useful tool for validating new ribosome footprinting datasets in early stages of analyses.
Asunto(s)
Biosíntesis de Proteínas , Ribosomas , Ribosomas/genética , Ribosomas/metabolismo , Polirribosomas/metabolismo , ARN Mensajero/genética , ARN Mensajero/metabolismo , TranscriptomaRESUMEN
In vitro transcribed, modified messenger RNAs (IVTmRNAs) have been used to vaccinate billions of individuals against the SARS-CoV-2 virus, and are currently being developed for many additional therapeutic applications. IVTmRNAs must be translated into proteins with therapeutic activity by the same cellular machinery that also translates native endogenous transcripts. However, different genesis pathways and routes of entry into target cells as well as the presence of modified nucleotides mean that the way in which IVTmRNAs engage with the translational machinery, and the efficiency with which they are being translated, differs from native mRNAs. This review summarises our current knowledge of commonalities and differences in translation between IVTmRNAs and cellular mRNAs, which is key for the development of future design strategies that can generate IVTmRNAs with improved activity in therapeutic applications.
RESUMEN
The genetic code is necessarily degenerate with 64 possible nucleotide triplets being translated into 20 amino acids. Eighteen out of the 20 amino acids are encoded by multiple synonymous codons. While synonymous codons are clearly equivalent in terms of the information they carry, it is now well established that they are used in a biased fashion. There is currently no consensus as to the origin of this bias. Drawing on ideas from stochastic thermodynamics we derive from first principles a mathematical model describing the statistics of codon usage bias. We show that the model accurately describes the distribution of codon usage bias of genomes in the fungal and bacterial kingdoms. Based on it, we derive a new computational measure of codon usage bias-the distance D capturing two aspects of codon usage bias: (i) differences in the genome-wide frequency of codons and (ii) apparent non-random distributions of codons across mRNAs. By means of large scale computational analysis of over 900 species across two kingdoms of life, we demonstrate that our measure provides novel biological insights. Specifically, we show that while codon usage bias is clearly based on heritable traits and closely related species show similar degrees of bias, there is considerable variation in the magnitude of D within taxonomic classes suggesting that the contribution of sequence-level selection to codon bias varies substantially within relatively confined taxonomic groups. Interestingly, commonly used model organisms are near the median for values of D for their taxonomic class, suggesting that they may not be good representative models for species with more extreme D, which comprise organisms of medical and agricultural interest. We also demonstrate that amino acid specific patterns of codon usage are themselves quite variable between branches of the tree of life, and that some of this variability correlates with organismal tRNA content.
Asunto(s)
Uso de Codones , Código Genético , Aminoácidos/genética , Bacterias/genética , Codón/genéticaRESUMEN
Regulation of protein synthesis makes a major contribution to post-transcriptional control pathways. During disease, or under stress, cells initiate processes to reprogramme protein synthesis and thus orchestrate the appropriate cellular response. Recent data show that the elongation stage of protein synthesis is a key regulatory node for translational control in health and disease. There is a complex set of factors that individually affect the overall rate of elongation and, for the most part, these influence either transfer RNA (tRNA)- and eukaryotic elongation factor 1A (eEF1A)-dependent codon decoding, and/or elongation factor 2 (eEF2)-dependent ribosome translocation along the mRNA. Decoding speeds depend on the relative abundance of each tRNA, the cognate:near-cognate tRNA ratios and the degree of tRNA modification, whereas eEF2-dependent ribosome translocation is negatively regulated by phosphorylation on threonine-56 by eEF2 kinase. Additional factors that contribute to the control of the elongation rate include epigenetic modification of the mRNA, coding sequence variation and the expression of eIF5A, which stimulates peptide bond formation between proline residues. Importantly, dysregulation of elongation control is central to disease mechanisms in both tumorigenesis and neurodegeneration, making the individual key steps in this process attractive therapeutic targets. Here, we discuss the relative contribution of individual components of the translational apparatus (e.g. tRNAs, elongation factors and their modifiers) to the overall control of translation elongation and how their dysregulation contributes towards disease processes.