RESUMEN
The genetic code is one of the most highly conserved features across life. Only a few lineages have deviated from the "universal" genetic code. Amongst the few variants of the genetic code reported to date, the codons UAA and UAG virtually always have the same translation, suggesting that their evolution is coupled. Here, we report the genome and transcriptome sequencing of a novel uncultured ciliate, belonging to the Oligohymenophorea class, where the translation of the UAA and UAG stop codons have changed to specify different amino acids. Genomic and transcriptomic analyses revealed that UAA has been reassigned to encode lysine, while UAG has been reassigned to encode glutamic acid. We identified multiple suppressor tRNA genes with anticodons complementary to the reassigned codons. We show that the retained UGA stop codon is enriched in the 3'UTR immediately downstream of the coding region of genes, suggesting that there is functional drive to maintain tandem stop codons. Using a phylogenomics approach, we reconstructed the ciliate phylogeny and mapped genetic code changes, highlighting the remarkable number of independent genetic code changes within the Ciliophora group of protists. According to our knowledge, this is the first report of a genetic code variant where UAA and UAG encode different amino acids.
Asunto(s)
Aminoácidos , Cilióforos , Aminoácidos/genética , Secuencia de Aminoácidos , Código Genético , Cilióforos/genética , Codón de TerminaciónRESUMEN
Natural proteins are normally made by 20 canonical amino acids. Genetic code expansion (GCE) enables incorporation of diverse chemically synthesized noncanonical amino acids (ncAAs) by orthogonal aminoacyl-tRNA synthetase (aaRS)/tRNA pairs using nonsense codons, which could significantly expand new functionalities of proteins in both scientific and biomedical applications. Here, by hijacking the cysteine biosynthetic enzymes, we describe a method combining amino acid biosynthesis and GCE to introduce around 50 structurally novel ncAAs into proteins by supplementation of commercially available aromatic thiol precursors, thus eliminating the need to chemically synthesize these ncAAs. A screening method is also provided for improving the incorporation efficiency of a particular ncAA. Furthermore, we demonstrate bioorthogonal groups, such as azide and ketone, that are compatible with our system and can be easily introduced into protein for subsequent site-specific labeling.
Asunto(s)
Aminoácidos , Aminoacil-ARNt Sintetasas , Aminoácidos/química , Proteínas/metabolismo , Código Genético , Aminoacil-ARNt Sintetasas/metabolismo , ARN de Transferencia/genética , ARN de Transferencia/metabolismo , Biosíntesis de ProteínasRESUMEN
Nonribosomal peptide synthetases (NRPSs) are large, multifunctional enzymes that facilitate the stepwise synthesis of modified peptides, many of which serve as important pharmaceutical products. Typically, NRPSs contain one module for the incorporation of one amino acid into the growing peptide chain. A module consists of the domains required for activation, covalent binding, condensation, termination, and optionally modification of the aminoacyl or peptidyl moiety. We here describe a protocol using genetically encoded photo-cross-linking amino acids to probe the 3D architecture of NRPSs by determining spatial proximity constraints. p-benzoyl-L-phenylalanine (BpF) is incorporated at positions of presumed contact interfaces between domains. The covalent cross-link products are visualized by SDS-PAGE-based methods and precisely mapped by tandem mass spectrometry. Originally intended to study the communication (COM) domains, a special pair of docking domains of unknown structure between two interacting subunits of one NRPS system, this cross-linking approach was also found to be useful to interrogate the spatial proximity of domains that are not connected on the level of the primary structure. The presented photo-cross-linking technique thus provides structural insights complementary to those obtained by protein crystallography and reports on the protein in solution.
Asunto(s)
Péptido Sintasas , Péptidos , Péptidos/genética , Péptido Sintasas/química , Aminoácidos/química , Código GenéticoRESUMEN
The origin of the genetic code is probably the central problem of the studies on the origin of life. The key question to answer is the molecular mechanism that allows the association of the amino acids with their triplet codons. We proposed that the codon-anticodon duplex located in the acceptor stem of primitive tRNAs would facilitate the chemical reactions required to synthesize cognate amino acids from simple amino acids (glycine, valine, and aspartic acid) linked to the 3' acceptor end. In our view, various nucleotide-A-derived cofactors (with reactive chemical groups) may be attached to the codon-anticodon duplex, which allows group-transferring reactions from cofactors to simple amino acids, thereby producing the final amino acid. The nucleotide-A-derived cofactors could be incorporated into the RNA duplex (helix) by docking Adenosine (cofactor) into the minor groove via an interaction similar to the A-minor motif, forming a base triple between Adenosine and one complementary base pair of the duplex. Furthermore, we propose that this codon-anticodon duplex could initially catalyze a self-aminoacylation reaction with a simple amino acid. Therefore, the sequence of bases in the codon-anticodon duplex would determine the reactions that occurred during the formation of new amino acids for selective binding of nucleotide-A-derived cofactors.
Asunto(s)
Anticodón , Ácido Aspártico , Adenosina , Aminoácidos/química , Ácido Aspártico/genética , Codón , Código Genético , Glicina , Nucleótidos , ARN/química , ARN de Transferencia/química , ARN de Transferencia/genética , ValinaRESUMEN
A message such as mRNA, which consists of continuous characters without separators (such as commas or spaces), can easily be decoded incorrectly if it is read in the wrong reading frame. One construct to theoretically avoid these reading frame errors is the class of block codes. However, the first hypothesis of Watson and Crick (1953) that block codes are used as a tool to avoid reading frame errors in coding sequences already failed because the four periodical codons AAA, CCC, GGG and UUU seem to play an important role in protein coding sequences. Even the class of circular codes later discovered by Arquès and Michel (1996) in coding sequences cannot contain a periodic codon. However, by incorporating the interpretation of the message into the robustness of the reading frame, the extension of circular codes to include periodic codons is theoretically possible. In this work, we introduce the new class of I-circular codes. Unlike circular codes, these codes allow frame shifts, but only if the decoded interpretation of the message is identical to the intended interpretation. In the following, the formal definition of I-circular codes is introduced and the maximum and the maximal size of I-circular codes are given based on the standard genetic code table. These numbers are calculated using a new graph-theoretic approach derived from the classical one for the class of circular codes. Furthermore, we show that all 216 maximum self-complementary C3-codes (see Fimmel et al., 2015) can be extended to larger I-circular codes. We present the increased code coverage of the 216 newly constructed I-circular codes based on the human coding sequences in chromosome 1. In the last section of this paper, we use the polarity of amino acids as an interpretation table to construct I-circular codes. In an optimization process, two maximum I-circular codes of length 30 are found.
Asunto(s)
Código Genético , Modelos Genéticos , Aminoácidos , Codón/genética , Código Genético/genética , Humanos , Sistemas de LecturaRESUMEN
MOTIVATION: Synthesizing genes to be expressed in other organisms is an essential tool in biotechnology. While the many-to-one mapping from codons to amino acids makes the genetic code degenerate, codon usage in a particular organism is not random either. This bias in codon use may have a remarkable effect on the level of gene expression. A number of measures have been developed to quantify a given codon sequence's strength to express a gene in a host organism. Codon optimization aims to find a codon sequence that will optimize one or more of these measures. Efficient computational approaches are needed since the possible number of codon sequences grows exponentially as the number of amino acids increases. RESULTS: We develop a unifying modeling approach for codon optimization. With our mathematical formulations based on graph/network representations of amino acid sequences, any combination of measures can be optimized in the same framework by finding a path satisfying additional limitations in an acyclic layered network. We tested our approach on bi-objectives commonly used in the literature, namely, Codon Pair Bias versus Codon Adaptation Index and Relative Codon Pair Bias versus Relative Codon Bias. However, our framework is general enough to handle any number of objectives concurrently with certain restrictions or preferences on the use of specific nucleotide sequences. We implemented our models using Python's Gurobi interface and showed the efficacy of our approach even for the largest proteins available. We also provided experimentation showing that highly expressed genes have objective values close to the optimized values in the bi-objective codon design problem. AVAILABILITY AND IMPLEMENTATION: http://alpersen.bilkent.edu.tr/NetworkCodon.zip. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)
Aminoácidos , Código Genético , Codón , Secuencia de AminoácidosRESUMEN
The Canonical Table of the Genetic Code (CTGC) is constructed theoretically on the basis of the similarity of PFs (PF) of proteins with the conformation of 4-arc chain graphs (Karasev, 2019). Of the 64 conformations of the graph, specified by the position of the connectivity edges, and the matrices of 6 variables (x1 x6), xi = (0, 1), 4 blocks of 16 elements each were formed. Then they were coded in the form of triplets based on the correspondence of pairs of variables to four letters of the code: 00 = C, 01 = U, 10 = G, 11 = A, and supplemented based on the known triplet-amino acid assignment. The resulting table is compared with the Periodic Table of Chemical Elements (PTCE). As in the PTCE, this CTGC has an initial element - a triplet that encodes graphs with zero number of connected edges. Within each block, vacancies are filled with connectivity edges in two alternative ways, both in rows and in the columns. As we move from the initial block 00 to the final block 11, there is a sequential filling of vacancies for variables x3x4: 00, 01, 10, 11. In general, the CTGC can be considered as a periodic system of triplets. Comparison with the previously described variety of tables of the genetic code made it possible to conclude that the CTGC more adequately reflects the properties of the genetic code. Prospects for the possible application of this table are being discussed.
Asunto(s)
Aminoácidos , Código Genético , Aminoácidos/genética , Código Genético/genética , Proteínas/genéticaRESUMEN
A chaotic ribonucleic acid (RNA) and deoxyribonucleic acid (DNA) encryption scheme is firstly proposed for security OFDM-WDM-PON in this paper. We adopt a dynamic key agreement based on the messenger RNA (mRNA) codebook to distribute the key, and the security and randomness of this key are enhanced by a pre-sharing key parameter set instead of transmission of a key directly. Also, the security key can be dynamically updated in real-time according to the needs of the users. The real (I) and imaginary (Q) parts of the QAM symbol matrix after modulation are encrypted by the correspondence between transfer RNA (tRNA) and amino acids and the selection mapping of DNA base complementary rules. Also, we add cubic permutation to ensure all data security encryption. The encrypted signals of 35.29 Gb/s on different wavelength channels are successfully demonstrated over a 25-km standard single-mode fiber (SSMF) and a back-to-back (BTB) system. It is proved that the proposed security OFDM-WDM-PON encryption scheme is compatible with the traditional WDM system, which can make full use of bandwidth resources and enhance the security with a large key space.
Asunto(s)
Aminoácidos , Seguridad Computacional , ADN/química , Código Genético , ARN/química , Codón , Humanos , ARN Mensajero/químicaRESUMEN
Selenium is a fascinating element that has a long history, most of which documents it as a deleterious element to health. In more recent years, selenium has been found to be an essential element in the diet of humans, all other mammals, and many other life forms. It has many health benefits that include, for example, roles in preventing heart disease and certain forms of cancer, slowing AIDS progression in HIV patients, supporting male reproduction, inhibiting viral expression, and boosting the immune system, and it also plays essential roles in mammalian development. Elucidating the molecular biology of selenium over the past 40 years generated an entirely new field of science which encompassed the many novel features of selenium. These features were (1) how this element makes its way into protein as the 21st amino acid in the genetic code, selenocysteine (Sec); (2) the vast amount of machinery dedicated to synthesizing Sec uniquely on its tRNA; (3) the incorporation of Sec into protein; and (4) the roles of the resulting Sec-containing proteins (selenoproteins) in health and development. One of the research areas receiving the most attention regarding selenium in health has been its role in cancer prevention, but further research has also exposed the role of this element as a facilitator of various maladies, including cancer.
Asunto(s)
Selenio/administración & dosificación , Selenocisteína/metabolismo , Selenoproteínas/metabolismo , Animales , Dieta , Código Genético , Salud , Humanos , ARN de Transferencia Aminoácido-Específico/metabolismoRESUMEN
Genetic code expansion is one of the most powerful technologies in protein engineering. In addition to the 20 canonical amino acids, the expanded genetic code is supplemented by unnatural amino acids, which have artificial side chains that can be introduced into target proteins inâ vitro and inâ vivo. A wide range of chemical groups have been incorporated co-translationally into proteins in single cells and multicellular organisms by using genetic code expansion. Incorporated unnatural amino acids have been used for novel structure-function relationship studies, bioorthogonal labelling of proteins in cellulo for microscopy and inâ vivo for tissue-specific proteomics, the introduction of post-translational modifications and optical control of protein function, to name a few examples. In this Minireview, the development of genetic code expansion technology is briefly introduced, then its applications in neurobiology are discussed, with a focus on studies using mammalian cells and mice as model organisms.
Asunto(s)
Aminoácidos/genética , Código Genético , Neuronas/metabolismo , Proteínas/genética , Aminoácidos/química , Aminoácidos/metabolismo , Animales , Humanos , Ingeniería de Proteínas , Procesamiento Proteico-Postraduccional , Proteínas/química , Proteínas/metabolismoRESUMEN
Background: Codon directional asymmetry (CDA) classifies the 64 codons into palindromes (XYX, CDA = 0), and 5'- and 3'-dominant (YXX and XXY, CDA < 0 and CDA > 0, respectively). Previously, CDA was defined by the purine/pyrimidine divide (A,G/C,T), where X is either a purine or a pyrimidine. For the remaining codons with undefined CDA, CDA was defined by the 5' or 3' nucleotide complementary to Y. This CDA correlates with cognate amino acid tRNA synthetase classes, antiparallel beta sheet conformation index and the evolutionary order defined by the self-referential genetic code evolution model (CDA < 0: class I, high beta sheet index, late genetic code inclusion). Methods: We explore associations of CDAs defined by nucleotide classifications according to complementarity strengths (A:T, weak; C:G, strong) and keto-enol/amino-imino groupings (G,T/A,C), also after swapping 1st and 2nd codon positions with amino acid physicochemical and structural properties. Results: Here, analyses show that for the eight codons whose purine/pyrimidine-based CDA requires using the rule of complementarity with the midposition, using weak interactions to define CDA instead of complementarity increases associations with tRNA synthetase classes, antiparallel beta sheet index and genetic code evolutionary order. CDA defined by keto-enol/amino-imino groups, 1st and 2nd codon positions swapped, correlates with amino acid parallel beta sheet formation indices and Doolittle's hydropathicities. Conclusions: Results suggest (a) prebiotic swaps from N2N1N3 to N1N2N3 codon structures, (b) that tRNA-mediated translation replaced direct codon-amino acid interactions, and (c) links between codon structures and cognate amino acid properties.
Asunto(s)
Aminoácidos/genética , Aminoacil-ARNt Sintetasas/metabolismo , Codón , Purinas/química , Pirimidinas/química , Aminoácidos/química , Evolución Molecular , Código Genético , Enlace de Hidrógeno , Interacciones Hidrofóbicas e Hidrofílicas , Modelos Genéticos , Nucleótidos/química , Biosíntesis de ProteínasRESUMEN
The mechanism of unwinding catalyzed by the hepatitis C virus nonstructural protein 3 helicase (NS3h) has been a subject of considerable interest, with NS3h serving as a prototypical enzyme in the study of helicase function. Recent studies support an ATP-fueled, inchworm-like stepping of NS3h on the nucleic acid that would result in the displacement of the complementary strand of the duplex during unwinding. Here, we describe the screening of a site of incorporation of an unnatural amino acid in NS3h for fluorescent labeling of the enzyme to be used in single-molecule Förster resonance energy transfer (FRET) experiments. From the nine potential sites identified in NS3h for incorporation of the unnatural amino acid, only one allowed for expression and fluorescent labeling of the recombinant protein. Incorporation of the unnatural amino acid was confirmed via bulk assays to not interfere with unwinding activity of the helicase. Binding to four different dsDNA sequences bearing a ssDNA overhang segment of varying length (either minimal 6 or 7 base length overhang to ensure binding or a long 24 base overhang) and sequence was recorded with the new NS3h construct at the single-molecule level. Single-molecule fluorescence displayed time intervals with anticorrelated donor and acceptor emission fluctuations associated with protein binding to the substrates. An apparent FRET value was estimated from the binding events showing a single FRET value of â¼0.8 for the 6-7 base overhangs. A smaller mean value and a broad distribution was in turn recorded for the long ssDNA overhang, consistent with NS3h exploring a larger physical space while bound to the DNA construct. Notably, intervals where NS3h binding was recorded were exhibited at time periods where the acceptor dye reversibly bleached. Protein induced fluorescence intensity enhancement in the donor channel became apparent at these intervals. Overall, the site-specific fluorescent labeling of NS3h reported here provides a powerful tool for future studies to monitor the dynamics of enzyme translocation during unwinding by single-molecule FRET.
Asunto(s)
Hepacivirus/enzimología , Imagen Individual de Molécula/métodos , Proteínas no Estructurales Virales/genética , Proteínas no Estructurales Virales/metabolismo , Azidas/química , Sitios de Unión , ADN/metabolismo , ADN Helicasas/química , ADN Helicasas/metabolismo , Escherichia coli/genética , Transferencia Resonante de Energía de Fluorescencia , Código Genético , Hepacivirus/genética , Modelos Moleculares , Mutagénesis Sitio-Dirigida , Fenilalanina/análogos & derivados , Fenilalanina/química , Unión Proteica , Proteínas Recombinantes/química , Proteínas Recombinantes/metabolismo , Proteínas no Estructurales Virales/químicaRESUMEN
Current methods to expand the genetic code enable site-specific incorporation of non-canonical amino acids (ncAAs) into proteins in eukaryotic and prokaryotic cells. However, current methods are limited by the number of codons possible, their orthogonality, and possibly their effects on protein synthesis and folding. An alternative approach relies on unnatural base pairs to create a virtually unlimited number of genuinely new codons that are efficiently translated and highly orthogonal because they direct ncAA incorporation using forces other than the complementary hydrogen bonds employed by their natural counterparts. This review outlines progress and achievements made towards developing a functional unnatural base pair and its use to generate semi-synthetic organisms with an expanded genetic alphabet that serves as the basis of an expanded genetic code.
Asunto(s)
Aminoácidos/genética , ADN/genética , Código Genético , Ingeniería Genética/métodos , Aminoácidos/química , Animales , Emparejamiento Base , ADN/química , Humanos , Enlace de Hidrógeno , Interacciones Hidrofóbicas e HidrofílicasRESUMEN
Protein-protein interactions are central to many biological processes. A considerable challenge consists however in understanding and deciphering when and how proteins interact, and this can be particularly difficult when interactions are weak and transient. The site-specific incorporation of unnatural amino acids (UAAs) that crosslink with nearby molecules in response to light provides a powerful tool for mapping transient protein-protein interactions and for defining the structure and topology of protein complexes both inâ vitro and inâ vivo. Complementary strategies consist in site-specific incorporation of UAAs bearing electrophilic moieties that react with natural nucleophilic amino acids in a proximity-dependent manner, thereby chemically stabilizing low-affinity interactions and providing additional constraints on distances and geometries in protein complexes. Herein, we review how UAAs bearing fine-tuned chemical moieties that react with proteins in their vicinity can be utilized to map, study, and characterize weak and transient protein-protein interactions in living systems.
Asunto(s)
Código Genético , Unión Proteica , Proteínas/química , Aminoácidos/químicaRESUMEN
Super-resolution microscopy allows imaging of cellular structures at nanometer resolution. This comes with a demand for small labels which can be attached directly to the structures of interest. In the context of protein labeling, one way to achieve this is by using genetic code expansion (GCE) and click chemistry. With GCE, small labeling handles in the form of noncanonical amino acids (ncAAs) are site-specifically introduced into a target protein. In a subsequent step, these amino acids can be directly labeled with small organic dyes by click chemistry reactions. Click chemistry labeling can also be combined with other methods, such as DNA-PAINT in which a "clickable" oligonucleotide is first attached to the ncAA-bearing target protein and then labeled with complementary fluorescent oligonucleotides. This protocol will cover both aspects: I describe (1) how to encode ncAAs and perform intracellular click chemistry-based labeling with an improved GCE system for eukaryotic cells and (2) how to combine click chemistry-based labeling with DNA-PAINT super-resolution imaging. As an example, I show click-PAINT imaging of vimentin and low-abundance nuclear protein, nucleoporin 153.
Asunto(s)
Aminoácidos/química , Aminoácidos/genética , Química Clic , ADN , Código Genético , Imagen Molecular , Proteínas/química , Proteínas/metabolismo , Animales , Células COS , Chlorocebus aethiops , ADN/química , Microscopía Fluorescente , Mutagénesis , Coloración y Etiquetado , TransfecciónRESUMEN
Sense and antisense peptides, i.e. peptides specified by complementary DNA and RNA sequences, interact with increased probability. Biro, Blalock, Mekler, Root-Bernstein and Siemion investigated the recognition rules of peptide-peptide interaction based on the complementary coding of DNA and RNA sequences in 3'â5' and 5'â3' directions. After more than three decades of theoretical and experimental investigations, the efficiency of this approach to predict peptide-peptide binding has been experimentally verified for more than 50 ligand-receptor systems, and represents a promising field of research. The natural genetic coding algorithm for sense and antisense peptide interactions combines following elements: of amino acid physico-chemical properties, stereochemical interaction, and bidirectional transcription. The interplay of these factors influences the specificity of sense-antisense peptide interactions, and affects the selection and evolution of peptide ligand-receptor systems. Complementary mRNA codon-tRNA anticodon complexes, and recently discovered Carter-Wolfenden tRNA acceptor-stem code, provide the basis for the rational modeling of peptide interactions based on their hydrophobic and lipophilic amino acid physico-chemical properties. It is shown that the interactions of complementary amino acid pairs according to the hydrophobic and lipophilic properties strongly depend on the central (second) purine base of the mRNA codon and its pyrimidine complement of the tRNA anticodon. This enables the development of new algorithms for the analysis of structure, function and evolution of protein and nucleotide sequences that take into account the residue's tendency to leave water and enter a nonpolar condensed phase considering its mass, size and accessible surface area. The practical applications of the sense-antisense peptide modeling are illustrated using different interaction assay types based on: microscale thermophoresis (MST), tryptophan fluorescence spectroscopy (TFS), nuclear magnetic resonance spectroscopy (NMR), and magnetic particles enzyme immunoassay (MPEIA). Various binding events and circumstances were considered, e.g., in situations with-short antisense peptide ligand (MST), L- and D-enantiomer acceptors (TFS), in low affinity conditions (NMR), and with more than one antisense peptide targeting hormone (MPEIA).
Asunto(s)
Algoritmos , Elementos sin Sentido (Genética)/metabolismo , Código Genético/fisiología , Péptidos/metabolismo , Secuencia de Aminoácidos/fisiología , Aminoácidos/genética , Aminoácidos/metabolismo , Animales , Elementos sin Sentido (Genética)/genética , Humanos , Péptidos/genética , ARN Mensajero/genética , ARN Mensajero/metabolismoRESUMEN
Comma-free codes constitute a class of circular codes, which has been widely studied, in particular by Golomb et al. (Biologiske Meddelelser, Kongelige Danske Videnskabernes Selskab 23:1-34, 1958a, Can J Math 10:202-209, 1958b), Michel et al. (Comput Math Appl 55:989-996, 2008a, Theor Comput Sci 401:17-26, 2008b, Inf Comput 212:55-63, 2012), Michel and Pirillo (Int J Comb 2011:659567, 2011), and Fimmel and Strüngmann (J Theor Biol 389:206-213, 2016). Based on a recent approach using graph theory to study circular codes Fimmel et al. (Philos Trans R Soc 374:20150058, 2016), a new class of circular codes, called strong comma-free codes, is identified. These codes detect a frameshift during the translation process immediately after a reading window of at most two nucleotides. We describe several combinatorial properties of strong comma-free codes: enumeration, maximality, self-complementarity and [Formula: see text]-property (comma-free property in all the three possible frames). These combinatorial results also highlight some new properties of the genetic code and its evolution. Each amino acid in the standard genetic code is coded by at least one strong comma-free code of size 1. There are 9 amino acids [Formula: see text] among 20 such that for each amino acid from S, its synonymous trinucleotide set (excluding the necessary periodic trinucleotides [Formula: see text]) is a strong comma-free code. The primeval comma-free RNY code of Eigen and Schuster (Naturwissenschaften 65:341-369, 1978) is a self-complementary [Formula: see text]-code of size 16. Furthermore, it is the union of two strong comma-free codes of size 8 which are complementary to each other.
Asunto(s)
Aminoácidos , Código Genético , Modelos Genéticos , Codón , NucleótidosRESUMEN
The genetic code is degenerate. With the exception of two amino acids (Met and Trp), all other amino acid residues are each encoded by multiple, so-called synonymous codons. Synonymous codons were initially presumed to have entirely equivalent functions, however, the finding that synonymous codons are not present at equal frequencies in genes/genomes suggested that codon choice might have functional implications beyond amino acid coding. The pattern of non-uniform codon use (known as codon usage bias) varies between organisms and represents a unique feature of an organism. Organism-specific codon choice is related to organism-specific differences in populations of cognate tRNAs. This implies that, in a given organism, frequently used codons will be translated more rapidly than infrequently used ones and vice versa A theory of codon-tRNA co-evolution (necessary to balance accurate and efficient protein production) was put forward to explain the existence of codon usage bias. This model suggests that selection favours preferred (frequent) over un-preferred (rare) codons in order to sustain efficient protein production in cells and that a given un-preferred codon will have the same effect on an organism's fitness regardless of its position within an mRNA's open reading frame. However, many recent studies refute this prediction. Un-preferred codons have been found to have important functional roles and their effects appeared to be position-dependent. Synonymous codon usage affects the efficiency/stringency of mRNA decoding, mRNA biogenesis/stability, and protein secretion and folding. This review summarizes recent developments in the field that have identified novel functions of synonymous codons and their usage.
Asunto(s)
Codón/genética , Código Genético/genética , ARN Mensajero/genética , Evolución Molecular , Sistemas de Lectura Abierta/genética , Biosíntesis de Proteínas/genética , Estabilidad del ARN , ARN Mensajero/biosíntesis , ARN Mensajero/metabolismo , ARN de Transferencia/genéticaRESUMEN
The genetic code in most organisms codes for 20 proteinogenic amino acids or translation stop. In order to encode more than 20 amino acids in the coding system, one of stop codons is usually reprogrammed to encode a non-proteinogenic amino acid. Although this approach works, usually only one amino acid is added to the amino acid repertoire. In this study, we incorporated non-proteinogenic amino acids into a protein by using a sense codon. As all the codons are allocated in the universal genetic code, we destroyed all the tRNA(Arg) in a cell-free protein synthesis system by using a tRNA(Arg) -specific tRNase, colicinâ D. Then by supplementing the system with tRNACCU , the translation system was partially restored. Through this creative destruction, reprogrammable codons were successfully created in the system to encode modified lysines along with the 20 proteinogenic amino acids.
Asunto(s)
Arginina/genética , Evolución Molecular Dirigida , Código Genético , Codón , Colicinas/metabolismo , Escherichia coli/genética , Escherichia coli/metabolismo , Proteínas Fluorescentes Verdes/biosíntesis , Proteínas Fluorescentes Verdes/genética , Biosíntesis de Proteínas/genética , ARN de Transferencia de Arginina/genética , ARN de Transferencia de Arginina/metabolismoRESUMEN
The problem of retrieval and maintenance of the correct reading frame plays a significant role in RNA transcription. Circular codes, and especially comma-free codes, can help to understand the underlying mechanisms of error-detection in this process. In recent years much attention has been paid to the investigation of trinucleotide circular codes (see, for instance, Fimmel et al., 2014; Fimmel and Strüngmann, 2015a; Michel and Pirillo, 2012; Michel et al., 2012, 2008), while dinucleotide codes had been touched on only marginally, even though dinucleotides are associated to important biological functions. Recently, all maximal dinucleotide circular codes were classified (Fimmel et al., 2015; Michel and Pirillo, 2013). The present paper studies maximal dinucleotide comma-free codes and their close connection to maximal dinucleotide circular codes. We give a construction principle for such codes and provide a graphical representation that allows them to be visualized geometrically. Moreover, we compare the results for dinucleotide codes with the corresponding situation for trinucleotide maximal self-complementary C(3)-codes. Finally, the results obtained are discussed with respect to Crick׳s hypothesis about frame-shift-detecting codes without commas.