RESUMO
We tested the idea that ancestral class I and II aminoacyl-tRNA synthetases arose on opposite strands of the same gene. We assembled excerpted 94-residue Urgenes for class I tryptophanyl-tRNA synthetase (TrpRS) and class II Histidyl-tRNA synthetase (HisRS) from a diverse group of species, by identifying and catenating three blocks coding for secondary structures that position the most highly conserved, active-site residues. The codon middle-base pairing frequency was 0.35 ± 0.0002 in all-by-all sense/antisense alignments for 211 TrpRS and 207 HisRS sequences, compared with frequencies between 0.22 ± 0.0009 and 0.27 ± 0.0005 for eight different representations of the null hypothesis. Clustering algorithms demonstrate further that profiles of middle-base pairing in the synthetase antisense alignments are correlated along the sequences from one species-pair to another, whereas this is not the case for similar operations on sets representing the null hypothesis. Most probable reconstructed sequences for ancestral nodes of maximum likelihood trees show that middle-base pairing frequency increases to approximately 0.42 ± 0.002 as bacterial trees approach their roots; ancestral nodes from trees including archaeal sequences show a less pronounced increase. Thus, contemporary and reconstructed sequences all validate important bioinformatic predictions based on descent from opposite strands of the same ancestral gene. They further provide novel evidence for the hypothesis that bacteria lie closer than archaea to the origin of translation. Moreover, the inverse polarity of genetic coding, together with a priori α-helix propensities suggest that in-frame coding on opposite strands leads to similar secondary structures with opposite polarity, as observed in TrpRS and HisRS crystal structures.
Assuntos
Aminoacil-tRNA Sintetases/genética , Evolução Molecular , Histidina-tRNA Ligase/genética , Triptofano-tRNA Ligase/genética , Bactérias/genética , Sequência de Bases , Domínio Catalítico , Códon , Estrutura Secundária de ProteínaRESUMO
How Nature discovered genetic coding is a largely ignored question, yet the answer is key to explaining the transition from biochemical building blocks to life. Other, related puzzles also fall inside the aegis enclosing the codes themselves. The peptide bond is unstable with respect to hydrolysis. So, it requires some form of chemical free energy to drive it. Amino acid activation and acyl transfer are also slow and must be catalyzed. All living things must thus also convert free energy and synchronize cellular chemistry. Most importantly, functional proteins occupy only small, isolated regions of sequence space. Nature evolved heritable symbolic data processing to seek out and use those sequences. That system has three parts: a memory of how amino acids behave in solution and inside proteins, a set of code keys to access that memory, and a scoring function. The code keys themselves are the genes for cognate pairs of tRNA and aminoacyl-tRNA synthetases, AARSs. The scoring function is the enzymatic specificity constant, kcat/kM, which measures both catalysis and specificity. The work described here deepens the evidence for and understanding of an unexpected consequence of ancestral bidirectional coding. Secondary structures occur in approximately the same places within antiparallel alignments of their gene products. However, the polar amino acids that define the molecular surface of one are reflected into core-defining non-polar side chains on the other. Proteins translated from base-paired coding strands fold up inside out. Bidirectional genes thus project an inverted structural duality into the proteome. I review how experimental data root the scoring functions responsible for the origins of coding and catalyzed activation of unfavorable chemical reactions in that duality.
RESUMO
Septins are GTP-binding proteins conserved across metazoans. They can polymerize into extended filaments and, hence, are considered a component of the cytoskeleton. The number of individual septins varies across the tree of life-yeast (Saccharomyces cerevisiae) has seven distinct subunits, a nematode (Caenorhabditis elegans) has two, and humans have 13. However, the overall geometric unit (an apolar hetero-octameric protomer and filaments assembled there from) has been conserved. To understand septin evolutionary variation, we focused on a related pair of yeast subunits (Cdc11 and Shs1) that appear to have arisen from gene duplication within the fungal clade. Either Cdc11 or Shs1 occupies the terminal position within a hetero-octamer, yet Cdc11 is essential for septin function and cell viability, whereas Shs1 is not. To discern the molecular basis of this divergence, we utilized ancestral gene reconstruction to predict, synthesize, and experimentally examine the most recent common ancestor ("Anc.11-S") of Cdc11 and Shs1. Anc.11-S was able to occupy the terminal position within an octamer, just like the modern subunits. Although Anc.11-S supplied many of the known functions of Cdc11, it was unable to replace the distinct function(s) of Shs1. To further evaluate the history of Shs1, additional intermediates along a proposed trajectory from Anc.11-S to yeast Shs1 were generated and tested. We demonstrate that multiple events contributed to the current properties of Shs1: (1) loss of Shs1-Shs1 self-association early after duplication, (2) co-evolution of heterotypic Cdc11-Shs1 interaction between neighboring hetero-octamers, and (3) eventual repurposing and acquisition of novel function(s) for its C-terminal extension domain. Thus, a pair of duplicated proteins, despite constraints imposed by assembly into a highly conserved multi-subunit structure, could evolve new functionality via a complex evolutionary pathway.
Assuntos
Proteínas de Ciclo Celular , Proteínas de Saccharomyces cerevisiae , Saccharomyces cerevisiae , Proteínas de Ciclo Celular/metabolismo , Proteínas do Citoesqueleto , Evolução Molecular , Subunidades Proteicas/metabolismo , Saccharomyces cerevisiae/metabolismo , Proteínas de Saccharomyces cerevisiae/metabolismo , Septinas/metabolismoRESUMO
Genes encoding nuclear receptors (NRs) are attractive as candidates for investigating the evolution of gene regulation because they (1) have a direct effect on gene expression and (2) modulate many cellular processes that underlie development. We employed a three-phase investigation linking NR molecular evolution among primates with direct experimental assessment of NR function. Phase 1 was an analysis of NR domain evolution and the results were used to guide the design of phase 2, a codon-model-based survey for alterations of natural selection within the hominids. By using a series of reliability and robustness analyses we selected a single gene, NR2C1, as the best candidate for experimental assessment. We carried out assays to determine whether changes between the ancestral and extant NR2C1s could have impacted stem cell pluripotency (phase 3). We evaluated human, chimpanzee, and ancestral NR2C1 for transcriptional modulation of Oct4 and Nanog (key regulators of pluripotency and cell lineage commitment), promoter activity for Pepck (a proxy for differentiation in numerous cell types), and average size of embryological stem cell colonies (a proxy for the self-renewal capacity of pluripotent cells). Results supported the signal for alteration of natural selection identified in phase 2. We suggest that adaptive evolution of gene regulation has impacted several aspects of pluripotentiality within primates. Our study illustrates that the combination of targeted evolutionary surveys and experimental analysis is an effective strategy for investigating the evolution of gene regulation with respect to developmental phenotypes.