Search | VHL CLAP/WR-PAHO/WHO

1.

Selection on synonymous sites: the unwanted transcript hypothesis.

Radrizzani, Sofia; Kudla, Grzegorz; Izsvák, Zsuzsanna; Hurst, Laurence D.

Nat Rev Genet ; 25(6): 431-448, 2024 Jun.

Article in English | MEDLINE | ID: mdl-38297070

ABSTRACT

Although translational selection to favour codons that match the most abundant tRNAs is not readily observed in humans, there is nonetheless selection in humans on synonymous mutations. We hypothesize that much of this synonymous site selection can be explained in terms of protection against unwanted RNAs - spurious transcripts, mis-spliced forms or RNAs derived from transposable elements or viruses. We propose not only that selection on synonymous sites functions to reduce the rate of creation of unwanted transcripts (for example, through selection on exonic splice enhancers and cryptic splice sites) but also that high-GC content (but low-CpG content), together with intron presence and position, is both particular to functional native mRNAs and used to recognize transcripts as native. In support of this hypothesis, transcription, nuclear export, liquid phase condensation and RNA degradation have all recently been shown to promote GC-rich transcripts and suppress AU/CpG-rich ones. With such 'traps' being set against AU/CpG-rich transcripts, the codon usage of native genes has, in turn, evolved to avoid such suppression. That parallel filters against AU/CpG-rich transcripts also affect the endosomal import of RNAs further supports the unwanted transcript hypothesis of synonymous site selection and explains the similar design rules that have enabled the successful use of transgenes and RNA vaccines.

Subject(s)

RNA, Messenger , Humans , RNA, Messenger/genetics , RNA, Messenger/metabolism , Animals , Models, Genetic , Silent Mutation , Base Composition , Selection, Genetic , Transcription, Genetic

2.

People with more extreme attitudes towards science have self-confidence in their understanding of science, even if this is not justified.

Fonseca, Cristina; Pettitt, Jonathan; Woollard, Alison; Rutherford, Adam; Bickmore, Wendy; Ferguson-Smith, Anne; Hurst, Laurence D.

PLoS Biol ; 21(1): e3001915, 2023 01.

Article in English | MEDLINE | ID: mdl-36693040

ABSTRACT

People differ greatly in their attitudes towards well-evidenced science. What characterises this variation? Here, we consider this issue in the context of genetics and allied sciences. While most prior research has focused on the relationship between attitude to science and what people know about it, recent evidence suggests that individuals with strongly negative attitudes towards specific genetic technologies (genetic modification (GM) technology and vaccines) commonly do not objectively understand the science, but, importantly, believe that they do. Here, using data from a probability survey of United Kingdom adults, we extend this prior work in 2 regards. First, we ask whether people with more extreme attitudes, be they positive or negative, are more likely to believe that they understand the science. Second, as negativity to genetics is commonly framed around issues particular to specific technologies, we ask whether attitudinal trends are contingent on specification of technology. We find (1) that individuals with strongly positive or negative attitudes towards genetics more strongly believe that they well understand the science; but (2) only for those most positive to the science is this self-confidence warranted; and (3) these effects are not contingent on specification of any particular technologies. These results suggest a potentially general model to explain why people differ in their degree of acceptance or rejection of science, this being that the more someone believes they understand the science, the more confident they will be in their acceptance or rejection of it. While there are more technology nonspecific opponents who also oppose GM technology than expected by chance, most GM opponents fit a different demographic. For the most part, opposition to GM appears not to reflect a smokescreen concealing a broader underlying negativity.

Subject(s)

Attitude , Technology , Adult , Humans , United Kingdom , Surveys and Questionnaires

3.

A new human embryonic cell type associated with activity of young transposable elements allows definition of the inner cell mass.

Singh, Manvendra; Kondrashkina, Aleksandra M; Widmann, Thomas J; Cortes, Jose L; Bansal, Vikas; Wang, Jichang; Römer, Christine; Garcia-Canadas, Marta; Garcia-Perez, Jose L; Hurst, Laurence D; Izsvák, Zsuzsanna.

PLoS Biol ; 21(6): e3002162, 2023 06.

Article in English | MEDLINE | ID: mdl-37339119

ABSTRACT

There remains much that we do not understand about the earliest stages of human development. On a gross level, there is evidence for apoptosis, but the nature of the affected cell types is unknown. Perhaps most importantly, the inner cell mass (ICM), from which the foetus is derived and hence of interest in reproductive health and regenerative medicine, has proven hard to define. Here, we provide a multi-method analysis of the early human embryo to resolve these issues. Single-cell analysis (on multiple independent datasets), supported by embryo visualisation, uncovers a common previously uncharacterised class of cells lacking commitment markers that segregates after embryonic gene activation (EGA) and shortly after undergo apoptosis. The discovery of this cell type allows us to clearly define their viable ontogenetic sisters, these being the cells of the ICM. While ICM is characterised by the activity of an Old non-transposing endogenous retrovirus (HERVH) that acts to suppress Young transposable elements, the new cell type, by contrast, expresses transpositionally competent Young elements and DNA-damage response genes. As the Young elements are RetroElements and the cells are excluded from the developmental process, we dub these REject cells. With these and ICM being characterised by differential mobile element activities, the human embryo may be a "selection arena" in which one group of cells selectively die, while other less damaged cells persist.

Subject(s)

Blastocyst , DNA Transposable Elements , Humans , DNA Transposable Elements/genetics , Blastocyst/metabolism , Embryo, Mammalian

4.

Selfish centromeres and the wastefulness of human reproduction.

Hurst, Laurence D.

PLoS Biol ; 20(7): e3001671, 2022 07.

Article in English | MEDLINE | ID: mdl-35788750

ABSTRACT

Many human embryos die in utero owing to an excess or deficit of chromosomes, a phenomenon known as aneuploidy; this is largely a consequence of nondisjunction during maternal meiosis I. Asymmetries of this division render it vulnerable to selfish centromeres that promote their own transmission, these being thought to somehow underpin aneuploidy. In this essay, I suggest that these vulnerabilities provide only half the solution to the enigma. In mammals, as in utero and postnatal provisioning is continuous, the costs of early death are mitigated. With such reproductive compensation, selection can favour a centromere because it induces lethal aneuploidy: if, when taken towards the polar body, it instead kills the embryo via aneuploidy, it gains. The model is consistent with the observation that reduced dosage of a murine drive suppressor induces aneuploidy and with the fact that high aneuploidy rates in vertebrates are seen exclusively in mammals. I propose further tests of this idea. The wastefulness of human reproduction may be a price we pay for nurturing our offspring.

Subject(s)

Aneuploidy , Centromere , Animals , Embryo, Mammalian , Humans , Mammals , Meiosis , Mice , Reproduction

5.

Unusual mammalian usage of TGA stop codons reveals that sequence conservation need not imply purifying selection.

Ho, Alexander Thomas; Hurst, Laurence Daniel.

PLoS Biol ; 20(5): e3001588, 2022 05.

Article in English | MEDLINE | ID: mdl-35550630

ABSTRACT

The assumption that conservation of sequence implies the action of purifying selection is central to diverse methodologies to infer functional importance. GC-biased gene conversion (gBGC), a meiotic mismatch repair bias strongly favouring GC over AT, can in principle mimic the action of selection, this being thought to be especially important in mammals. As mutation is GCâAT biased, to demonstrate that gBGC does indeed cause false signals requires evidence that an AT-rich residue is selectively optimal compared to its more GC-rich allele, while showing also that the GC-rich alternative is conserved. We propose that mammalian stop codon evolution provides a robust test case. Although in most taxa TAA is the optimal stop codon, TGA is both abundant and conserved in mammalian genomes. We show that this mammalian exceptionalism is well explained by gBGC mimicking purifying selection and that TAA is the selectively optimal codon. Supportive of gBGC, we observe (i) TGA usage trends are consistent at the focal stop codon and elsewhere (in UTR sequences); (ii) that higher TGA usage and higher TAAâTGA substitution rates are predicted by a high recombination rate; and (iii) across species the difference in TAA <-> TGA substitution rates between GC-rich and GC-poor genes is largest in genomes that possess higher between-gene GC variation. TAA optimality is supported both by enrichment in highly expressed genes and trends associated with effective population size. High TGA usage and high TAAâTGA rates in mammals are thus consistent with gBGC's predicted ability to "drive" deleterious mutations and supports the hypothesis that sequence conservation need not be indicative of purifying selection. A general trend for GC-rich trinucleotides to reside at frequencies far above their mutational equilibrium in high recombining domains supports the generality of these results.

Subject(s)

Evolution, Molecular , Gene Conversion , Animals , Base Composition , Codon, Terminator , Mammals/genetics , Selection, Genetic

6.

In rice splice variants that restore the reading frame after frameshifting indel introduction are common, often induced by the indels and sometimes lead to organism-level rescue.

Jia, Yanxiao; Qin, Chao; Traw, Milton Brian; Chen, Xiaonan; He, Ying; Kai, Jing; Yang, Sihai; Wang, Long; Hurst, Laurence D.

PLoS Genet ; 18(2): e1010071, 2022 02.

Article in English | MEDLINE | ID: mdl-35180223

ABSTRACT

The introduction of frameshifting non-3n indels enables the identification of gene-trait associations. However, it has been hypothesised that recovery of the original reading frame owing to usage of non-canonical splice forms could cause rescue. To date there is very little evidence for organism-level rescue by such a mechanism and it is unknown how commonly indels induce, or are otherwise associated with, frame-restoring splice forms. We perform CRISPR/Cas9 editing of randomly selected loci in rice to investigate these issues. We find that the majority of loci have a frame-restoring isoform. Importantly, three quarters of these isoforms are not seen in the absence of the indels, consistent with indels commonly inducing novel isoforms. This is supported by analysis in the context of NMD knockdowns. We consider in detail the two top rescue candidates, in wax deficient anther 1 (wda1) and brittle culm (bc10), finding that organismal-level rescue in both cases is strong but owing to different splice modification routes. More generally, however, as frame-restoring isoforms are low abundance and possibly too disruptive, such rescue we suggest to be the rare exception, not the rule. Nonetheless, assuming that indels commonly induce frame-restoring isoforms, these results emphasize the need to examine RNA level effects of non-3n indels and suggest that multiple non-3n indels in any given gene are advisable to probe a gene's trait associations.

Subject(s)

Oryza , INDEL Mutation/genetics , Oryza/genetics , Reading Frames

7.

Mitotic gene conversion can be as important as meiotic conversion in driving genetic variability in plants and other species without early germline segregation.

Jia, Xianqing; Zhang, Qijun; Jiang, Mengmeng; Huang, Ju; Yu, Luyao; Traw, Milton Brian; Tian, Dacheng; Hurst, Laurence D; Yang, Sihai.

PLoS Biol ; 19(3): e3001164, 2021 03.

Article in English | MEDLINE | ID: mdl-33750968

ABSTRACT

In contrast to common meiotic gene conversion, mitotic gene conversion, because it is so rare, is often ignored as a process influencing allelic diversity. We show that if there is a large enough number of premeiotic cell divisions, as seen in many organisms without early germline sequestration, such as plants, this is an unsafe position. From examination of 1.1 million rice plants, we determined that the rate of mitotic gene conversion events, per mitosis, is 2 orders of magnitude lower than the meiotic rate. However, owing to the large number of mitoses between zygote and gamete and because of long mitotic tract lengths, meiotic and mitotic gene conversion can be of approximately equivalent importance in terms of numbers of markers converted from zygote to gamete. This holds even if we assume a low number of premeiotic cell divisions (approximately 40) as witnessed in Arabidopsis. A low mitotic rate associated with long tracts is also seen in yeast, suggesting generality of results. For species with many mitoses between each meiotic event, mitotic gene conversion should not be overlooked.

Subject(s)

Gene Conversion/genetics , Genetic Variation/genetics , Oryza/genetics , Plants/genetics , Alleles , Gene Conversion/physiology , Genotype , Germ Cells/metabolism , Meiosis/genetics , Mitosis/genetics , Recombination, Genetic/genetics

8.

Genes for highly abundant proteins in Escherichia coli avoid 5' codons that promote ribosomal initiation.

Lewin, Loveday E; Daniels, Kate G; Hurst, Laurence D.

PLoS Comput Biol ; 19(10): e1011581, 2023 10.

Article in English | MEDLINE | ID: mdl-37878567

ABSTRACT

In many species highly expressed genes (HEGs) over-employ the synonymous codons that match the more abundant iso-acceptor tRNAs. Bacterial transgene codon randomization experiments report, however, that enrichment with such "translationally optimal" codons has little to no effect on the resultant protein level. By contrast, consistent with the view that ribosomal initiation is rate limiting, synonymous codon usage following the 5' ATG greatly influences protein levels, at least in part by modifying RNA stability. For the design of bacterial transgenes, for simple codon based in silico inference of protein levels and for understanding selection on synonymous mutations, it would be valuable to computationally determine initiation optimality (IO) scores for codons for any given species. One attractive approach is to characterize the 5' codon enrichment of HEGs compared with the most lowly expressed genes, just as translational optimality scores of codons have been similarly defined employing the full gene body. Here we determine the viability of this approach employing a unique opportunity: for Escherichia coli there is both the most extensive protein abundance data for native genes and a unique large-scale transgene codon randomization experiment enabling objective definition of the 5' codons that cause, rather than just correlate with, high protein abundance (that we equate with initiation optimality, broadly defined). Surprisingly, the 5' ends of native genes that specify highly abundant proteins avoid such initiation optimal codons. We find that this is probably owing to conflicting selection pressures particular to native HEGs, including selection favouring low initiation rates, this potentially enabling high efficiency of ribosomal usage and low noise. While the classical HEG enrichment approach does not work, rendering simple prediction of native protein abundance from 5' codon content futile, we report evidence that initiation optimality scores derived from the transgene experiment may hold relevance for in silico transgene design for a broad spectrum of bacteria.

Subject(s)

Escherichia coli , Protein Biosynthesis , Escherichia coli/genetics , Protein Biosynthesis/genetics , Codon/genetics , Ribosomes/genetics , Codon Usage , Bacteria/genetics

9.

Re-evaluating evidence for adaptive mutation rate variation.

Wang, Long; Ho, Alexander T; Hurst, Laurence D; Yang, Sihai.

Nature ; 619(7971): E52-E56, 2023 Jul.

Article in English | MEDLINE | ID: mdl-37495884

Subject(s)

Arabidopsis , Mutation Rate , Mutation , Selection, Genetic

10.

Variation in Release Factor Abundance Is Not Needed to Explain Trends in Bacterial Stop Codon Usage.

Ho, Alexander T; Hurst, Laurence D.

Mol Biol Evol ; 39(1)2022 01 07.

Article in English | MEDLINE | ID: mdl-34751397

ABSTRACT

In bacteria stop codons are recognized by one of two class I release factors (RF1) recognizing TAG, RF2 recognizing TGA, and TAA being recognized by both. Variation across bacteria in the relative abundance of RF1 and RF2 is thus hypothesized to select for different TGA/TAG usage. This has been supported by correlations between TAG:TGA ratios and RF1:RF2 ratios across multiple bacterial species, potentially also explaining why TAG usage is approximately constant despite extensive variation in GC content. It is, however, possible that stop codon trends are determined by other forces and that RF ratios adapt to stop codon usage, rather than vice versa. Here, we determine which direction of the causal arrow is the more parsimonious. Our results support the notion that RF1/RF2 ratios become adapted to stop codon usage as the same trends, notably the anomalous TAG behavior, are seen in contexts where RF1:RF2 ratios cannot be, or are unlikely to be, causative, that is, at 3'untranslated sites never used for translation termination, in intragenomic analyses, and across archaeal species (that possess only one RF1). We conclude that specifics of RF biology are unlikely to fully explain TGA/TAG relative usage. We discuss why the causal relationships for the evolution of synonymous stop codon usage might be different from those affecting synonymous sense codon usage, noting that transitions between TGA and TAG require two-point mutations one of which is likely to be deleterious.

Subject(s)

Bacteria , Codon Usage , Codon, Terminator , Peptide Termination Factors , Bacteria/genetics , Base Composition , Peptide Termination Factors/genetics

11.

A Novel Gene Controls a New Structure: PiggyBac Transposable Element-Derived 1, Unique to Mammals, Controls Mammal-Specific Neuronal Paraspeckles.

Raskó, Tamás; Pande, Amit; Radscheit, Kathrin; Zink, Annika; Singh, Manvendra; Sommer, Christian; Wachtl, Gerda; Kolacsek, Orsolya; Inak, Gizem; Szvetnik, Attila; Petrakis, Spyros; Bunse, Mario; Bansal, Vikas; Selbach, Matthias; Orbán, Tamás I; Prigione, Alessandro; Hurst, Laurence D; Izsvák, Zsuzsanna.

Mol Biol Evol ; 39(10)2022 10 07.

Article in English | MEDLINE | ID: mdl-36205081

ABSTRACT

Although new genes can arrive from modes other than duplication, few examples are well characterized. Given high expression in some human brain subregions and a putative link to psychological disorders [e.g., schizophrenia (SCZ)], suggestive of brain functionality, here we characterize piggyBac transposable element-derived 1 (PGBD1). PGBD1 is nonmonotreme mammal-specific and under purifying selection, consistent with functionality. The gene body of human PGBD1 retains much of the original DNA transposon but has additionally captured SCAN and KRAB domains. Despite gene body retention, PGBD1 has lost transposition abilities, thus transposase functionality is absent. PGBD1 no longer recognizes piggyBac transposon-like inverted repeats, nonetheless PGBD1 has DNA binding activity. Genome scale analysis identifies enrichment of binding sites in and around genes involved in neuronal development, with association with both histone activating and repressing marks. We focus on one of the repressed genes, the long noncoding RNA NEAT1, also dysregulated in SCZ, the core structural RNA of paraspeckles. DNA binding assays confirm specific binding of PGBD1 both in the NEAT1 promoter and in the gene body. Depletion of PGBD1 in neuronal progenitor cells (NPCs) results in increased NEAT1/paraspeckles and differentiation. We conclude that PGBD1 has evolved core regulatory functionality for the maintenance of NPCs. As paraspeckles are a mammal-specific structure, the results presented here show a rare example of the evolution of a novel gene coupled to the evolution of a contemporaneous new structure.

Subject(s)

DNA Transposable Elements , RNA, Long Noncoding , Animals , Cell Nucleus/genetics , Histones/metabolism , Humans , Mammals/genetics , Mammals/metabolism , Nerve Tissue Proteins , Paraspeckles , RNA, Long Noncoding/metabolism , Transposases/genetics , Transposases/metabolism

12.

Transgene-design: a web application for the design of mammalian transgenes.

Mühlhausen, Stefanie; Hurst, Laurence D.

Bioinformatics ; 38(9): 2626-2627, 2022 04 28.

Article in English | MEDLINE | ID: mdl-35244144

ABSTRACT

SUMMARY: Transgene-design is a web application to help design transgenes for use in mammalian studies. It is predicated on the recent discovery that human intronless transgenes and native retrogenes can be expressed very effectively if the GC content at exonic synonymous sites is high. In addition, as exonic splice enhancers resident in intron containing genes may have different utility in intronless genes, these can be reduced or increased in density. Input can be a native gene or a commercially 'optimised' gene. The option to leave in the first intron and to protect or avoid other motifs is also permitted. AVAILABILITY AND IMPLEMENTATION: Transgene-design is based on a ruby for rails platform. The application is available at https://transgene-design.bath.ac.uk. The code is available under GNU General Public License from GitHub (https://github.com/smuehlh/transgenes). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Subject(s)

Mammals , Software , Animals , Humans , Introns , Exons , Transgenes , Mutation , Mammals/genetics

13.

Evidence in disease and non-disease contexts that nonsense mutations cause altered splicing via motif disruption.

Abrahams, Liam; Savisaar, Rosina; Mordstein, Christine; Young, Bethan; Kudla, Grzegorz; Hurst, Laurence D.

Nucleic Acids Res ; 49(17): 9665-9685, 2021 09 27.

Article in English | MEDLINE | ID: mdl-34469537

ABSTRACT

Transcripts containing premature termination codons (PTCs) can be subject to nonsense-associated alternative splicing (NAS). Two models have been evoked to explain this, scanning and splice motif disruption. The latter postulates that exonic cis motifs, such as exonic splice enhancers (ESEs), are disrupted by nonsense mutations. We employ genome-wide transcriptomic and k-mer enrichment methods to scrutinize this model. First, we show that ESEs are prone to disruptive nonsense mutations owing to their purine richness and paucity of TGA, TAA and TAG. The motif model correctly predicts that NAS rates should be low (we estimate 5-30%) and approximately in line with estimates for the rate at which random point mutations disrupt splicing (8-20%). Further, we find that, as expected, NAS-associated PTCs are predictable from nucleotide-based machine learning approaches to predict splice disruption and, at least for pathogenic variants, are enriched in ESEs. Finally, we find that both in and out of frame mutations to TAA, TGA or TAG are associated with exon skipping. While a higher relative frequency of such skip-inducing mutations in-frame than out of frame lends some credence to the scanning model, these results reinforce the importance of considering splice motif modulation to understand the etiology of PTC-associated disease.

Subject(s)

Alternative Splicing , Codon, Nonsense , Regulatory Sequences, Ribonucleic Acid , Codon, Terminator , Disease/genetics , Exons , HEK293 Cells , HeLa Cells , Humans , Nonsense Mediated mRNA Decay , Nucleotide Motifs , Nucleotides/chemistry

14.

Active and repressed biosynthetic gene clusters have spatially distinct chromosome states.

Nützmann, Hans-Wilhelm; Doerr, Daniel; Ramírez-Colmenero, América; Sotelo-Fonseca, Jesús Emiliano; Wegel, Eva; Di Stefano, Marco; Wingett, Steven W; Fraser, Peter; Hurst, Laurence; Fernandez-Valverde, Selene L; Osbourn, Anne.

Proc Natl Acad Sci U S A ; 117(24): 13800-13809, 2020 06 16.

Article in English | MEDLINE | ID: mdl-32493747

ABSTRACT

While colocalization within a bacterial operon enables coexpression of the constituent genes, the mechanistic logic of clustering of nonhomologous monocistronic genes in eukaryotes is not immediately obvious. Biosynthetic gene clusters that encode pathways for specialized metabolites are an exception to the classical eukaryote rule of random gene location and provide paradigmatic exemplars with which to understand eukaryotic cluster dynamics and regulation. Here, using 3C, Hi-C, and Capture Hi-C (CHi-C) organ-specific chromosome conformation capture techniques along with high-resolution microscopy, we investigate how chromosome topology relates to transcriptional activity of clustered biosynthetic pathway genes in Arabidopsis thaliana Our analyses reveal that biosynthetic gene clusters are embedded in local hot spots of 3D contacts that segregate cluster regions from the surrounding chromosome environment. The spatial conformation of these cluster-associated domains differs between transcriptionally active and silenced clusters. We further show that silenced clusters associate with heterochromatic chromosomal domains toward the periphery of the nucleus, while transcriptionally active clusters relocate away from the nuclear periphery. Examination of chromosome structure at unrelated clusters in maize, rice, and tomato indicates that integration of clustered pathway genes into distinct topological domains is a common feature in plant genomes. Our results shed light on the potential mechanisms that constrain coexpression within clusters of nonhomologous eukaryotic genes and suggest that gene clustering in the one-dimensional chromosome is accompanied by compartmentalization of the 3D chromosome.

Subject(s)

Arabidopsis/genetics , Chromosomes, Plant/genetics , Multigene Family , Plant Proteins/genetics , Solanum lycopersicum/genetics , Zea mays/genetics , Arabidopsis/metabolism , Chromosomes, Plant/metabolism , Genome, Plant , Solanum lycopersicum/metabolism , Oryza/genetics , Oryza/metabolism , Plant Proteins/metabolism , Zea mays/metabolism

15.

Effective Population Size Predicts Local Rates but Not Local Mitigation of Read-through Errors.

Ho, Alexander T; Hurst, Laurence D.

Mol Biol Evol ; 38(1): 244-262, 2021 01 04.

Article in English | MEDLINE | ID: mdl-32797190

ABSTRACT

In correctly predicting that selection efficiency is positively correlated with the effective population size (Ne), the nearly neutral theory provides a coherent understanding of between-species variation in numerous genomic parameters, including heritable error (germline mutation) rates. Does the same theory also explain variation in phenotypic error rates and in abundance of error mitigation mechanisms? Translational read-through provides a model to investigate both issues as it is common, mostly nonadaptive, and has good proxy for rate (TAA being the least leaky stop codon) and potential error mitigation via "fail-safe" 3' additional stop codons (ASCs). Prior theory of translational read-through has suggested that when population sizes are high, weak selection for local mitigation can be effective thus predicting a positive correlation between ASC enrichment and Ne. Contra to prediction, we find that ASC enrichment is not correlated with Ne. ASC enrichment, although highly phylogenetically patchy, is, however, more common both in unicellular species and in genes expressed in unicellular modes in multicellular species. By contrast, Ne does positively correlate with TAA enrichment. These results imply that local phenotypic error rates, not local mitigation rates, are consistent with a drift barrier/nearly neutral model.

Subject(s)

Codon, Terminator , Evolution, Molecular , Mutation Rate , Selection, Genetic , Arabidopsis , Dictyostelium , Population Density

16.

Inferring Adaptive Codon Preference to Understand Sources of Selection Shaping Codon Usage Bias.

de Oliveira, Janaina Lima; Morales, Atahualpa Castillo; Hurst, Laurence D; Urrutia, Araxi O; Thompson, Christopher R L; Wolf, Jason B.

Mol Biol Evol ; 38(8): 3247-3266, 2021 07 29.

Article in English | MEDLINE | ID: mdl-33871580

ABSTRACT

Alternative synonymous codons are often used at unequal frequencies. Classically, studies of such codon usage bias (CUB) attempted to separate the impact of neutral from selective forces by assuming that deviations from a predicted neutral equilibrium capture selection. However, GC-biased gene conversion (gBGC) can also cause deviation from a neutral null. Alternatively, selection has been inferred from CUB in highly expressed genes, but the accuracy of this approach has not been extensively tested, and gBGC can interfere with such extrapolations (e.g., if expression and gene conversion rates covary). It is therefore critical to examine deviations from a mutational null in a species with no gBGC. To achieve this goal, we implement such an analysis in the highly AT rich genome of Dictyostelium discoideum, where we find no evidence of gBGC. We infer neutral CUB under mutational equilibrium to quantify "adaptive codon preference," a nontautologous genome wide quantitative measure of the relative selection strength driving CUB. We observe signatures of purifying selection consistent with selection favoring adaptive codon preference. Preferred codons are not GC rich, underscoring the independence from gBGC. Expression-associated "preference" largely matches adaptive codon preference but does not wholly capture the influence of selection shaping patterns across all genes, suggesting selective constraints associated specifically with high expression. We observe patterns consistent with effects on mRNA translation and stability shaping adaptive codon preference. Thus, our approach to quantifying adaptive codon preference provides a framework for inferring the sources of selection that shape CUB across different contexts within the genome.

Subject(s)

Codon Usage , Dictyostelium/genetics , Selection, Genetic , Adaptation, Biological , Base Composition , Protein Biosynthesis , RNA, Transfer/metabolism

17.

Evidence for Strong Mutation Bias toward, and Selection against, U Content in SARS-CoV-2: Implications for Vaccine Design.

Rice, Alan M; Castillo Morales, Atahualpa; Ho, Alexander T; Mordstein, Christine; Mühlhausen, Stefanie; Watson, Samir; Cano, Laura; Young, Bethan; Kudla, Grzegorz; Hurst, Laurence D.

Mol Biol Evol ; 38(1): 67-83, 2021 01 04.

Article in English | MEDLINE | ID: mdl-32687176

ABSTRACT

Large-scale re-engineering of synonymous sites is a promising strategy to generate vaccines either through synthesis of attenuated viruses or via codon-optimized genes in DNA vaccines. Attenuation typically relies on deoptimization of codon pairs and maximization of CpG dinucleotide frequencies. So as to formulate evolutionarily informed attenuation strategies that aim to force nucleotide usage against the direction favored by selection, here, we examine available whole-genome sequences of SARS-CoV-2 to infer patterns of mutation and selection on synonymous sites. Analysis of mutational profiles indicates a strong mutation bias toward U. In turn, analysis of observed synonymous site composition implicates selection against U. Accounting for dinucleotide effects reinforces this conclusion, observed UU content being a quarter of that expected under neutrality. Possible mechanisms of selection against U mutations include selection for higher expression, for high mRNA stability or lower immunogenicity of viral genes. Consistent with gene-specific selection against CpG dinucleotides, we observe systematic differences of CpG content between SARS-CoV-2 genes. We propose an evolutionarily informed approach to attenuation that, unusually, seeks to increase usage of the already most common synonymous codons. Comparable analysis of H1N1 and Ebola finds that GC3 deviated from neutral equilibrium is not a universal feature, cautioning against generalization of results.

Subject(s)

COVID-19 Vaccines/genetics , COVID-19/genetics , Genome, Viral , Mutation , SARS-CoV-2/genetics , Selection, Genetic , COVID-19/prevention & control , Humans , RNA Stability/genetics , RNA, Messenger/genetics , RNA, Viral/genetics , Uracil

18.

The architecture of intra-organism mutation rate variation in plants.

Wang, Long; Ji, Yilun; Hu, Yingwen; Hu, Huaying; Jia, Xianqin; Jiang, Mengmeng; Zhang, Xiaohui; Zhao, Lina; Zhang, Yanchun; Jia, Yanxiao; Qin, Chao; Yu, Luyao; Huang, Ju; Yang, Sihai; Hurst, Laurence D; Tian, Dacheng.

PLoS Biol ; 17(4): e3000191, 2019 04.

Article in English | MEDLINE | ID: mdl-30964866

ABSTRACT

Given the disposability of somatic tissue, selection can favor a higher mutation rate in the early segregating soma than in germline, as seen in some animals. Although in plants intra-organismic mutation rate heterogeneity is poorly resolved, the same selectionist logic can predict a lower rate in shoot than in root and in longer-lived terminal tissues (e.g., leaves) than in ontogenetically similar short-lived ones (e.g., petals), and that mutation rate heterogeneity should be deterministic with no significant differences between biological replicates. To address these expectations, we sequenced 754 genomes from various tissues of eight plant species. Consistent with a selectionist model, the rate of mutation accumulation per unit time in shoot apical meristem is lower than that in root apical tissues in perennials, in which a high proportion of mutations in shoots are themselves transmissible, but not in annuals, in which somatic mutations tend not to be transmissible. Similarly, the number of mutations accumulated in leaves is commonly lower than that within a petal of the same plant, and there is no more heterogeneity in accumulation rates between replicate branches than expected by chance. High mutation accumulation in runners of strawberry is, we argue, the exception that proves the rule, as mutation transmission patterns indicate that runner has a restricted germline. However, we also find that in vitro callus tissue has a higher mutation rate (per unit time) than the wild-grown comparator, suggesting nonadaptive mutational "fragility". As mutational fragility does not obviously explain why the shoot-root difference varies with plant longevity, we conclude that some mutation rate variation between tissues is consistent with selectionist theory but that a mechanistic null of mutational fragility should be considered.

Subject(s)

Gene Expression Regulation, Plant/genetics , Mutation Rate , Plants/genetics , Genes, Plant/genetics , Germ Cells , Mutation/genetics , Mutation Accumulation , Plant Leaves/genetics , Plant Roots/genetics , Plant Shoots/genetics

19.

Open questions in the study of de novo genes: what, how and why.

McLysaght, Aoife; Hurst, Laurence D.

Nat Rev Genet ; 17(9): 567-78, 2016 09.

Article in English | MEDLINE | ID: mdl-27452112

ABSTRACT

The study of de novo protein-coding genes is maturing from the ad hoc reporting of individual cases to the systematic analysis of extensive genomic data from several species. We identify three key challenges for this emerging field: understanding how best to identify de novo genes, how they arise and why they spread. We highlight the intellectual challenges of understanding how a de novo gene becomes integrated into pre-existing functions and becomes essential. We suggest that, as with protein sequence evolution, antagonistic co-evolution may be key to de novo gene evolution, particularly for new essential genes and new cancer-associated genes.

Subject(s)

Evolution, Molecular , Genomics/methods , Open Reading Frames/genetics , Proteins/genetics , Humans , Phylogeny , Proteins/metabolism

20.

In eubacteria, unlike eukaryotes, there is no evidence for selection favouring fail-safe 3' additional stop codons.

Ho, Alexander T; Hurst, Laurence D.

PLoS Genet ; 15(9): e1008386, 2019 09.

Article in English | MEDLINE | ID: mdl-31527909

ABSTRACT

Errors throughout gene expression are likely deleterious, hence genomes are under selection to ameliorate their consequences. Additional stop codons (ASCs) are in-frame nonsense 'codons' downstream of the primary stop which may be read by translational machinery should the primary stop have been accidentally read through. Prior evidence in several eukaryotes suggests that ASCs are selected to prevent potentially-deleterious consequences of read-through. We extend this evidence showing that enrichment of ASCs is common but not universal for single cell eukaryotes. By contrast, there is limited evidence as to whether the same is true in other taxa. Here, we provide the first systematic test of the hypothesis that ASCs act as a fail-safe mechanism in eubacteria, a group with high read-through rates. Contra to the predictions of the hypothesis we find: there is paucity, not enrichment, of ASCs downstream; substitutions that degrade stops are more frequent in-frame than out-of-frame in 3' sequence; highly expressed genes are no more likely to have ASCs than lowly expressed genes; usage of the leakiest primary stop (TGA) in highly expressed genes does not predict ASC enrichment even compared to usage of non-leaky stops (TAA) in lowly expressed genes, beyond downstream codon +1. Any effect at the codon immediately proximal to the primary stop can be accounted for by a preference for a T/U residue immediately following the stop, although if anything, TT- and TC- starting codons are preferred. We conclude that there is no compelling evidence for ASC selection in eubacteria. This presents an unusual case in which the same error could be solved by the same mechanism in eukaryotes and prokaryotes but is not. We discuss two possible explanations: that, owing to the absence of nonsense mediated decay, bacteria may solve read-through via gene truncation and in eukaryotes certain prion states cause raised read-through rates.

Subject(s)

3' Untranslated Regions/genetics , Bacteria/genetics , Codon, Terminator/genetics , Codon, Nonsense/genetics , Eukaryota/genetics , Evolution, Molecular , Gene Expression/genetics , Genome , Nonsense Mediated mRNA Decay/genetics , Open Reading Frames/genetics

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL