Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 185
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Nat Rev Genet ; 25(6): 431-448, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38297070

RESUMO

Although translational selection to favour codons that match the most abundant tRNAs is not readily observed in humans, there is nonetheless selection in humans on synonymous mutations. We hypothesize that much of this synonymous site selection can be explained in terms of protection against unwanted RNAs - spurious transcripts, mis-spliced forms or RNAs derived from transposable elements or viruses. We propose not only that selection on synonymous sites functions to reduce the rate of creation of unwanted transcripts (for example, through selection on exonic splice enhancers and cryptic splice sites) but also that high-GC content (but low-CpG content), together with intron presence and position, is both particular to functional native mRNAs and used to recognize transcripts as native. In support of this hypothesis, transcription, nuclear export, liquid phase condensation and RNA degradation have all recently been shown to promote GC-rich transcripts and suppress AU/CpG-rich ones. With such 'traps' being set against AU/CpG-rich transcripts, the codon usage of native genes has, in turn, evolved to avoid such suppression. That parallel filters against AU/CpG-rich transcripts also affect the endosomal import of RNAs further supports the unwanted transcript hypothesis of synonymous site selection and explains the similar design rules that have enabled the successful use of transgenes and RNA vaccines.


Assuntos
RNA Mensageiro , Humanos , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Animais , Modelos Genéticos , Mutação Silenciosa , Composição de Bases , Seleção Genética , Transcrição Gênica
2.
PLoS Biol ; 21(1): e3001915, 2023 01.
Artigo em Inglês | MEDLINE | ID: mdl-36693040

RESUMO

People differ greatly in their attitudes towards well-evidenced science. What characterises this variation? Here, we consider this issue in the context of genetics and allied sciences. While most prior research has focused on the relationship between attitude to science and what people know about it, recent evidence suggests that individuals with strongly negative attitudes towards specific genetic technologies (genetic modification (GM) technology and vaccines) commonly do not objectively understand the science, but, importantly, believe that they do. Here, using data from a probability survey of United Kingdom adults, we extend this prior work in 2 regards. First, we ask whether people with more extreme attitudes, be they positive or negative, are more likely to believe that they understand the science. Second, as negativity to genetics is commonly framed around issues particular to specific technologies, we ask whether attitudinal trends are contingent on specification of technology. We find (1) that individuals with strongly positive or negative attitudes towards genetics more strongly believe that they well understand the science; but (2) only for those most positive to the science is this self-confidence warranted; and (3) these effects are not contingent on specification of any particular technologies. These results suggest a potentially general model to explain why people differ in their degree of acceptance or rejection of science, this being that the more someone believes they understand the science, the more confident they will be in their acceptance or rejection of it. While there are more technology nonspecific opponents who also oppose GM technology than expected by chance, most GM opponents fit a different demographic. For the most part, opposition to GM appears not to reflect a smokescreen concealing a broader underlying negativity.


Assuntos
Atitude , Tecnologia , Adulto , Humanos , Reino Unido , Inquéritos e Questionários
3.
PLoS Biol ; 21(6): e3002162, 2023 06.
Artigo em Inglês | MEDLINE | ID: mdl-37339119

RESUMO

There remains much that we do not understand about the earliest stages of human development. On a gross level, there is evidence for apoptosis, but the nature of the affected cell types is unknown. Perhaps most importantly, the inner cell mass (ICM), from which the foetus is derived and hence of interest in reproductive health and regenerative medicine, has proven hard to define. Here, we provide a multi-method analysis of the early human embryo to resolve these issues. Single-cell analysis (on multiple independent datasets), supported by embryo visualisation, uncovers a common previously uncharacterised class of cells lacking commitment markers that segregates after embryonic gene activation (EGA) and shortly after undergo apoptosis. The discovery of this cell type allows us to clearly define their viable ontogenetic sisters, these being the cells of the ICM. While ICM is characterised by the activity of an Old non-transposing endogenous retrovirus (HERVH) that acts to suppress Young transposable elements, the new cell type, by contrast, expresses transpositionally competent Young elements and DNA-damage response genes. As the Young elements are RetroElements and the cells are excluded from the developmental process, we dub these REject cells. With these and ICM being characterised by differential mobile element activities, the human embryo may be a "selection arena" in which one group of cells selectively die, while other less damaged cells persist.


Assuntos
Blastocisto , Elementos de DNA Transponíveis , Humanos , Elementos de DNA Transponíveis/genética , Blastocisto/metabolismo , Embrião de Mamíferos
4.
PLoS Biol ; 20(7): e3001671, 2022 07.
Artigo em Inglês | MEDLINE | ID: mdl-35788750

RESUMO

Many human embryos die in utero owing to an excess or deficit of chromosomes, a phenomenon known as aneuploidy; this is largely a consequence of nondisjunction during maternal meiosis I. Asymmetries of this division render it vulnerable to selfish centromeres that promote their own transmission, these being thought to somehow underpin aneuploidy. In this essay, I suggest that these vulnerabilities provide only half the solution to the enigma. In mammals, as in utero and postnatal provisioning is continuous, the costs of early death are mitigated. With such reproductive compensation, selection can favour a centromere because it induces lethal aneuploidy: if, when taken towards the polar body, it instead kills the embryo via aneuploidy, it gains. The model is consistent with the observation that reduced dosage of a murine drive suppressor induces aneuploidy and with the fact that high aneuploidy rates in vertebrates are seen exclusively in mammals. I propose further tests of this idea. The wastefulness of human reproduction may be a price we pay for nurturing our offspring.


Assuntos
Aneuploidia , Centrômero , Animais , Embrião de Mamíferos , Humanos , Mamíferos , Meiose , Camundongos , Reprodução
5.
PLoS Genet ; 18(2): e1010071, 2022 02.
Artigo em Inglês | MEDLINE | ID: mdl-35180223

RESUMO

The introduction of frameshifting non-3n indels enables the identification of gene-trait associations. However, it has been hypothesised that recovery of the original reading frame owing to usage of non-canonical splice forms could cause rescue. To date there is very little evidence for organism-level rescue by such a mechanism and it is unknown how commonly indels induce, or are otherwise associated with, frame-restoring splice forms. We perform CRISPR/Cas9 editing of randomly selected loci in rice to investigate these issues. We find that the majority of loci have a frame-restoring isoform. Importantly, three quarters of these isoforms are not seen in the absence of the indels, consistent with indels commonly inducing novel isoforms. This is supported by analysis in the context of NMD knockdowns. We consider in detail the two top rescue candidates, in wax deficient anther 1 (wda1) and brittle culm (bc10), finding that organismal-level rescue in both cases is strong but owing to different splice modification routes. More generally, however, as frame-restoring isoforms are low abundance and possibly too disruptive, such rescue we suggest to be the rare exception, not the rule. Nonetheless, assuming that indels commonly induce frame-restoring isoforms, these results emphasize the need to examine RNA level effects of non-3n indels and suggest that multiple non-3n indels in any given gene are advisable to probe a gene's trait associations.


Assuntos
Oryza , Mutação INDEL/genética , Oryza/genética , Fases de Leitura
6.
PLoS Biol ; 19(3): e3001164, 2021 03.
Artigo em Inglês | MEDLINE | ID: mdl-33750968

RESUMO

In contrast to common meiotic gene conversion, mitotic gene conversion, because it is so rare, is often ignored as a process influencing allelic diversity. We show that if there is a large enough number of premeiotic cell divisions, as seen in many organisms without early germline sequestration, such as plants, this is an unsafe position. From examination of 1.1 million rice plants, we determined that the rate of mitotic gene conversion events, per mitosis, is 2 orders of magnitude lower than the meiotic rate. However, owing to the large number of mitoses between zygote and gamete and because of long mitotic tract lengths, meiotic and mitotic gene conversion can be of approximately equivalent importance in terms of numbers of markers converted from zygote to gamete. This holds even if we assume a low number of premeiotic cell divisions (approximately 40) as witnessed in Arabidopsis. A low mitotic rate associated with long tracts is also seen in yeast, suggesting generality of results. For species with many mitoses between each meiotic event, mitotic gene conversion should not be overlooked.


Assuntos
Conversão Gênica/genética , Variação Genética/genética , Oryza/genética , Plantas/genética , Alelos , Conversão Gênica/fisiologia , Genótipo , Células Germinativas/metabolismo , Meiose/genética , Mitose/genética , Recombinação Genética/genética
7.
PLoS Comput Biol ; 19(10): e1011581, 2023 10.
Artigo em Inglês | MEDLINE | ID: mdl-37878567

RESUMO

In many species highly expressed genes (HEGs) over-employ the synonymous codons that match the more abundant iso-acceptor tRNAs. Bacterial transgene codon randomization experiments report, however, that enrichment with such "translationally optimal" codons has little to no effect on the resultant protein level. By contrast, consistent with the view that ribosomal initiation is rate limiting, synonymous codon usage following the 5' ATG greatly influences protein levels, at least in part by modifying RNA stability. For the design of bacterial transgenes, for simple codon based in silico inference of protein levels and for understanding selection on synonymous mutations, it would be valuable to computationally determine initiation optimality (IO) scores for codons for any given species. One attractive approach is to characterize the 5' codon enrichment of HEGs compared with the most lowly expressed genes, just as translational optimality scores of codons have been similarly defined employing the full gene body. Here we determine the viability of this approach employing a unique opportunity: for Escherichia coli there is both the most extensive protein abundance data for native genes and a unique large-scale transgene codon randomization experiment enabling objective definition of the 5' codons that cause, rather than just correlate with, high protein abundance (that we equate with initiation optimality, broadly defined). Surprisingly, the 5' ends of native genes that specify highly abundant proteins avoid such initiation optimal codons. We find that this is probably owing to conflicting selection pressures particular to native HEGs, including selection favouring low initiation rates, this potentially enabling high efficiency of ribosomal usage and low noise. While the classical HEG enrichment approach does not work, rendering simple prediction of native protein abundance from 5' codon content futile, we report evidence that initiation optimality scores derived from the transgene experiment may hold relevance for in silico transgene design for a broad spectrum of bacteria.


Assuntos
Escherichia coli , Biossíntese de Proteínas , Escherichia coli/genética , Biossíntese de Proteínas/genética , Códon/genética , Ribossomos/genética , Uso do Códon , Bactérias/genética
9.
Mol Biol Evol ; 39(1)2022 01 07.
Artigo em Inglês | MEDLINE | ID: mdl-34751397

RESUMO

In bacteria stop codons are recognized by one of two class I release factors (RF1) recognizing TAG, RF2 recognizing TGA, and TAA being recognized by both. Variation across bacteria in the relative abundance of RF1 and RF2 is thus hypothesized to select for different TGA/TAG usage. This has been supported by correlations between TAG:TGA ratios and RF1:RF2 ratios across multiple bacterial species, potentially also explaining why TAG usage is approximately constant despite extensive variation in GC content. It is, however, possible that stop codon trends are determined by other forces and that RF ratios adapt to stop codon usage, rather than vice versa. Here, we determine which direction of the causal arrow is the more parsimonious. Our results support the notion that RF1/RF2 ratios become adapted to stop codon usage as the same trends, notably the anomalous TAG behavior, are seen in contexts where RF1:RF2 ratios cannot be, or are unlikely to be, causative, that is, at 3'untranslated sites never used for translation termination, in intragenomic analyses, and across archaeal species (that possess only one RF1). We conclude that specifics of RF biology are unlikely to fully explain TGA/TAG relative usage. We discuss why the causal relationships for the evolution of synonymous stop codon usage might be different from those affecting synonymous sense codon usage, noting that transitions between TGA and TAG require two-point mutations one of which is likely to be deleterious.


Assuntos
Bactérias , Uso do Códon , Códon de Terminação , Fatores de Terminação de Peptídeos , Bactérias/genética , Composição de Bases , Fatores de Terminação de Peptídeos/genética
10.
Mol Biol Evol ; 39(10)2022 10 07.
Artigo em Inglês | MEDLINE | ID: mdl-36205081

RESUMO

Although new genes can arrive from modes other than duplication, few examples are well characterized. Given high expression in some human brain subregions and a putative link to psychological disorders [e.g., schizophrenia (SCZ)], suggestive of brain functionality, here we characterize piggyBac transposable element-derived 1 (PGBD1). PGBD1 is nonmonotreme mammal-specific and under purifying selection, consistent with functionality. The gene body of human PGBD1 retains much of the original DNA transposon but has additionally captured SCAN and KRAB domains. Despite gene body retention, PGBD1 has lost transposition abilities, thus transposase functionality is absent. PGBD1 no longer recognizes piggyBac transposon-like inverted repeats, nonetheless PGBD1 has DNA binding activity. Genome scale analysis identifies enrichment of binding sites in and around genes involved in neuronal development, with association with both histone activating and repressing marks. We focus on one of the repressed genes, the long noncoding RNA NEAT1, also dysregulated in SCZ, the core structural RNA of paraspeckles. DNA binding assays confirm specific binding of PGBD1 both in the NEAT1 promoter and in the gene body. Depletion of PGBD1 in neuronal progenitor cells (NPCs) results in increased NEAT1/paraspeckles and differentiation. We conclude that PGBD1 has evolved core regulatory functionality for the maintenance of NPCs. As paraspeckles are a mammal-specific structure, the results presented here show a rare example of the evolution of a novel gene coupled to the evolution of a contemporaneous new structure.


Assuntos
Elementos de DNA Transponíveis , RNA Longo não Codificante , Animais , Núcleo Celular/genética , Histonas/metabolismo , Humanos , Mamíferos/genética , Mamíferos/metabolismo , Proteínas do Tecido Nervoso , Paraspeckles , RNA Longo não Codificante/metabolismo , Transposases/genética , Transposases/metabolismo
11.
Bioinformatics ; 38(9): 2626-2627, 2022 04 28.
Artigo em Inglês | MEDLINE | ID: mdl-35244144

RESUMO

SUMMARY: Transgene-design is a web application to help design transgenes for use in mammalian studies. It is predicated on the recent discovery that human intronless transgenes and native retrogenes can be expressed very effectively if the GC content at exonic synonymous sites is high. In addition, as exonic splice enhancers resident in intron containing genes may have different utility in intronless genes, these can be reduced or increased in density. Input can be a native gene or a commercially 'optimised' gene. The option to leave in the first intron and to protect or avoid other motifs is also permitted. AVAILABILITY AND IMPLEMENTATION: Transgene-design is based on a ruby for rails platform. The application is available at https://transgene-design.bath.ac.uk. The code is available under GNU General Public License from GitHub (https://github.com/smuehlh/transgenes). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Mamíferos , Software , Animais , Humanos , Íntrons , Éxons , Transgenes , Mutação , Mamíferos/genética
12.
Nucleic Acids Res ; 49(17): 9665-9685, 2021 09 27.
Artigo em Inglês | MEDLINE | ID: mdl-34469537

RESUMO

Transcripts containing premature termination codons (PTCs) can be subject to nonsense-associated alternative splicing (NAS). Two models have been evoked to explain this, scanning and splice motif disruption. The latter postulates that exonic cis motifs, such as exonic splice enhancers (ESEs), are disrupted by nonsense mutations. We employ genome-wide transcriptomic and k-mer enrichment methods to scrutinize this model. First, we show that ESEs are prone to disruptive nonsense mutations owing to their purine richness and paucity of TGA, TAA and TAG. The motif model correctly predicts that NAS rates should be low (we estimate 5-30%) and approximately in line with estimates for the rate at which random point mutations disrupt splicing (8-20%). Further, we find that, as expected, NAS-associated PTCs are predictable from nucleotide-based machine learning approaches to predict splice disruption and, at least for pathogenic variants, are enriched in ESEs. Finally, we find that both in and out of frame mutations to TAA, TGA or TAG are associated with exon skipping. While a higher relative frequency of such skip-inducing mutations in-frame than out of frame lends some credence to the scanning model, these results reinforce the importance of considering splice motif modulation to understand the etiology of PTC-associated disease.


Assuntos
Processamento Alternativo , Códon sem Sentido , Sequências Reguladoras de Ácido Ribonucleico , Códon de Terminação , Doença/genética , Éxons , Células HEK293 , Células HeLa , Humanos , Degradação do RNAm Mediada por Códon sem Sentido , Motivos de Nucleotídeos , Nucleotídeos/química
13.
Mol Biol Evol ; 38(1): 244-262, 2021 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-32797190

RESUMO

In correctly predicting that selection efficiency is positively correlated with the effective population size (Ne), the nearly neutral theory provides a coherent understanding of between-species variation in numerous genomic parameters, including heritable error (germline mutation) rates. Does the same theory also explain variation in phenotypic error rates and in abundance of error mitigation mechanisms? Translational read-through provides a model to investigate both issues as it is common, mostly nonadaptive, and has good proxy for rate (TAA being the least leaky stop codon) and potential error mitigation via "fail-safe" 3' additional stop codons (ASCs). Prior theory of translational read-through has suggested that when population sizes are high, weak selection for local mitigation can be effective thus predicting a positive correlation between ASC enrichment and Ne. Contra to prediction, we find that ASC enrichment is not correlated with Ne. ASC enrichment, although highly phylogenetically patchy, is, however, more common both in unicellular species and in genes expressed in unicellular modes in multicellular species. By contrast, Ne does positively correlate with TAA enrichment. These results imply that local phenotypic error rates, not local mitigation rates, are consistent with a drift barrier/nearly neutral model.


Assuntos
Códon de Terminação , Evolução Molecular , Taxa de Mutação , Seleção Genética , Arabidopsis , Dictyostelium , Densidade Demográfica
14.
Mol Biol Evol ; 38(8): 3247-3266, 2021 07 29.
Artigo em Inglês | MEDLINE | ID: mdl-33871580

RESUMO

Alternative synonymous codons are often used at unequal frequencies. Classically, studies of such codon usage bias (CUB) attempted to separate the impact of neutral from selective forces by assuming that deviations from a predicted neutral equilibrium capture selection. However, GC-biased gene conversion (gBGC) can also cause deviation from a neutral null. Alternatively, selection has been inferred from CUB in highly expressed genes, but the accuracy of this approach has not been extensively tested, and gBGC can interfere with such extrapolations (e.g., if expression and gene conversion rates covary). It is therefore critical to examine deviations from a mutational null in a species with no gBGC. To achieve this goal, we implement such an analysis in the highly AT rich genome of Dictyostelium discoideum, where we find no evidence of gBGC. We infer neutral CUB under mutational equilibrium to quantify "adaptive codon preference," a nontautologous genome wide quantitative measure of the relative selection strength driving CUB. We observe signatures of purifying selection consistent with selection favoring adaptive codon preference. Preferred codons are not GC rich, underscoring the independence from gBGC. Expression-associated "preference" largely matches adaptive codon preference but does not wholly capture the influence of selection shaping patterns across all genes, suggesting selective constraints associated specifically with high expression. We observe patterns consistent with effects on mRNA translation and stability shaping adaptive codon preference. Thus, our approach to quantifying adaptive codon preference provides a framework for inferring the sources of selection that shape CUB across different contexts within the genome.


Assuntos
Uso do Códon , Dictyostelium/genética , Seleção Genética , Adaptação Biológica , Composição de Bases , Biossíntese de Proteínas , RNA de Transferência/metabolismo
15.
Mol Biol Evol ; 38(1): 67-83, 2021 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-32687176

RESUMO

Large-scale re-engineering of synonymous sites is a promising strategy to generate vaccines either through synthesis of attenuated viruses or via codon-optimized genes in DNA vaccines. Attenuation typically relies on deoptimization of codon pairs and maximization of CpG dinucleotide frequencies. So as to formulate evolutionarily informed attenuation strategies that aim to force nucleotide usage against the direction favored by selection, here, we examine available whole-genome sequences of SARS-CoV-2 to infer patterns of mutation and selection on synonymous sites. Analysis of mutational profiles indicates a strong mutation bias toward U. In turn, analysis of observed synonymous site composition implicates selection against U. Accounting for dinucleotide effects reinforces this conclusion, observed UU content being a quarter of that expected under neutrality. Possible mechanisms of selection against U mutations include selection for higher expression, for high mRNA stability or lower immunogenicity of viral genes. Consistent with gene-specific selection against CpG dinucleotides, we observe systematic differences of CpG content between SARS-CoV-2 genes. We propose an evolutionarily informed approach to attenuation that, unusually, seeks to increase usage of the already most common synonymous codons. Comparable analysis of H1N1 and Ebola finds that GC3 deviated from neutral equilibrium is not a universal feature, cautioning against generalization of results.


Assuntos
Vacinas contra COVID-19/genética , COVID-19/genética , Genoma Viral , Mutação , SARS-CoV-2/genética , Seleção Genética , COVID-19/prevenção & controle , Humanos , Estabilidade de RNA/genética , RNA Mensageiro/genética , RNA Viral/genética , Uracila
16.
PLoS Biol ; 17(4): e3000191, 2019 04.
Artigo em Inglês | MEDLINE | ID: mdl-30964866

RESUMO

Given the disposability of somatic tissue, selection can favor a higher mutation rate in the early segregating soma than in germline, as seen in some animals. Although in plants intra-organismic mutation rate heterogeneity is poorly resolved, the same selectionist logic can predict a lower rate in shoot than in root and in longer-lived terminal tissues (e.g., leaves) than in ontogenetically similar short-lived ones (e.g., petals), and that mutation rate heterogeneity should be deterministic with no significant differences between biological replicates. To address these expectations, we sequenced 754 genomes from various tissues of eight plant species. Consistent with a selectionist model, the rate of mutation accumulation per unit time in shoot apical meristem is lower than that in root apical tissues in perennials, in which a high proportion of mutations in shoots are themselves transmissible, but not in annuals, in which somatic mutations tend not to be transmissible. Similarly, the number of mutations accumulated in leaves is commonly lower than that within a petal of the same plant, and there is no more heterogeneity in accumulation rates between replicate branches than expected by chance. High mutation accumulation in runners of strawberry is, we argue, the exception that proves the rule, as mutation transmission patterns indicate that runner has a restricted germline. However, we also find that in vitro callus tissue has a higher mutation rate (per unit time) than the wild-grown comparator, suggesting nonadaptive mutational "fragility". As mutational fragility does not obviously explain why the shoot-root difference varies with plant longevity, we conclude that some mutation rate variation between tissues is consistent with selectionist theory but that a mechanistic null of mutational fragility should be considered.


Assuntos
Regulação da Expressão Gênica de Plantas/genética , Taxa de Mutação , Plantas/genética , Genes de Plantas/genética , Células Germinativas , Mutação/genética , Acúmulo de Mutações , Folhas de Planta/genética , Raízes de Plantas/genética , Brotos de Planta/genética
17.
Nat Rev Genet ; 17(9): 567-78, 2016 09.
Artigo em Inglês | MEDLINE | ID: mdl-27452112

RESUMO

The study of de novo protein-coding genes is maturing from the ad hoc reporting of individual cases to the systematic analysis of extensive genomic data from several species. We identify three key challenges for this emerging field: understanding how best to identify de novo genes, how they arise and why they spread. We highlight the intellectual challenges of understanding how a de novo gene becomes integrated into pre-existing functions and becomes essential. We suggest that, as with protein sequence evolution, antagonistic co-evolution may be key to de novo gene evolution, particularly for new essential genes and new cancer-associated genes.


Assuntos
Evolução Molecular , Genômica/métodos , Fases de Leitura Aberta/genética , Proteínas/genética , Humanos , Filogenia , Proteínas/metabolismo
18.
PLoS Genet ; 15(9): e1008386, 2019 09.
Artigo em Inglês | MEDLINE | ID: mdl-31527909

RESUMO

Errors throughout gene expression are likely deleterious, hence genomes are under selection to ameliorate their consequences. Additional stop codons (ASCs) are in-frame nonsense 'codons' downstream of the primary stop which may be read by translational machinery should the primary stop have been accidentally read through. Prior evidence in several eukaryotes suggests that ASCs are selected to prevent potentially-deleterious consequences of read-through. We extend this evidence showing that enrichment of ASCs is common but not universal for single cell eukaryotes. By contrast, there is limited evidence as to whether the same is true in other taxa. Here, we provide the first systematic test of the hypothesis that ASCs act as a fail-safe mechanism in eubacteria, a group with high read-through rates. Contra to the predictions of the hypothesis we find: there is paucity, not enrichment, of ASCs downstream; substitutions that degrade stops are more frequent in-frame than out-of-frame in 3' sequence; highly expressed genes are no more likely to have ASCs than lowly expressed genes; usage of the leakiest primary stop (TGA) in highly expressed genes does not predict ASC enrichment even compared to usage of non-leaky stops (TAA) in lowly expressed genes, beyond downstream codon +1. Any effect at the codon immediately proximal to the primary stop can be accounted for by a preference for a T/U residue immediately following the stop, although if anything, TT- and TC- starting codons are preferred. We conclude that there is no compelling evidence for ASC selection in eubacteria. This presents an unusual case in which the same error could be solved by the same mechanism in eukaryotes and prokaryotes but is not. We discuss two possible explanations: that, owing to the absence of nonsense mediated decay, bacteria may solve read-through via gene truncation and in eukaryotes certain prion states cause raised read-through rates.


Assuntos
Regiões 3' não Traduzidas/genética , Bactérias/genética , Códon de Terminação/genética , Códon sem Sentido/genética , Eucariotos/genética , Evolução Molecular , Expressão Gênica/genética , Genoma , Degradação do RNAm Mediada por Códon sem Sentido/genética , Fases de Leitura Aberta/genética
19.
Mol Biol Evol ; 37(4): 1148-1164, 2020 04 01.
Artigo em Inglês | MEDLINE | ID: mdl-31841162

RESUMO

Although the constraints on a gene's sequence are often assumed to reflect the functioning of that gene, here we propose transfer selection, a constraint operating on one class of genes transferred to another, mediated by shared binding factors. We show that such transfer can explain an otherwise paradoxical depletion of stop codons in long intergenic noncoding RNAs (lincRNAs). Serine/arginine-rich proteins direct the splicing machinery by binding exonic splice enhancers (ESEs) in immature mRNA. As coding exons cannot contain stop codons in one reading frame, stop codons should be rare within ESEs. We confirm that the stop codon density (SCD) in ESE motifs is low, even accounting for nucleotide biases. Given that serine/arginine-rich proteins binding ESEs also facilitate lincRNA splicing, a low SCD could transfer to lincRNAs. As predicted, multiexon lincRNA exons are depleted in stop codons, a result not explained by open reading frame (ORF) contamination. Consistent with transfer selection, stop codon depletion in lincRNAs is most acute in exonic regions with the highest ESE density, disappears when ESEs are masked, is consistent with stop codon usage skews in ESEs, and is diminished in both single-exon lincRNAs and introns. Owing to low SCD, the maximum lengths of pseudo-ORFs frequently exceed null expectations. This has implications for ORF annotation and the evolution of de novo protein-coding genes from lincRNAs. We conclude that not all constraints operating on genes need be explained by the functioning of the gene but may instead be transferred owing to shared binding factors.


Assuntos
Códon de Terminação , Modelos Genéticos , RNA Longo não Codificante , Sequências Reguladoras de Ácido Ribonucleico , Seleção Genética , Humanos , Fases de Leitura Aberta
20.
Genome Res ; 28(10): 1442-1454, 2018 10.
Artigo em Inglês | MEDLINE | ID: mdl-30143596

RESUMO

What proportion of coding sequence nucleotides have roles in splicing, and how strong is the selection that maintains them? Despite a large body of research into exonic splice regulatory signals, these questions have not been answered. This is because, to our knowledge, previous investigations have not explicitly disentangled the frequency of splice regulatory elements from the strength of the evolutionary constraint under which they evolve. Current data are consistent both with a scenario of weak and diffuse constraint, enveloping large swaths of sequence, as well as with well-defined pockets of strong purifying selection. In the former case, natural selection on exonic splice enhancers (ESEs) might primarily act as a slight modifier of codon usage bias. In the latter, mutations that disrupt ESEs are likely to have large fitness and, potentially, clinical effects. To distinguish between these scenarios, we used several different methods to determine the distribution of selection coefficients for new mutations within ESEs. The analyses converged to suggest that ∼15%-20% of fourfold degenerate sites are part of functional ESEs. Most of these sites are under strong evolutionary constraint. Therefore, exonic splice regulation does not simply impose a weak bias that gently nudges coding sequence evolution in a particular direction. Rather, the selection to preserve these motifs is a strong force that severely constrains the evolution of a substantial proportion of coding nucleotides. Thus synonymous mutations that disrupt ESEs should be considered as a potentially common cause of single-locus genetic disorders.


Assuntos
Sítios de Splice de RNA , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Mutação Silenciosa , Elementos Facilitadores Genéticos , Evolução Molecular , Éxons , Regulação da Expressão Gênica , Humanos , Polimorfismo Genético , Splicing de RNA , Seleção Genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA