Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 185
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Nat Rev Genet ; 25(6): 431-448, 2024 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-38297070

RESUMEN

Although translational selection to favour codons that match the most abundant tRNAs is not readily observed in humans, there is nonetheless selection in humans on synonymous mutations. We hypothesize that much of this synonymous site selection can be explained in terms of protection against unwanted RNAs - spurious transcripts, mis-spliced forms or RNAs derived from transposable elements or viruses. We propose not only that selection on synonymous sites functions to reduce the rate of creation of unwanted transcripts (for example, through selection on exonic splice enhancers and cryptic splice sites) but also that high-GC content (but low-CpG content), together with intron presence and position, is both particular to functional native mRNAs and used to recognize transcripts as native. In support of this hypothesis, transcription, nuclear export, liquid phase condensation and RNA degradation have all recently been shown to promote GC-rich transcripts and suppress AU/CpG-rich ones. With such 'traps' being set against AU/CpG-rich transcripts, the codon usage of native genes has, in turn, evolved to avoid such suppression. That parallel filters against AU/CpG-rich transcripts also affect the endosomal import of RNAs further supports the unwanted transcript hypothesis of synonymous site selection and explains the similar design rules that have enabled the successful use of transgenes and RNA vaccines.


Asunto(s)
ARN Mensajero , Humanos , ARN Mensajero/genética , ARN Mensajero/metabolismo , Animales , Modelos Genéticos , Mutación Silenciosa , Composición de Base , Selección Genética , Transcripción Genética
2.
PLoS Biol ; 21(1): e3001915, 2023 01.
Artículo en Inglés | MEDLINE | ID: mdl-36693040

RESUMEN

People differ greatly in their attitudes towards well-evidenced science. What characterises this variation? Here, we consider this issue in the context of genetics and allied sciences. While most prior research has focused on the relationship between attitude to science and what people know about it, recent evidence suggests that individuals with strongly negative attitudes towards specific genetic technologies (genetic modification (GM) technology and vaccines) commonly do not objectively understand the science, but, importantly, believe that they do. Here, using data from a probability survey of United Kingdom adults, we extend this prior work in 2 regards. First, we ask whether people with more extreme attitudes, be they positive or negative, are more likely to believe that they understand the science. Second, as negativity to genetics is commonly framed around issues particular to specific technologies, we ask whether attitudinal trends are contingent on specification of technology. We find (1) that individuals with strongly positive or negative attitudes towards genetics more strongly believe that they well understand the science; but (2) only for those most positive to the science is this self-confidence warranted; and (3) these effects are not contingent on specification of any particular technologies. These results suggest a potentially general model to explain why people differ in their degree of acceptance or rejection of science, this being that the more someone believes they understand the science, the more confident they will be in their acceptance or rejection of it. While there are more technology nonspecific opponents who also oppose GM technology than expected by chance, most GM opponents fit a different demographic. For the most part, opposition to GM appears not to reflect a smokescreen concealing a broader underlying negativity.


Asunto(s)
Actitud , Tecnología , Adulto , Humanos , Reino Unido , Encuestas y Cuestionarios
3.
PLoS Biol ; 21(6): e3002162, 2023 06.
Artículo en Inglés | MEDLINE | ID: mdl-37339119

RESUMEN

There remains much that we do not understand about the earliest stages of human development. On a gross level, there is evidence for apoptosis, but the nature of the affected cell types is unknown. Perhaps most importantly, the inner cell mass (ICM), from which the foetus is derived and hence of interest in reproductive health and regenerative medicine, has proven hard to define. Here, we provide a multi-method analysis of the early human embryo to resolve these issues. Single-cell analysis (on multiple independent datasets), supported by embryo visualisation, uncovers a common previously uncharacterised class of cells lacking commitment markers that segregates after embryonic gene activation (EGA) and shortly after undergo apoptosis. The discovery of this cell type allows us to clearly define their viable ontogenetic sisters, these being the cells of the ICM. While ICM is characterised by the activity of an Old non-transposing endogenous retrovirus (HERVH) that acts to suppress Young transposable elements, the new cell type, by contrast, expresses transpositionally competent Young elements and DNA-damage response genes. As the Young elements are RetroElements and the cells are excluded from the developmental process, we dub these REject cells. With these and ICM being characterised by differential mobile element activities, the human embryo may be a "selection arena" in which one group of cells selectively die, while other less damaged cells persist.


Asunto(s)
Blastocisto , Elementos Transponibles de ADN , Humanos , Elementos Transponibles de ADN/genética , Blastocisto/metabolismo , Embrión de Mamíferos
4.
PLoS Biol ; 20(7): e3001671, 2022 07.
Artículo en Inglés | MEDLINE | ID: mdl-35788750

RESUMEN

Many human embryos die in utero owing to an excess or deficit of chromosomes, a phenomenon known as aneuploidy; this is largely a consequence of nondisjunction during maternal meiosis I. Asymmetries of this division render it vulnerable to selfish centromeres that promote their own transmission, these being thought to somehow underpin aneuploidy. In this essay, I suggest that these vulnerabilities provide only half the solution to the enigma. In mammals, as in utero and postnatal provisioning is continuous, the costs of early death are mitigated. With such reproductive compensation, selection can favour a centromere because it induces lethal aneuploidy: if, when taken towards the polar body, it instead kills the embryo via aneuploidy, it gains. The model is consistent with the observation that reduced dosage of a murine drive suppressor induces aneuploidy and with the fact that high aneuploidy rates in vertebrates are seen exclusively in mammals. I propose further tests of this idea. The wastefulness of human reproduction may be a price we pay for nurturing our offspring.


Asunto(s)
Aneuploidia , Centrómero , Animales , Embrión de Mamíferos , Humanos , Mamíferos , Meiosis , Ratones , Reproducción
5.
PLoS Genet ; 18(2): e1010071, 2022 02.
Artículo en Inglés | MEDLINE | ID: mdl-35180223

RESUMEN

The introduction of frameshifting non-3n indels enables the identification of gene-trait associations. However, it has been hypothesised that recovery of the original reading frame owing to usage of non-canonical splice forms could cause rescue. To date there is very little evidence for organism-level rescue by such a mechanism and it is unknown how commonly indels induce, or are otherwise associated with, frame-restoring splice forms. We perform CRISPR/Cas9 editing of randomly selected loci in rice to investigate these issues. We find that the majority of loci have a frame-restoring isoform. Importantly, three quarters of these isoforms are not seen in the absence of the indels, consistent with indels commonly inducing novel isoforms. This is supported by analysis in the context of NMD knockdowns. We consider in detail the two top rescue candidates, in wax deficient anther 1 (wda1) and brittle culm (bc10), finding that organismal-level rescue in both cases is strong but owing to different splice modification routes. More generally, however, as frame-restoring isoforms are low abundance and possibly too disruptive, such rescue we suggest to be the rare exception, not the rule. Nonetheless, assuming that indels commonly induce frame-restoring isoforms, these results emphasize the need to examine RNA level effects of non-3n indels and suggest that multiple non-3n indels in any given gene are advisable to probe a gene's trait associations.


Asunto(s)
Oryza , Mutación INDEL/genética , Oryza/genética , Sistemas de Lectura
6.
PLoS Biol ; 19(3): e3001164, 2021 03.
Artículo en Inglés | MEDLINE | ID: mdl-33750968

RESUMEN

In contrast to common meiotic gene conversion, mitotic gene conversion, because it is so rare, is often ignored as a process influencing allelic diversity. We show that if there is a large enough number of premeiotic cell divisions, as seen in many organisms without early germline sequestration, such as plants, this is an unsafe position. From examination of 1.1 million rice plants, we determined that the rate of mitotic gene conversion events, per mitosis, is 2 orders of magnitude lower than the meiotic rate. However, owing to the large number of mitoses between zygote and gamete and because of long mitotic tract lengths, meiotic and mitotic gene conversion can be of approximately equivalent importance in terms of numbers of markers converted from zygote to gamete. This holds even if we assume a low number of premeiotic cell divisions (approximately 40) as witnessed in Arabidopsis. A low mitotic rate associated with long tracts is also seen in yeast, suggesting generality of results. For species with many mitoses between each meiotic event, mitotic gene conversion should not be overlooked.


Asunto(s)
Conversión Génica/genética , Variación Genética/genética , Oryza/genética , Plantas/genética , Alelos , Conversión Génica/fisiología , Genotipo , Células Germinativas/metabolismo , Meiosis/genética , Mitosis/genética , Recombinación Genética/genética
7.
PLoS Comput Biol ; 19(10): e1011581, 2023 10.
Artículo en Inglés | MEDLINE | ID: mdl-37878567

RESUMEN

In many species highly expressed genes (HEGs) over-employ the synonymous codons that match the more abundant iso-acceptor tRNAs. Bacterial transgene codon randomization experiments report, however, that enrichment with such "translationally optimal" codons has little to no effect on the resultant protein level. By contrast, consistent with the view that ribosomal initiation is rate limiting, synonymous codon usage following the 5' ATG greatly influences protein levels, at least in part by modifying RNA stability. For the design of bacterial transgenes, for simple codon based in silico inference of protein levels and for understanding selection on synonymous mutations, it would be valuable to computationally determine initiation optimality (IO) scores for codons for any given species. One attractive approach is to characterize the 5' codon enrichment of HEGs compared with the most lowly expressed genes, just as translational optimality scores of codons have been similarly defined employing the full gene body. Here we determine the viability of this approach employing a unique opportunity: for Escherichia coli there is both the most extensive protein abundance data for native genes and a unique large-scale transgene codon randomization experiment enabling objective definition of the 5' codons that cause, rather than just correlate with, high protein abundance (that we equate with initiation optimality, broadly defined). Surprisingly, the 5' ends of native genes that specify highly abundant proteins avoid such initiation optimal codons. We find that this is probably owing to conflicting selection pressures particular to native HEGs, including selection favouring low initiation rates, this potentially enabling high efficiency of ribosomal usage and low noise. While the classical HEG enrichment approach does not work, rendering simple prediction of native protein abundance from 5' codon content futile, we report evidence that initiation optimality scores derived from the transgene experiment may hold relevance for in silico transgene design for a broad spectrum of bacteria.


Asunto(s)
Escherichia coli , Biosíntesis de Proteínas , Escherichia coli/genética , Biosíntesis de Proteínas/genética , Codón/genética , Ribosomas/genética , Uso de Codones , Bacterias/genética
9.
Mol Biol Evol ; 39(1)2022 01 07.
Artículo en Inglés | MEDLINE | ID: mdl-34751397

RESUMEN

In bacteria stop codons are recognized by one of two class I release factors (RF1) recognizing TAG, RF2 recognizing TGA, and TAA being recognized by both. Variation across bacteria in the relative abundance of RF1 and RF2 is thus hypothesized to select for different TGA/TAG usage. This has been supported by correlations between TAG:TGA ratios and RF1:RF2 ratios across multiple bacterial species, potentially also explaining why TAG usage is approximately constant despite extensive variation in GC content. It is, however, possible that stop codon trends are determined by other forces and that RF ratios adapt to stop codon usage, rather than vice versa. Here, we determine which direction of the causal arrow is the more parsimonious. Our results support the notion that RF1/RF2 ratios become adapted to stop codon usage as the same trends, notably the anomalous TAG behavior, are seen in contexts where RF1:RF2 ratios cannot be, or are unlikely to be, causative, that is, at 3'untranslated sites never used for translation termination, in intragenomic analyses, and across archaeal species (that possess only one RF1). We conclude that specifics of RF biology are unlikely to fully explain TGA/TAG relative usage. We discuss why the causal relationships for the evolution of synonymous stop codon usage might be different from those affecting synonymous sense codon usage, noting that transitions between TGA and TAG require two-point mutations one of which is likely to be deleterious.


Asunto(s)
Bacterias , Uso de Codones , Codón de Terminación , Factores de Terminación de Péptidos , Bacterias/genética , Composición de Base , Factores de Terminación de Péptidos/genética
10.
Mol Biol Evol ; 39(10)2022 10 07.
Artículo en Inglés | MEDLINE | ID: mdl-36205081

RESUMEN

Although new genes can arrive from modes other than duplication, few examples are well characterized. Given high expression in some human brain subregions and a putative link to psychological disorders [e.g., schizophrenia (SCZ)], suggestive of brain functionality, here we characterize piggyBac transposable element-derived 1 (PGBD1). PGBD1 is nonmonotreme mammal-specific and under purifying selection, consistent with functionality. The gene body of human PGBD1 retains much of the original DNA transposon but has additionally captured SCAN and KRAB domains. Despite gene body retention, PGBD1 has lost transposition abilities, thus transposase functionality is absent. PGBD1 no longer recognizes piggyBac transposon-like inverted repeats, nonetheless PGBD1 has DNA binding activity. Genome scale analysis identifies enrichment of binding sites in and around genes involved in neuronal development, with association with both histone activating and repressing marks. We focus on one of the repressed genes, the long noncoding RNA NEAT1, also dysregulated in SCZ, the core structural RNA of paraspeckles. DNA binding assays confirm specific binding of PGBD1 both in the NEAT1 promoter and in the gene body. Depletion of PGBD1 in neuronal progenitor cells (NPCs) results in increased NEAT1/paraspeckles and differentiation. We conclude that PGBD1 has evolved core regulatory functionality for the maintenance of NPCs. As paraspeckles are a mammal-specific structure, the results presented here show a rare example of the evolution of a novel gene coupled to the evolution of a contemporaneous new structure.


Asunto(s)
Elementos Transponibles de ADN , ARN Largo no Codificante , Animales , Núcleo Celular/genética , Histonas/metabolismo , Humanos , Mamíferos/genética , Mamíferos/metabolismo , Proteínas del Tejido Nervioso , Paraspeckles , ARN Largo no Codificante/metabolismo , Transposasas/genética , Transposasas/metabolismo
11.
Bioinformatics ; 38(9): 2626-2627, 2022 04 28.
Artículo en Inglés | MEDLINE | ID: mdl-35244144

RESUMEN

SUMMARY: Transgene-design is a web application to help design transgenes for use in mammalian studies. It is predicated on the recent discovery that human intronless transgenes and native retrogenes can be expressed very effectively if the GC content at exonic synonymous sites is high. In addition, as exonic splice enhancers resident in intron containing genes may have different utility in intronless genes, these can be reduced or increased in density. Input can be a native gene or a commercially 'optimised' gene. The option to leave in the first intron and to protect or avoid other motifs is also permitted. AVAILABILITY AND IMPLEMENTATION: Transgene-design is based on a ruby for rails platform. The application is available at https://transgene-design.bath.ac.uk. The code is available under GNU General Public License from GitHub (https://github.com/smuehlh/transgenes). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Mamíferos , Programas Informáticos , Animales , Humanos , Intrones , Exones , Transgenes , Mutación , Mamíferos/genética
12.
Nucleic Acids Res ; 49(17): 9665-9685, 2021 09 27.
Artículo en Inglés | MEDLINE | ID: mdl-34469537

RESUMEN

Transcripts containing premature termination codons (PTCs) can be subject to nonsense-associated alternative splicing (NAS). Two models have been evoked to explain this, scanning and splice motif disruption. The latter postulates that exonic cis motifs, such as exonic splice enhancers (ESEs), are disrupted by nonsense mutations. We employ genome-wide transcriptomic and k-mer enrichment methods to scrutinize this model. First, we show that ESEs are prone to disruptive nonsense mutations owing to their purine richness and paucity of TGA, TAA and TAG. The motif model correctly predicts that NAS rates should be low (we estimate 5-30%) and approximately in line with estimates for the rate at which random point mutations disrupt splicing (8-20%). Further, we find that, as expected, NAS-associated PTCs are predictable from nucleotide-based machine learning approaches to predict splice disruption and, at least for pathogenic variants, are enriched in ESEs. Finally, we find that both in and out of frame mutations to TAA, TGA or TAG are associated with exon skipping. While a higher relative frequency of such skip-inducing mutations in-frame than out of frame lends some credence to the scanning model, these results reinforce the importance of considering splice motif modulation to understand the etiology of PTC-associated disease.


Asunto(s)
Empalme Alternativo , Codón sin Sentido , Secuencias Reguladoras de Ácido Ribonucleico , Codón de Terminación , Enfermedad/genética , Exones , Células HEK293 , Células HeLa , Humanos , Degradación de ARNm Mediada por Codón sin Sentido , Motivos de Nucleótidos , Nucleótidos/química
13.
Mol Biol Evol ; 38(1): 244-262, 2021 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-32797190

RESUMEN

In correctly predicting that selection efficiency is positively correlated with the effective population size (Ne), the nearly neutral theory provides a coherent understanding of between-species variation in numerous genomic parameters, including heritable error (germline mutation) rates. Does the same theory also explain variation in phenotypic error rates and in abundance of error mitigation mechanisms? Translational read-through provides a model to investigate both issues as it is common, mostly nonadaptive, and has good proxy for rate (TAA being the least leaky stop codon) and potential error mitigation via "fail-safe" 3' additional stop codons (ASCs). Prior theory of translational read-through has suggested that when population sizes are high, weak selection for local mitigation can be effective thus predicting a positive correlation between ASC enrichment and Ne. Contra to prediction, we find that ASC enrichment is not correlated with Ne. ASC enrichment, although highly phylogenetically patchy, is, however, more common both in unicellular species and in genes expressed in unicellular modes in multicellular species. By contrast, Ne does positively correlate with TAA enrichment. These results imply that local phenotypic error rates, not local mitigation rates, are consistent with a drift barrier/nearly neutral model.


Asunto(s)
Codón de Terminación , Evolución Molecular , Tasa de Mutación , Selección Genética , Arabidopsis , Dictyostelium , Densidad de Población
14.
Mol Biol Evol ; 38(8): 3247-3266, 2021 07 29.
Artículo en Inglés | MEDLINE | ID: mdl-33871580

RESUMEN

Alternative synonymous codons are often used at unequal frequencies. Classically, studies of such codon usage bias (CUB) attempted to separate the impact of neutral from selective forces by assuming that deviations from a predicted neutral equilibrium capture selection. However, GC-biased gene conversion (gBGC) can also cause deviation from a neutral null. Alternatively, selection has been inferred from CUB in highly expressed genes, but the accuracy of this approach has not been extensively tested, and gBGC can interfere with such extrapolations (e.g., if expression and gene conversion rates covary). It is therefore critical to examine deviations from a mutational null in a species with no gBGC. To achieve this goal, we implement such an analysis in the highly AT rich genome of Dictyostelium discoideum, where we find no evidence of gBGC. We infer neutral CUB under mutational equilibrium to quantify "adaptive codon preference," a nontautologous genome wide quantitative measure of the relative selection strength driving CUB. We observe signatures of purifying selection consistent with selection favoring adaptive codon preference. Preferred codons are not GC rich, underscoring the independence from gBGC. Expression-associated "preference" largely matches adaptive codon preference but does not wholly capture the influence of selection shaping patterns across all genes, suggesting selective constraints associated specifically with high expression. We observe patterns consistent with effects on mRNA translation and stability shaping adaptive codon preference. Thus, our approach to quantifying adaptive codon preference provides a framework for inferring the sources of selection that shape CUB across different contexts within the genome.


Asunto(s)
Uso de Codones , Dictyostelium/genética , Selección Genética , Adaptación Biológica , Composición de Base , Biosíntesis de Proteínas , ARN de Transferencia/metabolismo
15.
Mol Biol Evol ; 38(1): 67-83, 2021 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-32687176

RESUMEN

Large-scale re-engineering of synonymous sites is a promising strategy to generate vaccines either through synthesis of attenuated viruses or via codon-optimized genes in DNA vaccines. Attenuation typically relies on deoptimization of codon pairs and maximization of CpG dinucleotide frequencies. So as to formulate evolutionarily informed attenuation strategies that aim to force nucleotide usage against the direction favored by selection, here, we examine available whole-genome sequences of SARS-CoV-2 to infer patterns of mutation and selection on synonymous sites. Analysis of mutational profiles indicates a strong mutation bias toward U. In turn, analysis of observed synonymous site composition implicates selection against U. Accounting for dinucleotide effects reinforces this conclusion, observed UU content being a quarter of that expected under neutrality. Possible mechanisms of selection against U mutations include selection for higher expression, for high mRNA stability or lower immunogenicity of viral genes. Consistent with gene-specific selection against CpG dinucleotides, we observe systematic differences of CpG content between SARS-CoV-2 genes. We propose an evolutionarily informed approach to attenuation that, unusually, seeks to increase usage of the already most common synonymous codons. Comparable analysis of H1N1 and Ebola finds that GC3 deviated from neutral equilibrium is not a universal feature, cautioning against generalization of results.


Asunto(s)
Vacunas contra la COVID-19/genética , COVID-19/genética , Genoma Viral , Mutación , SARS-CoV-2/genética , Selección Genética , COVID-19/prevención & control , Humanos , Estabilidad del ARN/genética , ARN Mensajero/genética , ARN Viral/genética , Uracilo
16.
PLoS Biol ; 17(4): e3000191, 2019 04.
Artículo en Inglés | MEDLINE | ID: mdl-30964866

RESUMEN

Given the disposability of somatic tissue, selection can favor a higher mutation rate in the early segregating soma than in germline, as seen in some animals. Although in plants intra-organismic mutation rate heterogeneity is poorly resolved, the same selectionist logic can predict a lower rate in shoot than in root and in longer-lived terminal tissues (e.g., leaves) than in ontogenetically similar short-lived ones (e.g., petals), and that mutation rate heterogeneity should be deterministic with no significant differences between biological replicates. To address these expectations, we sequenced 754 genomes from various tissues of eight plant species. Consistent with a selectionist model, the rate of mutation accumulation per unit time in shoot apical meristem is lower than that in root apical tissues in perennials, in which a high proportion of mutations in shoots are themselves transmissible, but not in annuals, in which somatic mutations tend not to be transmissible. Similarly, the number of mutations accumulated in leaves is commonly lower than that within a petal of the same plant, and there is no more heterogeneity in accumulation rates between replicate branches than expected by chance. High mutation accumulation in runners of strawberry is, we argue, the exception that proves the rule, as mutation transmission patterns indicate that runner has a restricted germline. However, we also find that in vitro callus tissue has a higher mutation rate (per unit time) than the wild-grown comparator, suggesting nonadaptive mutational "fragility". As mutational fragility does not obviously explain why the shoot-root difference varies with plant longevity, we conclude that some mutation rate variation between tissues is consistent with selectionist theory but that a mechanistic null of mutational fragility should be considered.


Asunto(s)
Regulación de la Expresión Génica de las Plantas/genética , Tasa de Mutación , Plantas/genética , Genes de Plantas/genética , Células Germinativas , Mutación/genética , Acumulación de Mutaciones , Hojas de la Planta/genética , Raíces de Plantas/genética , Brotes de la Planta/genética
17.
Nat Rev Genet ; 17(9): 567-78, 2016 09.
Artículo en Inglés | MEDLINE | ID: mdl-27452112

RESUMEN

The study of de novo protein-coding genes is maturing from the ad hoc reporting of individual cases to the systematic analysis of extensive genomic data from several species. We identify three key challenges for this emerging field: understanding how best to identify de novo genes, how they arise and why they spread. We highlight the intellectual challenges of understanding how a de novo gene becomes integrated into pre-existing functions and becomes essential. We suggest that, as with protein sequence evolution, antagonistic co-evolution may be key to de novo gene evolution, particularly for new essential genes and new cancer-associated genes.


Asunto(s)
Evolución Molecular , Genómica/métodos , Sistemas de Lectura Abierta/genética , Proteínas/genética , Humanos , Filogenia , Proteínas/metabolismo
18.
PLoS Genet ; 15(9): e1008386, 2019 09.
Artículo en Inglés | MEDLINE | ID: mdl-31527909

RESUMEN

Errors throughout gene expression are likely deleterious, hence genomes are under selection to ameliorate their consequences. Additional stop codons (ASCs) are in-frame nonsense 'codons' downstream of the primary stop which may be read by translational machinery should the primary stop have been accidentally read through. Prior evidence in several eukaryotes suggests that ASCs are selected to prevent potentially-deleterious consequences of read-through. We extend this evidence showing that enrichment of ASCs is common but not universal for single cell eukaryotes. By contrast, there is limited evidence as to whether the same is true in other taxa. Here, we provide the first systematic test of the hypothesis that ASCs act as a fail-safe mechanism in eubacteria, a group with high read-through rates. Contra to the predictions of the hypothesis we find: there is paucity, not enrichment, of ASCs downstream; substitutions that degrade stops are more frequent in-frame than out-of-frame in 3' sequence; highly expressed genes are no more likely to have ASCs than lowly expressed genes; usage of the leakiest primary stop (TGA) in highly expressed genes does not predict ASC enrichment even compared to usage of non-leaky stops (TAA) in lowly expressed genes, beyond downstream codon +1. Any effect at the codon immediately proximal to the primary stop can be accounted for by a preference for a T/U residue immediately following the stop, although if anything, TT- and TC- starting codons are preferred. We conclude that there is no compelling evidence for ASC selection in eubacteria. This presents an unusual case in which the same error could be solved by the same mechanism in eukaryotes and prokaryotes but is not. We discuss two possible explanations: that, owing to the absence of nonsense mediated decay, bacteria may solve read-through via gene truncation and in eukaryotes certain prion states cause raised read-through rates.


Asunto(s)
Regiones no Traducidas 3'/genética , Bacterias/genética , Codón de Terminación/genética , Codón sin Sentido/genética , Eucariontes/genética , Evolución Molecular , Expresión Génica/genética , Genoma , Degradación de ARNm Mediada por Codón sin Sentido/genética , Sistemas de Lectura Abierta/genética
19.
Mol Biol Evol ; 37(4): 1148-1164, 2020 04 01.
Artículo en Inglés | MEDLINE | ID: mdl-31841162

RESUMEN

Although the constraints on a gene's sequence are often assumed to reflect the functioning of that gene, here we propose transfer selection, a constraint operating on one class of genes transferred to another, mediated by shared binding factors. We show that such transfer can explain an otherwise paradoxical depletion of stop codons in long intergenic noncoding RNAs (lincRNAs). Serine/arginine-rich proteins direct the splicing machinery by binding exonic splice enhancers (ESEs) in immature mRNA. As coding exons cannot contain stop codons in one reading frame, stop codons should be rare within ESEs. We confirm that the stop codon density (SCD) in ESE motifs is low, even accounting for nucleotide biases. Given that serine/arginine-rich proteins binding ESEs also facilitate lincRNA splicing, a low SCD could transfer to lincRNAs. As predicted, multiexon lincRNA exons are depleted in stop codons, a result not explained by open reading frame (ORF) contamination. Consistent with transfer selection, stop codon depletion in lincRNAs is most acute in exonic regions with the highest ESE density, disappears when ESEs are masked, is consistent with stop codon usage skews in ESEs, and is diminished in both single-exon lincRNAs and introns. Owing to low SCD, the maximum lengths of pseudo-ORFs frequently exceed null expectations. This has implications for ORF annotation and the evolution of de novo protein-coding genes from lincRNAs. We conclude that not all constraints operating on genes need be explained by the functioning of the gene but may instead be transferred owing to shared binding factors.


Asunto(s)
Codón de Terminación , Modelos Genéticos , ARN Largo no Codificante , Secuencias Reguladoras de Ácido Ribonucleico , Selección Genética , Humanos , Sistemas de Lectura Abierta
20.
Genome Res ; 28(10): 1442-1454, 2018 10.
Artículo en Inglés | MEDLINE | ID: mdl-30143596

RESUMEN

What proportion of coding sequence nucleotides have roles in splicing, and how strong is the selection that maintains them? Despite a large body of research into exonic splice regulatory signals, these questions have not been answered. This is because, to our knowledge, previous investigations have not explicitly disentangled the frequency of splice regulatory elements from the strength of the evolutionary constraint under which they evolve. Current data are consistent both with a scenario of weak and diffuse constraint, enveloping large swaths of sequence, as well as with well-defined pockets of strong purifying selection. In the former case, natural selection on exonic splice enhancers (ESEs) might primarily act as a slight modifier of codon usage bias. In the latter, mutations that disrupt ESEs are likely to have large fitness and, potentially, clinical effects. To distinguish between these scenarios, we used several different methods to determine the distribution of selection coefficients for new mutations within ESEs. The analyses converged to suggest that ∼15%-20% of fourfold degenerate sites are part of functional ESEs. Most of these sites are under strong evolutionary constraint. Therefore, exonic splice regulation does not simply impose a weak bias that gently nudges coding sequence evolution in a particular direction. Rather, the selection to preserve these motifs is a strong force that severely constrains the evolution of a substantial proportion of coding nucleotides. Thus synonymous mutations that disrupt ESEs should be considered as a potentially common cause of single-locus genetic disorders.


Asunto(s)
Sitios de Empalme de ARN , ARN Mensajero/genética , ARN Mensajero/metabolismo , Mutación Silenciosa , Elementos de Facilitación Genéticos , Evolución Molecular , Exones , Regulación de la Expresión Génica , Humanos , Polimorfismo Genético , Empalme del ARN , Selección Genética
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA