Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 185
Filtrar
1.
mBio ; : e0058124, 2024 Apr 29.
Artigo em Inglês | MEDLINE | ID: mdl-38683013

RESUMO

Recombination of short DNA fragments via horizontal gene transfer (HGT) can introduce beneficial alleles, create genomic disharmony through negative epistasis, and create adaptive gene combinations through positive epistasis. For non-core (accessory) genes, the negative epistatic cost is likely to be minimal because the incoming genes have not co-evolved with the recipient genome and are frequently observed as tightly linked cassettes with major effects. By contrast, interspecific recombination in the core genome is expected to be rare because disruptive allelic replacement is likely to introduce negative epistasis. Why then is homologous recombination common in the core of bacterial genomes? To understand this enigma, we take advantage of an exceptional model system, the common enteric pathogens Campylobacter jejuni and C. coli that are known for very high magnitude interspecies gene flow in the core genome. As expected, HGT does indeed disrupt co-adapted allele pairings, indirect evidence of negative epistasis. However, multiple HGT events enable recovery of the genome's co-adaption between introgressing alleles, even in core metabolism genes (e.g., formate dehydrogenase). These findings demonstrate that, even for complex traits, genetic coalitions can be decoupled, transferred, and independently reinstated in a new genetic background-facilitating transition between fitness peaks. In this example, the two-step recombinational process is associated with C. coli that are adapted to the agricultural niche.IMPORTANCEGenetic exchange among bacteria shapes the microbial world. From the acquisition of antimicrobial resistance genes to fundamental questions about the nature of bacterial species, this powerful evolutionary force has preoccupied scientists for decades. However, the mixing of genes between species rests on a paradox: 0n one hand, promoting adaptation by conferring novel functionality; on the other, potentially introducing disharmonious gene combinations (negative epistasis) that will be selected against. Taking an interdisciplinary approach to analyze natural populations of the enteric bacteria Campylobacter, an ideal example of long-range admixture, we demonstrate that genes can independently transfer across species boundaries and rejoin in functional networks in a recipient genome. The positive impact of two-gene interactions appears to be adaptive by expanding metabolic capacity and facilitating niche shifts through interspecific hybridization. This challenges conventional ideas and highlights the possibility of multiple-step evolution of multi-gene traits by interspecific introgression.

2.
Nat Rev Genet ; 2024 Jan 31.
Artigo em Inglês | MEDLINE | ID: mdl-38297070

RESUMO

Although translational selection to favour codons that match the most abundant tRNAs is not readily observed in humans, there is nonetheless selection in humans on synonymous mutations. We hypothesize that much of this synonymous site selection can be explained in terms of protection against unwanted RNAs - spurious transcripts, mis-spliced forms or RNAs derived from transposable elements or viruses. We propose not only that selection on synonymous sites functions to reduce the rate of creation of unwanted transcripts (for example, through selection on exonic splice enhancers and cryptic splice sites) but also that high-GC content (but low-CpG content), together with intron presence and position, is both particular to functional native mRNAs and used to recognize transcripts as native. In support of this hypothesis, transcription, nuclear export, liquid phase condensation and RNA degradation have all recently been shown to promote GC-rich transcripts and suppress AU/CpG-rich ones. With such 'traps' being set against AU/CpG-rich transcripts, the codon usage of native genes has, in turn, evolved to avoid such suppression. That parallel filters against AU/CpG-rich transcripts also affect the endosomal import of RNAs further supports the unwanted transcript hypothesis of synonymous site selection and explains the similar design rules that have enabled the successful use of transgenes and RNA vaccines.

3.
PLoS Comput Biol ; 19(10): e1011581, 2023 10.
Artigo em Inglês | MEDLINE | ID: mdl-37878567

RESUMO

In many species highly expressed genes (HEGs) over-employ the synonymous codons that match the more abundant iso-acceptor tRNAs. Bacterial transgene codon randomization experiments report, however, that enrichment with such "translationally optimal" codons has little to no effect on the resultant protein level. By contrast, consistent with the view that ribosomal initiation is rate limiting, synonymous codon usage following the 5' ATG greatly influences protein levels, at least in part by modifying RNA stability. For the design of bacterial transgenes, for simple codon based in silico inference of protein levels and for understanding selection on synonymous mutations, it would be valuable to computationally determine initiation optimality (IO) scores for codons for any given species. One attractive approach is to characterize the 5' codon enrichment of HEGs compared with the most lowly expressed genes, just as translational optimality scores of codons have been similarly defined employing the full gene body. Here we determine the viability of this approach employing a unique opportunity: for Escherichia coli there is both the most extensive protein abundance data for native genes and a unique large-scale transgene codon randomization experiment enabling objective definition of the 5' codons that cause, rather than just correlate with, high protein abundance (that we equate with initiation optimality, broadly defined). Surprisingly, the 5' ends of native genes that specify highly abundant proteins avoid such initiation optimal codons. We find that this is probably owing to conflicting selection pressures particular to native HEGs, including selection favouring low initiation rates, this potentially enabling high efficiency of ribosomal usage and low noise. While the classical HEG enrichment approach does not work, rendering simple prediction of native protein abundance from 5' codon content futile, we report evidence that initiation optimality scores derived from the transgene experiment may hold relevance for in silico transgene design for a broad spectrum of bacteria.


Assuntos
Escherichia coli , Biossíntese de Proteínas , Escherichia coli/genética , Biossíntese de Proteínas/genética , Códon/genética , Ribossomos/genética , Uso do Códon , Bactérias/genética
5.
J Clin Invest ; 133(14)2023 07 17.
Artigo em Inglês | MEDLINE | ID: mdl-37463444

RESUMO

Cell senescence suppresses tumors by arresting cells at risk of becoming malignant. However, this process in turn can affect the microenvironment, leading to acquisition of a senescence-associated secretory phenotype (SASP) that renders senescent cells proinflammatory and results in tumor progression. But how is SASP controlled? In this issue of the JCI, Attig and Pape et al. describe the role of chimeric calbindin 1 (CALB1) transcripts, which are driven by an upstream human endogenous retrovirus subfamily H (HERVH) element. The authors propose that in lung squamous cell carcinoma (LUSC), HERVH-driven isoforms of calbindin (HERVH-CALB1) counteract SASP. As an alternative promoter, HERVH drove calbindin isoforms that prevented cancer cell senescence and associated inflammation, which was associated with better patient survival. We comment on the similarities between HERVH-CALB1-related cellular fitness in cancer and early embryogenesis and discuss the potential benefits of HERVH-driven chimeric transcripts.


Assuntos
Carcinoma de Células Escamosas , Retrovirus Endógenos , Humanos , Carcinoma de Células Escamosas/genética , Carcinoma de Células Escamosas/patologia , Senescência Celular/genética , Retrovirus Endógenos/genética , Fenótipo , Microambiente Tumoral
6.
PLoS Biol ; 21(6): e3002162, 2023 06.
Artigo em Inglês | MEDLINE | ID: mdl-37339119

RESUMO

There remains much that we do not understand about the earliest stages of human development. On a gross level, there is evidence for apoptosis, but the nature of the affected cell types is unknown. Perhaps most importantly, the inner cell mass (ICM), from which the foetus is derived and hence of interest in reproductive health and regenerative medicine, has proven hard to define. Here, we provide a multi-method analysis of the early human embryo to resolve these issues. Single-cell analysis (on multiple independent datasets), supported by embryo visualisation, uncovers a common previously uncharacterised class of cells lacking commitment markers that segregates after embryonic gene activation (EGA) and shortly after undergo apoptosis. The discovery of this cell type allows us to clearly define their viable ontogenetic sisters, these being the cells of the ICM. While ICM is characterised by the activity of an Old non-transposing endogenous retrovirus (HERVH) that acts to suppress Young transposable elements, the new cell type, by contrast, expresses transpositionally competent Young elements and DNA-damage response genes. As the Young elements are RetroElements and the cells are excluded from the developmental process, we dub these REject cells. With these and ICM being characterised by differential mobile element activities, the human embryo may be a "selection arena" in which one group of cells selectively die, while other less damaged cells persist.


Assuntos
Blastocisto , Elementos de DNA Transponíveis , Humanos , Elementos de DNA Transponíveis/genética , Blastocisto/metabolismo , Embrião de Mamíferos
7.
PLoS One ; 18(3): e0278169, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36952552

RESUMO

While attempts to promote acceptance of well-evidenced science have historically focused on increasing scientific knowledge, it is now thought that for acceptance of science, trust in, rather than simply knowledge of, science is foundational. Here we employ the COVID-19 pandemic as a natural experiment on trust modulation as it has enabled unprecedented exposure of science. We ask whether trust in science has on the average altered, whether trust has changed the same way for all and, if people have responded differently, what predicts these differences? We 1) categorize the nature of self-reported change in trust in "scientists" in a random sample of over 2000 UK adults after the introduction of the first COVID vaccines, 2) ask whether any reported change is likely to be real through consideration of both a negative control and through experiment, and 3) address what predicts change in trust considering sex, educational attainment, religiosity, political attitude, age and pre-pandemic reported trust. We find that many more (33%) report increased trust towards "scientists" than report decreased trust (7%), effects of this magnitude not being seen in negative controls. Only age and prior degree of trust predict change in trust, the older population increasing trust more. The prior degree of trust effect is such that those who say they did not trust science prior to the pandemic are more likely to report becoming less trusting, indicative of both trust polarization and a backfire effect. Since change in trust is predictive of willingness to have a COVID-19 vaccine, it is likely that these changes have public health consequences.


Assuntos
Sucesso Acadêmico , COVID-19 , Adulto , Humanos , COVID-19/epidemiologia , COVID-19/prevenção & controle , Vacinas contra COVID-19 , Pandemias , Autorrelato
8.
PLoS Biol ; 21(1): e3001915, 2023 01.
Artigo em Inglês | MEDLINE | ID: mdl-36693040

RESUMO

People differ greatly in their attitudes towards well-evidenced science. What characterises this variation? Here, we consider this issue in the context of genetics and allied sciences. While most prior research has focused on the relationship between attitude to science and what people know about it, recent evidence suggests that individuals with strongly negative attitudes towards specific genetic technologies (genetic modification (GM) technology and vaccines) commonly do not objectively understand the science, but, importantly, believe that they do. Here, using data from a probability survey of United Kingdom adults, we extend this prior work in 2 regards. First, we ask whether people with more extreme attitudes, be they positive or negative, are more likely to believe that they understand the science. Second, as negativity to genetics is commonly framed around issues particular to specific technologies, we ask whether attitudinal trends are contingent on specification of technology. We find (1) that individuals with strongly positive or negative attitudes towards genetics more strongly believe that they well understand the science; but (2) only for those most positive to the science is this self-confidence warranted; and (3) these effects are not contingent on specification of any particular technologies. These results suggest a potentially general model to explain why people differ in their degree of acceptance or rejection of science, this being that the more someone believes they understand the science, the more confident they will be in their acceptance or rejection of it. While there are more technology nonspecific opponents who also oppose GM technology than expected by chance, most GM opponents fit a different demographic. For the most part, opposition to GM appears not to reflect a smokescreen concealing a broader underlying negativity.


Assuntos
Atitude , Tecnologia , Adulto , Humanos , Reino Unido , Inquéritos e Questionários
9.
Mol Biol Evol ; 39(10)2022 10 07.
Artigo em Inglês | MEDLINE | ID: mdl-36205081

RESUMO

Although new genes can arrive from modes other than duplication, few examples are well characterized. Given high expression in some human brain subregions and a putative link to psychological disorders [e.g., schizophrenia (SCZ)], suggestive of brain functionality, here we characterize piggyBac transposable element-derived 1 (PGBD1). PGBD1 is nonmonotreme mammal-specific and under purifying selection, consistent with functionality. The gene body of human PGBD1 retains much of the original DNA transposon but has additionally captured SCAN and KRAB domains. Despite gene body retention, PGBD1 has lost transposition abilities, thus transposase functionality is absent. PGBD1 no longer recognizes piggyBac transposon-like inverted repeats, nonetheless PGBD1 has DNA binding activity. Genome scale analysis identifies enrichment of binding sites in and around genes involved in neuronal development, with association with both histone activating and repressing marks. We focus on one of the repressed genes, the long noncoding RNA NEAT1, also dysregulated in SCZ, the core structural RNA of paraspeckles. DNA binding assays confirm specific binding of PGBD1 both in the NEAT1 promoter and in the gene body. Depletion of PGBD1 in neuronal progenitor cells (NPCs) results in increased NEAT1/paraspeckles and differentiation. We conclude that PGBD1 has evolved core regulatory functionality for the maintenance of NPCs. As paraspeckles are a mammal-specific structure, the results presented here show a rare example of the evolution of a novel gene coupled to the evolution of a contemporaneous new structure.


Assuntos
Elementos de DNA Transponíveis , RNA Longo não Codificante , Animais , Núcleo Celular/genética , Histonas/metabolismo , Humanos , Mamíferos/genética , Mamíferos/metabolismo , Proteínas do Tecido Nervoso , Paraspeckles , RNA Longo não Codificante/metabolismo , Transposases/genética , Transposases/metabolismo
10.
PLoS Biol ; 20(7): e3001671, 2022 07.
Artigo em Inglês | MEDLINE | ID: mdl-35788750

RESUMO

Many human embryos die in utero owing to an excess or deficit of chromosomes, a phenomenon known as aneuploidy; this is largely a consequence of nondisjunction during maternal meiosis I. Asymmetries of this division render it vulnerable to selfish centromeres that promote their own transmission, these being thought to somehow underpin aneuploidy. In this essay, I suggest that these vulnerabilities provide only half the solution to the enigma. In mammals, as in utero and postnatal provisioning is continuous, the costs of early death are mitigated. With such reproductive compensation, selection can favour a centromere because it induces lethal aneuploidy: if, when taken towards the polar body, it instead kills the embryo via aneuploidy, it gains. The model is consistent with the observation that reduced dosage of a murine drive suppressor induces aneuploidy and with the fact that high aneuploidy rates in vertebrates are seen exclusively in mammals. I propose further tests of this idea. The wastefulness of human reproduction may be a price we pay for nurturing our offspring.


Assuntos
Aneuploidia , Centrômero , Animais , Embrião de Mamíferos , Humanos , Mamíferos , Meiose , Camundongos , Reprodução
11.
Cell Stem Cell ; 29(7): 1031-1050.e12, 2022 07 07.
Artigo em Inglês | MEDLINE | ID: mdl-35803225

RESUMO

Reprogramming of H3K9me3-dependent heterochromatin is required for early development. How H3K9me3 is involved in early human development remains, however, largely unclear. Here, we resolve the temporal landscape of H3K9me3 during human preimplantation development and its regulation for diverse hominoid-specific retrotransposons. At the 8-cell stage, H3K9me3 reprogramming at hominoid-specific retrotransposons termed SINE-VNTR-Alu (SVA) facilitates interaction between certain promoters and SVA-derived enhancers, promoting the zygotic genome activation. In trophectoderm, de novo H3K9me3 domains prevent pluripotent transcription factors from binding to hominoid-specific retrotransposons-derived regulatory elements for inner cell mass (ICM)-specific genes. H3K9me3 re-establishment at SVA elements in the ICM is associated with higher transcription of DNA repair genes, when compared with naive human pluripotent stem cells. Our data demonstrate that species-specific reorganization of H3K9me3-dependent heterochromatin at hominoid-specific retrotransposons plays important roles during early human development, shedding light on how the epigenetic regulation for early development has evolved in mammals.


Assuntos
Heterocromatina , Retroelementos , Elementos Alu , Animais , Desenvolvimento Embrionário/genética , Epigênese Genética , Humanos , Mamíferos , Retroelementos/genética
12.
Genome Biol Evol ; 14(8)2022 08 03.
Artigo em Inglês | MEDLINE | ID: mdl-35867377

RESUMO

Protein coding genes terminate with one of three stop codons (TAA, TGA, or TAG) that, like synonymous codons, are not employed equally. With TGA and TAG having identical nucleotide content, analysis of their differential usage provides an unusual window into the forces operating on what are ostensibly functionally identical residues. Across genomes and between isochores within the human genome, TGA usage increases with G + C content but, with a common G + C → A + T mutation bias, this cannot be explained by mutation bias-drift equilibrium. Increased usage of TGA in G + C-rich genomes or genomic regions is also unlikely to reflect selection for the optimal stop codon, as TAA appears to be universally optimal, probably because it has the lowest read-through rate. Despite TAA being favored by selection and mutation bias, as with codon usage bias G + C pressure is the prime determinant of between-species TGA usage trends. In species with strong G + C-biased gene conversion (gBGC), such as mammals and birds, the high usage and conservation of TGA is best explained by an A + T → G + C repair bias. How to explain TGA enrichment in other G + C-rich genomes is less clear. Enigmatically, across bacterial and archaeal species and between human isochores TAG usage is mostly unresponsive to G + C pressure. This unresponsiveness we dub the TAG paradox as currently no mutational, selective, or gBGC model provides a well-supported explanation. That TAG does increase with G + C usage across eukaryotes makes the usage elsewhere yet more enigmatic. We suggest resolution of the TAG paradox may provide insights into either an unknown but common selective preference (probably at the DNA/RNA level) or an unrecognized complexity to the action of gBGC.


Assuntos
Uso do Códon , Conversão Gênica , Animais , Códon de Terminação , Evolução Molecular , Humanos , Isocoros , Mamíferos/genética , Seleção Genética
13.
Bioinformatics ; 38(9): 2626-2627, 2022 04 28.
Artigo em Inglês | MEDLINE | ID: mdl-35244144

RESUMO

SUMMARY: Transgene-design is a web application to help design transgenes for use in mammalian studies. It is predicated on the recent discovery that human intronless transgenes and native retrogenes can be expressed very effectively if the GC content at exonic synonymous sites is high. In addition, as exonic splice enhancers resident in intron containing genes may have different utility in intronless genes, these can be reduced or increased in density. Input can be a native gene or a commercially 'optimised' gene. The option to leave in the first intron and to protect or avoid other motifs is also permitted. AVAILABILITY AND IMPLEMENTATION: Transgene-design is based on a ruby for rails platform. The application is available at https://transgene-design.bath.ac.uk. The code is available under GNU General Public License from GitHub (https://github.com/smuehlh/transgenes). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Mamíferos , Software , Animais , Humanos , Íntrons , Éxons , Transgenes , Mutação , Mamíferos/genética
14.
PLoS Genet ; 18(2): e1010071, 2022 02.
Artigo em Inglês | MEDLINE | ID: mdl-35180223

RESUMO

The introduction of frameshifting non-3n indels enables the identification of gene-trait associations. However, it has been hypothesised that recovery of the original reading frame owing to usage of non-canonical splice forms could cause rescue. To date there is very little evidence for organism-level rescue by such a mechanism and it is unknown how commonly indels induce, or are otherwise associated with, frame-restoring splice forms. We perform CRISPR/Cas9 editing of randomly selected loci in rice to investigate these issues. We find that the majority of loci have a frame-restoring isoform. Importantly, three quarters of these isoforms are not seen in the absence of the indels, consistent with indels commonly inducing novel isoforms. This is supported by analysis in the context of NMD knockdowns. We consider in detail the two top rescue candidates, in wax deficient anther 1 (wda1) and brittle culm (bc10), finding that organismal-level rescue in both cases is strong but owing to different splice modification routes. More generally, however, as frame-restoring isoforms are low abundance and possibly too disruptive, such rescue we suggest to be the rare exception, not the rule. Nonetheless, assuming that indels commonly induce frame-restoring isoforms, these results emphasize the need to examine RNA level effects of non-3n indels and suggest that multiple non-3n indels in any given gene are advisable to probe a gene's trait associations.


Assuntos
Oryza , Mutação INDEL/genética , Oryza/genética , Fases de Leitura
15.
Genome Biol Evol ; 14(1)2022 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-35018456

RESUMO

The faster rate of evolution of duplicated genes relative to singletons has been well documented in multiple lineages. This observation has generally been attributed to a presumed release from constraint following creation of a redundant, duplicate copy. However, it is not obvious that the relationship operates in this direction. An alternative possibility-that the faster rate of evolution predates the duplication event and the observed differences result from a higher propensity to duplicate in fast-evolving genes-has been tested in primates and in insects. However, these studies arrived at different conclusions and clarity is needed on whether these contrasting results relate to differences in methodology or legitimate biological differences between the lineages selected. Here, we test whether duplicable genes are faster evolving independent of duplication in the Drosophila lineage and find that our results support the conclusion that faster evolving genes are more likely to duplicate, in agreement with previous work in primates. Our findings indicate that this characteristic of gene duplication is not restricted to a single lineage and has broad implications for the interpretation of the impact of gene duplication. We identify a subset of "singletons" which defy the general trends and appear to be faster evolving. Further investigation implicates homology detection failure and suggests that these may be duplicable genes with unidentifiable paralogs.


Assuntos
Drosophila , Evolução Molecular , Animais , Drosophila/genética , Duplicação Gênica , Genes Duplicados , Primatas/genética
16.
Mol Biol Evol ; 39(1)2022 01 07.
Artigo em Inglês | MEDLINE | ID: mdl-34751397

RESUMO

In bacteria stop codons are recognized by one of two class I release factors (RF1) recognizing TAG, RF2 recognizing TGA, and TAA being recognized by both. Variation across bacteria in the relative abundance of RF1 and RF2 is thus hypothesized to select for different TGA/TAG usage. This has been supported by correlations between TAG:TGA ratios and RF1:RF2 ratios across multiple bacterial species, potentially also explaining why TAG usage is approximately constant despite extensive variation in GC content. It is, however, possible that stop codon trends are determined by other forces and that RF ratios adapt to stop codon usage, rather than vice versa. Here, we determine which direction of the causal arrow is the more parsimonious. Our results support the notion that RF1/RF2 ratios become adapted to stop codon usage as the same trends, notably the anomalous TAG behavior, are seen in contexts where RF1:RF2 ratios cannot be, or are unlikely to be, causative, that is, at 3'untranslated sites never used for translation termination, in intragenomic analyses, and across archaeal species (that possess only one RF1). We conclude that specifics of RF biology are unlikely to fully explain TGA/TAG relative usage. We discuss why the causal relationships for the evolution of synonymous stop codon usage might be different from those affecting synonymous sense codon usage, noting that transitions between TGA and TAG require two-point mutations one of which is likely to be deleterious.


Assuntos
Bactérias , Uso do Códon , Códon de Terminação , Fatores de Terminação de Peptídeos , Bactérias/genética , Composição de Bases , Fatores de Terminação de Peptídeos/genética
18.
Nucleic Acids Res ; 49(17): 9665-9685, 2021 09 27.
Artigo em Inglês | MEDLINE | ID: mdl-34469537

RESUMO

Transcripts containing premature termination codons (PTCs) can be subject to nonsense-associated alternative splicing (NAS). Two models have been evoked to explain this, scanning and splice motif disruption. The latter postulates that exonic cis motifs, such as exonic splice enhancers (ESEs), are disrupted by nonsense mutations. We employ genome-wide transcriptomic and k-mer enrichment methods to scrutinize this model. First, we show that ESEs are prone to disruptive nonsense mutations owing to their purine richness and paucity of TGA, TAA and TAG. The motif model correctly predicts that NAS rates should be low (we estimate 5-30%) and approximately in line with estimates for the rate at which random point mutations disrupt splicing (8-20%). Further, we find that, as expected, NAS-associated PTCs are predictable from nucleotide-based machine learning approaches to predict splice disruption and, at least for pathogenic variants, are enriched in ESEs. Finally, we find that both in and out of frame mutations to TAA, TGA or TAG are associated with exon skipping. While a higher relative frequency of such skip-inducing mutations in-frame than out of frame lends some credence to the scanning model, these results reinforce the importance of considering splice motif modulation to understand the etiology of PTC-associated disease.


Assuntos
Processamento Alternativo , Códon sem Sentido , Sequências Reguladoras de Ácido Ribonucleico , Códon de Terminação , Doença/genética , Éxons , Células HEK293 , Células HeLa , Humanos , Degradação do RNAm Mediada por Códon sem Sentido , Motivos de Nucleotídeos , Nucleotídeos/química
19.
Genome Biol Evol ; 13(10)2021 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-34427640

RESUMO

Owing to a lag between a deleterious mutation's appearance and its selective removal, gold-standard methods for mutation rate estimation assume no meaningful loss of mutations between parents and offspring. Indeed, from analysis of closely related lineages, in SARS-CoV-2, the Ka/Ks ratio was previously estimated as 1.008, suggesting no within-host selection. By contrast, we find a higher number of observed SNPs at 4-fold degenerate sites than elsewhere and, allowing for the virus's complex mutational and compositional biases, estimate that the mutation rate is at least 49-67% higher than would be estimated based on the rate of appearance of variants in sampled genomes. Given the high Ka/Ks one might assume that the majority of such intrahost selection is the purging of nonsense mutations. However, we estimate that selection against nonsense mutations accounts for only ∼10% of all the "missing" mutations. Instead, classical protein-level selective filters (against chemically disparate amino acids and those predicted to disrupt protein functionality) account for many missing mutations. It is less obvious why for an intracellular parasite, amino acid cost parameters, notably amino acid decay rate, is also significant. Perhaps most surprisingly, we also find evidence for real-time selection against synonymous mutations that move codon usage away from that of humans. We conclude that there is common intrahost selection on SARS-CoV-2 that acts on nonsense, missense, and possibly synonymous mutations. This has implications for methods of mutation rate estimation, for determining times to common ancestry and the potential for intrahost evolution including vaccine escape.


Assuntos
COVID-19/virologia , Mutação , SARS-CoV-2/genética , Uso do Códon , Códon sem Sentido , Evolução Molecular , Humanos , Modelos Genéticos , Taxa de Mutação , Mutação de Sentido Incorreto , Polimorfismo de Nucleotídeo Único , Seleção Genética , Mutação Silenciosa
20.
Genome Biol Evol ; 13(9)2021 09 01.
Artigo em Inglês | MEDLINE | ID: mdl-33988683

RESUMO

The nucleotide composition, dinucleotide composition, and codon usage of many viruses differ from their hosts. These differences arise because viruses are subject to unique mutation and selection pressures that do not apply to host genomes; however, the molecular mechanisms that underlie these evolutionary forces are unclear. Here, we analyzed the patterns of codon usage in 1,520 vertebrate-infecting viruses, focusing on parameters known to be under selection and associated with gene regulation. We find that GC content, dinucleotide content, and splicing and m6A modification-related sequence motifs are associated with the type of genetic material (DNA or RNA), strandedness, and replication compartment of viruses. In an experimental follow-up, we find that the effects of GC content on gene expression depend on whether the genetic material is delivered to the cell as DNA or mRNA, whether it is transcribed by endogenous or exogenous RNA polymerase, and whether transcription takes place in the nucleus or cytoplasm. Our results suggest that viral codon usage cannot be explained by a simple adaptation to the codon usage of the host-instead, it reflects the combination of multiple selective and mutational pressures, including the need for efficient transcription, export, and immune evasion.


Assuntos
Uso do Códon , Vírus , Códon/genética , Evolução Molecular , Genoma Viral , Evasão da Resposta Imune , RNA Mensageiro/genética , Vírus/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...