RESUMEN
Large-scale re-engineering of synonymous sites is a promising strategy to generate vaccines either through synthesis of attenuated viruses or via codon-optimized genes in DNA vaccines. Attenuation typically relies on deoptimization of codon pairs and maximization of CpG dinucleotide frequencies. So as to formulate evolutionarily informed attenuation strategies that aim to force nucleotide usage against the direction favored by selection, here, we examine available whole-genome sequences of SARS-CoV-2 to infer patterns of mutation and selection on synonymous sites. Analysis of mutational profiles indicates a strong mutation bias toward U. In turn, analysis of observed synonymous site composition implicates selection against U. Accounting for dinucleotide effects reinforces this conclusion, observed UU content being a quarter of that expected under neutrality. Possible mechanisms of selection against U mutations include selection for higher expression, for high mRNA stability or lower immunogenicity of viral genes. Consistent with gene-specific selection against CpG dinucleotides, we observe systematic differences of CpG content between SARS-CoV-2 genes. We propose an evolutionarily informed approach to attenuation that, unusually, seeks to increase usage of the already most common synonymous codons. Comparable analysis of H1N1 and Ebola finds that GC3 deviated from neutral equilibrium is not a universal feature, cautioning against generalization of results.
Asunto(s)
Vacunas contra la COVID-19/genética , COVID-19/genética , Genoma Viral , Mutación , SARS-CoV-2/genética , Selección Genética , COVID-19/prevención & control , Humanos , Estabilidad del ARN/genética , ARN Mensajero/genética , ARN Viral/genética , UraciloRESUMEN
BACKGROUND: During early development of the nervous system, gene expression patterns are known to vary widely depending on the specific developmental trajectories of different structures. Observable changes in gene expression profiles throughout development are determined by an underlying network of precise regulatory interactions between individual genes. Elucidating the organizing principles that shape this gene regulatory network is one of the central goals of developmental biology. Whether the developmental programme is the result of a dynamic driven by a fixed architecture of regulatory interactions, or alternatively, the result of waves of regulatory reorganization is not known. RESULTS: Here we contrast these two alternative models by examining existing expression data derived from the developing human brain in prenatal and postnatal stages. We reveal a sharp change in gene expression profiles at birth across brain areas. This sharp division between foetal and postnatal profiles is not the result of pronounced changes in level of expression of existing gene networks. Instead we demonstrate that the perinatal transition is marked by the widespread regulatory rearrangement within and across existing gene clusters, leading to the emergence of new functional groups. This rearrangement is itself organized into discrete blocks of genes, each targeted by a distinct set of transcriptional regulators and associated to specific biological functions. CONCLUSIONS: Our results provide evidence of an acute modular reorganization of the regulatory architecture of the brain transcriptome occurring at birth, reflecting the reassembly of new functional associations required for the normal transition from prenatal to postnatal brain development.
Asunto(s)
Encéfalo/crecimiento & desarrollo , Encéfalo/metabolismo , Genes Reguladores , Adolescente , Encéfalo/anatomía & histología , Encéfalo/embriología , Niño , Preescolar , Femenino , Perfilación de la Expresión Génica , Humanos , Lactante , Modelos Biológicos , EmbarazoRESUMEN
Genomic and transcriptomics analyses have revealed human head and body lice to be almost genetically identical; although con-specific, they nevertheless occupy distinct ecological niches and have differing feeding patterns. Most importantly, while head lice are not known to be vector competent, body lice can transmit three serious bacterial diseases; epidemictyphus, trench fever, and relapsing fever. In order to gain insights into the molecular bases for these differences, we analyzed alternative splicing (AS) using next-generation sequencing data for one strain of head lice and one strain of body lice. We identified a total of 3,598 AS events which were head or body lice specific. Exon skipping AS events were overrepresented among both head and body lice, whereas intron retention events were underrepresented in both. However, both the enrichment of exon skipping and the underrepresentation of intron retention are significantly stronger in body lice compared with head lice. Genes containing body louse-specific AS events were found to be significantly enriched for functions associated with development of the nervous system, salivary gland, trachea, and ovarian follicle cells, as well as regulation of transcription. In contrast, no functional categories were overrepresented among genes with head louse-specific AS events. Together, our results constitute the first evidence for transcript pool differences in head and body lice, providing insights into molecular adaptations that enabled human lice to adapt to clothing, and representing a powerful illustration of the pivotal role AS can play in functional adaptation.
Asunto(s)
Empalme Alternativo/genética , Phthiraptera/genética , Animales , Ontología de Genes , Genes de Insecto , Humanos , Pediculus/genéticaRESUMEN
The sequencing of multiple genomes of the same plant species has revealed polymorphic gene and exon loss. Genes associated with disease resistance are overrepresented among those showing structural variations, suggesting an adaptive role for gene and exon presence-absence variation (PAV). To shed light on the possible functional relevance of polymorphic coding region loss and the mechanisms driving this process, we characterized genes that have lost entire exons or their whole coding regions in 17 fully sequenced Arabidopsis thaliana accessions. We found that although a significant enrichment in genes associated with certain functional categories is observed, PAV events are largely restricted to genes with signatures of reduced essentiality: PAV genes tend to be newer additions to the genome, tissue specific, and lowly expressed. In addition, PAV genes are located in regions of lower gene density and higher transposable element density. Partial coding region PAV events were associated with only a marginal reduction in gene expression level in the affected accession and occurred in genes with higher levels of alternative splicing in the Col-0 accession. Together, these results suggest that although adaptive scenarios cannot be ruled out, PAV events can be explained without invoking them.
Asunto(s)
Arabidopsis/genética , Evolución Molecular , Genes de Plantas , Variación Genética , Empalme Alternativo , Elementos Transponibles de ADN , Exones , Expresión Génica , Sistemas de Lectura Abierta , Análisis de Secuencia de ADN , Transducción de SeñalRESUMEN
What at the genomic level underlies organism complexity? Although several genomic features have been associated with organism complexity, in the case of alternative splicing, which has long been proposed to explain the variation in complexity, no such link has been established. Here, we analyzed over 39 million expressed sequence tags available for 47 eukaryotic species with fully sequenced genomes to obtain a comparable index of alternative splicing estimates, which corrects for the distorting effect of a variable number of transcripts per species--an important obstacle for comparative studies of alternative splicing. We find that alternative splicing has steadily increased over the last 1,400 My of eukaryotic evolution and is strongly associated with organism complexity, assayed as the number of cell types. Importantly, this association is not explained as a by-product of covariance between alternative splicing with other variables previously linked to complexity including gene content, protein length, proteome disorder, and protein interactivity. In addition, we found no evidence to suggest that the relationship of alternative splicing to cell type number is explained by drift due to reduced N(e) in more complex species. Taken together, our results firmly establish alternative splicing as a significant predictor of organism complexity and are, in principle, consistent with an important role of transcript diversification through alternative splicing as a means of determining a genome's functional information capacity.
Asunto(s)
Empalme Alternativo , Evolución Biológica , Eucariontes/clasificación , Eucariontes/genética , Animales , Evolución Molecular , Etiquetas de Secuencia Expresada , Humanos , Filogenia , Especificidad de la Especie , TranscriptomaRESUMEN
In female mammals most X-linked genes are subject to X-inactivation. However, in humans some X-linked genes escape silencing, these escapees being candidates for the phenotypic aberrations seen in polyX karyotypes. These escape genes have been reported to be under stronger purifying selection than other X-linked genes. Although it is known that escape from X-inactivation is much more common in humans than in mice, systematic assays of escape in humans have to date employed only interspecies somatic cell hybrids. Here we provide the first systematic next-generation sequencing analysis of escape in a human cell line. We analyzed RNA and genotype sequencing data obtained from B lymphocyte cell lines derived from Europeans (CEU) and Yorubans (YRI). By replicated detection of heterozygosis in the transcriptome, we identified 114 escaping genes, including 76 not previously known to be escapees. The newly described escape genes cluster on the X chromosome in the same chromosomal regions as the previously known escapees. There is an excess of escaping genes associated with mental retardation, consistent with this being a common phenotype of polyX phenotypes. We find both differences between populations and between individuals in the propensity to escape. Indeed, we provide the first evidence for there being both hyper- and hypo-escapee females in the human population, consistent with the highly variable phenotypic presentation of polyX karyotypes. Considering also prior data, we reclassify genes as being always, never, and sometimes escape genes. We fail to replicate the prior claim that genes that escape X-inactivation are under stronger purifying selection than others.
Asunto(s)
Expresión Génica , Genes Ligados a X , Discapacidad Intelectual/genética , Inactivación del Cromosoma X , Animales , Pueblo Asiatico/genética , Línea Celular , Evolución Molecular , Femenino , Variación Genética , Humanos , Masculino , Ratones , Tasa de Mutación , Fenotipo , Polimorfismo de Nucleótido Simple , Análisis de Secuencia de ARN , Población Blanca/genética , Cromosoma XRESUMEN
Genomic determinants underlying increased encephalization across mammalian lineages are unknown. Whole genome comparisons have revealed large and frequent changes in the size of gene families, and it has been proposed that these variations could play a major role in shaping morphological and physiological differences among species. Using a genome-wide comparative approach, we examined changes in gene family size (GFS) and degree of encephalization in 39 fully sequenced mammalian species and found a significant over-representation of GFS variations in line with increased encephalization in mammals. We found that this relationship is not accounted for by known correlates of brain size such as maximum lifespan or body size and is not explained by phylogenetic relatedness. Genes involved in chemotaxis, immune regulation and cell signalling-related functions are significantly over-represented among those gene families most highly correlated with encephalization. Genes within these families are prominently expressed in the human brain, particularly the cortex, and organized in co-expression modules that display distinct temporal patterns of expression in the developing cortex. Our results suggest that changes in GFS associated with encephalization represent an evolutionary response to the specific functional requirements underlying increased brain size in mammals.
Asunto(s)
Encéfalo/anatomía & histología , Quimiotaxis/genética , Mamíferos/anatomía & histología , Animales , Evolución Biológica , Evolución Molecular , Genómica , Humanos , Familia de Multigenes , Tamaño de los Órganos , Filogenia , Análisis de Secuencia de ADN , Transducción de Señal/genética , Especificidad de la EspecieRESUMEN
In mammals, sexual size dimorphism often reflects the intensity of sexual selection, yet its connection to genomic evolution remains unexplored. Gene family size evolution can reflect shifts in the relative importance of different molecular functions. Here, we investigate the associate between brain development gene repertoire to sexual size dimorphism using 124 mammalian species. We reveal significant changes in gene family size associations with sexual size dimorphism. High levels of dimorphism correlate with an expansion of gene families enriched in olfactory sensory perception and a contraction of gene families associated with brain development functions, many of which exhibited particularly high expression in the human adult brain. These findings suggest a relationship between intense sexual selection and alterations in gene family size. These insights illustrate the complex interplay between sexual dimorphism, gene family size evolution, and their roles in mammalian brain development and function, offering a valuable understanding of mammalian genome evolution.
Asunto(s)
Encéfalo , Mamíferos , Caracteres Sexuales , Animales , Mamíferos/genética , Encéfalo/crecimiento & desarrollo , Humanos , Femenino , Masculino , Familia de Multigenes , Evolución Molecular , Selección Sexual/genéticaRESUMEN
Importance: The APOE ε2 and APOE ε4 alleles are the strongest protective and risk-increasing, respectively, genetic variants for late-onset Alzheimer disease (AD). However, the mechanisms linking APOE to AD-particularly the apoE protein's role in AD pathogenesis and how this is affected by APOE variants-remain poorly understood. Identifying missense variants in addition to APOE ε2 and APOE ε4 could provide critical new insights, but given the low frequency of additional missense variants, AD genetic cohorts have previously been too small to interrogate this question robustly. Objective: To determine whether rare missense variants on APOE are associated with AD risk. Design, Setting, and Participants: Association with case-control status was tested in a sequenced discovery sample (stage 1) and followed up in several microarray imputed cohorts as well as the UK Biobank whole-exome sequencing resource using a proxy-AD phenotype (stages 2 and 3). This study combined case-control, family-based, population-based, and longitudinal AD-related cohorts that recruited referred and volunteer participants. Stage 1 included 37â¯409 nonunique participants of European or admixed European ancestry, with 11â¯868 individuals with AD and 11â¯934 controls passing analysis inclusion criteria. In stages 2 and 3, 475â¯473 participants were considered across 8 cohorts, of which 84â¯513 individuals with AD and proxy-AD and 328â¯372 controls passed inclusion criteria. Selection criteria were cohort specific, and this study was performed a posteriori on individuals who were genotyped. Among the available genotypes, 76â¯195 were excluded. All data were retrieved between September 2015 and November 2021 and analyzed between April and November 2021. Main Outcomes and Measures: In primary analyses, the AD risk associated with each missense variant was estimated, as appropriate, with either linear mixed-model regression or logistic regression. In secondary analyses, associations were estimated with age at onset using linear mixed-model regression and risk of conversion to AD using competing-risk regression. Results: A total of 544â¯384 participants were analyzed in the primary case-control analysis; 312â¯476 (57.4%) were female, and the mean (SD; range) age was 64.9 (15.2; 40-110) years. Two missense variants were associated with a 2-fold to 3-fold decreased AD risk: APOE ε4 (R251G) (odds ratio, 0.44; 95% CI, 0.33-0.59; P = 4.7 × 10-8) and APOE ε3 (V236E) (odds ratio, 0.37; 95% CI, 0.25-0.56; P = 1.9 × 10-6). Additionally, the cumulative incidence of AD in carriers of these variants was found to grow more slowly with age compared with noncarriers. Conclusions and Relevance: In this genetic association study, a novel variant associated with AD was identified: R251G always coinherited with ε4 on the APOE gene, which mitigates the ε4-associated AD risk. The protective effect of the V236E variant, which is always coinherited with ε3 on the APOE gene, was also confirmed. The location of these variants confirms that the carboxyl-terminal portion of apoE plays an important role in AD pathogenesis. The large risk reductions reported here suggest that protein chemistry and functional assays of these variants should be pursued, as they have the potential to guide drug development targeting APOE.
Asunto(s)
Enfermedad de Alzheimer , Edad de Inicio , Alelos , Enfermedad de Alzheimer/epidemiología , Enfermedad de Alzheimer/genética , Apolipoproteína E2/genética , Apolipoproteína E4/genética , Apolipoproteínas E/genética , Femenino , Genotipo , Humanos , MasculinoRESUMEN
Alzheimer's disease (AD)-related degenerative decline is associated to the presence of amyloid beta (Aß) plaque lesions and neuro fibrillary tangles (NFT). However, the precise molecular mechanisms linking Aß deposition and neurological decline are still unclear. Here we combine genome-wide transcriptional profiling of the insular cortex of 3xTg-AD mice and control littermates from early through to late adulthood (2-14 months of age), with behavioral and biochemical profiling in the same animals to identify transcriptional determinants of functional decline specifically associated to build-up of Aß deposits. Differential expression analysis revealed differentially expressed genes (DEGs) in the cortex long before observed onset of behavioral symptoms in this model. Using behavioral and biochemical data derived from the same mice and samples, we found that down but not up-regulated DEGs show a stronger average association with learning performance than random background genes in control not seen in AD mice. Conversely, these same genes were found to have a stronger association with Aß deposition than background genes in AD but not in control mice, thereby identifying these genes as potential intermediaries between abnormal Aß/NFT deposition and functional decline. Using a complementary approach, gene ontology analysis revealed a highly significant enrichment of learning and memory, associative, memory, and cognitive functions only among down-regulated, but not up-regulated, DEGs. Our results demonstrate wider transcriptional changes triggered by the abnormal deposition of Aß/NFT occurring well before behavioral decline and identify a distinct set of genes specifically associated to abnormal Aß protein deposition and cognitive decline.
RESUMEN
Different cell types have different postmitotic maintenance requirements. Nerve cells, however, are unique in this respect as they need to survive and preserve their functional complexity for the entire lifetime of the organism, and failure at any level of their supporting mechanisms leads to a wide range of neurodegenerative conditions. Whether these differences across tissues arise from the activation of distinct cell type-specific maintenance mechanisms or the differential activation of a common molecular repertoire is not known. To identify the transcriptional signature of postmitotic cellular longevity (PMCL), we compared whole-genome transcriptome data from human tissues ranging in longevity from 120 days to over 70 years and found a set of 81 genes whose expression levels are closely associated with increased cell longevity. Using expression data from 10 independent sources, we found that these genes are more highly coexpressed in longer-living tissues and are enriched in specific biological processes and transcription factor targets compared with randomly selected gene samples. Crucially, we found that PMCL-associated genes are downregulated in the cerebral cortex and substantia nigra of patients with Alzheimer's and Parkinson's disease, respectively, as well as Hutchinson-Gilford progeria-derived fibroblasts, and that this downregulation is specifically linked to their underlying association with cellular longevity. Moreover, we found that sexually dimorphic brain expression of PMCL-associated genes reflects sexual differences in lifespan in humans and macaques. Taken together, our results suggest that PMCL-associated genes are part of a generalized machinery of postmitotic maintenance and functional stability in both neural and non-neural cells and support the notion of a common molecular repertoire differentially engaged in different cell types with different survival requirements.
Asunto(s)
Supervivencia Celular/genética , Mitosis/genética , Neuronas , Transcripción Genética/genética , Animales , Corteza Cerebral/metabolismo , Regulación hacia Abajo , Fibroblastos/patología , Expresión Génica , Estudio de Asociación del Genoma Completo , Humanos , Macaca , Enfermedades Neurodegenerativas/etiología , Enfermedades Neurodegenerativas/genética , Enfermedades Neurodegenerativas/patología , Caracteres Sexuales , Sustancia Negra/metabolismo , Transcriptoma/genéticaRESUMEN
Increased brain size is thought to have played an important role in the evolution of mammals and is a highly variable trait across lineages. Variations in brain size are closely linked to corresponding variations in the size of the neocortex, a distinct mammalian evolutionary innovation. The genomic features that explain and/or accompany variations in the relative size of the neocortex remain unknown. By comparing the genomes of 28 mammalian species, we show that neocortical expansion relative to the rest of the brain is associated with variations in gene family size (GFS) of gene families that are significantly enriched in biological functions associated with chemotaxis, cell-cell signalling and immune response. Importantly, we find that previously reported GFS variations associated with increased brain size are largely accounted for by the stronger link between neocortex expansion and variations in the size of gene families. Moreover, genes within these families are more prominently expressed in the human neocortex during early compared with adult development. These results suggest that changes in GFS underlie morphological adaptations during brain evolution in mammalian lineages.
Asunto(s)
Quimiotaxis/genética , Mamíferos/genética , Familia de Multigenes , Neocórtex/anatomía & histología , Animales , Comunicación Celular , Regulación del Desarrollo de la Expresión Génica , Ontología de Genes , Genómica , Humanos , Mamíferos/anatomía & histología , Neocórtex/crecimiento & desarrollo , Tamaño de los Órganos , FilogeniaRESUMEN
During development, the nervous system (NS) is assembled and sculpted through a concerted series of neurodevelopmental events orchestrated by a complex genetic programme. While neural-specific gene expression plays a critical part in this process, in recent years, a number of immune-related signaling and regulatory components have also been shown to play key physiological roles in the developing and adult NS. While the involvement of individual immune-related signaling components in neural functions may reflect their ubiquitous character, it may also reflect a much wider, as yet undescribed, genetic network of immune-related molecules acting as an intrinsic component of the neural-specific regulatory machinery that ultimately shapes the NS. In order to gain insights into the scale and wider functional organization of immune-related genetic networks in the NS, we examined the large scale pattern of expression of these genes in the brain. Our results show a highly significant correlated expression and transcriptional clustering among immune-related genes in the developing and adult brain, and this correlation was the highest in the brain when compared to muscle, liver, kidney and endothelial cells. We experimentally tested the regulatory clustering of immune system (IS) genes by using microarray expression profiling in cultures of dissociated neurons stimulated with the pro-inflammatory cytokine TNF-alpha, and found a highly significant enrichment of immune system-related genes among the resulting differentially expressed genes. Our findings strongly suggest a coherent recruitment of entire immune-related genetic regulatory modules by the neural-specific genetic programme that shapes the NS.