RESUMO
Mutations in glycosylation pathways, such as N-linked glycosylation, O-linked glycosylation, and GPI anchor synthesis, lead to Congenital Disorders of Glycosylation (CDG). CDG typically present with seizures, hypotonia, and developmental delay but display large clinical variability with symptoms affecting every system in the body. This variability suggests modifier genes might influence the phenotypes. Because of the similar physiology and clinical symptoms, there are likely common genetic modifiers between CDG. Here, we use evolution as a tool to identify common modifiers between CDG and glycosylation genes. Protein glycosylation is evolutionarily conserved from yeast to mammals. Evolutionary rate covariation (ERC) identifies proteins with similar evolutionary rates that indicate shared biological functions and pathways. Using ERC, we identified strong evolutionary rate signatures between proteins in the same and different glycosylation pathways. Genome-wide analysis of proteins showing significant ERC with GPI anchor synthesis proteins revealed strong signatures with ncRNA modification proteins and DNA repair proteins. We also identified strong patterns of ERC based on cellular sub-localization of the GPI anchor synthesis enzymes. Functional testing of the highest scoring candidates validated genetic interactions and identified novel genetic modifiers of CDG genes. ERC analysis of disease genes and biological pathways allows for rapid prioritization of potential genetic modifiers, which can provide a better understanding of disease pathophysiology and novel therapeutic targets.
Assuntos
Defeitos Congênitos da Glicosilação , Evolução Molecular , Glicosilação , Humanos , Defeitos Congênitos da Glicosilação/genética , Defeitos Congênitos da Glicosilação/metabolismo , Animais , Mutação , Glicosilfosfatidilinositóis/metabolismo , Glicosilfosfatidilinositóis/genética , FenótipoRESUMO
Tribe Plantagineae (Plantaginaceae) comprises ~ 270 species in three currently recognized genera (Aragoa, Littorella, Plantago), of which Plantago is most speciose. Plantago plastomes exhibit several atypical features including large inversions, expansions of the inverted repeat, increased repetitiveness, intron losses, and gene-specific increases in substitution rate, but the prevalence of these plastid features among species and subgenera is unknown. To assess phylogenetic relationships and plastomic evolutionary dynamics among Plantagineae genera and Plantago subgenera, we generated 25 complete plastome sequences and compared them with existing plastome sequences from Plantaginaceae. Using whole plastome and partitioned alignments, our phylogenomic analyses provided strong support for relationships among major Plantagineae lineages. General plastid features-including size, GC content, intron content, and indels-provided additional support that reinforced major Plantagineae subdivisions. Plastomes from Plantago subgenera Plantago and Coronopus have synapomorphic expansions and inversions affecting the size and gene order of the inverted repeats, and particular genes near the inversion breakpoints exhibit accelerated nucleotide substitution rates, suggesting localized hypermutation associated with rearrangements. The Littorella plastome lacks functional copies of ndh genes, which may be related to an amphibious lifestyle and partial reliance on CAM photosynthesis.
Assuntos
Evolução Molecular , Genes de Plantas/genética , Genomas de Plastídeos , Mutagênese , NADH Desidrogenase/genética , Filogenia , Plantaginaceae/genética , Fotossíntese , Plantago/genética , Plastídeos/genéticaRESUMO
Identifying genomic elements underlying phenotypic adaptations is an important problem in evolutionary biology. Comparative analyses learning from convergent evolution of traits are gaining momentum in accurately detecting such elements. We previously developed a method for predicting phenotypic associations of genetic elements by contrasting patterns of sequence evolution in species showing a phenotype with those that do not. Using this method, we successfully demonstrated convergent evolutionary rate shifts in genetic elements associated with two phenotypic adaptations, namely the independent subterranean and marine transitions of terrestrial mammalian lineages. Our original method calculates gene-specific rates of evolution on branches of phylogenetic trees using linear regression. These rates represent the extent of sequence divergence on a branch after removing the expected divergence on the branch due to background factors. The rates calculated using this regression analysis exhibit an important statistical limitation, namely heteroscedasticity. We observe that the rates on branches that are longer on average show higher variance, and describe how this problem adversely affects the confidence with which we can make inferences about rate shifts. Using a combination of data transformation and weighted regression, we have developed an updated method that corrects this heteroscedasticity in the rates. We additionally illustrate the improved performance offered by the updated method at robust detection of convergent rate shifts in phylogenetic trees of protein-coding genes across mammals, as well as using simulated tree data sets. Overall, we present an important extension to our evolutionary-rates-based method that performs more robustly and consistently at detecting convergent shifts in evolutionary rates.
Assuntos
Evolução Molecular , Técnicas Genéticas , Algoritmos , Fenótipo , Filogenia , SoftwareRESUMO
MOTIVATION: When different lineages of organisms independently adapt to similar environments, selection often acts repeatedly upon the same genes, leading to signatures of convergent evolutionary rate shifts at these genes. With the increasing availability of genome sequences for organisms displaying a variety of convergent traits, the ability to identify genes with such convergent rate signatures would enable new insights into the molecular basis of these traits. RESULTS: Here we present the R package RERconverge, which tests for association between relative evolutionary rates of genes and the evolution of traits across a phylogeny. RERconverge can perform associations with binary and continuous traits, and it contains tools for visualization and enrichment analyses of association results. AVAILABILITY AND IMPLEMENTATION: RERconverge source code, documentation and a detailed usage walk-through are freely available at https://github.com/nclark-lab/RERconverge. Datasets for mammals, Drosophila and yeast are available at https://bit.ly/2J2QBnj. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Genoma , Software , Animais , Estudo de Associação Genômica Ampla , Fenótipo , FilogeniaRESUMO
Coronavirus disease 2019 (COVID-19) and influenza are respiratory illnesses caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and influenza viruses, respectively. Both diseases share symptoms and clinical risk factors1, but the extent to which these conditions have a common genetic etiology is unknown. This is partly because host genetic risk factors are well characterized for COVID-19 but not for influenza, with the largest published genome-wide association studies for these conditions including >2 million individuals2 and about 1,000 individuals3-6, respectively. Shared genetic risk factors could point to targets to prevent or treat both infections. Through a genetic study of 18,334 cases with a positive test for influenza and 276,295 controls, we show that published COVID-19 risk variants are not associated with influenza. Furthermore, we discovered and replicated an association between influenza infection and noncoding variants in B3GALT5 and ST6GAL1, neither of which was associated with COVID-19. In vitro small interfering RNA knockdown of ST6GAL1-an enzyme that adds sialic acid to the cell surface, which is used for viral entry-reduced influenza infectivity by 57%. These results mirror the observation that variants that downregulate ACE2, the SARS-CoV-2 receptor, protect against COVID-19 (ref. 7). Collectively, these findings highlight downregulation of key cell surface receptors used for viral entry as treatment opportunities to prevent COVID-19 and influenza.
Assuntos
COVID-19 , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Influenza Humana , SARS-CoV-2 , Humanos , Influenza Humana/genética , Influenza Humana/epidemiologia , Influenza Humana/virologia , COVID-19/genética , COVID-19/virologia , Fatores de Risco , SARS-CoV-2/genética , Masculino , Feminino , Polimorfismo de Nucleotídeo Único , Estudos de Casos e Controles , Pessoa de Meia-IdadeRESUMO
Gamete fusion is a critical event of mammalian fertilization. A random one-bead one-compound combinatorial peptide library represented synthetic human egg mimics and identified a previously unidentified ligand as Fc receptor-like 3, named MAIA after the mythological goddess intertwined with JUNO. This immunoglobulin super family receptor was expressed on human oolemma and played a major role during sperm-egg adhesion and fusion. MAIA forms a highly stable interaction with the known IZUMO1/JUNO sperm-egg complex, permitting specific gamete fusion. The complexity of the MAIA isotype may offer a cryptic sexual selection mechanism to avoid genetic incompatibility and achieve favorable fitness outcomes.
RESUMO
Multiple COVID-19 genome-wide association studies (GWASs) have identified reproducible genetic associations indicating that there is a genetic component to susceptibility and severity risk. To complement these studies, we collected deep coronavirus disease 2019 (COVID-19) phenotype data from a survey of 736,723 AncestryDNA research participants. With these data, we defined eight phenotypes related to COVID-19 outcomes: four phenotypes that align with previously studied COVID-19 definitions and four 'expanded' phenotypes that focus on susceptibility given exposure, mild clinical manifestations and an aggregate score of symptom severity. We performed a replication analysis of 12 previously reported COVID-19 genetic associations with all eight phenotypes in a trans-ancestry meta-analysis of AncestryDNA research participants. In this analysis, we show distinct patterns of association at the 12 loci with the eight outcomes that we assessed. We also performed a genome-wide discovery analysis of all eight phenotypes, which did not yield new genome-wide significant loci but did suggest that three of the four 'expanded' COVID-19 phenotypes have enhanced power to capture protective genetic associations relative to the previously studied phenotypes. Thus, we conclude that continued large-scale ascertainment of deep COVID-19 phenotype data would likely represent a boon for COVID-19 therapeutic target identification.
Assuntos
COVID-19 , Estudo de Associação Genômica Ampla , COVID-19/genética , Predisposição Genética para Doença , Humanos , Fenótipo , Polimorfismo de Nucleotídeo Único/genéticaRESUMO
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) enters human host cells via angiotensin-converting enzyme 2 (ACE2) and causes coronavirus disease 2019 (COVID-19). Here, through a genome-wide association study, we identify a variant (rs190509934, minor allele frequency 0.2-2%) that downregulates ACE2 expression by 37% (P = 2.7 × 10-8) and reduces the risk of SARS-CoV-2 infection by 40% (odds ratio = 0.60, P = 4.5 × 10-13), providing human genetic evidence that ACE2 expression levels influence COVID-19 risk. We also replicate the associations of six previously reported risk variants, of which four were further associated with worse outcomes in individuals infected with the virus (in/near LZTFL1, MHC, DPP9 and IFNAR2). Lastly, we show that common variants define a risk score that is strongly associated with severe disease among cases and modestly improves the prediction of disease severity relative to demographic and clinical factors alone.
Assuntos
COVID-19 , Enzima de Conversão de Angiotensina 2/genética , COVID-19/genética , Estudo de Associação Genômica Ampla , Humanos , Fatores de Risco , SARS-CoV-2/genéticaRESUMO
Although lifespan in mammals varies over 100-fold, the precise evolutionary mechanisms underlying variation in longevity remain unknown. Species-specific genetic changes have been observed in long-lived species including the naked mole-rat, bats, and the bowhead whale, but these adaptations do not generalize to other mammals. We present a novel method to identify associations between rates of protein evolution and continuous phenotypes across the entire mammalian phylogeny. Unlike previous analyses that focused on individual species, we treat absolute and relative longevity as quantitative traits and demonstrate that these lifespan traits affect the evolutionary constraint on hundreds of genes. Specifically, we find that genes related to cell cycle, DNA repair, cell death, the IGF1 pathway, and immunity are under increased evolutionary constraint in large and long-lived mammals. For mammals exceptionally long-lived for their body size, we find increased constraint in inflammation, DNA repair, and NFKB-related pathways. Strikingly, these pathways have considerable overlap with those that have been previously reported to have potentially adaptive changes in single-species studies, and thus would be expected to show decreased constraint in our analysis. This unexpected finding of increased constraint in many longevity-associated pathways underscores the power of our quantitative approach to detect patterns that generalize across the mammalian phylogeny.
Assuntos
Evolução Molecular , Longevidade/genética , Mamíferos/genética , Animais , Gatos , Bovinos , Cricetinae , Cães , Cobaias , Humanos , Camundongos , Filogenia , Coelhos , RatosRESUMO
N-Glycanase 1 (NGLY1) is a cytoplasmic deglycosylating enzyme. Loss-of-function mutations in the NGLY1 gene cause NGLY1 deficiency, which is characterized by developmental delay, seizures, and a lack of sweat and tears. To model the phenotypic variability observed among patients, we crossed a Drosophila model of NGLY1 deficiency onto a panel of genetically diverse strains. The resulting progeny showed a phenotypic spectrum from 0 to 100% lethality. Association analysis on the lethality phenotype, as well as an evolutionary rate covariation analysis, generated lists of modifying genes, providing insight into NGLY1 function and disease. The top association hit was Ncc69 (human NKCC1/2), a conserved ion transporter. Analyses in NGLY1-/- mouse cells demonstrated that NKCC1 has an altered average molecular weight and reduced function. The misregulation of this ion transporter may explain the observed defects in secretory epithelium function in NGLY1 deficiency patients.
Assuntos
Defeitos Congênitos da Glicosilação/metabolismo , Peptídeo-N4-(N-acetil-beta-glucosaminil) Asparagina Amidase/deficiência , Membro 2 da Família 12 de Carreador de Soluto/metabolismo , Animais , Modelos Animais de Doenças , Drosophila melanogaster , Camundongos , Camundongos Knockout , Peptídeo-N4-(N-acetil-beta-glucosaminil) Asparagina Amidase/metabolismo , FenótipoRESUMO
Mammals diversified by colonizing drastically different environments, with each transition yielding numerous molecular changes, including losses of protein function. Though not initially deleterious, these losses could subsequently carry deleterious pleiotropic consequences. We have used phylogenetic methods to identify convergent functional losses across independent marine mammal lineages. In one extreme case, Paraoxonase 1 (PON1) accrued lesions in all marine lineages, while remaining intact in all terrestrial mammals. These lesions coincide with PON1 enzymatic activity loss in marine species' blood plasma. This convergent loss is likely explained by parallel shifts in marine ancestors' lipid metabolism and/or bloodstream oxidative environment affecting PON1's role in fatty acid oxidation. PON1 loss also eliminates marine mammals' main defense against neurotoxicity from specific man-made organophosphorus compounds, implying potential risks in modern environments.
Assuntos
Arildialquilfosfatase/sangue , Arildialquilfosfatase/genética , Cetáceos , Evolução Molecular , Metabolismo dos Lipídeos , Desintoxicação Metabólica Fase I , Compostos Organofosforados/metabolismo , Adaptação Biológica , Animais , Cetáceos/sangue , Cetáceos/classificação , Cetáceos/genética , Exposição Ambiental , Aptidão Genética , Lipoproteínas HDL/metabolismo , Lipoproteínas LDL/metabolismo , Compostos Organofosforados/toxicidade , Oxirredução , Filogenia , Risco , Seleção GenéticaRESUMO
The underground environment imposes unique demands on life that have led subterranean species to evolve specialized traits, many of which evolved convergently. We studied convergence in evolutionary rate in subterranean mammals in order to associate phenotypic evolution with specific genetic regions. We identified a strong excess of vision- and skin-related genes that changed at accelerated rates in the subterranean environment due to relaxed constraint and adaptive evolution. We also demonstrate that ocular-specific transcriptional enhancers were convergently accelerated, whereas enhancers active outside the eye were not. Furthermore, several uncharacterized genes and regulatory sequences demonstrated convergence and thus constitute novel candidate sequences for congenital ocular disorders. The strong evidence of convergence in these species indicates that evolution in this environment is recurrent and predictable and can be used to gain insights into phenotype-genotype relationships.
Assuntos
Adaptação Biológica , Evolução Biológica , Ecossistema , Olho , Mamíferos , Fenômenos Fisiológicos Oculares , AnimaisRESUMO
Robustness and evolvability are highly intertwined properties of biological systems. The relationship between these properties determines how biological systems are able to withstand mutations and show variation in response to them. Computational studies have explored the relationship between these two properties using neutral networks of RNA sequences (genotype) and their secondary structures (phenotype) as a model system. However, these studies have assumed every mutation to a sequence to be equally likely; the differences in the likelihood of the occurrence of various mutations, and the consequence of probabilistic nature of the mutations in such a system have previously been ignored. Associating probabilities to mutations essentially results in the weighting of genotype space. We here perform a comparative analysis of weighted and unweighted neutral networks of RNA sequences, and subsequently explore the relationship between robustness and evolvability. We show that assuming an equal likelihood for all mutations (as in an unweighted network), underestimates robustness and overestimates evolvability of a system. In spite of discarding this assumption, we observe that a negative correlation between sequence (genotype) robustness and sequence evolvability persists, and also that structure (phenotype) robustness promotes structure evolvability, as observed in earlier studies using unweighted networks. We also study the effects of base composition bias on robustness and evolvability. Particularly, we explore the association between robustness and evolvability in a sequence space that is AU-rich-sequences with an AU content of 80% or higher, compared to a normal (unbiased) sequence space. We find that evolvability of both sequences and structures in an AU-rich space is lesser compared to the normal space, and robustness higher. We also observe that AU-rich populations evolving on neutral networks of phenotypes, can access less phenotypic variation compared to normal populations evolving on neutral networks.