RESUMEN
The accurate and complete assembly of both haplotype sequences of a diploid organism is essential to understanding the role of variation in genome functions, phenotypes and diseases1. Here, using a trio-binning approach, we present a high-quality, diploid reference genome, with both haplotypes assembled independently at the chromosome level, for the common marmoset (Callithrix jacchus), an primate model system that is widely used in biomedical research2,3. The full spectrum of heterozygosity between the two haplotypes involves 1.36% of the genome-much higher than the 0.13% indicated by the standard estimation based on single-nucleotide heterozygosity alone. The de novo mutation rate is 0.43 × 10-8 per site per generation, and the paternal inherited genome acquired twice as many mutations as the maternal. Our diploid assembly enabled us to discover a recent expansion of the sex-differentiation region and unique evolutionary changes in the marmoset Y chromosome. In addition, we identified many genes with signatures of positive selection that might have contributed to the evolution of Callithrix biological features. Brain-related genes were highly conserved between marmosets and humans, although several genes experienced lineage-specific copy number variations or diversifying selection, with implications for the use of marmosets as a model system.
Asunto(s)
Callithrix/genética , Diploidia , Evolución Molecular , Genoma/genética , Genómica/normas , Animales , Investigación Biomédica , Variaciones en el Número de Copia de ADN , Femenino , Mutación de Línea Germinal/genética , Haplotipos/genética , Heterocigoto , Humanos , Mutación INDEL/genética , Masculino , Estándares de Referencia , Selección Genética , Diferenciación Sexual/genética , Cromosoma Y/genéticaRESUMEN
Whether adaptation is limited by the beneficial mutation supply is a long-standing question of evolutionary genetics, which is more generally related to the determination of the adaptive substitution rate and its relationship with species effective population size (Ne) and genetic diversity. Empirical evidence reported so far is equivocal, with some but not all studies supporting a higher adaptive substitution rate in large-Ne than in small-Ne species. We gathered coding sequence polymorphism data and estimated the adaptive amino-acid substitution rate ωa, in 50 species from ten distant groups of animals with markedly different population mutation rate θ. We reveal the existence of a complex, timescale dependent relationship between species adaptive substitution rate and genetic diversity. We find a positive relationship between ωa and θ among closely related species, indicating that adaptation is indeed limited by the mutation supply, but this was only true in relatively low-θ taxa. In contrast, we uncover no significant correlation between ωa and θ at a larger taxonomic scale, suggesting that the proportion of beneficial mutations scales negatively with species' long-term Ne.
Asunto(s)
Adaptación Fisiológica , Tasa de Mutación , Polimorfismo Genético , Animales , Aves/genética , Insectos/genética , Mamíferos/genética , Modelos Genéticos , Moluscos/genética , Sistemas de Lectura Abierta , TiempoRESUMEN
Recombination is expected to affect functional sequence evolution in several ways. On the one hand, recombination is thought to improve the efficiency of multilocus selection by dissipating linkage disequilibrium. On the other hand, natural selection can be counteracted by recombination-associated transmission distorters such as GC-biased gene conversion (gBGC), which tends to promote G and C alleles irrespective of their fitness effect in high-recombining regions. It has been suggested that gBGC might impact coding sequence evolution in vertebrates, and particularly the ratio of nonsynonymous to synonymous substitution rates (dN/dS). However, distinctive gBGC patterns have been reported in mammals and birds, maybe reflecting the documented contrasts in evolutionary dynamics of recombination rate between these two taxa. Here, we explore how recombination and gBGC affect coding sequence evolution in mammals and birds by analyzing proteome-wide data in six species of Galloanserae (fowls) and six species of catarrhine primates. We estimated the dN/dS ratio and rates of adaptive and nonadaptive evolution in bins of genes of increasing recombination rate, separately analyzing AT â GC, GC â AT, and G â C/A â T mutations. We show that in both taxa, recombination and gBGC entail a decrease in dN/dS. Our analysis indicates that recombination enhances the efficiency of purifying selection by lowering Hill-Robertson effects, whereas gBGC leads to an overestimation of the adaptive rate of AT â GC mutations. Finally, we report a mutagenic effect of recombination, which is independent of gBGC.
Asunto(s)
Evolución Molecular , Conversión Génica , Aves de Corral/genética , Primates/genética , AnimalesRESUMEN
Selection on codon usage bias is well documented in a number of microorganisms. Whether codon usage is also generally shaped by natural selection in large organisms, despite their relatively small effective population size (Ne), is unclear. In animals, the population genetics of codon usage bias has only been studied in a handful of model organisms so far, and can be affected by confounding, nonadaptive processes such as GC-biased gene conversion and experimental artefacts. Using population transcriptomics data, we analyzed the relationship between codon usage, gene expression, allele frequency distribution, and recombination rate in 30 nonmodel species of animals, each from a different family, covering a wide range of effective population sizes. We disentangled the effects of translational selection and GC-biased gene conversion on codon usage by separately analyzing GC-conservative and GC-changing mutations. We report evidence for effective translational selection on codon usage in large-Ne species of animals, but not in small-Ne ones, in agreement with the nearly neutral theory of molecular evolution. C- and T-ending codons tend to be preferred over synonymous G- and A-ending ones, for reasons that remain to be determined. In contrast, we uncovered a conspicuous effect of GC-biased gene conversion, which is widespread in animals and the main force determining the fate of ATâGC mutations. Intriguingly, the strength of its effect was uncorrelated with Ne.
Asunto(s)
Codón , Conversión Génica , Insectos/genética , Selección Genética , Mutación Silenciosa , Animales , Composición de Base , Densidad de PoblaciónRESUMEN
Sex chromosomes have different evolutionary properties compared to autosomes due to their hemizygous nature. In particular, recessive mutations are more readily exposed to selection, which can lead to faster rates of molecular evolution. Here, we report patterns of gene expression and molecular evolution for a group of butterflies. First, we improve the completeness of the Heliconius melpomene reference annotation, a neotropical butterfly with a ZW sex determination system. Then, we analyse RNA from male and female whole abdomens and sequence female ovary and gut tissue to identify sex- and tissue-specific gene expression profiles in H. melpomene. Using these expression profiles, we compare (a) sequence divergence and polymorphism; (b) the strength of positive and negative selection; and (c) rates of adaptive evolution, for Z and autosomal genes between two species of Heliconius butterflies, H. melpomene and H. erato. We show that the rate of adaptive substitutions is higher for Z than autosomal genes, but contrary to expectation, it is also higher for male-biased than female-biased genes. Additionally, we find no significant increase in the rate of adaptive evolution or purifying selection on genes expressed in ovary tissue, a heterogametic-specific tissue. Our results contribute to a growing body of literature from other ZW systems that also provide mixed evidence for a fast-Z effect where hemizygosity influences the rate of adaptive substitutions.
Asunto(s)
Adaptación Biológica , Mariposas Diurnas/genética , Evolución Molecular , Selección Genética , Cromosomas Sexuales , Animales , Femenino , Tracto Gastrointestinal/metabolismo , Expresión Génica , Masculino , Ovario/metabolismo , Caracteres Sexuales , TranscriptomaRESUMEN
Estimating the proportion of adaptive substitutions (α) is of primary importance to uncover the determinants of adaptation in comparative genomic studies. Several methods have been proposed to estimate α from patterns polymorphism and divergence in coding sequences. However, estimators of α can be biased when the underlying assumptions are not met. Here we focus on a potential source of bias, i.e. variation through time in the long-term population size (N) of the considered species. We show via simulations that ancient demographic fluctuations can generate severe overestimations of α, and this is irrespective of the recent population history.
Asunto(s)
Adaptación Fisiológica/genética , Evolución Molecular , Densidad de Población , Clima , Simulación por Computador , Genoma , Modelos Genéticos , Tasa de Mutación , Polimorfismo GenéticoRESUMEN
Incomplete lineage sorting (ILS) causes the phylogeny of some parts of the genome to differ from the species tree. In this work, we investigate the frequencies and determinants of ILS in 29 major ancestral nodes across the entire primate phylogeny. We find up to 64% of the genome affected by ILS at individual nodes. We exploit ILS to reconstruct speciation times and ancestral population sizes. Estimated speciation times are much more recent than genomic divergence times and are in good agreement with the fossil record. We show extensive variation of ILS along the genome, mainly driven by recombination but also by the distance to genes, highlighting a major impact of selection on variation along the genome. In many nodes, ILS is reduced more on the X chromosome compared with autosomes than expected under neutrality, which suggests higher impacts of natural selection on the X chromosome. Finally, we show an excess of ILS in genes with immune functions and a deficit of ILS in housekeeping genes. The extensive ILS in primates discovered in this study provides insights into the speciation times, ancestral population sizes, and patterns of natural selection that shape primate evolution.
Asunto(s)
Especiación Genética , Variación Genética , Genoma , Primates , Animales , Genómica , Filogenia , Primates/genéticaRESUMEN
The Y chromosome usually plays a critical role in determining male sex and comprises sequence classes that have experienced unique evolutionary trajectories. Here we generated 19 new primate sex chromosome assemblies, analysed them with 10 existing assemblies and report rapid evolution of the Y chromosome across primates. The pseudoautosomal boundary has shifted at least six times during primate evolution, leading to the formation of a Simiiformes-specific evolutionary stratum and to the independent start of young strata in Catarrhini and Platyrrhini. Different primate lineages experienced different rates of gene loss and structural and chromatin change on their Y chromosomes. Selection on several Y-linked genes has contributed to the evolution of male developmental traits across the primates. Additionally, lineage-specific expansions of ampliconic regions have further increased the diversification of the structure and gene composition of the Y chromosome. Overall, our comprehensive analysis has broadened our knowledge of the evolution of the primate Y chromosome.
Asunto(s)
Evolución Molecular , Cromosoma Y , Animales , Masculino , Cromosoma Y/genética , Primates/genéticaRESUMEN
The rich diversity of morphology and behavior displayed across primate species provides an informative context in which to study the impact of genomic diversity on fundamental biological processes. Analysis of that diversity provides insight into long-standing questions in evolutionary and conservation biology and is urgent given severe threats these species are facing. Here, we present high-coverage whole-genome data from 233 primate species representing 86% of genera and all 16 families. This dataset was used, together with fossil calibration, to create a nuclear DNA phylogeny and to reassess evolutionary divergence times among primate clades. We found within-species genetic diversity across families and geographic regions to be associated with climate and sociality, but not with extinction risk. Furthermore, mutation rates differ across species, potentially influenced by effective population sizes. Lastly, we identified extensive recurrence of missense mutations previously thought to be human specific. This study will open a wide range of research avenues for future primate genomic research.
Asunto(s)
Evolución Biológica , Variación Genética , Primates , Animales , Humanos , Genoma , Tasa de Mutación , Filogenia , Primates/genética , Densidad de PoblaciónRESUMEN
Personalized genome sequencing has revealed millions of genetic differences between individuals, but our understanding of their clinical relevance remains largely incomplete. To systematically decipher the effects of human genetic variants, we obtained whole-genome sequencing data for 809 individuals from 233 primate species and identified 4.3 million common protein-altering variants with orthologs in humans. We show that these variants can be inferred to have nondeleterious effects in humans based on their presence at high allele frequencies in other primate populations. We use this resource to classify 6% of all possible human protein-altering variants as likely benign and impute the pathogenicity of the remaining 94% of variants with deep learning, achieving state-of-the-art accuracy for diagnosing pathogenic variants in patients with genetic diseases.
Asunto(s)
Variación Genética , Primates , Animales , Humanos , Secuencia de Bases , Frecuencia de los Genes , Primates/genética , Secuenciación Completa del GenomaRESUMEN
Due to their limited ranges and inherent isolation, island species have long been recognized as crucial systems for tackling a range of evolutionary questions, including in the early study of speciation.1,2 Such species have been less studied in the understanding of the evolutionary forces driving DNA sequence evolution. Island species usually have lower census population sizes (N) than continental species and, supposedly, lower effective population sizes (Ne). Given that both the rates of change caused by genetic drift and by selection are dependent upon Ne, island species are theoretically expected to exhibit (1) lower genetic diversity, (2) less effective natural selection against slightly deleterious mutations,3,4 and (3) a lower rate of adaptive evolution.5-8 Here, we have used a large set of newly sequenced and published whole-genome sequences of Passerida species (14 insular and 11 continental) to test these predictions. We confirm that island species exhibit lower census size and Ne, supporting the hypothesis that the smaller area available on islands constrains the upper bound of Ne. In the insular species, we find lower nucleotide diversity in coding regions, higher ratios of non-synonymous to synonymous polymorphisms, and lower adaptive substitution rates. Our results provide robust evidence that the lower Ne experienced by island species has affected both the ability of natural selection to efficiently remove weakly deleterious mutations and also the adaptive potential of island species, therefore providing considerable empirical support for the nearly neutral theory. We discuss the implications for both evolutionary and conservation biology.
Asunto(s)
Evolución Molecular , Genética de Población , Pájaros Cantores , Animales , Flujo Genético , Variación Genética , Densidad de Población , Selección Genética , Pájaros Cantores/genéticaRESUMEN
Genetic drift is an important evolutionary force of strength inversely proportional to Ne , the effective population size. The impact of drift on genome diversity and evolution is known to vary among species, but quantifying this effect is a difficult task. Here we assess the magnitude of variation in drift power among species of animals via its effect on the mutation load - which implies also inferring the distribution of fitness effects of deleterious mutations. To this aim, we analyze the nonsynonymous (amino-acid changing) and synonymous (amino-acid conservative) allele frequency spectra in a large sample of metazoan species, with a focus on the primates vs. fruit flies contrast. We show that a Gamma model of the distribution of fitness effects is not suitable due to strong differences in estimated shape parameters among taxa, while adding a class of lethal mutations essentially solves the problem. Using the Gamma + lethal model and assuming that the mean deleterious effects of nonsynonymous mutations is shared among species, we estimate that the power of drift varies by a factor of at least 500 between large-Ne and small-Ne species of animals, i.e., an order of magnitude more than the among-species variation in genetic diversity. Our results are relevant to Lewontin's paradox while further questioning the meaning of the Ne parameter in population genomics.
Asunto(s)
Flujo Genético , Modelos Genéticos , Tasa de Mutación , Animales , Aptitud Genética , Humanos , Mutación MissenseRESUMEN
The fixation probability of a recessive beneficial mutation is increased on the X or Z chromosome, relative to autosomes, because recessive alleles carried by X or Z are exposed to selection in the heterogametic sex. This leads to an increased dN/dS ratio on sex chromosomes relative to autosomes, a pattern called the "fast-X" or "fast-Z" effect. Besides positive selection, the strength of genetic drift and the efficacy of purifying selection, which affect the rate of molecular evolution, might differ between sex chromosomes and autosomes. Disentangling the complex effects of these distinct forces requires the genome-wide analysis of polymorphism, divergence and gene expression data in a variety of taxa. Here we study the influence of hemizygosity of the Z chromosome in Maniola jurtina and Pyronia tithonus, two species of butterflies (Lepidoptera, Nymphalidae, Satyrinae). Using transcriptome data, we compare the strength of positive and negative selection between Z and autosomes accounting for sex-specific gene expression. We show that M. jurtina and P. tithonus do not experience a faster, but rather a slightly slower evolutionary rate on the Z than on autosomes. Our analysis failed to detect a significant difference in adaptive evolutionary rate between Z and autosomes, but comparison of male-biased, unbiased and female-biased Z-linked genes revealed an increased efficacy of purifying selection against recessive deleterious mutations in female-biased Z-linked genes. This probably contributes to the lack of fast-Z evolution of satyrines. We suggest that the effect of hemizygosity on the fate of recessive deleterious mutations should be taken into account when interpreting patterns of molecular evolution in sex chromosomes vs. autosomes.