RESUMO
BACKGROUND: Regulation of transcription by DNA methylation in 5'-CpG-3' context is a widespread mechanism allowing differential expression of genetically identical cells to persist throughout development. Consequently, differences in DNA methylation can reinforce variation in gene expression among cells, tissues, populations, and species. Despite a surge in studies on DNA methylation, we know little about the importance of DNA methylation in population differentiation and speciation. Here we investigate the regulatory and evolutionary impact of DNA methylation in five tissues of two Ficedula flycatcher species and their naturally occurring F1 hybrids. RESULTS: We show that the density of CpG in the promoters of genes determines the strength of the association between DNA methylation and gene expression. The impact of DNA methylation on gene expression varies among tissues with the brain showing unique patterns. Differentially expressed genes between parental species are predicted by genetic and methylation differentiation in CpG-rich promoters. However, both these factors fail to predict hybrid misexpression suggesting that promoter mismethylation is not a main determinant of hybrid misexpression in Ficedula flycatchers. Using allele-specific methylation estimates in hybrids, we also determine the genome-wide contribution of cis- and trans effects in DNA methylation differentiation. These distinct mechanisms are roughly balanced in all tissues except the brain, where trans differences predominate. CONCLUSIONS: Overall, this study provides insight on the regulatory and evolutionary impact of DNA methylation in songbirds.
Assuntos
Ilhas de CpG , Metilação de DNA , Regiões Promotoras Genéticas , Aves Canoras , Animais , Aves Canoras/genética , Ilhas de CpG/genética , Hibridização Genética , Evolução Molecular , Evolução Biológica , Regulação da Expressão GênicaRESUMO
Recombination is a central evolutionary process that reshuffles combinations of alleles along chromosomes, and consequently is expected to influence the efficacy of direct selection via Hill-Robertson interference. Additionally, the indirect effects of selection on neutral genetic diversity are expected to show a negative relationship with recombination rate, as background selection and genetic hitchhiking are stronger when recombination rate is low. However, owing to the limited availability of recombination rate estimates across divergent species, the impact of evolutionary changes in recombination rate on genomic signatures of selection remains largely unexplored. To address this question, we estimate recombination rate in two Ficedula flycatcher species, the taiga flycatcher (Ficedula albicilla) and collared flycatcher (Ficedula albicollis). We show that recombination rate is strongly correlated with signatures of indirect selection, and that evolutionary changes in recombination rate between species have observable impacts on this relationship. Conversely, signatures of direct selection on coding sequences show little to no relationship with recombination rate, even when restricted to genes where recombination rate is conserved between species. Thus, using measures of indirect and direct selection that bridge micro- and macro-evolutionary timescales, we demonstrate that the role of recombination rate and its dynamics varies for different signatures of selection.
Assuntos
Passeriformes , Aves Canoras , Animais , Aves Canoras/genética , Seleção Genética , Genoma , Passeriformes/genética , Recombinação GenéticaRESUMO
The sex chromosomes have been hypothesized to play a key role in driving adaptation and speciation across many taxa. The reason for this is thought to be the hemizygosity of the heteromorphic part of sex chromosomes in the heterogametic sex, which exposes recessive mutations to natural and sexual selection. The exposure of recessive beneficial mutations increases their rate of fixation on the sex chromosomes, which results in a faster rate of evolution. In addition, genetic incompatibilities between sex-linked loci are exposed faster in the genomic background of hybrids of divergent lineages, which makes sex chromosomes contribute disproportionately to reproductive isolation. However, in birds, which show a Z/W sex determination system, the role of adaptation versus genetic drift as the driving force of the faster differentiation of the Z chromosome (fast-Z effect) and the disproportionate role of the Z chromosome in reproductive isolation (large-Z effect) are still debated. Here, we address this debate in the bird genus Ficedula flycatchers based on population-level whole-genome sequencing data of six species. Our analysis provides evidence for both faster lineage sorting and reduced gene flow on the Z chromosome than the autosomes. However, these patterns appear to be driven primarily by the increased role of genetic drift on the Z chromosome, rather than an increased rate of adaptive evolution. Genomic scans of selective sweeps and fixed differences in fact suggest a reduced action of positive selection on the Z chromosome.
RESUMO
We introduce a multi-allele Wright-Fisher model with mutation and selection such that allele frequencies at a single locus are traced by the path of a hybrid jump-diffusion process. The state space of the process is given by the vertices and edges of a topological graph, i.e. edges are unit intervals. Vertices represent monomorphic population states and positions on the edges mark the biallelic proportions of ancestral and derived alleles during polymorphic segments. In this setting, mutations can only occur at monomorphic loci. We derive the stationary distribution in mutation-selection-drift equilibrium and obtain the expected allele frequency spectrum under large population size scaling. For the extended model with multiple independent loci we derive rigorous upper bounds for a wide class of associated measures of genetic variation. Within this framework we present mathematically precise arguments to conclude that the presence of directional selection reduces the magnitude of genetic variation, as constrained by the bounds for neutral evolution.
Assuntos
Frequência do Gene , Variação Genética , Genética Populacional , Modelos Genéticos , Seleção Genética , Mutação , Alelos , Deriva Genética , Humanos , Densidade DemográficaRESUMO
Changes in interacting cis- and trans-regulatory elements are important candidates for Dobzhansky-Muller hybrid incompatibilities and may contribute to hybrid dysfunction by giving rise to misexpression in hybrids. To gain insight into the molecular mechanisms and determinants of gene expression evolution in natural populations, we analyzed the transcriptome from multiple tissues of two recently diverged Ficedula flycatcher species and their naturally occurring F1 hybrids. Differential gene expression analysis revealed that the extent of differentiation between species and the set of differentially expressed genes varied across tissues. Common to all tissues, a higher proportion of Z-linked genes than autosomal genes showed differential expression, providing evidence for a fast-Z effect. We further found clear signatures of hybrid misexpression in brain, heart, kidney, and liver. However, while testis showed the highest divergence of gene expression among tissues, it showed no clear signature of misexpression in F1 hybrids, even though these hybrids were found to be sterile. It is therefore unlikely that incompatibilities between cis-trans regulatory changes explain the observed sterility. Instead, we found evidence that cis-regulatory changes play a significant role in the evolution of gene expression in testis, which illustrates the tissue-specific nature of cis-regulatory evolution bypassing constraints associated with pleiotropic effects of genes.
Assuntos
Proteínas Aviárias/genética , Perfilação da Expressão Gênica/métodos , Aves Canoras/genética , Testículo/metabolismo , Animais , Encéfalo/metabolismo , Evolução Molecular , Regulação da Expressão Gênica , Rim/metabolismo , Fígado/metabolismo , Masculino , Miocárdio/metabolismo , Especificidade de Órgãos , Análise de Sequência de RNA , Aves Canoras/fisiologia , Especificidade da EspécieRESUMO
The ratio of nonsynonymous over synonymous sequence divergence, dN/dS, is a widely used estimate of the nonsynonymous over synonymous fixation rate ratio ω, which measures the extent to which natural selection modulates protein sequence evolution. Its computation is based on a phylogenetic approach and computes sequence divergence of protein-coding DNA between species, traditionally using a single representative DNA sequence per species. This approach ignores the presence of polymorphisms and relies on the indirect assumption that new mutations fix instantaneously, an assumption which is generally violated and reasonable only for distantly related species. The violation of the underlying assumption leads to a time-dependence of sequence divergence, and biased estimates of ω in particular for closely related species, where the contribution of ancestral and lineage-specific polymorphisms to sequence divergence is substantial. We here use a time-dependent Poisson random field model to derive an analytical expression of dN/dS as a function of divergence time and sample size. We then extend our framework to the estimation of the proportion of adaptive protein evolution α. This mathematical treatment enables us to show that the joint usage of polymorphism and divergence data can assist the inference of selection for closely related species. Moreover, our analytical results provide the basis for a protocol for the estimation of ω and α for closely related species. We illustrate the performance of this protocol by studying a population data set of four corvid species, which involves the estimation of ω and α at different time-scales and for several choices of sample sizes.
Assuntos
Evolução Molecular , Técnicas Genéticas , Modelos Genéticos , Mutação Silenciosa , Animais , Corvos/genética , Polimorfismo GenéticoRESUMO
The rate of recombination impacts on rates of protein evolution for at least two reasons: it affects the efficacy of selection due to linkage and influences sequence evolution through the process of GC-biased gene conversion (gBGC). We studied how recombination, via gBGC, affects inferences of selection in gene sequences using comparative genomic and population genomic data from the collared flycatcher (Ficedula albicollis). We separately analyzed different mutation categories ("strong"-to-"weak," "weak-to-strong," and GC-conservative changes) and found that gBGC impacts on the distribution of fitness effects of new mutations, and leads to that the rate of adaptive evolution and the proportion of adaptive mutations among nonsynonymous substitutions are underestimated by 22-33%. It also biases inferences of demographic history based on the site frequency spectrum. In light of this impact, we suggest that inferences of selection (and demography) in lineages with pronounced gBGC should be based on GC-conservative changes only. Doing so, we estimate that 10% of nonsynonymous mutations are effectively neutral and that 27% of nonsynonymous substitutions have been fixed by positive selection in the flycatcher lineage. We also find that gene expression level, sex-bias in expression, and the number of protein-protein interactions, but not Hill-Robertson interference (HRI), are strong determinants of selective constraint and rate of adaptation of collared flycatcher genes. This study therefore illustrates the importance of disentangling the effects of different evolutionary forces and genetic factors in interpretation of sequence data, and from that infer the role of natural selection in DNA sequence evolution.
Assuntos
Evolução Molecular , Conversão Gênica , Seleção Genética , Aves Canoras/genética , Animais , Feminino , MasculinoRESUMO
Recombination is an engine of genetic diversity and therefore constitutes a key process in evolutionary biology and genetics. While the outcome of crossover recombination can readily be detected as shuffled alleles by following the inheritance of markers in pedigreed families, the more precise location of both crossover and non-crossover recombination events has been difficult to pinpoint. As a consequence, we lack a detailed portrait of the recombination landscape for most organisms and knowledge on how this landscape impacts on sequence evolution at a local scale. To localize recombination events with high resolution in an avian system, we performed whole-genome re-sequencing at high coverage of a complete three-generation collared flycatcher pedigree. We identified 325 crossovers at a median resolution of 1.4 kb, with 86% of the events localized to <10 kb intervals. Observed crossover rates were in excellent agreement with data from linkage mapping, were 52% higher in male (3.56 cM/Mb) than in female meiosis (2.28 cM/Mb), and increased towards chromosome ends in male but not female meiosis. Crossover events were non-randomly distributed in the genome with several distinct hot-spots and a concentration to genic regions, with the highest density in promoters and CpG islands. We further identified 267 non-crossovers, whose location was significantly associated with crossover locations. We detected a significant transmission bias (0.18) in favour of 'strong' (G, C) over 'weak' (A, T) alleles at non-crossover events, providing direct evidence for the process of GC-biased gene conversion in an avian system. The approach taken in this study should be applicable to any species and would thereby help to provide a more comprehensive portray of the recombination landscape across organism groups.
Assuntos
Troca Genética , Recombinação Genética , Aves Canoras/genética , Animais , Mapeamento Cromossômico , Feminino , Variação Genética , Genoma , Sequenciamento de Nucleotídeos em Larga Escala , Masculino , LinhagemRESUMO
Speciation is a continuous process during which genetic changes gradually accumulate in the genomes of diverging species. Recent studies have documented highly heterogeneous differentiation landscapes, with distinct regions of elevated differentiation ("differentiation islands") widespread across genomes. However, it remains unclear which processes drive the evolution of differentiation islands; how the differentiation landscape evolves as speciation advances; and ultimately, how differentiation islands are related to speciation. Here, we addressed these questions based on population genetic analyses of 200 resequenced genomes from 10 populations of four Ficedula flycatcher sister species. We show that a heterogeneous differentiation landscape starts emerging among populations within species, and differentiation islands evolve recurrently in the very same genomic regions among independent lineages. Contrary to expectations from models that interpret differentiation islands as genomic regions involved in reproductive isolation that are shielded from gene flow, patterns of sequence divergence (d(xy) and relative node depth) do not support a major role of gene flow in the evolution of the differentiation landscape in these species. Instead, as predicted by models of linked selection, genome-wide variation in diversity and differentiation can be explained by variation in recombination rate and the density of targets for selection. We thus conclude that the heterogeneous landscape of differentiation in Ficedula flycatchers evolves mainly as the result of background selection and selective sweeps in genomic regions of low recombination. Our results emphasize the necessity of incorporating linked selection as a null model to identify genome regions involved in adaptation and speciation.
Assuntos
Especiação Genética , Passeriformes/classificação , Passeriformes/genética , Recombinação Genética , Seleção Genética , Animais , Feminino , Fluxo Gênico , Genética Populacional , Genoma , Genômica , Técnicas de Genotipagem , Masculino , Polimorfismo de Nucleotídeo Único , Isolamento Reprodutivo , Análise de Sequência de DNA , Especificidade da EspécieRESUMO
Theoretical work suggests that sexual conflict should promote the maintenance of genetic diversity by the opposing directions of selection on males and females. If such conflict is pervasive, it could potentially lead to genomic heterogeneity in levels of genetic diversity an idea that so far has not been empirically tested on a genomewide scale. We used large-scale population genomic and transcriptomic data from the collared flycatcher (Ficedula albicollis) to analyse how sexual conflict, for which we use sex-biased gene expression as a proxy, relates to genetic variability. Here, we demonstrate that the extent of sex-biased gene expression of both male-biased and female-biased genes is significantly correlated with levels of nucleotide diversity in gene sequences and that this correlation extends to diversity levels also in intergenic DNA and introns. We find signatures of balancing selection in sex-biased genes but also note that relaxed purifying selection could potentially explain part of the observed patterns. The finding of significant genetic differentiation between males and females for male-biased (and gonad-specific) genes indicates ongoing sexual conflict and sex-specific viability selection, potentially driven by sexual selection. Our results thus indicate that sexual antagonism could potentially be considered as one viable explanation to the long-standing question in evolutionary biology of how genomes can remain so genetically variable in face of strong natural and sexual selection.
Assuntos
Variação Genética , Seleção Genética , Caracteres Sexuais , Aves Canoras/genética , Animais , DNA Intergênico/genética , Feminino , Expressão Gênica , Genética Populacional , Genoma , Íntrons , Masculino , TranscriptomaRESUMO
The ratio of nonsynonymous to synonymous substitution rates (ω) is often used to measure the strength of natural selection. However, ω may be influenced by linkage among different targets of selection, that is, Hill-Robertson interference (HRI), which reduces the efficacy of selection. Recombination modulates the extent of HRI but may also affect ω by means of GC-biased gene conversion (gBGC), a process leading to a preferential fixation of G:C ("strong," S) over A:T ("weak," W) alleles. As HRI and gBGC can have opposing effects on ω, it is essential to understand their relative impact to make proper inferences of ω. We used a model that separately estimated S-to-S, S-to-W, W-to-S, and W-to-W substitution rates in 8,423 avian genes in the Ficedula flycatcher lineage. We found that the W-to-S substitution rate was positively, and the S-to-W rate negatively, correlated with recombination rate, in accordance with gBGC but not predicted by HRI. The W-to-S rate further showed the strongest impact on both dN and dS. However, since the effects were stronger at 4-fold than at 0-fold degenerated sites, likely because the GC content of these sites is farther away from its equilibrium, ω slightly decreases with increasing recombination rate, which could falsely be interpreted as a consequence of HRI. We corroborated this hypothesis analytically and demonstrate that under particular conditions, ω can decrease with increasing recombination rate. Analyses of the site-frequency spectrum showed that W-to-S mutations were skewed toward high, and S-to-W mutations toward low, frequencies, consistent with a prevalent gBGC-driven fixation bias.
Assuntos
Composição de Bases/genética , Aves/genética , Evolução Molecular , Conversão Gênica/genética , Recombinação Genética/genética , Animais , Variação Genética , Modelos Genéticos , Mutação , Polimorfismo de Nucleotídeo Único/genética , Seleção GenéticaRESUMO
Closely related species may show similar levels of genetic diversity in homologous regions of the genome owing to shared ancestral variation still segregating in the extant species. However, after completion of lineage sorting, such covariation is not necessarily expected. On the other hand, if the processes that govern genetic diversity are conserved, diversity may potentially covary even among distantly related species. We mapped regions of conserved synteny between the genomes of two divergent bird species-collared flycatcher and hooded crow-and identified more than 600 Mb of homologous regions (66% of the genome). From analyses of whole-genome resequencing data in large population samples of both species we found nucleotide diversity in 200 kb windows to be well correlated (Spearman's ρ = 0.407). The correlation remained highly similar after excluding coding sequences. To explain this covariation, we suggest that a stable avian karyotype and a conserved landscape of recombination rate variation render the diversity-reducing effects of linked selection similar in divergent bird lineages. Principal component regression analysis of several potential explanatory variables driving heterogeneity in flycatcher diversity levels revealed the strongest effects from recombination rate variation and density of coding sequence targets for selection, consistent with linked selection. It is also possible that a stable karyotype is associated with a conserved genomic mutation environment contributing to covariation in diversity levels between lineages. Our observations imply that genetic diversity is to some extent predictable.
Assuntos
Corvos/genética , Genoma , Nucleotídeos/genética , Aves Canoras/genética , Animais , Evolução Molecular , Cariótipo , Recombinação Genética , SinteniaRESUMO
Recombination rate is heterogeneous across the genome of various species and so are genetic diversity and differentiation as a consequence of linked selection. However, we still lack a clear picture of the underlying mechanisms for regulating recombination. Here we estimated fine-scale population recombination rate based on the patterns of linkage disequilibrium across the genomes of multiple populations of two closely related flycatcher species (Ficedula albicollis and F. hypoleuca). This revealed an overall conservation of the recombination landscape between these species at the scale of 200 kb, but we also identified differences in the local rate of recombination despite their recent divergence (<1 million years). Genetic diversity and differentiation were associated with recombination rate in a lineage-specific manner, indicating differences in the extent of linked selection between species. We detected 400-3,085 recombination hotspots per population. Location of hotspots was conserved between species, but the intensity of hotspot activity varied between species. Recombination hotspots were primarily associated with CpG islands (CGIs), regardless of whether CGIs were at promoter regions or away from genes. Recombination hotspots were also associated with specific transposable elements (TEs), but this association appears indirect due to shared preferences of the transposition machinery and the recombination machinery for accessible open chromatin regions. Our results suggest that CGIs are a major determinant of the localization of recombination hotspots, and we propose that both the distribution of TEs and fine-scale variation in recombination rate may be associated with the evolution of the epigenetic landscape.
Assuntos
Genética Populacional , Desequilíbrio de Ligação , Recombinação Genética , Aves Canoras/genética , Animais , Ilhas de CpG , Elementos de DNA Transponíveis , Epigênese Genética , Variação Genética , Genoma , Regiões Promotoras GenéticasRESUMO
The origin and evolutionary dynamics of the spatial heterogeneity in genomic base composition have been debated since its discovery in the 1970s. With the recent availability of numerous genome sequences from a wide range of species it has been possible to address this question from a comparative perspective, and similarities and differences in base composition between groups of organisms are becoming evident. Ample evidence suggests that the contrasting dynamics of base composition are driven by GC-biased gene conversion (gBGC), a process that is associated with meiotic recombination. In line with this hypothesis, base composition is associated with the rate of recombination and the evolutionary dynamics of the recombination landscape, therefore, governs base composition. In addition, and at first sight perhaps surprisingly, the relationship between demography and genomic base composition is in agreement with the gBGC hypothesis: organisms with larger populations have higher GC content than those with smaller populations.
Assuntos
Composição de Bases/genética , Conversão Gênica/genética , Recombinação Genética/genética , Animais , Demografia/métodos , Evolução Molecular , Genoma/genética , Genômica/métodos , Humanos , Meiose/genéticaRESUMO
Relatively little is known about the character of gene expression evolution as species diverge. It is for instance unclear if gene expression generally evolves in a clock-like manner (by stabilizing selection or neutral evolution) or if there are frequent episodes of directional selection. To gain insights into the evolutionary divergence of gene expression, we sequenced and compared the transcriptomes of multiple organs from population samples of collared (Ficedula albicollis) and pied flycatchers (F. hypoleuca), two species which diverged less than one million years ago. Ordination analysis separated samples by organ rather than by species. Organs differed in their degrees of expression variance within species and expression divergence between species. Variance was negatively correlated with expression breadth and protein interactivity, suggesting that pleiotropic constraints reduce gene expression variance within species. Variance was correlated with between-species divergence, consistent with a pattern expected from stabilizing selection and neutral evolution. Using an expression PST approach, we identified genes differentially expressed between species and found 16 genes uniquely expressed in one of the species. For one of these, DPP7, uniquely expressed in collared flycatcher, the absence of expression in pied flycatcher could be associated with a ≈20-kb deletion including 11 of 13 exons. This study of a young vertebrate speciation model system expands our knowledge of how gene expression evolves as natural populations become reproductively isolated.
Assuntos
Evolução Biológica , Deriva Genética , Seleção Genética , Aves Canoras/classificação , Animais , Feminino , Expressão Gênica , Pleiotropia Genética , Genética Populacional , Masculino , Modelos Genéticos , Aves Canoras/genética , Especificidade da Espécie , SuéciaRESUMO
In population genetic studies, the allele frequency spectrum (AFS) efficiently summarizes genome-wide polymorphism data and shapes a variety of allele frequency-based summary statistics. While existing theory typically features equilibrium conditions, emerging methodology requires an analytical understanding of the build-up of the allele frequencies over time. In this work, we use the framework of Poisson random fields to derive new representations of the non-equilibrium AFS for the case of a Wright-Fisher population model with selection. In our approach, the AFS is a scaling-limit of the expectation of a Poisson stochastic integral and the representation of the non-equilibrium AFS arises in terms of a fixation time probability distribution. The known duality between the Wright-Fisher diffusion process and a birth and death process generalizing Kingman's coalescent yields an additional representation. The results carry over to the setting of a random sample drawn from the population and provide the non-equilibrium behavior of sample statistics. Our findings are consistent with and extend a previous approach where the non-equilibrium AFS solves a partial differential forward equation with a non-traditional boundary condition. Moreover, we provide a bridge to previous coalescent-based work, and hence tie several frameworks together. Since frequency-based summary statistics are widely used in population genetics, for example, to identify candidate loci of adaptive evolution, to infer the demographic history of a population, or to improve our understanding of the underlying mechanics of speciation events, the presented results are potentially useful for a broad range of topics.
Assuntos
Frequência do Gene , Genética Populacional , Modelos Genéticos , Polimorfismo GenéticoRESUMO
The ratio of divergence at nonsynonymous and synonymous sites, dN/dS, is a widely used measure in evolutionary genetic studies to investigate the extent to which selection modulates gene sequence evolution. Originally tailored to codon sequences of distantly related lineages, dN/dS represents the ratio of fixed nonsynonymous to synonymous differences. The impact of ancestral and lineage-specific polymorphisms on dN/dS, which we here show to be substantial for closely related lineages, is generally neglected in estimation techniques of dN/dS. To address this issue, we formulate a codon model that is firmly anchored in population genetic theory, derive analytical expressions for the dN/dS measure by Poisson random field approximation in a Markovian framework and validate the derivations by simulations. In good agreement, simulations and analytical derivations demonstrate that dN/dS is biased by polymorphisms at short time scales and that it can take substantial time for the expected value to settle at its time limit where only fixed differences are considered. We further show that in any attempt to estimate the dN/dS ratio from empirical data the effect of the intrinsic fluctuations of a ratio of stochastic variables, can even under neutrality yield extreme values of dN/dS at short time scales or in regions of low mutation rate. Taken together, our results have significant implications for the interpretation of dN/dS estimates, the McDonald-Kreitman test and other related statistics, in particular for closely related lineages.
Assuntos
Códon/genética , Evolução Molecular , Modelos Genéticos , Substituição de Aminoácidos , Códon/metabolismo , Genética Populacional , Mutação , Filogenia , Polimorfismo Genético , Seleção Genética , Fatores de TempoRESUMO
The genomes of many vertebrates show a characteristic heterogeneous distribution of GC content, the so-called GC isochore structure. The origin of isochores has been explained via the mechanism of GC-biased gene conversion (gBGC). However, although the isochore structure is declining in many mammalian genomes, the heterogeneity in GC content is being reinforced in the avian genome. Despite this discrepancy, which remains unexplained, examinations of individual substitution frequencies in mammals and birds are both consistent with the gBGC model of isochore evolution. On the other hand, a negative correlation between substitution and recombination rate found in the chicken genome is inconsistent with the gBGC model. It should therefore be important to consider along with gBGC other consequences of recombination on the origin and fate of mutations, as well as to account for relationships between recombination rate and other genomic features. We therefore developed an analytical model to describe the substitution patterns found in the chicken genome, and further investigated the relationships between substitution patterns and several genomic features in a rigorous statistical framework. Our analysis indicates that GC content itself, either directly or indirectly via interrelations to other genomic features, has an impact on the substitution pattern. Further, we suggest that this phenomenon is particularly visible in avian genomes due to their unusually low rate of chromosomal evolution. Because of this, interrelations between GC content and other genomic features are being reinforced, and are as such more pronounced in avian genomes as compared with other vertebrate genomes with a less stable karyotype.
Assuntos
Cromossomos/genética , Evolução Molecular , Conversão Gênica , Cariótipo , Animais , Composição de Bases , Galinhas , Genoma , Isocoros/genética , Mamíferos/genética , Recombinação Genética , Vertebrados/genéticaRESUMO
Detailed linkage and recombination rate maps are necessary to use the full potential of genome sequencing and population genomic analyses. We used a custom collared flycatcher 50 K SNP array to develop a high-density linkage map with 37 262 markers assigned to 34 linkage groups in 33 autosomes and the Z chromosome. The best-order map contained 4215 markers, with a total distance of 3132 cM and a mean genetic distance between markers of 0.12 cM. Facilitated by the array being designed to include markers from most scaffolds, we obtained a second-generation assembly of the flycatcher genome that approaches full chromosome sequences (N50 super-scaffold size 20.2 Mb and with 1.042 Gb (of 1.116 Gb) anchored to and mostly ordered and oriented along chromosomes). We found that flycatcher and zebra finch chromosomes are entirely syntenic but that inversions at mean rates of 1.5-2.0 event (6.6-7.5 Mb) per My have changed the organization within chromosomes, rates high enough for inversions to potentially have been involved with many speciation events during avian evolution. The mean recombination rate was 3.1 cM/Mb and correlated closely with chromosome size, from 2 cM/Mb for chromosomes >100 Mb to >10 cM/Mb for chromosomes <10 Mb. This size dependence seemed entirely due to an obligate recombination event per chromosome; if 50 cM was subtracted from the genetic lengths of chromosomes, the rate per physical unit DNA was constant across chromosomes. Flycatcher recombination rate showed similar variation along chromosomes as chicken but lacked the large interior recombination deserts characteristic of zebra finch chromosomes.
Assuntos
Evolução Molecular , Ligação Genética , Recombinação Genética , Aves Canoras/genética , Animais , Galinhas , Mapeamento Cromossômico , Feminino , Tentilhões , Genoma , Técnicas de Genotipagem , Masculino , Dados de Sequência Molecular , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNA , SinteniaRESUMO
BACKGROUND: Polymorphism is key to the evolutionary potential of populations. Understanding which factors shape levels of genetic diversity within genomes forms a central question in evolutionary genomics and is of importance for the possibility to infer episodes of adaptive evolution from signs of reduced diversity. There is an on-going debate on the relative role of mutation and selection in governing diversity levels. This question is also related to the role of recombination because recombination is expected to indirectly affect polymorphism via the efficacy of selection. Moreover, recombination might itself be mutagenic and thereby assert a direct effect on diversity levels. RESULTS: We used whole-genome re-sequencing data from domestic chicken (broiler and layer breeds) and its wild ancestor (the red jungle fowl) to study the relationship between genetic diversity and several genomic parameters. We found that recombination rate had the largest effect on local levels of nucleotide diversity. The fact that divergence (a proxy for mutation rate) and recombination rate were negatively correlated argues against a mutagenic role of recombination. Furthermore, divergence had limited influence on polymorphism. CONCLUSIONS: Overall, our results are consistent with a selection model, in which regions within a short distance from loci under selection show reduced polymorphism levels. This conclusion lends further support from the observations of strong correlations between intergenic levels of diversity and diversity at synonymous as well as non-synonymous sites. Our results also demonstrate differences between the two domestic breeds and red jungle fowl, where the domestic breeds show a stronger relationship between intergenic diversity levels and diversity at synonymous and non-synonymous sites. This finding, together with overall lower diversity levels in domesticates compared to red jungle fowl, seem attributable to artificial selection during domestication.