RESUMEN
Understanding the extent to which ecological divergence is repeatable is essential for predicting responses of biodiversity to environmental change. Here we test the predictability of evolution, from genotype to phenotype, by studying parallel evolution in a salmonid fish, Arctic charr (Salvelinus alpinus), across eleven replicate sympatric ecotype pairs (benthivorous-planktivorous and planktivorous-piscivorous) and two evolutionary lineages. We found considerable variability in eco-morphological divergence, with several traits related to foraging (eye diameter, pectoral fin length) being highly parallel even across lineages. This suggests repeated and predictable adaptation to environment. Consistent with ancestral genetic variation, hundreds of loci were associated with ecotype divergence within lineages of which eight were shared across lineages. This shared genetic variation was maintained despite variation in evolutionary histories, ranging from postglacial divergence in sympatry (ca. 10-15kya) to pre-glacial divergence (ca. 20-40kya) with postglacial secondary contact. Transcriptome-wide gene expression (44,102 genes) was highly parallel across replicates, involved biological processes characteristic of ecotype morphology and physiology, and revealed parallelism at the level of regulatory networks. This expression divergence was not only plastic but in part genetically controlled by parallel cis-eQTL. Lastly, we found that the magnitude of phenotypic divergence was largely correlated with the genetic differentiation and gene expression divergence. In contrast, the direction of phenotypic change was mostly determined by the interplay of adaptive genetic variation, gene expression, and ecosystem size. Ecosystem size further explained variation in putatively adaptive, ecotype-associated genomic patterns within and across lineages, highlighting the role of environmental variation and stochasticity in parallel evolution. Together, our findings demonstrate the parallel evolution of eco-morphology and gene expression within and across evolutionary lineages, which is controlled by the interplay of environmental stochasticity and evolutionary contingencies, largely overcoming variable evolutionary histories and genomic backgrounds.
Asunto(s)
Ecotipo , Evolución Molecular , Peces/anatomía & histología , Peces/genética , Expresión Génica , Variación Genética , Genoma/genética , Animales , Ecología , Femenino , Flujo Genético , Especiación Genética , Genética de Población , Genómica , Masculino , SimpatríaRESUMEN
AbstractThe potentially significant genetic consequences associated with the loss of migratory capacity of diadromous fishes that have become landlocked in freshwater are poorly understood. Consistent selective pressures associated with freshwater residency may drive repeated differentiation both between allopatric landlocked and anadromous populations and within landlocked populations (resulting in sympatric morphs). Alternatively, the strong genetic drift anticipated in isolated landlocked populations could hinder consistent adaptation, limiting genetic parallelism. Understanding the degree of genetic parallelism underlying differentiation has implications for both the predictability of evolution and management practices. We employed an 87k single-nucleotide polymorphism (SNP) array to examine the genetic characteristics of landlocked and anadromous Arctic char (Salvelinus alpinus) populations from five drainages within Labrador, Canada. One gene was detected as an outlier between sympatric, size-differentiated morphs in each of two landlocked lakes. While no single locus differentiated all replicate pairs of landlocked and anadromous populations, several SNPs, genes, and paralogs were consistently detected as outliers in at least 70% of these pairwise comparisons. A significant C-score suggested that the amount of shared outlier SNPs across all paired landlocked and anadromous populations was greater than expected by chance. Our results indicate that despite their isolation, selection due to the loss of diadromy may drive consistent genetic responses in landlocked populations.
Asunto(s)
Lagos , Trucha , Animales , Regiones Árticas , Genoma , Genómica , Trucha/genéticaRESUMEN
The whole-genome duplication 80 million years ago of the common ancestor of salmonids (salmonid-specific fourth vertebrate whole-genome duplication, Ss4R) provides unique opportunities to learn about the evolutionary fate of a duplicated vertebrate genome in 70 extant lineages. Here we present a high-quality genome assembly for Atlantic salmon (Salmo salar), and show that large genomic reorganizations, coinciding with bursts of transposon-mediated repeat expansions, were crucial for the post-Ss4R rediploidization process. Comparisons of duplicate gene expression patterns across a wide range of tissues with orthologous genes from a pre-Ss4R outgroup unexpectedly demonstrate far more instances of neofunctionalization than subfunctionalization. Surprisingly, we find that genes that were retained as duplicates after the teleost-specific whole-genome duplication 320 million years ago were not more likely to be retained after the Ss4R, and that the duplicate retention was not influenced to a great extent by the nature of the predicted protein interactions of the gene products. Finally, we demonstrate that the Atlantic salmon assembly can serve as a reference sequence for the study of other salmonids for a range of purposes.
Asunto(s)
Diploidia , Evolución Molecular , Duplicación de Gen/genética , Genes Duplicados/genética , Genoma/genética , Salmo salar/genética , Animales , Elementos Transponibles de ADN/genética , Femenino , Genómica , Masculino , Modelos Genéticos , Mutagénesis/genética , Filogenia , Estándares de Referencia , Salmo salar/clasificación , Homología de SecuenciaRESUMEN
Copepods encompass numerous ecological roles including parasites, detrivores and phytoplankton grazers. Nonetheless, copepod genome assemblies remain scarce. Lepeophtheirus salmonis is an economically and ecologically important ectoparasitic copepod found on salmonid fish. We present the 695.4 Mbp L. salmonis genome assembly containing ≈60% repetitive regions and 13,081 annotated protein-coding genes. The genome comprises 14 autosomes and a ZZ-ZW sex chromosome system. Assembly assessment identified 92.4% of the expected arthropod genes. Transcriptomics supported annotation and indicated a marked shift in gene expression after host attachment, including apparent downregulation of genes related to circadian rhythm coinciding with abandoning diurnal migration. The genome shows evolutionary signatures including loss of genes needed for peroxisome biogenesis, presence of numerous FNII domains, and an incomplete heme homeostasis pathway suggesting heme proteins to be obtained from the host. Despite repeated development of resistance against chemical treatments L. salmonis exhibits low numbers of many genes involved in detoxification.
Asunto(s)
Copépodos , Enfermedades de los Peces , Parásitos , Aclimatación , Animales , Copépodos/genética , Copépodos/parasitología , Enfermedades de los Peces/genética , Parásitos/genética , TranscriptomaRESUMEN
The post-glacial colonization of Gander Lake in Newfoundland, Canada, by Arctic Charr (Salvelinus alpinus) provides the opportunity to study the genomic basis of adaptation to extreme deep-water environments. Colonization of deep-water (>50 m) habitats often requires extensive adaptation to cope with novel environmental challenges from high hydrostatic pressure, low temperature, and low light, but the genomic mechanisms underlying evolution in these environments are rarely known. Here, we compare genomic divergence between a deep-water morph adapted to depths of up to 288 m and a larger, piscivorous pelagic morph occupying shallower depths. Using both a SNP array and resequencing of whole nuclear and mitochondrial genomes, we find clear genetic divergence (FST = 0.11-0.15) between deep and shallow water morphs, despite an absence of morph divergence across the mitochondrial genome. Outlier analyses identified many diverged genomic regions containing genes enriched for processes such as gene expression and DNA repair, cardiac function, and membrane transport. Detection of putative copy number variants (CNVs) uncovered 385 genes with CNVs distinct to piscivorous morphs, and 275 genes with CNVs distinct to deep-water morphs, enriched for processes associated with synapse assembly. Demographic analyses identified evidence for recent and local morph divergence, and ongoing reductions in diversity consistent with postglacial colonization. Together, these results show that Arctic Charr morph divergence has occurred through genome-wide differentiation and elevated divergence of genes underlying multiple cellular and physiological processes, providing insight into the genomic basis of adaptation in a deep-water habitat following postglacial recolonization.
Asunto(s)
Trucha , Agua , Adaptación Fisiológica/genética , Animales , Genoma , Genómica , Trucha/genéticaRESUMEN
The genetic underpinnings of incipient speciation, including the genomic mechanisms which contribute to morphological and ecological differentiation and reproductive isolation, remain poorly understood. The repeated evolution of consistently, phenotypically distinct morphs of Arctic Charr (Salvelinus alpinus) within the Quaternary period offer an ideal model to study the repeatability of evolution at the genomic level. Sympatric morphs of Arctic Charr are found across this species' circumpolar distribution. However, the specific genetic mechanisms driving this morph differentiation are largely unknown despite the cultural and economic importance of the anadromous morph. We used a newly designed 87k SNP chip to investigate the character and consistency of the genomic differences among sympatric morphs within three recently deglaciated and geographically proximate lakes in Labrador, Canada. We found genetically distinct small and large morph Arctic Charr in all three lakes consistent with resident and anadromous morphs, respectively. A degree of reproductive isolation among sympatric morphs is likely given genome-wide distributions of outlier SNPs and high genome-wide FST s. Across all lakes, outlier SNPs were largely nonoverlapping suggesting a lack of genetic parallelism driving morph differentiation. Alternatively, several genes and paralogous copies of the same gene consistently differentiated morphs across multiple lakes suggesting their importance to the manifestation of morphs. Our results confirm the utility of Arctic Charr as a model for investigating the predictability of evolution and support the importance of both genetic parallelism and nonparallelism to the incipient speciation of Arctic Charr morphs.
Asunto(s)
Lagos , Trucha , Animales , Regiones Árticas , Canadá , Terranova y Labrador , Trucha/genéticaRESUMEN
Wild stocks of Pacific salmonids have experienced sharp declines in abundance over the past century. Consequently, billions of fish are released each year for enhancing abundance and sustaining fisheries. However, the beneficial role of this widely used management practice is highly debated since fitness decrease of hatchery-origin fish in the wild has been documented. Artificial selection in hatcheries has often been invoked as the most likely explanation for reduced fitness, and most studies to date have focused on finding signatures of hatchery-induced selection at the DNA level. We tested an alternative hypothesis, that captive rearing induces epigenetic reprogramming, by comparing genome-wide patterns of methylation and variation at the DNA level in hatchery-reared coho salmon (Oncorhynchus kisutch) with those of their wild counterparts in two geographically distant rivers. We found a highly significant proportion of epigenetic variation explained by the rearing environment that was as high as the one explained by the river of origin. The differentially methylated regions show enrichment for biological functions that may affect the capacity of hatchery-born smolts to migrate successfully in the ocean. Shared epigenetic variation between hatchery-reared salmon provides evidence for parallel epigenetic modifications induced by hatchery rearing in the absence of genetic differentiation between hatchery and natural-origin fish for each river. This study highlights epigenetic modifications induced by captive rearing as a potential explanatory mechanism for reduced fitness in hatchery-reared salmon.
Asunto(s)
Epigénesis Genética , Músculo Esquelético/metabolismo , Oncorhynchus/genética , Animales , Metilación de ADN , Proteínas de Peces/genética , Explotaciones Pesqueras , Ontología de Genes , Anotación de Secuencia Molecular , Músculo Esquelético/crecimiento & desarrollo , Oncorhynchus/crecimiento & desarrollo , Oncorhynchus/metabolismoRESUMEN
Much progress has been made regarding our understanding of aromatase regulation, estrogen synthesis partitioning and communication between the germinal and somatic compartments of the differentiating gonad. We now know that most of the enzymatic and signaling apparatus required for steroidogenesis is endogenously expressed within germ cells. However, less is known about the expression and localization of steroidogenic components within mature spermatozoa. We have assembled a sperm library presenting 197,015 putative transcripts. Co-expression clustering analysis revealed that 6687 genes were present at higher levels in sperm in comparison to fifteen other salmon tissue libraries. The sperm transcriptome is highly complex containing the highest proportion of unannotated genes (45%) of the tissues analyzed. Our analysis of highly expressed genes in late-stage sperm revealed dedication to tasks involving chromatin remodeling, flagellogenesis and proteolysis. In addition, using various different embedding and microscopic techniques, we examined the morphology of salmon spermatozoa and characterized expression and localization of several estrogenic regulatory and signaling proteins by immunohistochemistry. We provide evidence for the endogenous synthesis and localization of aromatase (CYP19A and CYP19B1) and potential mediators of estrogen [i.e., ER-alpha and soluble adenylyl cyclase (sAC)] or phosphate (i.e., CREB and FOXL2A) signaling. Partitioning of select transcripts that encode AR-beta, FSH and the LH receptor, but not AR-alpha, LH or the FSH receptor, further points to localized specificity of function in the steroidogenic circuitry of the sperm cell. These results open new avenues of investigation to further our understanding of the intra- and intercellular regulatory processes that guide sperm development and biology.
Asunto(s)
Estrógenos/metabolismo , Salmo salar/metabolismo , Espermatozoides/citología , Espermatozoides/metabolismo , Animales , Inmunohistoquímica , MasculinoRESUMEN
BACKGROUND: MHC class I (MHCI) molecules are the key presenters of peptides generated through the intracellular pathway to CD8-positive T-cells. In fish, MHCI genes were first identified in the early 1990's, but we still know little about their functional relevance. The expansion and presumed sub-functionalization of cod MHCI and access to many published fish genome sequences provide us with the incentive to undertake a comprehensive study of deduced teleost fish MHCI molecules. RESULTS: We expand the known MHCI lineages in teleosts to five with identification of a new lineage defined as P. The two lineages U and Z, which both include presumed peptide binding classical/typical molecules besides more derived molecules, are present in all teleosts analyzed. The U lineage displays two modes of evolution, most pronouncedly observed in classical-type alpha 1 domains; cod and stickleback have expanded on one of at least eight ancient alpha 1 domain lineages as opposed to many other teleosts that preserved a number of these ancient lineages. The Z lineage comes in a typical format present in all analyzed ray-finned fish species as well as lungfish. The typical Z format displays an unprecedented conservation of almost all 37 residues predicted to make up the peptide binding groove. However, also co-existing atypical Z sub-lineage molecules, which lost the presumed peptide binding motif, are found in some fish like carps and cavefish. The remaining three lineages, L, S and P, are not predicted to bind peptides and are lost in some species. CONCLUSIONS: Much like tetrapods, teleosts have polymorphic classical peptide binding MHCI molecules, a number of classical-similar non-classical MHCI molecules, and some members of more diverged MHCI lineages. Different from tetrapods, however, is that in some teleosts the classical MHCI polymorphism incorporates multiple ancient MHCI domain lineages. Also different from tetrapods is that teleosts have typical Z molecules, in which the residues that presumably form the peptide binding groove have been almost completely conserved for over 400 million years. The reasons for the uniquely teleost evolution modes of peptide binding MHCI molecules remain an enigma.
Asunto(s)
Peces/genética , Genes MHC Clase I , Animales , Peces/clasificación , Antígenos de Histocompatibilidad Clase I/química , Antígenos de Histocompatibilidad Clase I/genética , Filogenia , Análisis de Secuencia de ADNRESUMEN
Inherited symbionts are ubiquitous in insects and can have important consequences for the fitness of their hosts. Many inherited symbionts defend their hosts against parasites or other natural enemies; however, the means by which most symbionts confer protection is virtually unknown. We examine the mechanisms of defence in a recently discovered case of symbiont-mediated protection, where the bacterial symbiont Spiroplasma defends the fly Drosophila neotestacea from a virulent nematode parasite, Howardula aoronymphium. Using quantitative PCR of Spiroplasma infection intensities and whole transcriptome sequencing, we attempt to distinguish between the following modes of defence: symbiont-parasite competition, host immune priming and the production of toxic factors by Spiroplasma. Our findings do not support a model of exploitative competition between Howardula and Spiroplasma to mediate defence, nor do we find strong support for host immune priming during Spiroplasma infection. Interestingly, we recovered sequence for putative toxins encoded by Spiroplasma, including a novel putative ribosome-inactivating protein, transcripts of which are up-regulated in response to nematode exposure. Protection via the production of toxins may be a widely used and important mechanism in heritable defensive symbioses in insects.
Asunto(s)
Drosophila/microbiología , Drosophila/parasitología , Nematodos/patogenicidad , Spiroplasma/fisiología , Simbiosis , Animales , Toxinas Bacterianas/metabolismo , Drosophila/genética , Regulación de la Expresión Génica , Genes Bacterianos , TranscriptomaRESUMEN
The northern pike Esox lucius is a freshwater fish with low genetic diversity but ecological success throughout the Northern Hemisphere. Here, we generate an annotated chromosome-level genome assembly of 941 Mbp in length with 25 chromosome-length scaffolds. We then genotype 47 northern pike from Alaska through New Jersey at a genome-wide scale and characterize a striking decrease in genetic diversity along the sampling range. Individuals west of the North American Continental Divide have substantially higher diversity than those to the east (e.g. Interior Alaska and St. Lawrence River have on average 181 and 64K heterozygous SNPs per individual, or a heterozygous SNP every 5.2 and 14.6 kbp, respectively). Individuals clustered within each population with strong support, with numerous private alleles observed within each population. Evidence for recent population expansion was observed for a Manitoba hatchery and the St. Lawrence population (Tajima's D = -1.07 and -1.30, respectively). Several chromosomes have large regions with elevated diversity, including LG24, which holds amhby, the ancestral sex determining gene. As expected amhby was largely male-specific in Alaska and the Yukon and absent southeast to these populations, but we document some amhby(-) males in Alaska and amhby(+) males in the Columbia River, providing evidence for a patchwork of presence of this system in the western region. These results support the theory that northern pike recolonized North America from refugia in Alaska and expanded following deglaciation from west to east, with probable founder effects resulting in loss of both neutral and functional diversity (e.g. amhby).
Asunto(s)
Esocidae , Variación Genética , Polimorfismo de Nucleótido Simple , Secuenciación Completa del Genoma , Animales , Esocidae/genética , Masculino , Procesos de Determinación del Sexo/genética , Femenino , Genoma , América del Norte , Genética de Población , GenotipoRESUMEN
BACKGROUND: Classical major histocompatibility complex (MHC) class II molecules play an essential role in presenting peptide antigens to CD4+ T lymphocytes in the acquired immune system. The non-classical class II DM molecule, HLA-DM in the case of humans, possesses critical function in assisting the classical MHC class II molecules for proper peptide loading and is highly conserved in tetrapod species. Although the absence of DM-like genes in teleost fish has been speculated based on the results of homology searches, it has not been definitively clear whether the DM system is truly specific for tetrapods or not. To obtain a clear answer, we comprehensively searched class II genes in representative teleost fish genomes and analyzed those genes regarding the critical functional features required for the DM system. RESULTS: We discovered a novel ancient class II group (DE) in teleost fish and classified teleost fish class II genes into three major groups (DA, DB and DE). Based on several criteria, we investigated the classical/non-classical nature of various class II genes and showed that only one of three groups (DA) exhibits classical-type characteristics. Analyses of predicted class II molecules revealed that the critical tryptophan residue required for a classical class II molecule in the DM system could be found only in some non-classical but not in classical-type class II molecules of teleost fish. CONCLUSIONS: Teleost fish, a major group of vertebrates, do not possess the DM system for the classical class II peptide-loading and this sophisticated system has specially evolved in the tetrapod lineage.
Asunto(s)
Presentación de Antígeno , Proteínas de Peces/genética , Peces/genética , Peces/inmunología , Antígenos de Histocompatibilidad Clase II/genética , Secuencia de Aminoácidos , Animales , Proteínas de Peces/química , Proteínas de Peces/inmunología , Genes MHC Clase II , Antígenos de Histocompatibilidad Clase II/química , Antígenos de Histocompatibilidad Clase II/inmunología , Humanos , Datos de Secuencia Molecular , Péptidos/genética , Péptidos/inmunología , Filogenia , Alineación de Secuencia , Vertebrados/genética , Vertebrados/inmunologíaRESUMEN
BACKGROUND: The sablefish (order: Scorpaeniformes) is an economically important species in commercial fisheries of the North Pacific and an emerging species in aquaculture. Aside from a handful of sequences in NCBI and a few published microsatellite markers, little is known about the genetics of this species. The development of genetic tools, including polymorphic markers and a linkage map will allow for the successful development of future broodstock and mapping of phenotypes of interest. The significant sexual dimorphism between females and males makes a genetic test for early identification of sex desirable. RESULTS: A full mitochondrial genome is presented and the resulting phylogenetic analysis verifies the placement of the sablefish within the Scorpaeniformes. Nearly 35,000 assembled transcript sequences are used to identify genes and obtain polymorphic SNP and microsatellite markers. 360 transcribed polymorphic loci from two sablefish families produce a map of 24 linkage groups. The sex phenotype maps to sablefish LG14 of the male map. We show significant conserved synteny and conservation of gene-order between the threespine stickleback Gasterosteus aculeatus and sablefish. An additional 1843 polymorphic SNP markers are identified through next-generation sequencing techniques. Sex-specific markers and sequence insertions are identified immediately upstream of the gene gonadal-soma derived factor (gsdf), the master sex determinant locus in the medaka species Oryzias luzonensis. CONCLUSIONS: The first genomic resources for sablefish provide a foundation for further studies. Over 35,000 transcripts are presented, and the genetic map represents, as far as we can determine, the first linkage map for a member of the Scorpaeniformes. The observed level of conserved synteny and comparative mapping will allow the use of the stickleback genome in future genetic studies on sablefish and other related fish, particularly as a guide to whole-genome assembly. The identification of sex-specific insertions immediately upstream of a known master sex determinant implicates gsdf as an excellent candidate for the master sex determinant for sablefish.
Asunto(s)
Mapeo Cromosómico , Peces/genética , Genómica , Mitocondrias/genética , Filogenia , Caracteres Sexuales , Procesos de Determinación del Sexo/genética , Animales , Femenino , Peces/fisiología , Marcadores Genéticos/genética , Genoma Mitocondrial/genética , Técnicas de Genotipaje , Masculino , Fenotipo , Smegmamorpha/genética , Sintenía/genéticaRESUMEN
Coho salmon (Oncorhynchus kisutch) are a culturally and economically important species that return from multiyear ocean migrations to spawn in rivers that flow to the Northern Pacific Ocean. Southern stocks of coho salmon in Canada and the United States have significantly declined over the past quarter century, and unfortunately, conservation efforts have not reversed this trend. To assist in stock management and conservation efforts, we generated a chromosome-level genome assembly. We also resequenced the genomes of 83 coho salmon across the North American range to identify nucleotide variants and understand the demographic histories of these salmon by modeling effective population size from genome-wide data. From demographic history modeling, we observed reductions in effective population sizes between 3,750 and 8,000 years ago for several northern sampling sites, which may correspond to bottleneck events during recolonization after glacial retreat.
Asunto(s)
Oncorhynchus kisutch , Animales , Oncorhynchus kisutch/genética , Densidad de Población , GenomaRESUMEN
BACKGROUND: There are a growing number of genomes sequenced with tentative functions assigned to a large proportion of the individual genes. Model organisms in laboratory settings form the basis for the assignment of gene function, and the ecological context of gene function is lacking. This work addresses this shortcoming by investigating expressed genes of sockeye salmon (Oncorhynchus nerka) muscle tissue. We compared morphology and gene expression in natural juvenile sockeye populations related to river and lake habitats. Based on previously documented divergent morphology, feeding strategy, and predation in association with these distinct environments, we expect that burst swimming is favored in riverine population and continuous swimming is favored in lake-type population. In turn we predict that morphology and expressed genes promote burst swimming in riverine sockeye and continuous swimming in lake-type sockeye. RESULTS: We found the riverine sockeye population had deep, robust bodies and lake-type had shallow, streamlined bodies. Gene expression patterns were measured using a 16 k microarray, discovering 141 genes with significant differential expression. Overall, the identity and function of these genes was consistent with our hypothesis. In addition, Gene Ontology (GO) enrichment analyses with a larger set of differentially expressed genes found the "biosynthesis" category enriched for the riverine population and the "metabolism" category enriched for the lake-type population. CONCLUSIONS: This study provides a framework for understanding sockeye life history from a transcriptomic perspective and a starting point for more extensive, targeted studies determining the ecological context of genes.
Asunto(s)
Metabolismo Energético/genética , Lagos , Salmón/genética , Animales , Ambiente , Proteínas de Peces/genética , Expresión Génica , Perfilación de la Expresión Génica , Músculos/metabolismo , Análisis de Secuencia por Matrices de Oligonucleótidos , Reacción en Cadena de la Polimerasa de Transcriptasa Inversa , Ríos , Salmón/fisiología , NataciónRESUMEN
BACKGROUND: Salmonids are one of the most intensely studied fish, in part due to their economic and environmental importance, and in part due to a recent whole genome duplication in the common ancestor of salmonids. This duplication greatly impacts species diversification, functional specialization, and adaptation. Extensive new genomic resources have recently become available for Atlantic salmon (Salmo salar), but documentation of allelic versus duplicate reference genes remains a major uncertainty in the complete characterization of its genome and its evolution. RESULTS: From existing expressed sequence tag (EST) resources and three new full-length cDNA libraries, 9,057 reference quality full-length gene insert clones were identified for Atlantic salmon. A further 1,365 reference full-length clones were annotated from 29,221 northern pike (Esox lucius) ESTs. Pairwise dN/dS comparisons within each of 408 sets of duplicated salmon genes using northern pike as a diploid out-group show asymmetric relaxation of selection on salmon duplicates. CONCLUSIONS: 9,057 full-length reference genes were characterized in S. salar and can be used to identify alleles and gene family members. Comparisons of duplicated genes show that while purifying selection is the predominant force acting on both duplicates, consistent with retention of functionality in both copies, some relaxation of pressure on gene duplicates can be identified. In addition, there is evidence that evolution has acted asymmetrically on paralogs, allowing one of the pair to diverge at a faster rate.
Asunto(s)
ADN Complementario/genética , Esocidae/genética , Evolución Molecular , Genoma/genética , Poliploidía , Salmo salar/genética , Animales , Secuencia de Bases , Clonación Molecular , Mapeo Contig , Etiquetas de Secuencia Expresada , Duplicación de Gen , Perfilación de la Expresión Génica , Alineación de SecuenciaRESUMEN
The resiliency of populations and species to environmental change is dependent on the maintenance of genetic diversity, and as such, quantifying diversity is central to combating ongoing widespread reductions in biodiversity. With the advent of next-generation sequencing, several methods now exist for resolving fine-scale population structure, but the comparative performance of these methods for genetic assignment has rarely been tested. Here, we evaluate the performance of sequenced microsatellites and a single nucleotide polymorphism (SNP) array to resolve fine-scale population structure in a critically important salmonid in north eastern Canada, Arctic Charr (Salvelinus alpinus). We also assess the utility of sequenced microsatellites for fisheries applications by quantifying the spatial scales of movement and exploitation through genetic assignment of fishery samples to rivers of origin and comparing these results with a 29-year tagging dataset. Self-assignment and simulation-based analyses of 111 genome-wide microsatellite loci and 500 informative SNPs from 28 populations of Arctic Charr in north-eastern Canada identified largely river-specific genetic structure. Despite large differences (~4X) in the number of loci surveyed between panels, mean self-assignment accuracy was similar with the microsatellite loci and the SNP panel (>90%). Subsequent analysis of 996 fishery-collected samples using the microsatellite panel revealed that larger rivers contribute greater numbers of individuals to the fishery and that coastal fisheries largely exploit individuals originating from nearby rivers, corroborating results from traditional tagging experiments. Our results demonstrate the efficacy of sequence-based microsatellite genotyping to advance understanding of fine-scale population structure and harvest composition in northern and understudied species.
RESUMEN
The estimation of linkage disequilibrium between molecular markers within a population is critical when establishing the minimum number of markers required for association studies, genomic selection, and inferring historical events influencing different populations. This work aimed to evaluate the extent and decay of linkage disequilibrium in a coho salmon breeding population using a high-density SNP array. Linkage disequilibrium was estimated between a total of 93,502 SNPs found in 64 individuals (33 dams and 31 sires) from the breeding population. The markers encompass all 30 coho salmon chromosomes and comprise 1,684.62 Mb of the genome. The average density of markers per chromosome ranged from 48.31 to 66 per 1 Mb. The minor allele frequency averaged 0.26 (with a range from 0.22 to 0.27). The overall average linkage disequilibrium among SNPs pairs measured as r 2 was 0.10. The Average r 2 value decreased with increasing physical distance, with values ranging from 0.21 to 0.07 at a distance lower than 1 kb and up to 10 Mb, respectively. An r 2 threshold of 0.2 was reached at distance of approximately 40 Kb. Chromosomes Okis05, Okis15 and Okis28 showed high levels of linkage disequilibrium (>0.20 at distances lower than 1 Mb). Average r 2 values were lower than 0.15 for all chromosomes at distances greater than 4 Mb. An effective population size of 43 was estimated for the population 10 generations ago, and 325, for 139 generations ago. Based on the effective number of chromosome segments, we suggest that at least 74,000 SNPs would be necessary for an association mapping study and genomic predictions. Therefore, the SNP panel used allowed us to capture high-resolution information in the farmed coho salmon population. Furthermore, based on the contemporary N e, a new mate allocation strategy is suggested to increase the effective population size.
RESUMEN
We have generated a high-density, high-throughput genotyping array for characterizing genome-wide variation in Arctic charr (Salvelinus alpinus). Novel single nucleotide polymorphisms (SNPs) were identified in charr from the Fraser, Nauyuk and Tree River aquaculture strains, which originated from northern Canada and fish from Iceland using high coverage sequencing, reduced representation sequencing and RNA-seq datasets. The array was designed to capture genome-wide variation from a diverse suite of Arctic charr populations. Cross validation of SNPs from various sources and comparison with previously published Arctic charr SNP data provided a set of candidate SNPs that generalize across populations. Further candidate SNPs were identified based on minor allele frequency, association with RNA transcripts, even spacing across intergenic regions and association with the sex determining (sdY) gene. The performance of the 86,503 SNP array was assessed by genotyping Fraser, Nauyuk and Tree River strain individuals, as well as wild Icelandic Arctic charr. Overall, 63,060 of the SNPs were polymorphic within at least one group and 36.8% were unique to one of the four groups, suggesting that the array design allows for characterization of both within and across population genetic diversity. The concordance between sdY markers and known phenotypic sex indicated that the array can accurately determine the sex of individuals based on genotype alone. The Salp87k genotyping array provides researchers and breeders the opportunity to analyze genetic variation in Arctic charr at a more detailed level than previously possible.
Asunto(s)
ADN Intergénico/genética , Técnicas de Genotipaje , Análisis de Secuencia por Matrices de Oligonucleótidos , Polimorfismo de Nucleótido Simple , Trucha/genética , Animales , Canadá , Femenino , MasculinoRESUMEN
BACKGROUND: Salmonids are of interest because of their relatively recent genome duplication, and their extensive use in wild fisheries and aquaculture. A comprehensive gene list and a comparison of genes in some of the different species provide valuable genomic information for one of the most widely studied groups of fish. RESULTS: 298,304 expressed sequence tags (ESTs) from Atlantic salmon (69% of the total), 11,664 chinook, 10,813 sockeye, 10,051 brook trout, 10,975 grayling, 8,630 lake whitefish, and 3,624 northern pike ESTs were obtained in this study and have been deposited into the public databases. Contigs were built and putative full-length Atlantic salmon clones have been identified. A database containing ESTs, assemblies, consensus sequences, open reading frames, gene predictions and putative annotation is available. The overall similarity between Atlantic salmon ESTs and those of rainbow trout, chinook, sockeye, brook trout, grayling, lake whitefish, northern pike and rainbow smelt is 93.4, 94.2, 94.6, 94.4, 92.5, 91.7, 89.6, and 86.2% respectively. An analysis of 78 transcript sets show Salmo as a sister group to Oncorhynchus and Salvelinus within Salmoninae, and Thymallinae as a sister group to Salmoninae and Coregoninae within Salmonidae. Extensive gene duplication is consistent with a genome duplication in the common ancestor of salmonids. Using all of the available EST data, a new expanded salmonid cDNA microarray of 32,000 features was created. Cross-species hybridizations to this cDNA microarray indicate that this resource will be useful for studies of all 68 salmonid species. CONCLUSION: An extensive collection and analysis of salmonid RNA putative transcripts indicate that Pacific salmon, Atlantic salmon and charr are 94-96% similar while the more distant whitefish, grayling, pike and smelt are 93, 92, 89 and 86% similar to salmon. The salmonid transcriptome reveals a complex history of gene duplication that is consistent with an ancestral salmonid genome duplication hypothesis. Genome resources, including a new 32 K microarray, provide valuable new tools to study salmonids.