RESUMEN
BACKGROUND: We have previously sequenced more than 500 kb of the duplicated MHC class I regions in Atlantic salmon. In the IA region we identified the loci for the MHC class I gene Sasa-UBA in addition to a soluble MHC class I molecule, Sasa-ULA. A pseudolocus for Sasa-UCA was identified in the nonclassical IB region. Both regions contained genes for antigen presentation, as wells as orthologues to other genes residing in the human MHC region. RESULTS: The genomic localisation of two MHC class I lineages (Z and S) has been resolved. 7 BACs were sequenced using a combination of standard Sanger and 454 sequencing. The new sequence data extended the IA region with 150 kb identifying the location of one Z-lineage locus, ZAA. The IB region was extended with 350 kb including three new Z-lineage loci, ZBA, ZCA and ZDA in addition to a UGA locus. An allelic version of the IB region contained a functional UDA locus in addition to the UCA pseudolocus. Additionally a BAC harbouring two MHC class I genes (UHA) was placed on linkage group 14, while a BAC containing the S-lineage locus SAA (previously known as UAA) was placed on LG10. Gene expression studies showed limited expression range for all class I genes with exception of UBA being dominantly expressed in gut, spleen and gills, and ZAA with high expression in blood. CONCLUSION: Here we describe the genomic organization of MHC class I loci from the U-, Z-, and S-lineages in Atlantic salmon. Nine of the described class I genes are located in the extension of the duplicated IA and IB regions, while three class I genes are found on two separate linkage groups. The gene organization of the two regions indicates that the IB region is evolving at a different pace than the IA region. Expression profiling, polymorphic content, peptide binding properties and phylogenetic relationship show that Atlantic salmon has only one MHC class Ia gene (UBA), in addition to a multitude of nonclassical MHC class I genes from the U-, S- and Z-lineages.
Asunto(s)
Proteínas de Peces/genética , Genes MHC Clase I , Salmo salar/genética , Alelos , Secuencia de Aminoácidos , Animales , Betahistina , Mapeo Cromosómico , Cromosomas Artificiales Bacterianos , Biología Computacional , Perfilación de la Expresión Génica , Orden Génico , Ligamiento Genético , Biblioteca Genómica , Genómica/métodos , Datos de Secuencia Molecular , Filogenia , Salmo salar/inmunología , Análisis de Secuencia de ADNRESUMEN
The Atlantic salmon (Salmo salar) T-cell receptor gamma (TCRgamma) gene has extensive diversity in its capacity for antigen recognition due to the V-J-C gene segments recombinational possibilities, and N-region diversity. This is the first report completely characterizing and annotating a TCRgamma gene locus in teleosts. We identified two different TCRgamma loci in Atlantic salmon. The first locus, TCRgamma 1, spans 260 kbp and contains four tandemly repeated clusters each of which consists of 1-4 variable (V) segments, 1-2 sets of a joining (J) segment and a constant (C) region. In total, 10 V segments, 5 J segments and 5 C regions were found in locus 1. In the second locus, TCRgamma 2, a single non-expressed V-J-C cluster was found. Surprisingly, the Atlantic salmon TCRgamma loci have a larger number of C regions relative to other teleosts, mouse and human. In addition, each Atlantic salmon TCRgamma C region has a different connecting peptide region that may result in distinct cellular responses. Expression data confirm the diverse repertoire found at the genomic level. At least 6 out of 7 functional V segments, all 5 J segments and all 5 C regions found in TCRgamma 1 were identified in TCRgamma transcription analysis. The identification and characterization of the functional TCRgamma 1 along with the associated TCR alpha/delta locus suggest that Atlantic salmon have a functional gammadelta T-cell immune component.
Asunto(s)
Variación Genética , Receptores de Antígenos de Linfocitos T gamma-delta/genética , Salmo salar/genética , Salmo salar/inmunología , Secuencia de Aminoácidos , Animales , Secuencia de Bases , Células Clonales , ADN Complementario/análisis , ADN Complementario/genética , Etiquetas de Secuencia Expresada , Perfilación de la Expresión Génica , Genoma , Datos de Secuencia Molecular , Filogenia , Receptores de Antígenos de Linfocitos T gamma-delta/química , Receptores de Antígenos de Linfocitos T gamma-delta/metabolismo , Análisis de Secuencia de ADNRESUMEN
BACKGROUND: Growth hormone (GH) is an important regulator of skeletal growth, as well as other adapted processes in salmonids. The GH gene (gh) in salmonids is represented by duplicated, non-allelic isoforms designated as gh1 and gh2. We have isolated and characterized gh-containing bacterial artificial chromosomes (BACs) of both Atlantic and Chinook salmon (Salmo salar and Oncorhynchus tshawytscha) in order to further elucidate our understanding of the conservation and regulation of these loci. RESULTS: BACs containing gh1 and gh2 from both Atlantic and Chinook salmon were assembled, annotated, and compared to each other in their coding, intronic, regulatory, and flanking regions. These BACs also contain the genes for skeletal muscle sodium channel oriented in the same direction. The sequences of the genes for interferon alpha-1, myosin alkali light chain and microtubule associated protein Tau were also identified, and found in opposite orientations relative to gh1 and gh2. Viability of each of these genes was examined by PCR. We show that transposon insertions have occurred differently in the promoters of gh, within and between each species. Other differences within the promoters and intronic and 3'-flanking regions of the four gh genes provide evidence that they have distinct regulatory modes and possibly act to function differently and/or during different times of salmonid development. CONCLUSION: A core proximal promoter for transcription of both gh1 and gh2 is conserved between the two species of salmon. Nevertheless, transposon integration and regulatory element differences do exist between the promoters of gh1 and gh2. Additionally, organization of transposon families into the BACs containing gh1 and for the BACs containing gh2, are very similar within orthologous regions, but much less clear conservation is apparent in comparisons between the gh1- and gh2-containing paralogous BACs for the two fish species. This is consistent with the hypothesis that a burst of transposition activity occurred during the speciation events which led to Atlantic and Pacific salmon. The Chinook and other Oncorhynchus GH1s are strikingly different in comparison to the other GHs and this change is not apparent in the surrounding non-coding sequences.
Asunto(s)
Elementos Transponibles de ADN , Evolución Molecular , Hormona del Crecimiento/genética , Salmón/genética , Secuencia de Aminoácidos , Animales , Cromosomas Artificiales Bacterianos , Mapeo Contig , ADN Complementario/genética , Biblioteca de Genes , Intrones , Datos de Secuencia Molecular , Regiones Promotoras Genéticas , Reacción en Cadena de la Polimerasa de Transcriptasa Inversa , Alineación de Secuencia , Análisis de Secuencia de ADN , Especificidad de la EspecieRESUMEN
BACKGROUND: Salmonids are of interest because of their relatively recent genome duplication, and their extensive use in wild fisheries and aquaculture. A comprehensive gene list and a comparison of genes in some of the different species provide valuable genomic information for one of the most widely studied groups of fish. RESULTS: 298,304 expressed sequence tags (ESTs) from Atlantic salmon (69% of the total), 11,664 chinook, 10,813 sockeye, 10,051 brook trout, 10,975 grayling, 8,630 lake whitefish, and 3,624 northern pike ESTs were obtained in this study and have been deposited into the public databases. Contigs were built and putative full-length Atlantic salmon clones have been identified. A database containing ESTs, assemblies, consensus sequences, open reading frames, gene predictions and putative annotation is available. The overall similarity between Atlantic salmon ESTs and those of rainbow trout, chinook, sockeye, brook trout, grayling, lake whitefish, northern pike and rainbow smelt is 93.4, 94.2, 94.6, 94.4, 92.5, 91.7, 89.6, and 86.2% respectively. An analysis of 78 transcript sets show Salmo as a sister group to Oncorhynchus and Salvelinus within Salmoninae, and Thymallinae as a sister group to Salmoninae and Coregoninae within Salmonidae. Extensive gene duplication is consistent with a genome duplication in the common ancestor of salmonids. Using all of the available EST data, a new expanded salmonid cDNA microarray of 32,000 features was created. Cross-species hybridizations to this cDNA microarray indicate that this resource will be useful for studies of all 68 salmonid species. CONCLUSION: An extensive collection and analysis of salmonid RNA putative transcripts indicate that Pacific salmon, Atlantic salmon and charr are 94-96% similar while the more distant whitefish, grayling, pike and smelt are 93, 92, 89 and 86% similar to salmon. The salmonid transcriptome reveals a complex history of gene duplication that is consistent with an ancestral salmonid genome duplication hypothesis. Genome resources, including a new 32 K microarray, provide valuable new tools to study salmonids.
Asunto(s)
Bases de Datos Genéticas , Etiquetas de Secuencia Expresada , Duplicación de Gen , Filogenia , Salmonidae/genética , Animales , Mapeo Contig , Evolución Molecular , Perfilación de la Expresión Génica , Genoma , Análisis de Secuencia por Matrices de Oligonucleótidos , Análisis de Secuencia de ADN , Especificidad de la EspecieRESUMEN
The complete TCR alpha/delta locus of Atlantic salmon (Salmo salar) has been characterized and annotated. In the 900 kb TCR alpha/delta locus, 292 Valpha/delta segments and 123 Jalpha/delta segments were identified. Of these, 128 Valpha/delta, 113 Jalpha, and a Jdelta segment appeared to be functional as they lacked frame shifts or stop codons. This represents the largest repertoire of Valpha/delta and Jalpha segments of any organism to date. The 128 functional Valpha/delta segments could be grouped into 29 subgroups based upon 70% nucleotide similarity. Expression data confirmed the usage of the diverse repertoire found at the genomic level. At least 99 Valpha, 13 Vdelta 86 Jalpha, 1 Jdelta, and 2 Ddelta segments were used in TCR alpha or delta transcription, and 652 unique genes were identified from a sample of 759 TCRalpha cDNA clones. Cumulatively, the genomic and expression data suggest that the Atlantic salmon T-cell receptor has enormous capacity to recognize a wide diversity of antigens.
Asunto(s)
Genes Codificadores de la Cadena alfa de los Receptores de Linfocito T/genética , Genes Codificadores de la Cadena delta de los Receptores de Linfocito T/genética , Salmo salar/genética , Alelos , Animales , Secuencia de Bases , Regiones Determinantes de Complementariedad/genética , ADN Complementario/química , ADN Complementario/genética , Perfilación de la Expresión Génica , Biblioteca de Genes , Orden Génico , Genes Codificadores de la Cadena beta de los Receptores de Linfocito T/genética , Genes Codificadores de la Cadena gamma de los Receptores de Linfocito T/genética , Datos de Secuencia Molecular , Reacción en Cadena de la Polimerasa de Transcriptasa Inversa , Salmo salar/inmunología , Análisis de Secuencia de ADN , Homología de Secuencia de Ácido NucleicoRESUMEN
Nuclear deoxyribonucleic acid sequences from approximately 15,000 salmon louse expressed sequence tags (ESTs), the complete mitochondrial genome (16,148bp) of salmon louse, and 16S ribosomal ribonucleic acid (rRNA) and cytochrome oxidase subunit I (COI) genes from 68 salmon lice collected from Japan, Alaska, and western Canada support a Pacific lineage of Lepeophtheirus salmonis that is distinct from that occurring in the Atlantic Ocean. On average, nuclear genes are 3.2% different, the complete mitochondrial genome is 7.1% different, and 16S rRNA and COI genes are 4.2% and 6.1% different, respectively. Reduced genetic diversity within the Pacific form of L. salmonis is consistent with an introduction into the Pacific from the Atlantic Ocean. The level of divergence is consistent with the hypothesis that the Pacific form of L. salmonis coevolved with Pacific salmon (Onchorhynchus spp.) and the Atlantic form coevolved with Atlantic salmonids (Salmo spp.) independently for the last 2.5-11 million years. The level of genetic divergence coincides with the opportunity for migration of fish between the Atlantic and Pacific Ocean basins via the Arctic Ocean with the opening of the Bering Strait, approximately 5 million years ago. The genetic differences may help explain apparent differences in pathogenicity and environmental sensitivity documented for the Atlantic and Pacific forms of L. salmonis.
Asunto(s)
Copépodos/genética , ADN Mitocondrial/genética , Etiquetas de Secuencia Expresada , Animales , Complejo IV de Transporte de Electrones/genética , Evolución Molecular , Biblioteca de Genes , Genes Mitocondriales , Variación Genética , Genética de Población , Genoma Mitocondrial , Mitocondrias/genética , Océano Pacífico , Filogenia , ARN Ribosómico 16S/genética , Análisis de Secuencia de ADNRESUMEN
BACKGROUND: We have previously identified associations between major histocompatibility complex (MHC) class I and resistance towards bacterial and viral pathogens in Atlantic salmon. To evaluate if only MHC or also closely linked genes contributed to the observed resistance we ventured into sequencing of the duplicated MHC class I regions of Atlantic salmon. RESULTS: Nine BACs covering more than 500 kb of the two duplicated MHC class I regions of Atlantic salmon were sequenced and the gene organizations characterized. Both regions contained the proteasome components PSMB8, PSMB9, PSMB9-like and PSMB10 in addition to the transporter for antigen processing TAP2, as well as genes for KIFC1, ZBTB22, DAXX, TAPBP, BRD2, COL11A2, RXRB and SLC39A7. The IA region contained the recently reported MHC class I Sasa-ULA locus residing approximately 50 kb upstream of the major Sasa-UBA locus. The duplicated class IB region contained an MHC class I locus resembling the rainbow trout UCA locus, but although transcribed it was a pseudogene. No other MHC class I-like genes were detected in the two duplicated regions. Two allelic BACs spanning the UBA locus had 99.2% identity over 125 kb, while the IA region showed 82.5% identity over 136 kb to the IB region. The Atlantic salmon IB region had an insert of 220 kb in comparison to the IA region containing three chitin synthase genes. CONCLUSION: We have characterized the gene organization of more than 500 kb of the two duplicated MHC class I regions in Atlantic salmon. Although Atlantic salmon and rainbow trout are closely related, the gene organization of their IB region has undergone extensive gene rearrangements. The Atlantic salmon has only one class I UCA pseudogene in the IB region while trout contains the four MHC UCA, UDA, UEA and UFA class I loci. The large differences in gene content and most likely function of the salmon and trout class IB region clearly argues that sequencing of salmon will not necessarily provide information relevant for trout and vice versa.
Asunto(s)
Genes MHC Clase I , Salmo salar/genética , Secuencia de Aminoácidos , Animales , Presentación de Antígeno/genética , Evolución Molecular , Duplicación de Gen , Genoma , Datos de Secuencia Molecular , Regiones Promotoras Genéticas , Alineación de SecuenciaRESUMEN
We report 80,388 ESTs from 23 Atlantic salmon (Salmo salar) cDNA libraries (61,819 ESTs), 6 rainbow trout (Oncorhynchus mykiss) cDNA libraries (14,544 ESTs), 2 chinook salmon (Oncorhynchus tshawytscha) cDNA libraries (1317 ESTs), 2 sockeye salmon (Oncorhynchus nerka) cDNA libraries (1243 ESTs), and 2 lake whitefish (Coregonus clupeaformis) cDNA libraries (1465 ESTs). The majority of these are 3' sequences, allowing discrimination between paralogs arising from a recent genome duplication in the salmonid lineage. Sequence assembly reveals 28,710 different S. salar, 8981 O. mykiss, 1085 O. tshawytscha, 520 O. nerka, and 1176 C. clupeaformis putative transcripts. We annotate the submitted portion of our EST database by molecular function. Higher- and lower-molecular-weight fractions of libraries are shown to contain distinct gene sets, and higher rates of gene discovery are associated with higher-molecular weight libraries. Pyloric caecum library group annotations indicate this organ may function in redox control and as a barrier against systemic uptake of xenobiotics. A microarray is described, containing 7356 salmonid elements representing 3557 different cDNAs. Analyses of cross-species hybridizations to this cDNA microarray indicate that this resource may be used for studies involving all salmonids.