RESUMEN
Chinese indicine cattle harbor a much higher genetic diversity compared with other domestic cattle, but their genome architecture remains uninvestigated. Using PacBio HiFi sequencing data from 10 Chinese indicine cattle across southern China, we assembled 20 high-quality partially phased genomes and integrated them into a multiassembly graph containing 148.5 Mb (5.6%) of novel sequence. We identified 156,009 high-confidence nonredundant structural variants (SVs) and 206 SV hotspots spanning â¼195 Mb of gene-rich sequence. We detected 34,249 archaic introgressed fragments in Chinese indicine cattle covering 1.93 Gb (73.3%) of the genome. We inferred an average of 3.8%, 3.2%, 1.4%, and 0.5% of introgressed sequence originating, respectively, from banteng-like, kouprey-like, gayal-like, and gaur-like Bos species, as well as 0.6% of unknown origin. Introgression from multiple donors might have contributed to the genetic diversity of Chinese indicine cattle. Altogether, this study highlights the contribution of interspecies introgression to the genomic architecture of an important livestock population and shows how exotic genomic elements can contribute to the genetic variation available for selection.
Asunto(s)
Bovinos , Rumiantes , Animales , Bovinos/genética , China , Genoma , Genómica , Rumiantes/genéticaRESUMEN
Structural variations (SVs) are a major contributor to genetic diversity and phenotypic variations, but their prevalence and functions in domestic animals are largely unexplored. Here we generated high-quality genome assemblies for 15 individuals from genetically diverse sheep breeds using Pacific Biosciences (PacBio) high-fidelity sequencing, discovering 130.3 Mb nonreference sequences, from which 588 genes were annotated. A total of 149,158 biallelic insertions/deletions, 6531 divergent alleles, and 14,707 multiallelic variations with precise breakpoints were discovered. The SV spectrum is characterized by an excess of derived insertions compared to deletions (94,422 vs. 33,571), suggesting recent active LINE expansions in sheep. Nearly half of the SVs display low to moderate linkage disequilibrium with surrounding single-nucleotide polymorphisms (SNPs) and most SVs cannot be tagged by SNP probes from the widely used ovine 50K SNP chip. We identified 865 population-stratified SVs including 122 SVs possibly derived in the domestication process among 690 individuals from sheep breeds worldwide. A novel 168-bp insertion in the 5' untranslated region (5' UTR) of HOXB13 is found at high frequency in long-tailed sheep. Further genome-wide association study and gene expression analyses suggest that this mutation is causative for the long-tail trait. In summary, we have developed a panel of high-quality de novo assemblies and present a catalog of structural variations in sheep. Our data capture abundant candidate functional variations that were previously unexplored and provide a fundamental resource for understanding trait biology in sheep.
Asunto(s)
Estudio de Asociación del Genoma Completo , Cola (estructura animal) , Animales , Ovinos/genética , Regiones no Traducidas 5' , Alelos , FenotipoRESUMEN
A cattle pangenome representation was created based on the genome sequences of 898 cattle representing 57 breeds. The pangenome identified 83 Mb of sequence not found in the cattle reference genome, representing 3.1% novel sequence compared with the 2.71-Gb reference. A catalog of structural variants developed from this cattle population identified 3.3 million deletions, 0.12 million inversions, and 0.18 million duplications. Estimates of breed ancestry and hybridization between cattle breeds using insertion/deletions as markers were similar to those produced by single nucleotide polymorphism-based analysis. Hundreds of deletions were observed to have stratification based on subspecies and breed. For example, an insertion of a Bov-tA1 repeat element was identified in the first intron of the APPL2 gene and correlated with cattle breed geographic distribution. This insertion falls within a segment overlapping predicted enhancer and promoter regions of the gene, and could affect important traits such as immune response, olfactory functions, cell proliferation, and glucose metabolism in muscle. The results indicate that pangenomes are a valuable resource for studying diversity and evolutionary history, and help to delineate how domestication, trait-based breeding, and adaptive introgression have shaped the cattle genome.
RESUMEN
The community-based breeding program (CBBP) is an innovative approach recommended for genetic improvement and sustainable use of animal genetic resources in extensive farming systems. Successful implementation of this approach requires an understanding of the characteristics of production systems, breeding objectives, and farmers' trait preference. This study aimed to identify the selection criteria of goat farmers in rural areas of Burkina Faso and their potential implications in establishing CBBP. Following focus group discussions, a well-structured questionnaire was designed and administered to 372 randomly selected goat farmers in two different agro-ecological zones. A list of traits obtained during focus group discussions was provided to farmers individually, and they were asked to rank the ones they preferentially use to select breeding animals. Statistical tests were conducted to compare data between the two agro-ecological zones. The results showed that the average goat flock per household was higher (P < 0.05) in the Sudanian (15.68 ± 13.76), compared to the Sudano-Sahelian area (12.93 ± 13.3). Adult females were the dominant age-sex group in both areas. Reasons for culling, keeping breeding bucks, and castration practice were significantly different (P < 0.05) among agro-ecological zones. The most important common criterion for selection in the two zones was body size, coat color, and growth rate for the bucks and does, while fertility (0.06) parameters including twining ability (0.18), kidding frequency (0.11), and mothering ability (0.15) were furthermore considered for breeding does selection. These findings provide valuable insights for developing CBBPs tailored to goat production in the study areas.
Asunto(s)
Cruzamiento , Cabras , Animales , Femenino , Humanos , Burkina Faso , Agricultores , Granjas , MasculinoRESUMEN
By uniformly analyzing 723 RNA-seq data from 91 tissues and cell types, we built a comprehensive gene atlas and studied tissue specificity of genes in cattle. We demonstrated that tissue-specific genes significantly reflected the tissue-relevant biology, showing distinct promoter methylation and evolution patterns (e.g., brain-specific genes evolve slowest, whereas testis-specific genes evolve fastest). Through integrative analyses of those tissue-specific genes with large-scale genome-wide association studies, we detected relevant tissues/cell types and candidate genes for 45 economically important traits in cattle, including blood/immune system (e.g., CCDC88C) for male fertility, brain (e.g., TRIM46 and RAB6A) for milk production, and multiple growth-related tissues (e.g., FGF6 and CCND2) for body conformation. We validated these findings by using epigenomic data across major somatic tissues and sperm. Collectively, our findings provided novel insights into the genetic and biological mechanisms underlying complex traits in cattle, and our transcriptome atlas can serve as a primary source for biological interpretation, functional validation, studies of adaptive evolution, and genomic improvement in livestock.
Asunto(s)
Bovinos/genética , Transcriptoma , Animales , Bovinos/crecimiento & desarrollo , Bovinos/fisiología , Metilación de ADN , Femenino , Genes , Leche , Especificidad de Órganos , RNA-Seq , ReproducciónRESUMEN
BACKGROUND: The gaur (Bos gaurus) is the largest extant wild bovine species, native to South and Southeast Asia, with unique traits, and is listed as vulnerable by the International Union for Conservation of Nature (IUCN). RESULTS: We report the first gaur reference genome and identify three biological pathways including lysozyme activity, proton transmembrane transporter activity, and oxygen transport with significant changes in gene copy number in gaur compared to other mammals. These may reflect adaptation to challenges related to climate and nutrition. Comparative analyses with domesticated indicine (Bos indicus) and taurine (Bos taurus) cattle revealed genomic signatures of artificial selection, including the expansion of sperm odorant receptor genes in domesticated cattle, which may have important implications for understanding selection for male fertility. CONCLUSIONS: Apart from aiding dissection of economically important traits, the gaur genome will also provide the foundation to conserve the species.
Asunto(s)
Receptores Odorantes , Animales , Bovinos/genética , Genoma , Genómica , Masculino , Mamíferos , Receptores Odorantes/genética , Espermatozoides , Glicoproteínas de la Zona PelúcidaRESUMEN
BACKGROUND: Copy number variation (CNV) has been routinely studied using bulk-cell sequencing. However, CNV is not well studied on the single-cell level except for humans and a few model organisms. RESULTS: We sequenced 143 single sperms of two Holstein bulls, from which we predicted CNV events using 14 single sperms with deep sequencing. We then compared the CNV results derived from single sperms with the bulk-cell sequencing of one bull's family trio of diploid genomes. As a known CNV hotspot, segmental duplications were also predicted using the bovine ARS-UCD1.2 genome. Although the trio CNVs validated only some single sperm CNVs, they still showed a distal chromosomal distribution pattern and significant associations with segmental duplications and satellite repeats. CONCLUSION: Our preliminary results pointed out future research directions and highlighted the importance of uniform whole genome amplification, deep sequence coverage, and dedicated software pipelines for CNV detection using single cell sequencing data.
Asunto(s)
Variaciones en el Número de Copia de ADN , Genoma , Animales , Bovinos/genética , Masculino , Duplicaciones Segmentarias en el Genoma , Análisis de Secuencia de ADN/métodos , EspermatozoidesRESUMEN
BACKGROUND: Meiotic recombination is one of the important phenomena contributing to gamete genome diversity. However, except for human and a few model organisms, it is not well studied in livestock, including cattle. RESULTS: To investigate their distributions in the cattle sperm genome, we sequenced 143 single sperms from two Holstein bulls. We mapped meiotic recombination events at high resolution based on phased heterozygous single nucleotide polymorphism (SNP). In the absence of evolutionary selection pressure in fertilization and survival, recombination events in sperm are enriched near distal chromosomal ends, revealing that such a pattern is intrinsic to the molecular mechanism of meiosis. Furthermore, we further validated these findings in single sperms with results derived from sequencing its family trio of diploid genomes and our previous studies of recombination in cattle. CONCLUSIONS: To our knowledge, this is the first large-scale single sperm whole-genome sequencing effort in livestock, which provided useful information for future studies of recombination, genome instability, and male infertility.
Asunto(s)
Meiosis , Recombinación Genética , Animales , Bovinos/genética , Mapeo Cromosómico , Masculino , Meiosis/genética , Polimorfismo de Nucleótido Simple , Análisis de Secuencia de ADN/métodos , EspermatozoidesRESUMEN
By their paternal transmission, Y-chromosomal haplotypes are sensitive markers of population history and male-mediated introgression. Previous studies identified biallelic single-nucleotide variants in the SRY, ZFY and DDX3Y genes, which in domestic goats identified four major Y-chromosomal haplotypes, Y1A, Y1B, Y2A and Y2B, with a marked geographical partitioning. Here, we extracted goat Y-chromosomal variants from whole-genome sequences of 386 domestic goats (75 breeds) and seven wild goat species, which were generated by the VarGoats goat genome project. Phylogenetic analyses indicated domestic haplogroups corresponding to Y1B, Y2A and Y2B, respectively, whereas Y1A is split into Y1AA and Y1AB. All five haplogroups were detected in 26 ancient DNA samples from southeast Europe or Asia. Haplotypes from present-day bezoars are not shared with domestic goats and are attached to deep nodes of the trees and networks. Haplogroup distributions for 186 domestic breeds indicate ancient paternal population bottlenecks and expansions during migrations into northern Europe, eastern and southern Asia, and Africa south of the Sahara. In addition, sharing of haplogroups indicates male-mediated introgressions, most notably an early gene flow from Asian goats into Madagascar and the crossbreeding that in the 19th century resulted in the popular Boer and Anglo-Nubian breeds. More recent introgressions are those from European goats into the native Korean goat population and from Boer goat into Uganda, Kenya, Tanzania, Malawi and Zimbabwe. This study illustrates the power of the Y-chromosomal variants for reconstructing the history of domestic species with a wide geographical range.
Asunto(s)
ADN Mitocondrial , Variación Genética , Animales , ADN Mitocondrial/genética , Cabras/genética , Haplotipos/genética , Filogenia , Cromosoma Y/genéticaRESUMEN
BACKGROUND: In evolutionary theory, divergence and speciation can arise from long periods of reproductive isolation, genetic mutation, selection and environmental adaptation. After divergence, alleles can either persist in their initial state (ancestral allele - AA), co-exist or be replaced by a mutated state (derived alleles -DA). In this study, we aligned whole genome sequences of individuals from the Bovinae subfamily to the cattle reference genome (ARS.UCD-1.2) for defining ancestral alleles necessary for selection signatures study. RESULTS: Accommodating independent divergent of each lineage from the initial ancestral state, AA were defined based on fixed alleles on at least two groups of yak, bison and gayal-gaur-banteng resulting in ~ 32.4 million variants. Using non-overlapping scanning windows of 10 Kb, we counted the AA observed within taurine and zebu cattle. We focused on the extreme points, regions with top 0. 1% (high count) and regions without any occurrence of AA (null count). High count regions preserved gene functions from ancestral states that are still beneficial in the current condition, while null counts regions were linked to mutated ones. For both cattle, high count regions were associated with basal lipid metabolism, essential for survival of various environmental pressures. Mutated regions were associated to productive traits in taurine, i.e. higher metabolism, cell development and behaviors and in immune response domain for zebu. CONCLUSIONS: Our findings suggest that retaining and losing AA in some regions are varied and made it species-specific with possibility of overlapping as it depends on the selective pressure they had to experience.
Asunto(s)
Bison , Rumiantes , Alelos , Animales , Evolución Biológica , Bison/genética , Bovinos/genética , Fenotipo , Rumiantes/genéticaRESUMEN
BACKGROUND: Basenjis are considered an ancient dog breed of central African origins that still live and hunt with tribesmen in the African Congo. Nicknamed the barkless dog, Basenjis possess unique phylogeny, geographical origins and traits, making their genome structure of great interest. The increasing number of available canid reference genomes allows us to examine the impact the choice of reference genome makes with regard to reference genome quality and breed relatedness. RESULTS: Here, we report two high quality de novo Basenji genome assemblies: a female, China (CanFam_Bas), and a male, Wags. We conduct pairwise comparisons and report structural variations between assembled genomes of three dog breeds: Basenji (CanFam_Bas), Boxer (CanFam3.1) and German Shepherd Dog (GSD) (CanFam_GSD). CanFam_Bas is superior to CanFam3.1 in terms of genome contiguity and comparable overall to the high quality CanFam_GSD assembly. By aligning short read data from 58 representative dog breeds to three reference genomes, we demonstrate how the choice of reference genome significantly impacts both read mapping and variant detection. CONCLUSIONS: The growing number of high-quality canid reference genomes means the choice of reference genome is an increasingly critical decision in subsequent canid variant analyses. The basal position of the Basenji makes it suitable for variant analysis for targeted applications of specific dog breeds. However, we believe more comprehensive analyses across the entire family of canids is more suited to a pangenome approach. Collectively this work highlights the importance the choice of reference genome makes in all variation studies.
Asunto(s)
Lobos , Animales , China , Cromosomas , Perros , Femenino , Genoma , Genómica , Masculino , Lobos/genéticaRESUMEN
BACKGROUND: Copy number variations (CNV) are a significant source of variation in the genome and are therefore essential to the understanding of genetic characterization. The aim of this study was to develop a fine-scaled copy number variation map for African goats. We used sequence data from multiple breeds and from multiple African countries. RESULTS: A total of 253,553 CNV (244,876 deletions and 8677 duplications) were identified, corresponding to an overall average of 1393 CNV per animal. The mean CNV length was 3.3 kb, with a median of 1.3 kb. There was substantial differentiation between the populations for some CNV, suggestive of the effect of population-specific selective pressures. A total of 6231 global CNV regions (CNVR) were found across all animals, representing 59.2 Mb (2.4%) of the goat genome. About 1.6% of the CNVR were present in all 34 breeds and 28.7% were present in all 5 geographical areas across Africa, where animals had been sampled. The CNVR had genes that were highly enriched in important biological functions, molecular functions, and cellular components including retrograde endocannabinoid signaling, glutamatergic synapse and circadian entrainment. CONCLUSIONS: This study presents the first fine CNV map of African goat based on WGS data and adds to the growing body of knowledge on the genetic characterization of goats.
Asunto(s)
Variaciones en el Número de Copia de ADN , Cabras , África , Animales , Genoma , Cabras/genéticaRESUMEN
BACKGROUND: Reference genomes are essential in the analysis of genomic data. As the cost of sequencing decreases, multiple reference genomes are being produced within species to alleviate problems such as low mapping accuracy and reference allele bias in variant calling that can be associated with the alignment of divergent samples to a single reference individual. The latest reference sequence adopted by the scientific community for the analysis of cattle data is ARS_UCD1.2, built from the DNA of a Hereford cow (Bos taurus taurus-B. taurus). A complementary genome assembly, UOA_Brahman_1, was recently built to represent the other cattle subspecies (Bos taurus indicus-B. indicus) from a Brahman cow haplotype to further support analysis of B. indicus data. In this study, we aligned the sequence data of 15 B. taurus and B. indicus breeds to each of these references. RESULTS: The alignment of B. taurus individuals against UOA_Brahman_1 detected up to five million more single-nucleotide variants (SNVs) compared to that against ARS_UCD1.2. Similarly, the alignment of B. indicus individuals against ARS_UCD1.2 resulted in one and a half million more SNVs than that against UOA_Brahman_1. The number of SNVs with nearly fixed alternative alleles also increased in the alignments with cross-subspecies. Interestingly, the alignment of B. taurus cattle against UOA_Brahman_1 revealed regions with a smaller than expected number of counts of SNVs with nearly fixed alternative alleles. Since B. taurus introgression represents on average 10% of the genome of Brahman cattle, we suggest that these regions comprise taurine DNA as opposed to indicine DNA in the UOA_Brahman_1 reference genome. Principal component and admixture analyses using genotypes inferred from this region support these taurine-introgressed loci. Overall, the flagged taurine segments represent 13.7% of the UOA_Brahman_1 assembly. The genes located within these segments were previously reported to be under positive selection in Brahman cattle, and include functional candidate genes implicated in feed efficiency, development and immunity. CONCLUSIONS: We report a list of taurine segments that are in the UOA_Brahman_1 assembly, which will be useful for the interpretation of interesting genomic features (e.g., signatures of selection, runs of homozygosity, increased mutation rate, etc.) that could appear in future re-sequencing analysis of indicine cattle.
Asunto(s)
Genotipo , Animales , Bovinos/genética , FemeninoRESUMEN
BACKGROUND: Since their domestication 10,500 years ago, goat populations with distinctive genetic backgrounds have adapted to a broad variety of environments and breeding conditions. The VarGoats project is an international 1000-genome resequencing program designed to understand the consequences of domestication and breeding on the genetic diversity of domestic goats and to elucidate how speciation and hybridization have modeled the genomes of a set of species representative of the genus Capra. FINDINGS: A dataset comprising 652 sequenced goats and 507 public goat sequences, including 35 animals representing eight wild species, has been collected worldwide. We identified 74,274,427 single nucleotide polymorphisms (SNPs) and 13,607,850 insertion-deletions (InDels) by aligning these sequences to the latest version of the goat reference genome (ARS1). A Neighbor-joining tree based on Reynolds genetic distances showed that goats from Africa, Asia and Europe tend to group into independent clusters. Because goat breeds from Oceania and Caribbean (Creole) all derive from imported animals, they are distributed along the tree according to their ancestral geographic origin. CONCLUSIONS: We report on an unprecedented international effort to characterize the genome-wide diversity of domestic goats. This large range of sequenced individuals represents a unique opportunity to ascertain how the demographic and selection processes associated with post-domestication history have shaped the diversity of this species. Data generated for the project will also be extremely useful to identify deleterious mutations and polymorphisms with causal effects on complex traits, and thus will contribute to new knowledge that could be used in genomic prediction and genome-wide association studies.
Asunto(s)
Estudio de Asociación del Genoma Completo , Genoma , Animales , Domesticación , Variación Genética , Genómica , Cabras/genéticaRESUMEN
Bison are an icon of the American West and an ecologically, commercially, and culturally important species. Despite numbering in the hundreds of thousands today, conservation concerns remain for the species, including the impact on genetic diversity of a severe bottleneck around the turn of the 20th century and genetic introgression from domestic cattle. Genetic diversity and admixture are best evaluated at genome-wide scale, for which a high-quality reference is necessary. Here, we use trio binning of long reads from a bison-Simmental cattle (Bos taurus taurus) male F1 hybrid to sequence and assemble the genome of the American plains bison (Bison bison bison). The male haplotype genome is chromosome-scale, with a total length of 2.65 Gb across 775 scaffolds (839 contigs) and a scaffold N50 of 87.8 Mb. Our bison genome is ~13× more contiguous overall and ~3400× more contiguous at the contig level than the current bison reference genome. The bison genome sequence presented here (ARS-UCSC_bison1.0) will enable new research into the evolutionary history of this iconic megafauna species and provide a new tool for the management of bison populations in federal and commercial herds.
Asunto(s)
Bison/genética , Genoma , Animales , Bovinos/genética , Mapeo Cromosómico , Femenino , Variación Genética , Haplotipos , Hibridación Genética , MasculinoRESUMEN
Genomics research has relied principally on the establishment and curation of a reference genome for the species. However, it is increasingly recognized that a single reference genome cannot fully describe the extent of genetic variation within many widely distributed species. Pangenome representations are based on high-quality genome assemblies of multiple individuals and intended to represent the broadest possible diversity within a species. A Bovine Pangenome Consortium (BPC) has recently been established to begin assembling genomes from more than 600 recognized breeds of cattle, together with other related species to provide information on ancestral alleles and haplotypes. Previously reported de novo genome assemblies for Angus, Brahman, Hereford, and Highland breeds of cattle are part of the initial BPC effort. The present report describes a complete single haplotype assembly at chromosome-scale for a fullblood Simmental cow from an F1 bison-cattle hybrid fetus by trio binning. Simmental cattle, also known as Fleckvieh due to their red and white spots, originated in central Europe in the 1830s as a triple-purpose breed selected for draught, meat, and dairy production. There are over 50 million Simmental cattle in the world, known today for their fast growth and beef yields. This assembly (ARS_Simm1.0) is similar in length to the other bovine assemblies at 2.86 Gb, with a scaffold N50 of 102 Mb (max scaffold 156.8 Mb) and meets or exceeds the continuity of the best Bos taurus reference assemblies to date.
Asunto(s)
Bovinos/genética , Genoma , Animales , Bison , Mapeo Cromosómico , Femenino , Haplotipos , MasculinoRESUMEN
BACKGROUND: Efforts to improve animal health, and understand genetic bases for production, may benefit from a comprehensive analysis of animal genomes and epigenomes. Although DNA methylation has been well studied in humans and other model species, its distribution patterns and regulatory impacts in cattle are still largely unknown. Here, we present the largest collection of cattle DNA methylation epigenomic data to date. RESULTS: Using Holstein cattle, we generated 29 whole genome bisulfite sequencing (WGBS) datasets for 16 tissues, 47 corresponding RNA-seq datasets, and 2 whole genome sequencing datasets. We did read mapping and DNA methylation calling based on two different cattle assemblies, demonstrating the high quality of the long-read-based assembly markedly improved DNA methylation results. We observed large differences across cattle tissues in the methylation patterns of global CpG sites, partially methylated domains (PMDs), hypomethylated regions (HMRs), CG islands (CGIs), and common repeats. We detected that each tissue had a distinct set of PMDs, which showed tissue-specific patterns. Similar to human PMD, cattle PMDs were often linked to a general decrease of gene expression and a decrease in active histone marks and related to long-range chromatin organizations, like topologically associated domains (TADs). We tested a classification of the HMRs based on their distributions relative to transcription start sites (TSSs) and detected tissue-specific TSS-HMRs and genes that showed strong tissue effects. When performing cross-species comparisons of paired genes (two opposite strand genes with their TSS located in the same HMR), we found out they were more consistently co-expressed among human, mouse, sheep, goat, yak, pig, and chicken, but showed lower consistent ratios in more divergent species. We further used these WGBS data to detect 50,023 experimentally supported CGIs across bovine tissues and found that they might function as a guard against C-to-T mutations for TSS-HMRs. Although common repeats were often heavily methylated, some young Bov-A2 repeats were hypomethylated in sperm and could affect the promoter structures by exposing potential transcription factor binding sites. CONCLUSIONS: This study provides a comprehensive resource for bovine epigenomic research and enables new discoveries about DNA methylation and its role in complex traits.
Asunto(s)
Metilación de ADN , Epigénesis Genética , Genoma , Animales , Bovinos , Islas de CpG , Epigenómica , Femenino , Masculino , Especificidad de Órganos , Secuenciación Completa del GenomaRESUMEN
Copy number variation (CNV) is a major type of genomic structural variation. We investigated their impacts on goat dairy traits using the CaprineSNP50 array. From 120 samples of five dairy goat breeds, we totally identified 42 CNVs ranging from 56,044 bp to 4,337,625 bp. We found significant associations between two CNVs (CNV5 and CNV25) and two milk production traits (mean of milk fat yield and mean of milk protein yield) after false discovery rate (FDR) correction (P < 0.05). CNV5 overlaps the ADAMTS20 gene, which is involved in the differentiation of mammary cell and plays a crucial role in lactogenic activity of bovine mammary epithelial cells. CNV25 overlaps with PAPPA2, which has been found to be associated with bovine reproduction and milk production traits. Our results revealed that CNVs overlapped with ADAMTS20 and PAPPA2 could be involved in goat dairy traits and function as candidate markers for further genetic selection.
Asunto(s)
Variaciones en el Número de Copia de ADN , Cabras/genética , Leche , Proteínas ADAMTS/genética , Animales , Industria Lechera , Femenino , Técnicas de Genotipaje , Proteínas de la Leche/análisis , Reacción en Cadena de la Polimerasa , Proteína Plasmática A Asociada al Embarazo/genéticaRESUMEN
BACKGROUND: Mammalian X chromosomes are mainly euchromatic with a similar size and structure among species whereas Y chromosomes are smaller, have undergone substantial evolutionary changes and accumulated male specific genes and genes involved in sex determination. The pseudoautosomal region (PAR) is conserved on the X and Y and pair during meiosis. The structure, evolution and function of mammalian sex chromosomes, particularly the Y chromsome, is still poorly understood because few species have high quality sex chromosome assemblies. RESULTS: Here we report the first bovine sex chromosome assemblies that include the complete PAR spanning 6.84 Mb and three Y chromosome X-degenerate (X-d) regions. The PAR comprises 31 genes, including genes that are missing from the X chromosome in current cattle, sheep and goat reference genomes. Twenty-nine PAR genes are single-copy genes and two are multi-copy gene families, OBP, which has 3 copies and BDA20, which has 4 copies. The Y chromosome X-d1, 2a and 2b regions contain 11, 2 and 2 gametologs, respectively. CONCLUSIONS: The ruminant PAR comprises 31 genes and is similar to the PAR of pig and dog but extends further than those of human and horse. Differences in the pseudoautosomal boundaries are consistent with evolutionary divergence times. A bovidae-specific expansion of members of the lipocalin gene family in the PAR reported here, may affect immune-modulation and anti-inflammatory responses in ruminants. Comparison of the X-d regions of Y chromosomes across species revealed that five of the X-Y gametologs, which are known to be global regulators of gene activity and candidate sexual dimorphism genes, are conserved.
Asunto(s)
Bovinos/genética , Cromosoma X , Cromosoma Y , Animales , Cromosomas de los Mamíferos , Perros , Evolución Molecular , Orden Génico , Humanos , Masculino , Secuenciación Completa del GenomaRESUMEN
Duplicated sequences are an important source of gene evolution and structural variation within mammalian genomes. Using a read depth approach based on next-generation sequencing, we performed a genome-wide analysis of segmental duplications (SDs) and associated copy number variations (CNVs) in the water buffalo (Bubalus bubalis). By aligning short reads of Olimpia (the reference water buffalo) to the UMD3.1 cattle genome, we identified 1,038 segmental duplications comprising 44.6 Mb (equivalent to ~1.73% of the cattle genome) of the autosomal and X chromosomal sequence in the buffalo genome. We experimentally validated 70.3% (71/101) of these duplications using fluorescent in situ hybridization. We also detected a total of 1,344 CNV regions across 14 additional water buffaloes, amounting to 59.8 Mb of variable sequence or the equivalent of 2.2% of the cattle genome. The CNV regions overlap 1,245 genes that are significantly enriched for specific biological functions including immune response, oxygen transport, sensory system and signal transduction. Additionally, we performed array Comparative Genomic Hybridization (aCGH) experiments using the 14 water buffaloes as test samples and Olimpia as the reference. Using a linear regression model, a high Pearson correlation (r = 0.781) was observed between the log2 ratios between copy number estimates and the log2 ratios of aCGH probes. We further designed Quantitative PCR assays to confirm CNV regions within or near annotated genes and found 74.2% agreement with our CNV predictions. These results confirm sub-chromosome-scale structural rearrangements present in the cattle and water buffalo. The information on genome variation that will be of value for evolutionary and phenotypic studies, and may be useful for selective breeding of both species.