RESUMEN
BACKGROUND: Productive life (PL) of a cow is the time the cow remains in the milking herd from first calving to exit from the herd due to culling or death and is an important economic trait in U.S. Holstein cattle. The large samples of Holstein genomic evaluation data that have become available recently provided unprecedented statistical power to identify genetic factors affecting PL in Holstein cows using the approach of genome-wide association study (GWAS). METHODS: The GWAS analysis used 1,103,641 Holstein cows with phenotypic observations on PL and genotypes of 75,282 single nucleotide polymorphism (SNP) markers. The statistical tests and estimation of SNP additive and dominance effects used the approximate generalized least squares method implemented by the EPISNPmpi computer program. RESULTS: The GWAS detected 5390 significant additive effects of PL distributed over all 29 autosomes and the X-Y nonrecombining region of the X chromosome (Chr31). Two chromosome regions had the most significant and largest cluster of additive effects, the SLC4A4-GC-NPFFR2 (SGN) region of Chr06 with pleiotropic effects for PL, fertility, somatic cell score and milk yield; and the 32-52 Mb region of Chr10 with peak effects for PL in or near RASGRP1 with many important immunity functions. The dominance tests detected 38 significant dominance effects including 12 dominance effects with sharply negative homozygous recessive genotypes on Chr18, Chr05, Chr23 and Chr24. CONCLUSIONS: The GWAS results showed that highly significant genetic effects for PL were in chromosome regions known to have highly significant effects for fertility and health and a chromosome region with multiple genes with reproductive and immunity functions. SNPs with rare but sharply negative homozygous recessive genotypes for PL existed and should be used for eliminating heifers carrying those homozygous recessive genotypes.
Asunto(s)
Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple , Animales , Bovinos/genética , Estudio de Asociación del Genoma Completo/métodos , Estudio de Asociación del Genoma Completo/veterinaria , Femenino , Genotipo , Sitios de Carácter Cuantitativo , Estados Unidos , Fenotipo , Lactancia/genéticaRESUMEN
The U.S. Holstein cattle have unprecedentedly large samples for genomic evaluation with genotypes of Single Nucleotide Polymorphism (SNP) markers and phenotypic observations of dairy quantitative traits. Such large samples provided unprecedented opportunities for the discovery of genetic variants and mechanisms affecting quantitative traits in Holstein cattle. Recent studies using the Holstein large samples on finding genetic variants affecting quantitative traits included a fat percentage study and two studies on reproductive traits. The fat percentage study confirmed that a chromosome region interacted with all chromosomes and the reproductive studies detected sharply negative homozygous recessive genotypes that were recommended for heifer culling. These novel findings provided examples showing the power of large-sample genomic mining for quantitative traits.
RESUMEN
A genome-wide association study of resistance to retained placenta (RETP) using 632,212 Holstein cows and 74,747 SNPs identified 200 additive effects with p-values < 10-8 on thirteen chromosomes but no dominance effect was statistically significant. The regions of 87.61-88.74 Mb of Chr09 about 1.13 Mb in size had the most significant effect in LOC112448080 and other highly significant effects in CCDC170 and ESR1, and in or near RMND1 and AKAP12. Four non-ESR1 genes in this region were reported to be involved in ESR1 fusions in humans. Chr23 had the largest number of significant effects that peaked in SLC17A1, which was involved in urate metabolism and transport that could contribute to kidney disease. The PKHD1 gene contained seven significant effects and was downstream of another six significant effects. The ACOT13 gene also had a highly significant effect. Both PKHD1 and ACOT13 were associated with kidney disease. Another highly significant effect was upstream of BOLA-DQA2. The KITLG gene of Chr05 that acts in utero in germ cell and neural cell development, and hematopoiesis was upstream of a highly significant effect, contained a significant effect, and was between another two significant effects. The results of this study provided a new understanding of genetic factors underlying RETP in U.S. Holstein cows.
Asunto(s)
Enfermedades de los Bovinos , Estudio de Asociación del Genoma Completo , Retención de la Placenta , Polimorfismo de Nucleótido Simple , Bovinos , Animales , Femenino , Embarazo , Retención de la Placenta/genética , Retención de la Placenta/veterinaria , Enfermedades de los Bovinos/genética , Resistencia a la Enfermedad/genética , Predisposición Genética a la Enfermedad , Sitios de Carácter CuantitativoRESUMEN
A genome-wide association study (GWAS) of fat percentage (FPC) using 1,231,898 first lactation cows and 75,198 SNPs confirmed a previous result that a Chr14 region about 9.38 Mb in size (0.14-9.52 Mb) had significant inter-chromosome additive × additive (A×A) effects with all chromosomes and revealed many new such effects. This study divides this 9.38 Mb region into two sub-regions, Chr14a at 0.14-0.88 Mb (0.74 Mb in size) with 78% and Chr14b at 2.21-9.52 Mb (7.31 Mb in size) with 22% of the 2761 significant A×A effects. These two sub-regions were separated by a 1.3 Mb gap at 0.9-2.2 Mb without significant inter-chromosome A×A effects. The PPP1R16A-FOXH1-CYHR1-TONSL (PFCT) region of Chr14a (29 Kb in size) with four SNPs had the largest number of inter-chromosome A×A effects (1141 pairs) with all chromosomes, including the most significant inter-chromosome A×A effects. The SLC4A4-GC-NPFFR2 (SGN) region of Chr06, known to have highly significant additive effects for some production, fertility and health traits, specifically interacted with the PFCT region and a Chr14a region with CPSF1, ADCK5, SLC52A2, DGAT1, SMPD5 and PARP10 (CASDSP) known to have highly significant additive effects for milk production traits. The most significant effects were between an SNP in SGN and four SNPs in PFCT. The CASDSP region mostly interacted with the SGN region. In the Chr14b region, the 2.28-2.42 Mb region (138.46 Kb in size) lacking coding genes had the largest cluster of A×A effects, interacting with seventeen chromosomes. The results from this study provide high-confidence evidence towards the understanding of the genetic mechanism of FPC in Holstein cows.
Asunto(s)
Cromosomas Humanos Par 14 , Estudio de Asociación del Genoma Completo , Femenino , Humanos , Bovinos/genética , Animales , Fertilidad/genética , Lactancia , Fenotipo , FN-kappa B , Poli(ADP-Ribosa) Polimerasas , Proteínas Proto-OncogénicasRESUMEN
The accuracy of predicting seven human phenotypes of 3657-7564 individuals using global epistasis effects was evaluated and compared to the accuracy of haplotype genomic prediction using 380,705 SNPs and 10-fold cross-validation studies. The seven human phenotypes were the normality transformed high density lipoproteins (HDL), low density lipoproteins (LDL), total cholesterol (TC), triglycerides (TG), weight (WT), and the original phenotypic observations of height (HTo) and body mass index (BMIo). Fourth-order epistasis effects virtually had no contribution to the phenotypic variances, and third-order epistasis effects did not affect the prediction accuracy. Without haplotype effects in the prediction model, pairwise epistasis effects improved the prediction accuracy over the SNP models for six traits, with accuracy increases of 2.41%, 3.85%, 0.70%, 0.97%, 0.62% and 0.93% for HDL, LDL, TC, HTo, WT and BMIo respectively. However, none of the epistasis models had higher prediction accuracy than the haplotype models we previously reported. The epistasis model for TG decreased the prediction accuracy by 2.35% relative to the accuracy of the SNP model. The integrated models with epistasis and haplotype effects had slightly higher prediction accuracy than the haplotype models for two traits, HDL and BMIo. These two traits were the only traits where additive × dominance effects increased the prediction accuracy. These results indicated that haplotype effects containing local high-order epistasis effects had a tendency to be more important than global pairwise epistasis effects for the seven human phenotypes, and that the genetic mechanism of HDL and BMIo was more complex than that of the other traits.
Asunto(s)
Epistasis Genética , Genómica , Humanos , Haplotipos , Fenotipo , Triglicéridos , Polimorfismo de Nucleótido Simple/genéticaRESUMEN
A genome-wide association study (GWAS) of the daughter pregnancy rate (DPR), cow conception rate (CCR), and heifer conception rate (HCR) using 1,001,374-1,194,736 first-lactation Holstein cows and 75,140-75,295 SNPs identified 7567, 3798, and 726 additive effects, as well as 22, 27, and 25 dominance effects for DPR, CCR, and HCR, respectively, with log10(1/p) > 8. Most of these effects were new effects, and some new effects were in or near genes known to affect reproduction including GNRHR, SHBG, and ESR1, and a gene cluster of pregnancy-associated glycoproteins. The confirmed effects included those in or near the SLC4A4-GC-NPFFR2 and AFF1 regions of Chr06 and the KALRN region of Chr01. Eleven SNPs in the CEBPG-PEPD-CHST8 region of Chr18, the AFF1-KLHL8 region of Chr06, and the CCDC14-KALRN region of Chr01 with sharply negative allelic effects and dominance values for the recessive homozygous genotypes were recommended for heifer culling. Two SNPs in and near the AGMO region of Chr04 that were sharply negative for HCR and age at first calving, but slightly positive for the yield traits could also be considered for heifer culling. The results from this study provided new evidence and understanding about the genetic variants and genome regions affecting the three fertility traits in U.S. Holstein cows.
Asunto(s)
Fertilidad , Estudio de Asociación del Genoma Completo , Embarazo , Bovinos/genética , Animales , Femenino , Fertilidad/genética , Reproducción/genética , Fertilización , LactanciaRESUMEN
A genome-wide association study (GWAS) of age at first calving (AFC) using 813,114 first lactation Holstein cows and 75,524 SNPs identified 2063 additive effects and 29 dominance effects with p-values < 10-8. Three chromosomes had highly significant additive effects in the regions of 7.86-8.12 Mb of Chr15, 27.07-27.48 Mb and 31.25-32.11 Mb of Chr19, and 26.92-32.60 Mb of Chr23. Two of the genes in those regions were reproductive hormone genes with known biological functions that should be relevant to AFC, the sex hormone binding globulin (SHBG) gene, and the progesterone receptor (PGR) gene. The most significant dominance effects were near or in EIF4B and AAAS of Chr05 and AFF1 and KLHL8 of Chr06. All dominance effects were positive overdominance effects where the heterozygous genotype had an advantage, and the homozygous recessive genotype of each SNP had a very negative dominance value. Results from this study provided new evidence and understanding about the genetic variants and genome regions affecting AFC in U.S. Holstein cows.
Asunto(s)
Fertilidad , Estudio de Asociación del Genoma Completo , Animales , Femenino , Bovinos , Fertilidad/genética , Leche , Lactancia/genética , Genotipo , Polimorfismo de Nucleótido SimpleRESUMEN
The impact of genomic epistasis effects on the accuracy of predicting the phenotypic values of residual feed intake (RFI) in U.S. Holstein cows was evaluated using 6215 Holstein cows and 78,964 SNPs. Two SNP models and seven epistasis models were initially evaluated. Heritability estimates and the accuracy of predicting the RFI phenotypic values from 10-fold cross-validation studies identified the model with SNP additive effects and additive × additive (A×A) epistasis effects (A + A×A model) to be the best prediction model. Under the A + A×A model, additive heritability was 0.141, and A×A heritability was 0.263 that consisted of 0.260 inter-chromosome A×A heritability and 0.003 intra-chromosome A×A heritability, showing that inter-chromosome A×A effects were responsible for the accuracy increases due to A×A. Under the SNP additive model (A-only model), the additive heritability was 0.171. In the 10 validation populations, the average accuracy for predicting the RFI phenotypic values was 0.246 (with range 0.197-0.333) under A + A×A model and was 0.231 (with range of 0.188-0.319) under the A-only model. The average increase in the accuracy of predicting the RFI phenotypic values by the A + A×A model over the A-only model was 6.49% (with range of 3.02-14.29%). Results in this study showed A×A epistasis effects had a positive impact on the accuracy of predicting the RFI phenotypic values when combined with additive effects in the prediction model.
RESUMEN
The rapid growth in genomic selection data provides unprecedented opportunities to discover and utilize complex genetic effects for improving phenotypes, but the methodology is lacking. Epistasis effects are interaction effects, and haplotype effects may contain local high-order epistasis effects. Multifactorial methods with SNP, haplotype, and epistasis effects up to the third-order are developed to investigate the contributions of global low-order and local high-order epistasis effects to the phenotypic variance and the accuracy of genomic prediction of quantitative traits. These methods include genomic best linear unbiased prediction (GBLUP) with associated reliability for individuals with and without phenotypic observations, including a computationally efficient GBLUP method for large validation populations, and genomic restricted maximum estimation (GREML) of the variance and associated heritability using a combination of EM-REML and AI-REML iterative algorithms. These methods were developed for two models, Model-I with 10 effect types and Model-II with 13 effect types, including intra- and inter-chromosome pairwise epistasis effects that replace the pairwise epistasis effects of Model-I. GREML heritability estimate and GBLUP effect estimate for each effect of an effect type are derived, except for third-order epistasis effects. The multifactorial models evaluate each effect type based on the phenotypic values adjusted for the remaining effect types and can use more effect types than separate models of SNP, haplotype, and epistasis effects, providing a methodology capability to evaluate the contributions of complex genetic effects to the phenotypic variance and prediction accuracy and to discover and utilize complex genetic effects for improving the phenotypes of quantitative traits.
RESUMEN
BACKGROUND: Genomic selection using single nucleotide polymorphism (SNP) markers has been widely used for genetic improvement of livestock, but most current methods of genomic selection are based on SNP models. In this study, we investigated the prediction accuracies of haplotype models based on fixed chromosome distances and gene boundaries compared to those of SNP models for genomic prediction of phenotypic values. We also examined the reasons for the successes and failures of haplotype genomic prediction. METHODS: We analyzed a swine population of 3195 Duroc boars with records on eight traits: body judging score (BJS), teat number (TN), age (AGW), loin muscle area (LMA), loin muscle depth (LMD) and back fat thickness (BF) at 100 kg live weight, and average daily gain (ADG) and feed conversion rate (FCR) from 30 to100 kg live weight. Ten-fold validation was used to evaluate the prediction accuracy of each SNP model and each multi-allelic haplotype model based on 488,124 autosomal SNPs from low-coverage sequencing. Haplotype blocks were defined using fixed chromosome distances or gene boundaries. RESULTS: Compared to the best SNP model, the accuracy of predicting phenotypic values using a haplotype model was greater by 7.4% for BJS, 7.1% for AGW, 6.6% for ADG, 4.9% for FCR, 2.7% for LMA, 1.9% for LMD, 1.4% for BF, and 0.3% for TN. The use of gene-based haplotype blocks resulted in the best prediction accuracy for LMA, LMD, and TN. Compared to estimates of SNP additive heritability, estimates of haplotype epistasis heritability were strongly correlated with the increase in prediction accuracy by haplotype models. The increase in prediction accuracy was largest for BJS, AGW, ADG, and FCR, which also had the largest estimates of haplotype epistasis heritability, 24.4% for BJS, 14.3% for AGW, 14.5% for ADG, and 17.7% for FCR. SNP and haplotype heritability profiles across the genome identified several genes with large genetic contributions to phenotypes: NUDT3 for LMA, LMD and BF, VRTN for TN, COL5A2 for BJS, BSND for ADG, and CARTPT for FCR. CONCLUSIONS: Haplotype prediction models improved the accuracy for genomic prediction of phenotypes in Duroc pigs. For some traits, the best prediction accuracy was obtained with haplotypes defined using gene regions, which provides evidence that functional genomic information can improve the accuracy of haplotype genomic prediction for certain traits.
Asunto(s)
Genoma , Genómica , Animales , Cromosomas/genética , Haplotipos , Masculino , Fenotipo , Polimorfismo de Nucleótido Simple , Porcinos/genéticaRESUMEN
Epistasis is widely considered important, but epistasis studies lag those of SNP effects. Genome-wide association study (GWAS) using 76,109 SNPs and 294,079 first-lactation Holstein cows was conducted for testing pairwise epistasis effects of five production traits and three fertility traits: milk yield (MY), fat yield (FY), protein yield (PY), fat percentage (FPC), protein percentage (PPC), and daughter pregnancy rate (DPR). Among the top 50,000 pairwise epistasis effects of each trait, the five production traits had large chromosome regions with intra-chromosome epistasis. The percentage of inter-chromosome epistasis effects was 1.9% for FPC, 1.6% for PPC, 10.6% for MY, 29.9% for FY, 39.3% for PY, and 84.2% for DPR. Of the 50,000 epistasis effects, the number of significant effects with log10(1/p) ≥ 12 was 50,000 for FPC and PPC, and 10,508, 4763, 4637 and 1 for MY, FY, PY and DPR, respectively, and A × A effects were the most frequent epistasis effects for all traits. Majority of the inter-chromosome epistasis effects of FPC across all chromosomes involved a Chr14 region containing DGAT1, indicating a potential regulatory role of this Chr14 region affecting all chromosomes for FPC. The epistasis results provided new understanding about the genetic mechanism underlying quantitative traits in Holstein cattle.
Asunto(s)
Epistasis Genética/genética , Regulación de la Expresión Génica/genética , Leche/metabolismo , Animales , Bovinos , Femenino , Fertilidad/genética , Expresión Génica/genética , Perfilación de la Expresión Génica/métodos , Estudio de Asociación del Genoma Completo/métodos , Genotipo , Lactancia/genética , Leche/química , Fenotipo , Polimorfismo de Nucleótido Simple/genética , Embarazo , Índice de Embarazo , Transcriptoma/genéticaRESUMEN
Genomic prediction using multi-allelic haplotype models improved the prediction accuracy for all seven human phenotypes, the normality transformed high density lipoproteins, low density lipoproteins, total cholesterol, triglycerides, weight, and the original height and body mass index without normality transformation. Eight SNP sets with 40,941-380,705 SNPs were evaluated. The increase in prediction accuracy due to haplotypes was 1.86-8.12%. Haplotypes using fixed chromosome distances had the best prediction accuracy for four phenotypes, fixed number of SNPs for two phenotypes, and gene-based haplotypes for high density lipoproteins and height (tied for best). Haplotypes of coding genes were more accurate than haplotypes of all autosome genes that included both coding and noncoding genes for triglycerides and weight, and nearly the same as haplotypes of all autosome genes for the other phenotypes. Haplotypes of noncoding genes (mostly lncRNAs) only improved the prediction accuracy over the SNP models for high density lipoproteins, total cholesterol, and height. ChIP-seq haplotypes had better prediction accuracy than gene-based haplotypes for total cholesterol, body mass index and low density lipoproteins. The accuracy of ChIP-seq haplotypes was most striking for low density lipoproteins, where all four haplotype models with ChIP-seq haplotypes had similarly high prediction accuracy over the best prediction model with gene-based haplotypes. Haplotype epistasis was shown to be the reason for the increased accuracy due to haplotypes. Low density lipoproteins had the largest haplotype epistasis heritability that explained 14.70% of the phenotypic variance and was 31.27% of the SNP additive heritability, and the largest increase in prediction accuracy relative to the best SNP model (8.12%). Relative to the SNP additive heritability of the same regions, noncoding genes had the highest haplotype epistasis heritability, followed by coding genes and ChIP-seq for the seven phenotypes. SNP and haplotype heritability profiles showed that the integration of SNP and haplotype additive values compensated the weakness of haplotypes in estimating SNP heritabilities for four phenotypes, whereas models with haplotype additive values fully accounted for SNP additive values for three phenotypes. These results showed that haplotype analysis can be a method to utilize functional and structural genomic information to improve the accuracy of genomic prediction.
RESUMEN
The use of haplotypes may improve the accuracy of genomic prediction over single SNPs because haplotypes can better capture linkage disequilibrium and genomic similarity in different lines and may capture local high-order allelic interactions. Additionally, prediction accuracy could be improved by portraying population structure in the calibration set. A set of 383 advanced lines and cultivars that represent the diversity of the University of Minnesota wheat breeding program was phenotyped for yield, test weight, and protein content and genotyped using the Illumina 90K SNP Assay. Population structure was confirmed using single SNPs. Haplotype blocks of 5, 10, 15, and 20 adjacent markers were constructed for all chromosomes. A multi-allelic haplotype prediction algorithm was implemented and compared with single SNPs using both k-fold cross validation and stratified sampling optimization. After confirming population structure, the stratified sampling improved the predictive ability compared with k-fold cross validation for yield and protein content, but reduced the predictive ability for test weight. In all cases, haplotype predictions outperformed single SNPs. Haplotypes of 15 adjacent markers showed the best improvement in accuracy for all traits; however, this was more pronounced in yield and protein content. The combined use of haplotypes of 15 adjacent markers and training population optimization significantly improved the predictive ability for yield and protein content by 14.3 (four percentage points) and 16.8% (seven percentage points), respectively, compared with using single SNPs and k-fold cross validation. These results emphasize the effectiveness of using haplotypes in genomic selection to increase genetic gain in self-fertilized crops.
Asunto(s)
Fitomejoramiento , Triticum , Genotipo , Haplotipos , Desequilibrio de Ligamiento , Fenotipo , Polimorfismo de Nucleótido Simple , Triticum/genéticaRESUMEN
Haplotype prediction models open many possibilities to improve the accuracy of genomic selection but require more data processing and computing time than single-SNP prediction models. To facilitate haplotype analysis for genomic prediction and estimation using structural and functional genomic information, we developed a computing pipeline to implement haplotype analysis with capabilities for preparation of input data for haplotype analysis, genomic prediction and estimation using GVCHAP, and analysis of GVCHAP results. Data preparation includes utility programs for haplotype imputing; defining haplotype blocks by a fixed number of SNPs, a fixed distance in base pairs per block, or user defined block lengths based on structural or functional genomic information or a mixture of both types of information; and defining haplotype genotypes within each haplotype block. GVCHAP is the main program for genomic prediction and estimation, calculates GREML (genomic restricted maximum likelihood) estimates of variance components and heritabilities, and calculates GBLUP (genomic best linear unbiased prediction) for additive and dominance values of single SNPs as well as additive values of haplotypes with reliability estimates for training and validation populations. A two-step strategy and a method of multi-node processing are implemented to remove the computing bottleneck due to the creation of genomic relationship matrices for large samples. The analysis of GVCHAP results includes calculation of observed prediction accuracies from validation studies and preparation of input files for graphical visualization of heritability estimates of haplotype blocks as well as estimates of SNP effects and heritabilities. The entire pipeline provides an efficient and versatile computing tool for identifying the most accurate haplotype model among many candidate haplotype models utilizing structural and functional genomic information for genomic selection.
RESUMEN
Genome-wide association study (GWAS) is a powerful approach to identify genomic regions and genetic variants associated with phenotypes. However, only limited mutual confirmation from different studies is available. We conducted a large-scale GWAS using 294,079 first-lactation Holstein cows and identified new additive and dominance effects on five production traits, three fertility traits, and somatic cell score. Four chromosomes had the most significant SNP effects on the five production traits, a Chr14 region containing DGAT1 mostly had positive effects on fat yield and negative effects on milk and protein yields, the 88.07-89.60 Mb region of Chr06 with SLC4A4, GC, NPFFR2, and ADAMTS3 for milk and protein yields, the 30.03-36.67 Mb region of Chr20 with C6 and GHR for milk yield, and the 88.19-88.88 Mb region with ABCC9 as well as the 91.13-94.62 Mb region of Chr05 with PLEKHA5, MGST1, SLC15A5, and EPS8 for fat yield. For fertility traits, the SNP in GC of Chr06, and the SNPs in the 65.02-69.43 Mb region of Chr01 with COX17, ILDR1, and KALRN had the most significant effects for daughter pregnancy rate and cow conception rate, whereas SNPs in AFF1 of Chr06, the 47.54-52.79 Mb region of Chr07, TSPAN4 of Chr29, and NPAS1 of Chr18 had the most significant effects for heifer conception rate. For somatic cell score, GC of Chr06 and PRLR of Chr20 had the most significant effects. A small number of dominance effects were detected for the production traits with far lower statistical significance than the additive effects and for fertility traits with similar statistical significance as the additive effects. Analysis of allelic effects revealed the presence of uni-allelic, asymmetric, and symmetric SNP effects and found the previously reported DGAT1 antagonism was an extreme antagonistic pleiotropy between fat yield and milk and protein yields among all SNPs in this study.
RESUMEN
BACKGROUND: The availability of a unique unselected Holstein line since 1964 provided a direct comparison between selected and unselected Holstein genomes whereas large Holstein samples provided unprecedented statistical power for identifying high-confidence SNP effects. Utilizing these unique resources, we aimed to identify genome changes affected by selection since 1964. RESULTS: Direct comparison of genome-wide SNP markers between a Holstein line unselected since 1964 and contemporary Holsteins showed that the 40 years of artificial selection since 1964 resulted in genome landscape changes. Among the regions affected by selection, the regions containing 198 genes with fertility functions had a larger negative correlation than that of all SNPs between the SNP effects on milk yield and daughter pregnancy rate. These results supported the hypothesis that hitchhiking of genetic selection for milk production by negative effects of fertility genes contributed to the unintended declines in fertility since 1964. The genome regions subjected to selection also contained 67 immunity genes, the bovine MHC region of Chr23 with significantly decreased heterozygosity in contemporary Holsteins, and large gene clusters including T-cell receptor and immunoglobulin genes. CONCLUSIONS: This study for the first time provided direct evidence that genetic selection for milk production affected fertility and immunity genes and that the hitchhiking of genetic selection for milk production by negative fertility effects contributed to the fertility declines since 1964, and identified a large number of candidate fertility and immunity genes affected by selection. The results provided novel understanding about genome changes due to artificial selection and their impact on fertility and immunity genes and could facilitate developing genetic methods to reverse the declines in fertility and immunity in Holstein cattle.
Asunto(s)
Cruzamiento , Bovinos/genética , Genómica , Animales , Bovinos/inmunología , Bovinos/metabolismo , Diacilglicerol O-Acetiltransferasa/genética , Fertilidad/genética , Frecuencia de los Genes , Haplotipos , Humanos , Inmunidad/genética , Leche/metabolismo , Polimorfismo de Nucleótido Simple , Factores de TiempoRESUMEN
BACKGROUND: The number of teats in pigs is related to a sow's ability to rear piglets to weaning age. Several studies have identified genes and genomic regions that affect teat number in swine but few common results were reported. The objective of this study was to identify genetic factors that affect teat number in pigs, evaluate the accuracy of genomic prediction, and evaluate the contribution of significant genes and genomic regions to genomic broad-sense heritability and prediction accuracy using 41,108 autosomal single nucleotide polymorphisms (SNPs) from genotyping-by-sequencing on 2936 Duroc boars. RESULTS: Narrow-sense heritability and dominance heritability of teat number estimated by genomic restricted maximum likelihood were 0.365 ± 0.030 and 0.035 ± 0.019, respectively. The accuracy of genomic predictions, calculated as the average correlation between the genomic best linear unbiased prediction and phenotype in a tenfold validation study, was 0.437 ± 0.064 for the model with additive and dominance effects and 0.435 ± 0.064 for the model with additive effects only. Genome-wide association studies (GWAS) using three methods of analysis identified 85 significant SNP effects for teat number on chromosomes 1, 6, 7, 10, 11, 12 and 14. The region between 102.9 and 106.0 Mb on chromosome 7, which was reported in several studies, had the most significant SNP effects in or near the PTGR2, FAM161B, LIN52, VRTN, FCF1, AREL1 and LRRC74A genes. This region accounted for 10.0% of the genomic additive heritability and 8.0% of the accuracy of prediction. The second most significant chromosome region not reported by previous GWAS was the region between 77.7 and 79.7 Mb on chromosome 11, where SNPs in the FGF14 gene had the most significant effect and accounted for 5.1% of the genomic additive heritability and 5.2% of the accuracy of prediction. The 85 significant SNPs accounted for 28.5 to 28.8% of the genomic additive heritability and 35.8 to 36.8% of the accuracy of prediction. CONCLUSIONS: The three methods used for the GWAS identified 85 significant SNPs with additive effects on teat number, including SNPs in a previously reported chromosomal region and SNPs in novel chromosomal regions. Most significant SNPs with larger estimated effects also had larger contributions to the total genomic heritability and accuracy of prediction than other SNPs.
Asunto(s)
Cruzamiento/métodos , Estudio de Asociación del Genoma Completo/métodos , Técnicas de Genotipaje/métodos , Glándulas Mamarias Animales/anatomía & histología , Porcinos/genética , Animales , Cruzamiento/normas , Femenino , Estudio de Asociación del Genoma Completo/normas , Técnicas de Genotipaje/normas , Polimorfismo de Nucleótido Simple , Porcinos/anatomía & histologíaRESUMEN
Inbreeding and relatedness in wild panda populations are important parameters for panda conservation. Habitat loss and fragmentation are expected to increase inbreeding but the actual inbreeding levels in natural panda habitats were unknown. Using 150,025 SNPs and 14,926 SNPs selected from published whole-genome sequences, we estimated genomic inbreeding coefficients and relatedness of 49 pandas including 34 wild pandas sampled from six habitats. Qinling and Liangshan pandas had the highest levels of inbreeding and relatedness measured by genomic inbreeding and coancestry coefficients, whereas the inbreeding levels in Qionglai and Minshan were 28-45% of those in Qinling and Liangshan. Genomic coancestry coefficients between pandas from different habitats showed that panda populations from the four largest habitats, Minshan, Qionglai, Qinling and Liangshan, were genetically unrelated. Pandas between these four habitats on average shared 66.0-69.1% common alleles and 45.6-48.6% common genotypes, whereas pandas within each habitat shared 71.8-77.0% common alleles and 51.7-60.4% common genotypes. Pandas in the smaller populations of Qinling and Liangshan were more similarly to each other than pandas in the larger populations of Qionglai and Minshan according to three genomic similarity measures. Panda genetic differentiation between these habitats was positively related to their geographical distances. Most pandas separated by 200 kilometers or more shared no common ancestral alleles. The results provided a genomic quantification of the actual levels of inbreeding and relatedness among pandas in their natural habitats, provided genomic confirmation of the relationship between genetic diversity and geographical distances, and provided genomic evidence to the urgency of habitat protection.
Asunto(s)
Endogamia , Ursidae/genética , Animales , China , Ecosistema , Variación Genética , Genética de Población , Polimorfismo de Nucleótido SimpleRESUMEN
BACKGROUND: Dominance effect may play an important role in genetic variation of complex traits. Full featured and easy-to-use computing tools for genomic prediction and variance component estimation of additive and dominance effects using genome-wide single nucleotide polymorphism (SNP) markers are necessary to understand dominance contribution to a complex trait and to utilize dominance for selecting individuals with favorable genetic potential. RESULTS: The GVCBLUP package is a shared memory parallel computing tool for genomic prediction and variance component estimation of additive and dominance effects using genome-wide SNP markers. This package currently has three main programs (GREML_CE, GREML_QM, and GCORRMX) and a graphical user interface (GUI) that integrates the three main programs with an existing program for the graphical viewing of SNP additive and dominance effects (GVCeasy). The GREML_CE and GREML_QM programs offer complementary computing advantages with identical results for genomic prediction of breeding values, dominance deviations and genotypic values, and for genomic estimation of additive and dominance variances and heritabilities using a combination of expectation-maximization (EM) algorithm and average information restricted maximum likelihood (AI-REML) algorithm. GREML_CE is designed for large numbers of SNP markers and GREML_QM for large numbers of individuals. Test results showed that GREML_CE could analyze 50,000 individuals with 400 K SNP markers and GREML_QM could analyze 100,000 individuals with 50K SNP markers. GCORRMX calculates genomic additive and dominance relationship matrices using SNP markers. GVCeasy is the GUI for GVCBLUP integrated with an existing software tool for the graphical viewing of SNP effects and a function for editing the parameter files for the three main programs. CONCLUSION: The GVCBLUP package is a powerful and versatile computing tool for assessing the type and magnitude of genetic effects affecting a phenotype by estimating whole-genome additive and dominance heritabilities, for genomic prediction of breeding values, dominance deviations and genotypic values, for calculating genomic relationships, and for research and education in genomic prediction and estimation.