RESUMO
Most deleterious variants are recessive and segregate at relatively low frequency. Therefore, high sample sizes are required to identify these variants. In this study we report a large-scale sequence based genome-wide association study (GWAS) in pigs, with a total of 120,000 Large White and 80,000 Synthetic breed animals imputed to sequence using a reference population of approximately 1,100 whole genome sequenced pigs. We imputed over 20 million variants with high accuracies (R2>0.9) even for low frequency variants (1-5% minor allele frequency). This sequence-based analysis revealed a total of 14 additive and 9 non-additive significant quantitative trait loci (QTLs) for growth rate and backfat thickness. With the non-additive (recessive) model, we identified a deleterious missense SNP in the CDHR2 gene reducing growth rate and backfat in homozygous Large White animals. For the Synthetic breed, we revealed a QTL on chromosome 15 with a frameshift variant in the OBSL1 gene. This QTL has a major impact on both growth rate and backfat, resembling human 3M-syndrome 2 which is related to the same gene. With the additive model, we confirmed known QTLs on chromosomes 1 and 5 for both breeds, including variants in the MC4R and CCND2 genes. On chromosome 1, we disentangled a complex QTL region with multiple variants affecting both traits, harboring 4 independent QTLs in the span of 5 Mb. Together we present a large scale sequence-based association study that provides a key resource to scan for novel variants at high resolution for breeding and to further reduce the frequency of deleterious alleles at an early stage in the breeding program.
Assuntos
Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único , Humanos , Animais , Suínos/genética , Polimorfismo de Nucleotídeo Único/genética , Locos de Características Quantitativas/genética , Fenótipo , Frequência do Gene , Genótipo , Proteínas do Citoesqueleto/genéticaRESUMO
BACKGROUND: The genetic correlation between purebred (PB) and crossbred (CB) performances ([Formula: see text]) partially determines the response in CB when selection is on PB performance in the parental lines. An earlier study has derived expressions for an upper and lower bound of [Formula: see text], using the variance components of the parental purebred lines, including e.g. the additive genetic variance in the sire line for the trait expressed in one of the dam lines. How to estimate these variance components is not obvious, because animals from one parental line do not have phenotypes for the trait expressed in the other line. Thus, the aim of this study was to propose and compare three methods for approximating the required variance components. The first two methods are based on (co)variances of genomic estimated breeding values (GEBV) in the line of interest, either accounting for shrinkage (VCGEBV-S) or not (VCGEBV). The third method uses restricted maximum likelihood (REML) estimates directly from univariate and bivariate analyses (VCREML) by ignoring that the variance components should refer to the line of interest, rather than to the line in which the trait is expressed. We validated these methods by comparing the resulting predicted bounds of [Formula: see text] with the [Formula: see text] estimated from PB and CB data for five traits in a three-way cross in pigs. RESULTS: With both VCGEBV and VCREML, the estimated [Formula: see text] (plus or minus one standard error) was between the upper and lower bounds in 14 out of 15 cases. However, the range between the bounds was much smaller with VCREML (0.15-0.22) than with VCGEBV (0.44-0.57). With VCGEBV-S, the estimated [Formula: see text] was between the upper and lower bounds in only six out of 15 cases, with the bounds ranging from 0.21 to 0.44. CONCLUSIONS: We conclude that using REML estimates of variance components within and between parental lines to predict the bounds of [Formula: see text] resulted in better predictions than methods based on GEBV. Thus, we recommend that the studies that estimate [Formula: see text] with genotype data also report estimated genetic variance components within and between the parental lines.
Assuntos
Genoma , Modelos Genéticos , Suínos , Animais , Genótipo , Fenótipo , Genômica/métodosRESUMO
Lethal recessive alleles cause pre- or postnatal death in homozygous affected individuals, reducing fertility. Especially in small size domestic and wild populations, those alleles might be exposed by inbreeding, caused by matings between related parents that inherited the same recessive lethal allele from a common ancestor. In this study we report five relatively common (up to 13.4% carrier frequency) recessive lethal haplotypes in two commercial pig populations. The lethal haplotypes have a large effect on carrier-by-carrier matings, decreasing litter sizes by 15.1 to 21.6%. The causal mutations are of different type including two splice-site variants (affecting POLR1B and TADA2A genes), one frameshift (URB1), and one missense (PNKP) variant, resulting in a complete loss-of-function of these essential genes. The recessive lethal alleles affect up to 2.9% of the litters within a single population and are responsible for the death of 0.52% of the total population of embryos. Moreover, we provide compelling evidence that the identified embryonic lethal alleles contribute to the observed heterosis effect for fertility (i.e. larger litters in crossbred offspring). Together, this work marks specific recessive lethal variation describing its functional consequences at the molecular, phenotypic, and population level, providing a unique model to better understand fertility and heterosis in livestock.
Assuntos
Genes Letais , Mutação com Perda de Função , Sus scrofa/embriologia , Sus scrofa/genética , Sequência de Aminoácidos , Animais , Feminino , Fertilidade/genética , Genes Recessivos , Deriva Genética , Genética Populacional , Haplótipos , Vigor Híbrido/genética , Hibridização Genética/genética , Tamanho da Ninhada de Vivíparos/genética , Masculino , Gravidez , RNA Polimerase I/genética , Análise de Sequência de RNA , Sequenciamento Completo do GenomaRESUMO
The genotype-phenotype link is a major research topic in the life sciences but remains highly complex to disentangle. Part of the complexity arises from the number of genes contributing to the observed phenotype. Despite the vast increase of molecular data, pinpointing the causal variant underlying a phenotype of interest is still challenging. In this study, we present an approach to map causal variation and molecular pathways underlying important phenotypes in pigs. We prioritize variation by utilizing and integrating predicted variant impact scores (pCADD), functional genomic information, and associated phenotypes in other mammalian species. We demonstrate the efficacy of our approach by reporting known and novel causal variants, of which many affect non-coding sequences. Our approach allows the disentangling of the biology behind important phenotypes by accelerating the discovery of novel causal variants and molecular mechanisms affecting important phenotypes in pigs. This information on molecular mechanisms could be applicable in other mammalian species, including humans.
Assuntos
Variação Genética , Genômica , Animais , Genótipo , Mamíferos , Fenótipo , Suínos/genéticaRESUMO
Livestock populations can be used to study recessive defects caused by deleterious alleles. The frequency of deleterious alleles including recessive lethal alleles can stay at high or moderate frequency within a population, especially if recessive lethal alleles exhibit an advantage for favourable traits in heterozygotes. In this study, we report such a recessive lethal deletion of 212kb (del) within the BBS9 gene in a breeding population of pigs. The deletion produces a truncated BBS9 protein expected to cause a complete loss-of-function, and we find a reduction of approximately 20% on the total number of piglets born from carrier by carrier matings. Homozygous del/del animals die mid- to late-gestation, as observed from high increase in numbers of mummified piglets resulting from carrier-by-carrier crosses. The moderate 10.8% carrier frequency (5.4% allele frequency) in this pig population suggests an advantage on a favourable trait in heterozygotes. Indeed, heterozygous carriers exhibit increased growth rate, an important selection trait in pig breeding. Increased growth and appetite together with a lower birth weight for carriers of the BBS9 null allele in pigs is analogous to the phenotype described in human and mouse for (naturally occurring) BBS9 null-mutants. We show that fetal death, however, is induced by reduced expression of the downstream BMPER gene, an essential gene for normal foetal development. In conclusion, this study describes a lethal 212kb deletion with pleiotropic effects on two different genes, one resulting in fetal death in homozygous state (BMPER), and the other increasing growth (BBS9) in heterozygous state. We provide strong evidence for balancing selection resulting in an unexpected high frequency of a lethal allele in the population. This study shows that the large amounts of genomic and phenotypic data routinely generated in modern commercial breeding programs deliver a powerful tool to monitor and control lethal alleles much more efficiently.
Assuntos
Regulação da Expressão Gênica no Desenvolvimento , Frequência do Gene , Genes Letais/fisiologia , Endogamia , Sus scrofa/genética , Animais , Conjuntos de Dados como Assunto , Feminino , Fertilidade/genética , Genes Recessivos/fisiologia , Técnicas de Genotipagem , Heterozigoto , Homozigoto , Masculino , Modelos Animais , Sus scrofa/crescimento & desenvolvimentoRESUMO
Biological information regarding markers and gene association may be used to attribute different weights for single nucleotide polymorphism (SNP) in genome-wide selection. Therefore, we aimed to evaluate the predictive ability and the bias of genomic prediction using models that allow SNP weighting in the genomic relationship matrix (G) building, with and without incorporating biological information to obtain the weights. Firstly, we performed a genome-wide association studies (GWAS) in data set containing single- (SL) or a multi-line (ML) pig population for androstenone, skatole and indole levels. Secondly, 1%, 2%, 5%, 10%, 30% and 50% of the markers explaining the highest proportions of the genetic variance for each trait were selected to build gene networks through the association weight matrix (AWM) approach. The number of edges in the network was computed and used to derive weights for G (AWM-WssGBLUP). The single-step GBLUP (ssGBLUP) and weighted ssGBLUP (WssGBLUP) were used as standard scenarios. All scenarios presented predictive abilities different from zero; however, the great overlap in their confidences interval suggests no differences among scenarios. Most of scenarios of based on AWM provide overestimations for skatole in both SL and ML populations. On the other hand, the skatole and indole prediction were no biased in the ssGBLUP (S1) in both SL and ML populations. Most of scenarios based on AWM provide no biased predictions for indole in both SL and ML populations. In summary, using biological information through AWM matrix and gene networks to derive weights for genomic prediction resulted in no increase in predictive ability for boar taint compounds. In addition, this approach increased the number of analyses steps. Thus, we can conclude that ssGBLUP is most appropriate for the analysis of boar taint compounds in comparison with the weighted strategies used in the present work.
Assuntos
Suínos/genética , Animais , Genoma , Estudo de Associação Genômica Ampla/veterinária , Genômica , Masculino , Fenótipo , EscatolRESUMO
BACKGROUND: Use of whole-genome sequence data (WGS) is expected to improve identification of quantitative trait loci (QTL). However, this requires imputation to WGS, often with a limited number of sequenced animals for the target population. The objective of this study was to investigate imputation to WGS in two pig lines using a multi-line reference population and, subsequently, to investigate the effect of using these imputed WGS (iWGS) for GWAS. METHODS: Phenotypes and genotypes were available on 12,184 Large White pigs (LW-line) and 4943 Dutch Landrace pigs (DL-line). Imputed 660 K and 80 K genotypes for the LW-line and DL-line, respectively, were imputed to iWGS using Beagle v.4.1. Since only 32 LW-line and 12 DL-line boars were sequenced, 142 animals from eight commercial lines were added. GWAS were performed for each line using the 80 K and 660 K SNPs, the genotype scores of iWGS SNPs that had an imputation accuracy (Beagle R2) higher than 0.6, and the dosage scores of all iWGS SNPs. RESULTS: For the DL-line (LW-line), imputation of 80 K genotypes to iWGS resulted in an average Beagle R2 of 0.39 (0.49). After quality control, 2.5 × 106 (3.5 × 106) SNPs had a Beagle R2 higher than 0.6, resulting in an average Beagle R2 of 0.83 (0.93). Compared to the 80 K and 660 K genotypes, using iWGS led to the identification of 48.9 and 64.4% more QTL regions, for the DL-line and LW-line, respectively, and the most significant SNPs in the QTL regions explained a higher proportion of phenotypic variance. Using dosage instead of genotype scores improved the identification of QTL, because the model accounted for uncertainty of imputation, and all SNPs were used in the analysis. CONCLUSIONS: Imputation to WGS using the multi-line reference population resulted in relatively poor imputation, especially when imputing from 80 K (DL-line). In spite of the poor imputation accuracies, using iWGS instead of a lower density SNP chip increased the number of detected QTL and the estimated proportion of phenotypic variance explained by these QTL, especially when dosage scores were used instead of genotype scores. Thus, iWGS, even with poor imputation accuracy, can be used to identify possible interesting regions for fine mapping.
Assuntos
Estudo de Associação Genômica Ampla/métodos , Suínos/genética , Sequenciamento Completo do Genoma/métodos , Animais , Estudo de Associação Genômica Ampla/normas , Estudo de Associação Genômica Ampla/veterinária , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Sequenciamento Completo do Genoma/normas , Sequenciamento Completo do Genoma/veterináriaRESUMO
Significance testing for genome-wide association study (GWAS) with increasing SNP density up to whole-genome sequence data (WGS) is not straightforward, because of strong LD between SNP and population stratification. Therefore, the objective of this study was to investigate genomic control and different significance testing procedures using data from a commercial pig breeding scheme. A GWAS was performed in GCTA with data of 4,964 Large White pigs using medium density, high density or imputed whole-genome sequence data, fitting a genomic relationship matrix based on a leave-one-chromosome-out approach to account for population structure. Subsequently, genomic inflation factors were assessed on whole-genome level and the chromosome level. To establish a significance threshold, permutation testing, Bonferroni corrections using either the total number of SNPs or the number of independent chromosome fragments, and false discovery rates (FDR) using either the Benjamini-Hochberg procedure or the Benjamini and Yekutieli procedure were evaluated. We found that genomic inflation factors did not differ between different density genotypes but do differ between chromosomes. Also, the leave-one-chromosome-out approach for GWAS or using the pedigree relationships did not account appropriately for population stratification and gave strong genomic inflation. Regarding different procedures for significance testing, when the aim is to find QTL regions that are associated with a trait of interest, we recommend applying the FDR following the Benjamini and Yekutieli approach to establish a significance threshold that is adjusted for multiple testing. When the aim is to pinpoint a specific mutation, the more conservative Bonferroni correction based on the total number of SNPs is more appropriate, till an appropriate method is established to adjust for the number of independent tests.
Assuntos
Estudo de Associação Genômica Ampla , Genômica , Genótipo , Sequenciamento Completo do Genoma , Animais , Cruzamento , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas/genética , Suínos/genéticaRESUMO
BACKGROUND: In recent years, there has been increased interest in the study of the molecular processes that affect semen traits. In this study, our aim was to identify quantitative trait loci (QTL) regions associated with four semen traits (motility, progressive motility, number of sperm cells per ejaculate and total morphological defects) in two commercial pig lines (L1: Large White type and L2: Landrace type). Since the number of animals with both phenotypes and genotypes was relatively small in our dataset, we conducted a weighted single-step genome-wide association study, which also allows unequal variances for single nucleotide polymorphisms. In addition, our aim was also to identify candidate genes within QTL regions that explained the highest proportions of genetic variance. Subsequently, we performed gene network analyses to investigate the biological processes shared by genes that were identified for the same semen traits across lines. RESULTS: We identified QTL regions that explained up to 10.8% of the genetic variance of the semen traits on 12 chromosomes in L1 and 11 chromosomes in L2. Sixteen QTL regions in L1 and six QTL regions in L2 were associated with two or more traits within the population. Candidate genes SCN8A, PTGS2, PLA2G4A, DNAI2, IQCG and LOC102167830 were identified in L1 and NME5, AZIN2, SPATA7, METTL3 and HPGDS in L2. No regions overlapped between these two lines. However, the gene network analysis for progressive motility revealed two genes in L1 (PLA2G4A and PTGS2) and one gene in L2 (HPGDS) that were involved in two biological processes i.e. eicosanoid biosynthesis and arachidonic acid metabolism. PTGS2 and HPGDS were also involved in the cyclooxygenase pathway. CONCLUSIONS: We identified several QTL regions associated with semen traits in two pig lines, which confirms the assumption of a complex genetic determinism for these traits. A large part of the genetic variance of the semen traits under study was explained by different genes in the two evaluated lines. Nevertheless, the gene network analysis revealed candidate genes that are involved in shared biological pathways that occur in mammalian testes, in both lines.
Assuntos
Redes Reguladoras de Genes , Estudo de Associação Genômica Ampla/métodos , Locos de Características Quantitativas , Sus scrofa/genética , Animais , Cromossomos/genética , Bases de Dados Genéticas , Estudos de Associação Genética , Masculino , Polimorfismo de Nucleotídeo Único , Sêmen , SuínosRESUMO
BACKGROUND: Lethal recessive variation can cause prenatal death of homozygous offspring. Although usually present at low-frequency in populations, the impact on individual fitness can be substantial. Until recently, the presence of recessive embryonic lethal variation could only be measured indirectly through reduced fertility. In this study, we estimate the presence of genetic loci associated with both early and late termination of development during gestation in pigs from the wealth of genome data routinely generated by a commercial breeding company. RESULTS: We examined three commercial pig (Sus scrofa) populations for potentially deleterious genetic variation based on 80 K SNP-chip genotypes, and estimate the effects on reproductive traits. 24,000 pigs from three populations were analyzed for missing or depletion of homozygous haplotypes. We identified 145 haplotypes (ranging from 0.5-4 Mb in size) in the genome with complete absence or depletion of homozygous animals. Thirty-five haplotypes show a negative effect on at least one of the analysed reproductive traits (total number born, number of stillborn, and number of mummified piglets). One variant in particular appeared to result in relative late termination of development of fetuses, responsible for a significant fraction of observed stillborn piglets ('mummies'), as they die mid-gestation. Moreover, we identified the BMPER gene as a likely candidate underlying this phenomenon. CONCLUSIONS: Our study shows that although lethal recessive variation is present, the frequency of these alleles is invariably low in these highly managed populations. Nevertheless, due to cumulative effects of deleterious variants, large numbers of affected offspring are produced. Furthermore, our study demonstrates the use of a large-scale commercial genetic experiment to systematically screen for 'natural knockouts' that can increase understanding of gene function.
Assuntos
Genes Recessivos/genética , Polimorfismo de Nucleotídeo Único , Sus scrofa/genética , Animais , Haplótipos , Homozigoto , Inquéritos e QuestionáriosRESUMO
For reproductive traits such as total number born (TNB), variance due to different environments is highly relevant in animal breeding. In this study, we aimed to perform a gene-network analysis for TNB in pigs across different environments using genomic reaction norm models. Thus, based on relevant single-nucleotide polymorphisms and linkage disequilibrium blocks across environments obtained from GWAS, different sets of candidate genes having biological roles linked to TNB were identified. Network analysis across environment levels resulted in gene interactions consistent with known mammal's fertility biology, captured relevant transcription factors for TNB biology and pointing out different sets of candidate genes for TNB in different environments. These findings may have important implication for animal production, as optimal breeding may vary depending on later environments. Based on these results, genomic diversity was identified and inferred across environments highlighting differential genetic control in each scenario.
Assuntos
Meio Ambiente , Redes Reguladoras de Genes , Tamanho da Ninhada de Vivíparos/genética , Polimorfismo de Nucleotídeo Único/genética , Sus scrofa/genética , Fatores de Transcrição/genética , Animais , Cruzamento , Genótipo , Desequilíbrio de Ligação/genética , Masculino , Modelos Genéticos , Fenótipo , Análise de Sequência de DNARESUMO
BACKGROUND: Breed-specific effects are observed when the same allele of a given genetic marker has a different effect depending on its breed origin, which results in different allele substitution effects across breeds. In such a case, single-breed breeding values may not be the most accurate predictors of crossbred performance. Our aim was to estimate the contribution of alleles from each parental breed to the genetic variance of traits that are measured in crossbred offspring, and to compare the prediction accuracies of estimated direct genomic values (DGV) from a traditional genomic selection model (GS) that are trained on purebred or crossbred data, with accuracies of DGV from a model that accounts for breed-specific effects (BS), trained on purebred or crossbred data. The final dataset was composed of 924 Large White, 924 Landrace and 924 two-way cross (F1) genotyped and phenotyped animals. The traits evaluated were litter size (LS) and gestation length (GL) in pigs. RESULTS: The genetic correlation between purebred and crossbred performance was higher than 0.88 for both LS and GL. For both traits, the additive genetic variance was larger for alleles inherited from the Large White breed compared to alleles inherited from the Landrace breed (0.74 and 0.56 for LS, and 0.42 and 0.40 for GL, respectively). The highest prediction accuracies of crossbred performance were obtained when training was done on crossbred data. For LS, prediction accuracies were the same for GS and BS DGV (0.23), while for GL, prediction accuracy for BS DGV was similar to the accuracy of GS DGV (0.53 and 0.52, respectively). CONCLUSIONS: In this study, training on crossbred data resulted in higher prediction accuracy than training on purebred data and evidence of breed-specific effects for LS and GL was demonstrated. However, when training was done on crossbred data, both GS and BS models resulted in similar prediction accuracies. In future studies, traits with a lower genetic correlation between purebred and crossbred performance should be included to further assess the value of the BS model in genomic predictions.
Assuntos
Cruzamento , Genoma/genética , Modelos Genéticos , Alelos , Animais , Feminino , Genômica , Genótipo , Polimorfismo de Nucleotídeo Único , Gravidez , Reprodutibilidade dos Testes , Seleção Genética , SuínosRESUMO
BACKGROUND: Reproductive traits such as number of stillborn piglets (SB) and number of teats (NT) have been evaluated in many genome-wide association studies (GWAS). Most of these GWAS were performed under the assumption that these traits were normally distributed. However, both SB and NT are discrete (e.g. count) variables. Therefore, it is necessary to test for better fit of other appropriate statistical models based on discrete distributions. In addition, although many GWAS have been performed, the biological meaning of the identified candidate genes, as well as their functional relationships still need to be better understood. Here, we performed and tested a Bayesian treatment of a GWAS model assuming a Poisson distribution for SB and NT in a commercial pig line. To explore the biological role of the genes that underlie SB and NT and identify the most likely candidate genes, we used the most significant single nucleotide polymorphisms (SNPs), to collect related genes and generated gene-transcription factor (TF) networks. RESULTS: Comparisons of the Poisson and Gaussian distributions showed that the Poisson model was appropriate for SB, while the Gaussian was appropriate for NT. The fitted GWAS models indicated 18 and 65 significant SNPs with one and nine quantitative trait locus (QTL) regions within which 18 and 57 related genes were identified for SB and NT, respectively. Based on the related TF, we selected the most representative TF for each trait and constructed a gene-TF network of gene-gene interactions and identified new candidate genes. CONCLUSIONS: Our comparative analyses showed that the Poisson model presented the best fit for SB. Thus, to increase the accuracy of GWAS, counting models should be considered for this kind of trait. We identified multiple candidate genes (e.g. PTP4A2, NPHP1, and CYP24A1 for SB and YLPM1, SYNDIG1L, TGFB3, and VRTN for NT) and TF (e.g. NF-κB and KLF4 for SB and SOX9 and ELF5 for NT), which were consistent with known newborn survival traits (e.g. congenital heart disease in fetuses and kidney diseases and diabetes in the mother) and mammary gland biology (e.g. mammary gland development and body length).
Assuntos
Teorema de Bayes , Estudo de Associação Genômica Ampla , Reprodução/genética , Sus scrofa/genética , Animais , Feminino , Redes Reguladoras de Genes , Genótipo , Distribuição Normal , Fenótipo , Distribuição de Poisson , Polimorfismo de Nucleotídeo Único , Locos de Características QuantitativasRESUMO
Early pig farmers in Europe imported Asian pigs to cross with their local breeds in order to improve traits of commercial interest. Current genomics techniques enabled genome-wide identification of these Asian introgressed haplotypes in modern European pig breeds. We propose that the Asian variants are still present because they affect phenotypes that were important for ancient traditional, as well as recent, commercial pig breeding. Genome-wide introgression levels were only weakly correlated with gene content and recombination frequency. However, regions with an excess or absence of Asian haplotypes (AS) contained genes that were previously identified as phenotypically important such as FASN, ME1, and KIT. Therefore, the Asian alleles are thought to have an effect on phenotypes that were historically under selection. We aimed to estimate the effect of AS in introgressed regions in Large White pigs on the traits of backfat (BF) and litter size. The majority of regions we tested that retained Asian deoxyribonucleic acid (DNA) showed significantly increased BF from the Asian alleles. Our results suggest that the introgression in Large White pigs has been strongly determined by the selective pressure acting upon the introgressed AS. We therefore conclude that human-driven hybridization and selection contributed to the genomic architecture of these commercial pigs.
Assuntos
Sus scrofa/genética , Adiposidade/genética , Animais , Ásia , Cruzamento , Europa (Continente) , Haplótipos , Hibridização Genética , Tamanho da Ninhada de Vivíparos/genéticaRESUMO
BACKGROUND: Cryptorchidism and scrotal/inguinal hernia are the most frequent congenital defects in pigs. Identification of genomic regions that control these congenital defects is of great interest to breeding programs, both from an animal welfare point of view as well as for economic reasons. The aim of this genome-wide association study (GWAS) was to identify single nucleotide polymorphisms (SNPs) that are strongly associated with these congenital defects. Genotypes were available for 2570 Large White (LW) and 2272 Landrace (LR) pigs. Breeding values were estimated based on 1 359 765 purebred and crossbred male offspring, using a binary trait animal model. Estimated breeding values were deregressed (DEBV) and taken as the response variable in the GWAS. RESULTS: Heritability estimates were equal to 0.26 ± 0.02 for cryptorchidism and to 0.31 ± 0.01 for scrotal/inguinal hernia. Seven and 31 distinct QTL regions were associated with cryptorchidism in the LW and LR datasets, respectively. The top SNP per region explained between 0.96% and 1.10% and between 0.48% and 2.77% of the total variance of cryptorchidism incidence in the LW and LR populations, respectively. Five distinct QTL regions associated with scrotal/inguinal hernia were detected in both LW and LR datasets. The top SNP per region explained between 1.22% and 1.60% and between 1.15% and 1.46% of the total variance of scrotal/inguinal hernia incidence in the LW and LR populations, respectively. For each trait, we identified one overlapping region between the LW and LR datasets, i.e. a region on SSC8 (Sus scrofa chromosome) between 65 and 73 Mb for cryptorchidism and a region on SSC13 between 34 and 37 Mb for scrotal/inguinal hernia. CONCLUSIONS: The use of DEBV in combination with a binary trait model was a powerful approach to detect regions associated with difficult traits such as cryptorchidism and scrotal/inguinal hernia that have a low incidence and for which affected animals are generally not available for genotyping. Several novel QTL regions were detected for cryptorchidism and scrotal/inguinal hernia, and for several previously known QTL regions, the confidence interval was narrowed down.
Assuntos
Criptorquidismo/veterinária , Estudo de Associação Genômica Ampla/métodos , Hérnia Inguinal/veterinária , Polimorfismo de Nucleotídeo Único , Sus scrofa/genética , Animais , Cruzamento , Criptorquidismo/genética , Feminino , Genótipo , Haplótipos/genética , Hérnia Inguinal/genética , Masculino , Locos de Características Quantitativas , SuínosRESUMO
BACKGROUND: Genomic selection and genomic wide association studies are widely used methods that aim to exploit the linkage disequilibrium (LD) between markers and quantitative trait loci (QTL). Securing a sufficiently large set of genotypes and phenotypes can be a limiting factor that may be overcome by combining data from multiple breeds or using crossbred information. However, the estimated effect of a marker in one breed or a crossbred can only be useful for the selection of animals in another breed if there is a correspondence of the phase between the marker and the QTL across breeds. Using data of five pure pig (Sus scrofa) lines (SL1, SL2, SL3, DL1, DL2), one F1 cross (DLF1) and two commercial finishing crosses (TER1 and TER2), the objectives of this study were: (i) to compare the equality of LD decay curves of different pig populations; and (ii) to evaluate the persistence of the LD phase across lines or final crosses. RESULTS: Almost all of the lines presented different extents of LD, except for the SL2 and DL3, both of which exhibited the same extent of LD. Similar levels of LD over large distances were found in crossbred and pure lines. The crossbred animals (DLF1, TER1 and TER2) presented a high persistence of phase with their parental lines, suggesting that the available porcine single nucleotide polymorphism (SNP) chip should be dense enough to include markers that have the same LD phase with QTL across crossbred and parental pure lines. The persistence of phase across pure lines varied considerably between the different line comparisons; however, correlations were above 0.8 for all line comparisons when marker distances were smaller than 50 kb. CONCLUSIONS: This study showed that crossbred populations could be very useful as a reference for the selection of pure lines by means of the available SNP chip panel. Here, we also pinpoint pure lines that could be combined in a multiline training population. However, if multiline reference populations are used for genomic selection, the required density of SNP panels should be higher compared with a single breed reference population.
Assuntos
Desequilíbrio de Ligação , Sus scrofa/genética , Alelos , Animais , Frequência do Gene , Marcadores Genéticos , Hibridização GenéticaRESUMO
BACKGROUND: Traditional breeding programs consider an average pairwise kinship between sibs. Based on pedigree information, the relationship matrix is used for genetic evaluations disregarding variation due to Mendelian sampling. Therefore, inbreeding and kinship coefficients are either over or underestimated resulting in reduction of accuracy of genetic evaluations and genetic progress. Single nucleotide polymorphism (SNPs) can be used to estimate pairwise kinship and individual inbreeding more accurately. The aim of this study was to optimize the selection of markers and determine the required number of SNPs for estimation of kinship and inbreeding. RESULTS: A total of 1,565 animals from three commercial pig populations were analyzed for 28,740 SNPs from the PorcineSNP60 Beadchip. Mean genomic inbreeding was higher than pedigree-based estimates in lines 2 and 3, but lower in line 1. As expected, a larger variation of genomic kinship estimates was observed for half and full sibs than for pedigree-based kinship reflecting Mendelian sampling. Genomic kinship between father-offspring pairs was lower (0.23) than the estimate based on pedigree (0.26). Bootstrap analyses using six reduced SNP panels (n = 500, 1000, 1500, 2000, 2500 and 3000) showed that 2,000 SNPs were able to reproduce the results very close to those obtained using the full set of unlinked markers (n = 7,984-10,235) with high correlations (inbreeding r > 0.82 and kinship r > 0.96) and low variation between different sets with the same number of SNPs. CONCLUSIONS: Variation of kinship between sibs due to Mendelian sampling is better captured using genomic information than the pedigree-based method. Therefore, the reduced sets of SNPs could generate more accurate kinship coefficients between sibs than the pedigree-based method. Variation of genomic kinship of father-offspring pairs is recommended as a parameter to determine accuracy of the method rather than correlation with pedigree-based estimates. Inbreeding and kinship coefficients can be estimated with high accuracy using ≥2,000 unlinked SNPs within all three commercial pig lines evaluated. However, a larger number of SNPs might be necessary in other populations or across lines.
Assuntos
Genoma , Endogamia , Modelos Genéticos , Polimorfismo de Nucleotídeo Único , Suínos/genética , Animais , Genótipo , Desequilíbrio de Ligação , Linhagem , Seleção GenéticaRESUMO
Nearly 2000 SNPs associated with pig litter size traits have been reported based on genome-wide association studies (GWASs). The aims of this study were to gather and integrate previously reported associations between SNPs and five litter traits: total number born (TNB), number born alive (NBA), number of stillborn (SB), litter birth weight (LWT), and corpus luteum number (CLN), in order to evaluate their common genetic background and to perform a meta-analysis (MA) of GWASs for total number born (TNB) recorded for animals from five pig populations. In this study, the genes with the largest number of associations with evaluated litter traits were GABRG3, RBP7, PRKD1, and STXBP6. Only 21 genes out of 233 associated with the evaluated litter traits were reported in more than one population or for more than one trait. Based on this evaluation, the most interesting candidate gene is PRKD1, which has an association with SB and TNB traits. Based on GO term analysis, PRKD1 was shown to be involved in angiogenesis as well. As a result of the MA, two new genomic regions, which have not been previously reported, were found to be associated with the TNB trait. One SNP was located on Sus scrofa chromosome (SSC) 14 in the intron of the FAM13C gene. The second SNP was located on SSC9 within the intron of the AGMO gene. Functional analysis revealed a strong candidate causal gene underlying the QTL on SSC9. The third best hit and the most promising candidate gene for litter size was found within the SOSTDC1 gene, associated with lower male fertility in rats. We showed that litter traits studied across pig populations have only a few genomic regions in common based on candidate gene comparison. PRKD1 could be an interesting candidate gene with a wider association with fertility. The MA identified new genomic regions on SSC9 and SSC14 associated with TNB. Further functional analysis indicated the most promising gene was SOSTDC1, which was confirmed to affect male fertility in other mammals. This is an important finding, as litter traits are by default linked with females rather than males.
Assuntos
Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único , Masculino , Gravidez , Feminino , Ratos , Animais , Polimorfismo de Nucleotídeo Único/genética , Locos de Características Quantitativas/genética , Tamanho da Ninhada de Vivíparos/genética , Fenótipo , Mamíferos/genética , Proteínas de Transporte Vesicular/genéticaRESUMO
Backfat is an important trait in pork production, and it has been included in the breeding objectives of genetic companies for decades. Although adipose tissue is a good energy storage, excessive fat results in reduced efficiency and economical losses. A large QTL for backfat thickness on chromosome 5 is still segregating in different commercial pig breeds. We fine mapped this QTL region using a genome-wide association analysis (GWAS) with 133,358 genotyped animals from five commercial populations (Landrace, Pietrain, Large White, Synthetic, and Duroc) imputed to the porcine 660K SNP chip. The lead SNP was located at 5:66103958 (G/A) within the third intron of the CCND2 gene, with the G allele associated with more backfat, while the A allele is associated with less backfat. We further phased the QTL region to discover a core haplotype of five SNPs associated with low backfat across three breeds. Linkage disequilibrium analysis using whole-genome sequence data revealed three candidate causal variants within intronic regions and downstream of the CCND2 gene, including the lead SNP. We evaluated the association of the lead SNP with the expression of the genes in the QTL region (including CCND2) in a large cohort of 100 crossbred samples, sequenced in four different tissues (lung, spleen, liver, muscle). Results show that the A allele increases the expression of CCND2 in an additive way in three out of four tissues. Our findings indicate that the causal variant for this QTL region is a regulatory variant within the third intron of the CCND2 gene affecting the expression of CCND2.
RESUMO
In pig breeding, selection commonly takes place in purebred (PB) pigs raised mainly in temperate climates (TEMP) under optimal environmental conditions in nucleus farms. However, pork production typically makes use of crossbred (CB) animals raised in nonstandardized commercial farms, which are located not only in TEMP regions but also in tropical and subtropical regions (TROP). Besides the differences in the genetic background of PB and CB, differences in climate conditions, and differences between nucleus and commercial farms can lower the genetic correlation between the performance of PB in the TEMP (PBTEMP) and CB in the TROP (CBTROP). Genetic correlations (rg) between the performance of PB and CB growing-finishing pigs in TROP and TEMP environments have not been reported yet, due to the scarcity of data in both CB and TROP. Therefore, the present study aimed 1) to verify the presence of genotype × environment interaction (G × E) and 2) to estimate the rg for carcass and growth performance traits when PB and 3-way CB pigs are raised in 2 different climatic environments (TROP and TEMP). Phenotypic records of 217,332 PB and 195,978 CB, representing 2 climatic environments: TROP (Brazil) and TEMP (Canada, France, and the Netherlands) were available for this study. The PB population consisted of 2 sire lines, and the CB population consisted of terminal 3-way cross progeny generated by crossing sires from one of the PB sire lines with commercially available 2-way maternal sow crosses. G × E appears to be present for average daily gain, protein deposition, and muscle depth given the rg estimates between PB in both environments (0.64 to 0.79). With the presence of G × E, phenotypes should be collected in TROP when the objective is to improve the performance of CB in the TROP. Also, based on the rg estimates between PBTEMP and CBTROP (0.22 to 0.25), and on the expected responses to selection, selecting based only on the performance of PBTEMP would give limited genetic progress in the CBTROP. The rg estimates between PBTROP and CBTROP are high (0.80 to 0.99), suggesting that combined crossbred-purebred selection schemes would probably not be necessary to increase genetic progress in CBTROP. However, the calculated responses to selection show that when the objective is the improvement of CBTROP, direct selection based on the performance of CBTROP has the potential to lead to the higher genetic progress compared with indirect selection on the performance of PBTROP.