Búsqueda | Portal de Búsqueda de la BVS España

1.

Estimating genetic parameters of digital behavior traits and their relationship with production traits in purebred pigs.

Hollifield, Mary Kate; Chen, Ching-Yi; Psota, Eric; Holl, Justin; Lourenco, Daniela; Misztal, Ignacy.

Genet Sel Evol ; 56(1): 29, 2024 Apr 16.

Artículo en Inglés | MEDLINE | ID: mdl-38627636

RESUMEN

BACKGROUND: With the introduction of digital phenotyping and high-throughput data, traits that were previously difficult or impossible to measure directly have become easily accessible, offering the opportunity to enhance the efficiency and rate of genetic gain in animal production. It is of interest to assess how behavioral traits are indirectly related to the production traits during the performance testing period. The aim of this study was to assess the quality of behavior data extracted from day-wise video recordings and estimate the genetic parameters of behavior traits and their phenotypic and genetic correlations with production traits in pigs. Behavior was recorded for 70 days after on-test at about 10 weeks of age and ended at off-test for 2008 female purebred pigs, totaling 119,812 day-wise records. Behavior traits included time spent eating, drinking, laterally lying, sternally lying, sitting, standing, and meters of distance traveled. A quality control procedure was created for algorithm training and adjustment, standardizing recording hours, removing culled animals, and filtering unrealistic records. RESULTS: Production traits included average daily gain (ADG), back fat thickness (BF), and loin depth (LD). Single-trait linear models were used to estimate heritabilities of the behavior traits and two-trait linear models were used to estimate genetic correlations between behavior and production traits. The results indicated that all behavior traits are heritable, with heritability estimates ranging from 0.19 to 0.57, and showed low-to-moderate phenotypic and genetic correlations with production traits. Two-trait linear models were also used to compare traits at different intervals of the recording period. To analyze the redundancies in behavior data during the recording period, the averages of various recording time intervals for the behavior and production traits were compared. Overall, the average of the 55- to 68-day recording interval had the strongest phenotypic and genetic correlation estimates with the production traits. CONCLUSIONS: Digital phenotyping is a new and low-cost method to record behavior phenotypes, but thorough data cleaning procedures are needed. Evaluating behavioral traits at different time intervals offers a deeper insight into their changes throughout the growth periods and their relationship with production traits, which may be recorded at a less frequent basis.

Asunto(s)

Conducta Alimentaria , Porcinos/genética , Femenino , Animales , Fenotipo , Modelos Lineales

2.

Confidence intervals for validation statistics with data truncation in genomic prediction.

Bermann, Matias; Legarra, Andres; Munera, Alejandra Alvarez; Misztal, Ignacy; Lourenco, Daniela.

Genet Sel Evol ; 56(1): 18, 2024 Mar 08.

Artículo en Inglés | MEDLINE | ID: mdl-38459504

RESUMEN

BACKGROUND: Validation by data truncation is a common practice in genetic evaluations because of the interest in predicting the genetic merit of a set of young selection candidates. Two of the most used validation methods in genetic evaluations use a single data partition: predictivity or predictive ability (correlation between pre-adjusted phenotypes and estimated breeding values (EBV) divided by the square root of the heritability) and the linear regression (LR) method (comparison of "early" and "late" EBV). Both methods compare predictions with the whole dataset and a partial dataset that is obtained by removing the information related to a set of validation individuals. EBV obtained with the partial dataset are compared against adjusted phenotypes for the predictivity or EBV obtained with the whole dataset in the LR method. Confidence intervals for predictivity and the LR method can be obtained by replicating the validation for different samples (or folds), or bootstrapping. Analytical confidence intervals would be beneficial to avoid running several validations and to test the quality of the bootstrap intervals. However, analytical confidence intervals are unavailable for predictivity and the LR method. RESULTS: We derived standard errors and Wald confidence intervals for the predictivity and statistics included in the LR method (bias, dispersion, ratio of accuracies, and reliability). The confidence intervals for the bias, dispersion, and reliability depend on the relationships and prediction error variances and covariances across the individuals in the validation set. We developed approximations for large datasets that only need the reliabilities of the individuals in the validation set. The confidence intervals for the ratio of accuracies and predictivity were obtained through the Fisher transformation. We show the adequacy of both the analytical and approximated analytical confidence intervals and compare them versus bootstrap confidence intervals using two simulated examples. The analytical confidence intervals were closer to the simulated ones for both examples. Bootstrap confidence intervals tend to be narrower than the simulated ones. The approximated analytical confidence intervals were similar to those obtained by bootstrapping. CONCLUSIONS: Estimating the sampling variation of predictivity and the statistics in the LR method without replication or bootstrap is possible for any dataset with the formulas presented in this study.

Asunto(s)

Genómica , Modelos Genéticos , Humanos , Genotipo , Reproducibilidad de los Resultados , Intervalos de Confianza , Linaje , Genómica/métodos , Fenotipo

3.

Estimation of heritability with genomic information by method R.

Hollifield, Mary Kate; Lourenco, Daniela; Misztal, Ignacy.

J Anim Breed Genet ; 2024 Mar 25.

Artículo en Inglés | MEDLINE | ID: mdl-38523564

RESUMEN

Estimating heritabilities with large genomic models by established methods such as restricted maximum likelihood (REML) or Bayesian via Gibbs sampling is computationally expensive. Alternatively, heritability can be estimated indirectly by method R and by maximum predictivity, referred to as MaxPred here, at a much lower computing cost. By method R, the heritability used for predictions with whole and partial data is considered the best estimate when the predictions based on partial data are unbiased relative to those with the complete data. By MaxPred, the heritability estimate is the one that maximizes predictivity. This study compared heritability estimation with genomic information using average information REML (AI-REML), method R and MaxPred. A simulated population was generated with ten generations of 5000 animals each and an effective population size of 80. Each animal had one record for a trait with a heritability of 0.3, a phenotypic variance of 10.0 and was genotyped at 50 k SNP. In method R, the heritability estimate is found when the expectation of a regression coefficient is equal to one. The regression is the EBV of selection candidates calculated with the whole dataset regressed on the EBV of candidates calculated from a partial dataset. In this study, we used the GBLUP framework and therefore, GEBV was calculated. The partial dataset was created by removing the last generation of phenotypes. Predictivity was defined as the correlation between the adjusted phenotypes of the selection candidates and their GEBV calculated from the partial data. We estimated the heritability for populations that included between three and 10 generations. In every scenario, predictivity increased as more data was used and was the highest at the simulated heritability. However, the predictivity for all data subsets and all heritabilities compared did not differ more than 0.01, suggesting MaxPred is not the best indication for heritability estimation. For the whole dataset, the heritability was estimated as 0.30 ± 0.01, 0.26 ± 0.01 and 0.30 ± 0.04 for AI-REML without genomics, AI-REML with genomics and method R with genomics, respectively. Heritability estimation with genomics by method R reduced timing by 83%, implying a reduction in computing time from 9.5 to 1.6 h, on average, compared to AI-REML with genomics. Method R has the potential to estimate heritabilities with large genomic information at a low cost when many generations of animals are present; however, the standard error can be high when only a few iterations are used.

4.

Dimensionality of genomic information and its impact on genome-wide associations and variant selection for genomic prediction: a simulation study.

Jang, Sungbong; Tsuruta, Shogo; Leite, Natalia Galoro; Misztal, Ignacy; Lourenco, Daniela.

Genet Sel Evol ; 55(1): 49, 2023 Jul 17.

Artículo en Inglés | MEDLINE | ID: mdl-37460964

RESUMEN

BACKGROUND: Identifying true positive variants in genome-wide associations (GWA) depends on several factors, including the number of genotyped individuals. The limited dimensionality of genomic information may give insights into the optimal number of individuals to be used in GWA. This study investigated different discovery set sizes based on the number of largest eigenvalues explaining a certain proportion of variance in the genomic relationship matrix (G). In addition, we investigated the impact on the prediction accuracy by adding variants, which were selected based on different set sizes, to the regular single nucleotide polymorphism (SNP) chips used for genomic prediction. METHODS: We simulated sequence data that included 500k SNPs with 200 or 2000 quantitative trait nucleotides (QTN). A regular 50k panel included one in every ten simulated SNPs. Effective population size (Ne) was set to 20 or 200. GWA were performed using a number of genotyped animals equivalent to the number of largest eigenvalues of G (EIG) explaining 50, 60, 70, 80, 90, 95, 98, and 99% of the variance. In addition, the largest discovery set consisted of 30k genotyped animals. Limited or extensive phenotypic information was mimicked by changing the trait heritability. Significant and large-effect size SNPs were added to the 50k panel and used for single-step genomic best linear unbiased prediction (ssGBLUP). RESULTS: Using a number of genotyped animals corresponding to at least EIG98 allowed the identification of QTN with the largest effect sizes when Ne was large. Populations with smaller Ne required more than EIG98. Furthermore, including genotyped animals with a higher reliability (i.e., a higher trait heritability) improved the identification of the most informative QTN. Prediction accuracy was highest when the significant or the large-effect SNPs representing twice the number of simulated QTN were added to the 50k panel. CONCLUSIONS: Accurately identifying causative variants from sequence data depends on the effective population size and, therefore, on the dimensionality of genomic information. This dimensionality can help identify the most suitable sample size for GWA and could be considered for variant selection, especially when resources are restricted. Even when variants are accurately identified, their inclusion in prediction models has limited benefits.

Asunto(s)

Estudio de Asociación del Genoma Completo , Modelos Genéticos , Animales , Reproducibilidad de los Resultados , Genoma , Genómica , Genotipo , Fenotipo , Polimorfismo de Nucleótido Simple

5.

Reliabilities of estimated breeding values in models with metafounders.

Bermann, Matias; Aguilar, Ignacio; Lourenco, Daniela; Misztal, Ignacy; Legarra, Andres.

Genet Sel Evol ; 55(1): 6, 2023 Jan 23.

Artículo en Inglés | MEDLINE | ID: mdl-36690938

RESUMEN

BACKGROUND: Reliabilities of best linear unbiased predictions (BLUP) of breeding values are defined as the squared correlation between true and estimated breeding values and are helpful in assessing risk and genetic gain. Reliabilities can be computed from the prediction error variances for models with a single base population but are undefined for models that include several base populations and when unknown parent groups are modeled as fixed effects. In such a case, the use of metafounders in principle enables reliabilities to be derived. METHODS: We propose to compute the reliability of the contrast of an individual's estimated breeding value with that of a metafounder based on the prediction error variances of the individual and the metafounder, their prediction error covariance, and their genetic relationship. Computation of the required terms demands only little extra work once the sparse inverse of the mixed model equations is obtained, or they can be approximated. This also allows the reliabilities of the metafounders to be obtained. We studied the reliabilities for both BLUP and single-step genomic BLUP (ssGBLUP), using several definitions of reliability in a large dataset with 1,961,687 dairy sheep and rams, most of which had phenotypes and among which 27,000 rams were genotyped with a 50K single nucleotide polymorphism (SNP) chip. There were 23 metafounders with progeny sizes between 100,000 and 2000 individuals. RESULTS: In models with metafounders, directly using the prediction error variance instead of the contrast with a metafounder leads to artificially low reliabilities because they refer to a population with maximum heterozygosity. When only one metafounder is fitted in the model, the reliability of the contrast is shown to be equivalent to the reliability of the individual in a model without metafounders. When there are several metafounders in the model, using a contrast with the oldest metafounder yields reliabilities that are on a meaningful scale and very close to reliabilities obtained from models without metafounders. The reliabilities using contrasts with ssGBLUP also resulted in meaningful values. CONCLUSIONS: This work provides a general method to obtain reliabilities for both BLUP and ssGBLUP when several base populations are included through metafounders.

Asunto(s)

Genoma , Modelos Genéticos , Animales , Masculino , Ovinos , Reproducibilidad de los Resultados , Genotipo , Genómica/métodos , Fenotipo , Linaje

6.

Using pre-selected variants from large-scale whole-genome sequence data for single-step genomic predictions in pigs.

Jang, Sungbong; Ros-Freixedes, Roger; Hickey, John M; Chen, Ching-Yi; Holl, Justin; Herring, William O; Misztal, Ignacy; Lourenco, Daniela.

Genet Sel Evol ; 55(1): 55, 2023 Jul 26.

Artículo en Inglés | MEDLINE | ID: mdl-37495982

RESUMEN

BACKGROUND: Whole-genome sequence (WGS) data harbor causative variants that may not be present in standard single nucleotide polymorphism (SNP) chip data. The objective of this study was to investigate the impact of using preselected variants from WGS for single-step genomic predictions in maternal and terminal pig lines with up to 1.8k sequenced and 104k sequence imputed animals per line. METHODS: Two maternal and four terminal lines were investigated for eight and seven traits, respectively. The number of sequenced animals ranged from 1365 to 1491 for the maternal lines and 381 to 1865 for the terminal lines. Imputation to sequence occurred within each line for 66k to 76k animals for the maternal lines and 29k to 104k animals for the terminal lines. Two preselected SNP sets were generated based on a genome-wide association study (GWAS). Top40k included the SNPs with the lowest p-value in each of the 40k genomic windows, and ChipPlusSign included significant variants integrated into the porcine SNP chip used for routine genotyping. We compared the performance of single-step genomic predictions between using preselected SNP sets assuming equal or different variances and the standard porcine SNP chip. RESULTS: In the maternal lines, ChipPlusSign and Top40k showed an average increase in accuracy of 0.6 and 4.9%, respectively, compared to the regular porcine SNP chip. The greatest increase was obtained with Top40k, particularly for fertility traits, for which the initial accuracy based on the standard SNP chip was low. However, in the terminal lines, Top40k resulted in an average loss of accuracy of 1%. ChipPlusSign provided a positive, although small, gain in accuracy (0.9%). Assigning different variances for the SNPs slightly improved accuracies when using variances obtained from BayesR. However, increases were inconsistent across the lines and traits. CONCLUSIONS: The benefit of using sequence data depends on the line, the size of the genotyped population, and how the WGS variants are preselected. When WGS data are available on hundreds of thousands of animals, using sequence data presents an advantage but this remains limited in pigs.

Asunto(s)

Estudio de Asociación del Genoma Completo , Genoma , Animales , Porcinos/genética , Estudio de Asociación del Genoma Completo/métodos , Genómica/métodos , Genotipo , Fenotipo , Polimorfismo de Nucleótido Simple

7.

Implication of the order of blending and tuning when computing the genomic relationship matrix in single-step GBLUP.

McWhorter, Taylor M; Bermann, Matias; Garcia, Andre L S; Legarra, Andrés; Aguilar, Ignacio; Misztal, Ignacy; Lourenco, Daniela.

J Anim Breed Genet ; 140(1): 60-78, 2023 Jan.

Artículo en Inglés | MEDLINE | ID: mdl-35946919

RESUMEN

Single-step genomic BLUP (ssGBLUP) relies on the combination of the genomic ( G $$ \mathbf{G} $$ ) and pedigree relationship matrices for all ( A $$ \mathbf{A} $$ ) and genotyped ( A 22 $$ {\mathbf{A}}_{22} $$ ) animals. The procedure ensures G $$ \mathbf{G} $$ and A 22 $$ {\mathbf{A}}_{22} $$ are compatible so that both matrices refer to the same genetic base ('tuning'). Then G $$ \mathbf{G} $$ is combined with a proportion of A 22 $$ {\mathbf{A}}_{22} $$ ('blending') to avoid singularity problems and to account for the polygenic component not accounted for by markers. This computational procedure has been implemented in the reverse order (blending before tuning) following the sequential research developments. However, blending before tuning may result in less optimal tuning because the blended matrix already contains a proportion of A 22 $$ {\mathbf{A}}_{22} $$ . In this study, the impact of 'tuning before blending' was compared with 'blending before tuning' on genomic estimated breeding values (GEBV), single nucleotide polymorphism (SNP) effects and indirect predictions (IP) from ssGBLUP using American Angus Association and Holstein Association USA, Inc. data. Two slightly different tuning methods were used; one that adjusts the mean diagonals and off-diagonals of G $$ \mathbf{G} $$ to be similar to those in A 22 $$ {\mathbf{A}}_{22} $$ and another one that adjusts based on the average difference between all elements of G $$ \mathbf{G} $$ and A 22 $$ {\mathbf{A}}_{22} $$ . Over 6 million Angus growth records and 5.9 million Holstein udder depth records were available. Genomic information was available on 51,478 Angus and 105,116 Holstein animals. Average realized relationship estimates among groups of animals were similar across scenarios. Scatterplots show that GEBV, SNP effects and IP did not noticeably change for all animals in the evaluation regardless of the order of computations and when using blending parameter of 0.05. Formulas were derived to determine the blending parameter that maximizes changes in the genomic relationship matrix and GEBV when changing the order of blending and tuning. Algebraically, the change is maximized when the blending parameter is equal to 0.5. Overall, tuning G $$ \mathbf{G} $$ before blending, regardless of blending parameter used, had a negligible impact on genomic predictions and SNP effects in this study.

Asunto(s)

Genómica , Animales

8.

A comprehensive study on size and definition of the core group in the proven and young algorithm for single-step GBLUP.

Abdollahi-Arpanahi, Rostam; Lourenco, Daniela; Misztal, Ignacy.

Genet Sel Evol ; 54(1): 34, 2022 May 20.

Artículo en Inglés | MEDLINE | ID: mdl-35596130

RESUMEN

BACKGROUND: The algorithm for proven and young (APY) has been suggested as a solution for recursively computing a sparse representation for the inverse of a large genomic relationship matrix (G). In APY, a subset of genotyped individuals is used as the core and the remaining genotyped individuals are used as noncore. Size and definition of the core are relevant research subjects for the application of APY, especially given the ever-increasing number of genotyped individuals. METHODS: The aim of this study was to investigate several core definitions, including the most popular animals (MPA) (i.e., animals with high contributions to the genetic pool), the least popular males (LPM), the least popular females (LPF), a random set (Rnd), animals evenly distributed across genealogical paths (Ped), unrelated individuals (Unrel), or based on within-family selection (Fam), or on decomposition of the gene content matrix (QR). Each definition was evaluated for six core sizes based on prediction accuracy of single-step genomic best linear unbiased prediction (ssGBLUP) with APY. Prediction accuracy of ssGBLUP with the full inverse of G was used as the baseline. The dataset consisted of 357k pedigreed Duroc pigs with 111k pigs with genotypes and ~ 220k phenotypic records. RESULTS: When the core size was equal to the number of largest eigenvalues explaining 50% of the variation of G (n = 160), MPA and Ped core definitions delivered the highest average prediction accuracies (~ 0.41-0.53). As the core size increased to the number of eigenvalues explaining 99% of the variation in G (n = 7320), prediction accuracy was nearly identical for all core types and correlations with genomic estimated breeding values (GEBV) from ssGBLUP with the full inversion of G were greater than 0.99 for all core definitions. Cores that represent all generations, such as Rnd, Ped, Fam, and Unrel, were grouped together in the hierarchical clustering of GEBV. CONCLUSIONS: For small core sizes, the definition of the core matters; however, as the size of the core reaches an optimal value equal to the number of largest eigenvalues explaining 99% of the variation of G, the definition of the core becomes arbitrary.

Asunto(s)

Genoma , Modelos Genéticos , Algoritmos , Animales , Femenino , Genómica , Genotipo , Humanos , Masculino , Linaje , Fenotipo , Porcinos

9.

On the equivalence between marker effect models and breeding value models and direct genomic values with the Algorithm for Proven and Young.

Bermann, Matias; Lourenco, Daniela; Forneris, Natalia S; Legarra, Andres; Misztal, Ignacy.

Genet Sel Evol ; 54(1): 52, 2022 Jul 16.

Artículo en Inglés | MEDLINE | ID: mdl-35842585

RESUMEN

BACKGROUND: Single-step genomic predictions obtained from a breeding value model require calculating the inverse of the genomic relationship matrix [Formula: see text]. The Algorithm for Proven and Young (APY) creates a sparse representation of [Formula: see text] with a low computational cost. APY consists of selecting a group of core animals and expressing the breeding values of the remaining animals as a linear combination of those from the core animals plus an error term. The objectives of this study were to: (1) extend APY to marker effects models; (2) derive equations for marker effect estimates when APY is used for breeding value models, and (3) show the implication of selecting a specific group of core animals in terms of a marker effects model. RESULTS: We derived a family of marker effects models called APY-SNP-BLUP. It differs from the classic marker effects model in that the row space of the genotype matrix is reduced and an error term is fitted for non-core animals. We derived formulas for marker effect estimates that take this error term in account. The prediction error variance (PEV) of the marker effect estimates depends on the PEV for core animals but not directly on the PEV of the non-core animals. We extended the APY-SNP-BLUP to include a residual polygenic effect and accommodate non-genotyped animals. We show that selecting a specific group of core animals is equivalent to select a subspace of the row space of the genotype matrix. As the number of core animals increases, subspaces corresponding to different sets of core animals tend to overlap, showing that random selection of core animals is algebraically justified. CONCLUSIONS: The APY-(ss)GBLUP models can be expressed in terms of marker effect models. When the number of core animals is equal to the rank of the genotype matrix, APY-SNP-BLUP is identical to the classic marker effects model. If the number of core animals is less than the rank of the genotype matrix, genotypes for non-core animals are imputed as a linear combination of the genotypes of the core animals. For estimating SNP effects, only relationships and estimated breeding values for core animals are needed.

Asunto(s)

Genoma , Modelos Genéticos , Algoritmos , Animales , Genómica , Genotipo , Linaje , Fenotipo

10.

Theoretical accuracy for indirect predictions based on SNP effects from single-step GBLUP.

Garcia, Andre; Aguilar, Ignacio; Legarra, Andres; Tsuruta, Shogo; Misztal, Ignacy; Lourenco, Daniela.

Genet Sel Evol ; 54(1): 66, 2022 Sep 27.

Artículo en Inglés | MEDLINE | ID: mdl-36162979

RESUMEN

BACKGROUND: Although single-step GBLUP (ssGBLUP) is an animal model, SNP effects can be backsolved from genomic estimated breeding values (GEBV). Predicted SNP effects allow to compute indirect prediction (IP) per individual as the sum of the SNP effects multiplied by its gene content, which is helpful when the number of genotyped animals is large, for genotyped animals not in the official evaluations, and when interim evaluations are needed. Typically, IP are obtained for new batches of genotyped individuals, all of them young and without phenotypes. Individual (theoretical) accuracies for IP are rarely reported, but they are nevertheless of interest. Our first objective was to present equations to compute individual accuracy of IP, based on prediction error covariance (PEC) of SNP effects, and in turn, are obtained from PEC of GEBV in ssGBLUP. The second objective was to test the algorithm for proven and young (APY) in PEC computations. With large datasets, it is impossible to handle the full PEC matrix, thus the third objective was to examine the minimum number of genotyped animals needed in PEC computations to achieve IP accuracies that are equivalent to GEBV accuracies. RESULTS: Correlations between GEBV and IP for the validation animals using SNP effects from ssGBLUP evaluations were ≥ 0.99. When all available genotyped animals were used for PEC computations, correlations between GEBV and IP accuracy were ≥ 0.99. In addition, IP accuracies were compatible with GEBV accuracies either with direct inversion of the genomic relationship matrix (G) or using the algorithm for proven and young (APY) to obtain the inverse of G. As the number of genotyped animals included in the PEC computations decreased from around 55,000 to 15,000, correlations were still ≥ 0.96, but IP accuracies were biased downwards. CONCLUSIONS: Theoretical accuracy of indirect prediction can be successfully obtained by computing SNP PEC out of GEBV PEC from ssGBLUP equations using direct or APY G inverse. It is possible to reduce the number of genotyped animals in PEC computations, but accuracies may be underestimated. Further research is needed to approximate SNP PEC from ssGBLUP to limit the computational requirements with many genotyped animals.

Asunto(s)

Genoma , Modelos Genéticos , Animales , Genómica , Genotipo , Linaje , Fenotipo

11.

Invited review: Unknown-parent groups and metafounders in single-step genomic BLUP.

Masuda, Yutaka; VanRaden, Paul M; Tsuruta, Shogo; Lourenco, Daniela A L; Misztal, Ignacy.

J Dairy Sci ; 105(2): 923-939, 2022 Feb.

Artículo en Inglés | MEDLINE | ID: mdl-34799109

RESUMEN

Single-step genomic BLUP (ssGBLUP) is a method for genomic prediction that integrates matrices of pedigree (A) and genomic (G) relationships into a single unified additive relationship matrix whose inverse is incorporated into a set of mixed model equations (MME) to compute genomic predictions. Pedigree information in dairy cattle is often incomplete. Missing pedigree potentially causes biases and inflation in genomic estimated breeding values (GEBV) obtained with ssGBLUP. Three major issues are associated with missing pedigree in ssGBLUP, namely biased predictions by selection, missing inbreeding in pedigree relationships, and incompatibility between G and A in level and scale. These issues can be solved using a proper model for unknown-parent groups (UPG). The theory behind the use of UPG is well established for pedigree BLUP, but not for ssGBLUP. This study reviews the development of the UPG model in pedigree BLUP, the properties of UPG models in ssGBLUP, and the effect of UPG on genetic trends and genomic predictions. Similarities and differences between UPG and metafounder (MF) models, a generalized UPG model, are also reviewed. A UPG model (QP) derived using a transformation of the MME has a good convergence behavior. However, with insufficient data, the QP model may yield biased genetic trends and may underestimate UPG. The QP model can be altered by removing the genomic relationships linking GEBV and UPG effects from MME. This altered QP model exhibits less bias in genetic trends and less inflation in genomic predictions than the QP model, especially with large data sets. Recently, a new model, which encapsulates the UPG equations into the pedigree relationships for genotyped animals, was proposed in simulated purebred populations. The MF model is a comprehensive solution to the missing pedigree issue. This model can be a choice for multibreed or crossbred evaluations if the data set allows the estimation of a reasonable relationship matrix for MF. Missing pedigree influences genetic trends, but its effect on the predictability of genetic merit for genotyped animals should be negligible when many proven bulls are genotyped. The SNP effects can be back-solved using GEBV from older genotyped animals, and these predicted SNP effects can be used to calculate GEBV for young-genotyped animals with missing parents.

Asunto(s)

Genoma , Modelos Genéticos , Animales , Bovinos/genética , Genómica , Genotipo , Masculino , Linaje , Fenotipo

12.

Dissecting genetic trends to understand breeding practices in livestock: a maternal pig line example.

Abdollahi-Arpanahi, Rostam; Lourenco, Daniela; Legarra, Andres; Misztal, Ignacy.

Genet Sel Evol ; 53(1): 89, 2021 Nov 27.

Artículo en Inglés | MEDLINE | ID: mdl-34837954

RESUMEN

BACKGROUND: Understanding whether genomic selection has been effective in livestock and when the results of genomic selection became visible are essential questions which we have addressed in this paper. Three criteria were used to identify practices of breeding programs over time: (1) the point of divergence of estimated genetic trends based on pedigree-based best linear unbiased prediction (BLUP) versus single-step genomic BLUP (ssGBLUP), (2) the point of divergence of realized Mendelian sampling (RMS) trends based on BLUP and ssGBLUP, and (3) the partition of genetic trends into that contributed by genotyped and non-genotyped individuals and by males and females. METHODS: We used data on 282,035 animals from a commercial maternal line of pigs, of which 32,856 were genotyped for 36,612 single nucleotide polymorphisms (SNPs) after quality control. Phenotypic data included 228,427, 101,225, and 11,444 records for birth weight, average daily gain in the nursery, and feed intake, respectively. Breeding values were predicted in a multiple-trait framework using BLUP and ssGBLUP. RESULTS: The points of divergence of the genetic and RMS trends estimated by BLUP and ssGBLUP indicated that genomic selection effectively started in 2019. Partitioning the overall genetic trends into that for genotyped and non-genotyped individuals revealed that the contribution of genotyped animals to the overall genetic trend increased rapidly from ~ 74% in 2016 to 90% in 2019. The contribution of the female pathway to the genetic trend also increased since genomic selection was implemented in this pig population, which reflects the changes in the genotyping strategy in recent years. CONCLUSIONS: Our results show that an assessment of breeding program practices can be done based on the point of divergence of genetic and RMS trends between BLUP and ssGBLUP and based on the partitioning of the genetic trend into contributions from different selection pathways. However, it should be noted that genetic trends can diverge before the onset of genomic selection if superior animals are genotyped retroactively. For the pig population example, the results showed that genomic selection was effective in this population.

Asunto(s)

Ganado , Modelos Genéticos , Animales , Femenino , Genoma , Genotipo , Ganado/genética , Masculino , Linaje , Fenotipo , Porcinos/genética

13.

Validation of single-step GBLUP genomic predictions from threshold models using the linear regression method: An application in chicken mortality.

Bermann, Matias; Legarra, Andres; Hollifield, Mary Kate; Masuda, Yutaka; Lourenco, Daniela; Misztal, Ignacy.

J Anim Breed Genet ; 138(1): 4-13, 2021 Jan.

Artículo en Inglés | MEDLINE | ID: mdl-32985749

RESUMEN

The objective of this study was to determine whether the linear regression (LR) method could be used to validate genomic threshold models. Statistics for the LR method were computed from estimated breeding values (EBVs) using the whole and truncated data sets with variances from the reference and validation populations. The method was tested using simulated and real chicken data sets. The simulated data set included 10 generations of 4,500 birds each; genotypes were available for the last three generations. Each animal was assigned a continuous trait, which was converted to a binary score assuming an incidence of failure of 7%. The real data set included the survival status of 186,596 broilers (mortality rate equal to 7.2%) and genotypes of 18,047 birds. Both data sets were analysed using best linear unbiased predictor (BLUP) or single-step GBLUP (ssGBLUP). The whole data set included all phenotypes available, whereas in the partial data set, phenotypes of the most recent generation were removed. In the simulated data set, the accuracies based on the LR formulas were 0.45 for BLUP and 0.76 for ssGBLUP, whereas the correlations between true breeding values and EBVs (i.e. true accuracies) were 0.37 and 0.65, respectively. The gain in accuracy by adding genomic information was overestimated by 0.09 when using the LR method compared to the true increase in accuracy. However, when the estimated ratio between the additive variance computed based on pedigree only and on pedigree and genomic information was considered, the difference between true and estimated gain was <0.02. Accuracies of BLUP and ssGBLUP with the real data set were 0.41 and 0.47, respectively. This small improvement in accuracy when using ssGBLUP with the real data set was due to population structure and lower heritability. The LR method is a useful tool for estimating improvements in accuracy of EBVs due to the inclusion of genomic information when traditional validation methods as k-fold validation and predictive ability are not applicable.

Asunto(s)

Pollos , Genoma , Animales , Genómica , Genotipo , Modelos Lineales , Modelos Genéticos , Linaje , Fenotipo

14.

Performances of Adaptive MultiBLUP, Bayesian regressions, and weighted-GBLUP approaches for genomic predictions in Belgian Blue beef cattle.

Gualdrón Duarte, José Luis; Gori, Ann-Stephan; Hubin, Xavier; Lourenco, Daniela; Charlier, Carole; Misztal, Ignacy; Druet, Tom.

BMC Genomics ; 21(1): 545, 2020 Aug 06.

Artículo en Inglés | MEDLINE | ID: mdl-32762654

RESUMEN

BACKGROUND: Genomic selection has been successfully implemented in many livestock and crop species. The genomic best linear unbiased predictor (GBLUP) approach, assigning equal variance to all SNP effects, is one of the reference methods. When large-effect variants contribute to complex traits, it has been shown that genomic prediction methods that assign a higher variance to subsets of SNP effects can achieve higher prediction accuracy. We herein compared the efficiency of several such approaches, including the Adaptive MultiBLUP (AM-BLUP) that uses local genomic relationship matrices (GRM) to automatically identify and weight genomic regions with large effects, to predict genetic merit in Belgian Blue beef cattle. RESULTS: We used a population of approximately 10,000 genotyped cows and their phenotypes for 14 traits, mostly related to muscular development and body dimensions. According to the trait, we found that 4 to 25% of the genetic variance could be associated with 2 to 12 genomic regions harbouring large-effect variants. Noteworthy, three previously identified recessive deleterious variants presented heterozygote advantage and were among the most significant SNPs for several traits. The AM-BLUP resulted in increased reliability of genomic predictions compared to GBLUP (+ 2%), but Bayesian methods proved more efficient (+ 3%). Overall, the reliability gains remained thus limited although higher gains were observed for skin thickness, a trait affected by two genomic regions having particularly large effects. Higher accuracies than those from the original AM-BLUP were achieved when applying the Bayesian Sparse Linear Mixed Model to pre-select groups of SNPs with large effects and subsequently use their estimated variance to build a weighted GRM. Finally, the single-step GBLUP performed best and could be further improved (+ 3% prediction accuracy) by using these weighted GRM. CONCLUSIONS: The AM-BLUP is an attractive method to automatically identify and weight genomic regions with large effects on complex traits. However, the method was less accurate than Bayesian methods. Overall, weighted methods achieved modest accuracy gains compared to GBLUP. Nevertheless, the computational efficiency of the AM-BLUP might be valuable at higher marker density, including with whole-genome sequencing data. Furthermore, weighted GRM are particularly useful to account for large variance loci in the single-step GBLUP.

Asunto(s)

Genoma , Genómica , Animales , Teorema de Bayes , Bélgica , Bovinos/genética , Femenino , Genotipo , Modelos Genéticos , Fenotipo , Polimorfismo de Nucleótido Simple , Reproducibilidad de los Resultados

15.

Genomic prediction of growth traits for pigs in the presence of genotype by environment interactions using single-step genomic reaction norm model.

Song, Hailiang; Zhang, Qin; Misztal, Ignacy; Ding, Xiangdong.

J Anim Breed Genet ; 137(6): 523-534, 2020 Nov.

Artículo en Inglés | MEDLINE | ID: mdl-32779853

RESUMEN

Economically important traits are usually complex traits influenced by genes, environment and genotype-by-environment (G × E) interactions. Ignoring G × E interaction could lead to bias in the estimation of breeding values and selection decisions. A total of 1,778 pigs were genotyped using the PorcineSNP80 BeadChip. The existence of G × E interactions was investigated using a single-step reaction norm model for growth traits of days to 100 kg (AGE) and backfat thickness adjusted to 100 kg (BFT), based on a pedigree-based relationship matrix (A) or a genomic-pedigree joint relationship matrix (H). In the reaction norm model, the herd-year-season effect was measured as the environmental variable (EV). Our results showed no G × E interactions for AGE, but for BFT. For both AGE and BFT, the genomic reaction norm model (H) produced more accurate predictions than the conventional reaction norm model (A). For BFT, the accuracies were greater based on the reaction norm model than those based on the reduced model without exploiting G × E interaction, with EV ranging from 0.5 to 1, and accuracy increasing by 3.9% and 4.6% in the reaction norm model based on A and H matrices, respectively, while reaction norm model yielded approximately 8.4% and 7.9% lower accuracy for EVs ranging from 0 to 0.4, based on A and H matrices, respectively. In addition, for BFT, the highest accuracy was obtained in the BJLM6 farm for realizing directional selection. This study will help to apply G × E interactions to practical genomic selection.

Asunto(s)

Cruzamiento , Genoma/genética , Genómica/métodos , Porcinos/genética , Animales , Interacción Gen-Ambiente , Genotipo , Modelos Genéticos , Fenotipo , Porcinos/crecimiento & desarrollo

16.

Modeling honey yield, defensive and swarming behaviors of Italian honey bees (Apis mellifera ligustica) using linear-threshold approaches.

Andonov, Sreten; Costa, Cecilia; Uzunov, Aleksandar; Bergomi, Patrizia; Lourenco, Daniela; Misztal, Ignacy.

BMC Genet ; 20(1): 78, 2019 10 21.

Artículo en Inglés | MEDLINE | ID: mdl-31638899

RESUMEN

BACKGROUND: Genetic improvement of honey bees is more difficult compared to other livestock, due to the very different reproductive behavior. Estimation of breeding values requires specific adjustment and the use of sires in the pedigree is only possible when mating of queens and drones is strictly controlled. In the breeding program of the National Registry for Italian Queen Breeders and Bee Producers the paternal contribution is mostly unknown. As stronger modeling may compensate for the lack of pedigree information, we tested two models that differed in the way the direct and maternal effects were considered. The two models were tested using 4003 records for honey yield, defensive and swarming behaviors of Italian honey bee queens produced between 2002 and 2014. The first model accounted for the direct genetic effect of worker bees and the genetic maternal effect of the queen, whereas model 2 considered the direct genetic effect of the queen without maternal effect. The analyses were performed by linear (honey production) and threshold (defensive and swarming behavior) single-trait models; estimated genetic correlations among traits were obtained by a three-trait linear-threshold model. RESULTS: For all traits, the highest predictability (correlation between breeding values estimated with and without performance records) was obtained with model 2, where direct genetic effect of queens was considered. With this model, heritability estimates were 0.26 for honey yield, 0.36 for defensive behavior, and 0.34 for swarming behavior. Multi-trait estimation resulted in similar or higher heritability estimates for all traits. A low, positive genetic correlation (0.19) was found between honey yield and defensive behavior, whereas the genetic correlation between honey yield and swarming behavior was moderate (0.41). A strong, positive genetic correlation was found between defensive and swarming behaviors (0.62). Predictability for multi-trait evaluations was higher for honey yield (0.46) and defensive behavior (0.30) but almost identical for swarming behavior (0.45) compared to corresponding single-trait predictability. CONCLUSIONS: Multi-trait evaluation using a model that accounts for the direct genetic effect of queen was the best approach for breeding value estimation of Italian honey bees. The results suggest a new direction for selection of linear and categorical traits in breeding programs where drone origin is unknown.

Asunto(s)

Abejas/fisiología , Conducta Animal/fisiología , Sitios de Carácter Cuantitativo , Animales , Cruzamiento , Conducta Materna , Conducta Sexual Animal

17.

Accuracy of genomic BLUP when considering a genomic relationship matrix based on the number of the largest eigenvalues: a simulation study.

Pocrnic, Ivan; Lourenco, Daniela A L; Masuda, Yutaka; Misztal, Ignacy.

Genet Sel Evol ; 51(1): 75, 2019 Dec 12.

Artículo en Inglés | MEDLINE | ID: mdl-31830899

RESUMEN

BACKGROUND: The dimensionality of genomic information is limited by the number of independent chromosome segments (Me), which is a function of the effective population size. This dimensionality can be determined approximately by singular value decomposition of the gene content matrix, by eigenvalue decomposition of the genomic relationship matrix (GRM), or by the number of core animals in the algorithm for proven and young (APY) that maximizes the accuracy of genomic prediction. In the latter, core animals act as proxies to linear combinations of Me. Field studies indicate that a moderate accuracy of genomic selection is achieved with a small dataset, but that further improvement of the accuracy requires much more data. When only one quarter of the optimal number of core animals are used in the APY algorithm, the accuracy of genomic selection is only slightly below the optimal value. This suggests that genomic selection works on clusters of Me. RESULTS: The simulation included datasets with different population sizes and amounts of phenotypic information. Computations were done by genomic best linear unbiased prediction (GBLUP) with selected eigenvalues and corresponding eigenvectors of the GRM set to zero. About four eigenvalues in the GRM explained 10% of the genomic variation, and less than 2% of the total eigenvalues explained 50% of the genomic variation. With limited phenotypic information, the accuracy of GBLUP was close to the peak where most of the smallest eigenvalues were set to zero. With a large amount of phenotypic information, accuracy increased as smaller eigenvalues were added. CONCLUSIONS: A small amount of phenotypic data is sufficient to estimate only the effects of the largest eigenvalues and the associated eigenvectors that contain a large fraction of the genomic information, and a very large amount of data is required to estimate the remaining eigenvalues that account for a limited amount of genomic information. Core animals in the APY algorithm act as proxies of almost the same number of eigenvalues. By using an eigenvalues-based approach, it was possible to explain why the moderate accuracy of genomic selection based on small datasets only increases slowly as more data are added.

Asunto(s)

Genómica/métodos , Algoritmos , Animales , Simulación por Computador , Fenotipo , Densidad de Población

18.

Frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle.

Aguilar, Ignacio; Legarra, Andres; Cardoso, Fernando; Masuda, Yutaka; Lourenco, Daniela; Misztal, Ignacy.

Genet Sel Evol ; 51(1): 28, 2019 Jun 20.

Artículo en Inglés | MEDLINE | ID: mdl-31221101

RESUMEN

BACKGROUND: Single-step genomic best linear unbiased prediction (SSGBLUP) is a comprehensive method for genomic prediction. Point estimates of marker effects from SSGBLUP are often used for genome-wide association studies (GWAS) without a formal framework of hypothesis testing. Our objective was to implement p-values for single-marker GWAS studies within the single-step GWAS (SSGWAS) framework by deriving computational algorithms and procedures, and by applying these to a large beef cattle population. METHODS: P-values were obtained based on the prediction error (co)variances for single nucleotide polymorphisms (SNPs), which were obtained from the prediction error (co)variances of genomic predictions based on the inverse of the coefficient matrix and formulas to estimate SNP effects. RESULTS: Computation of p-values took a negligible time for a dataset with almost 2 million animals in the pedigree and 1424 genotyped sires, and no inflation of statistics was observed. The SNPs that passed the Bonferroni threshold of 10-5.9 were the same as those that explained the highest proportion of additive genetic variance, but even at the same significance levels and effects, some of them explained less genetic variance due to lower allele frequency. CONCLUSIONS: The use of a p-value for SSGWAS is a very general and efficient strategy to identify quantitative trait loci (QTL). It can be used for complex datasets such as those used in animal breeding, where only a proportion of the pedigreed animals are genotyped.

Asunto(s)

Peso al Nacer/genética , Bovinos/genética , Marcadores Genéticos , Estudio de Asociación del Genoma Completo/veterinaria , Algoritmos , Animales , Conjuntos de Datos como Asunto , Femenino , Masculino , Modelos Genéticos , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo

19.

Bias in heritability estimates from genomic restricted maximum likelihood methods under different genotyping strategies.

Cesarani, Alberto; Pocrnic, Ivan; Macciotta, Nicolò P P; Fragomeni, Breno O; Misztal, Ignacy; Lourenco, Daniela A L.

J Anim Breed Genet ; 136(1): 40-50, 2019 Jan.

Artículo en Inglés | MEDLINE | ID: mdl-30426582

RESUMEN

We investigated the effects of different strategies for genotyping populations on variance components and heritabilities estimated with an animal model under restricted maximum likelihood (REML), genomic REML (GREML), and single-step GREML (ssGREML). A population with 10 generations was simulated. Animals from the last one, two or three generations were genotyped with 45,116 SNP evenly distributed on 27 chromosomes. Animals to be genotyped were chosen randomly or based on EBV. Each scenario was replicated five times. A single trait was simulated with three heritability levels (low, moderate, high). Phenotypes were simulated for only females to mimic dairy sheep and also for both sexes to mimic meat sheep. Variance component estimates from genomic data and phenotypes for one or two generations were more biased than from three generations. Estimates in the scenario without selection were the most accurate across heritability levels and methods. When selection was present in the simulations, the best option was to use genotypes of randomly selected animals. For selective genotyping, heritabilities from GREML were more biased compared to those estimated by ssGREML, because ssGREML was less affected by selective or limited genotyping.

Asunto(s)

Genómica , Técnicas de Genotipaje/métodos , Animales , Funciones de Verosimilitud , Masculino , Modelos Genéticos , Linaje

20.

Correction: Theoretical accuracy for indirect predictions based on SNP effects from single-step GBLUP.

Garcia, Andre; Aguilar, Ignacio; Legarra, Andres; Tsuruta, Shogo; Misztal, Ignacy; Lourenco, Daniela.

Genet Sel Evol ; 55(1): 26, 2023 Apr 17.

Artículo en Inglés | MEDLINE | ID: mdl-37069505

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA