Pesquisa | BVS Doenças Infecciosas e Parasitárias

1.

Validation with single-step SNPBLUP shows that evaluations can continue using a single mean of genotyped individuals, even with multiple breeds.

Aldridge, Michael; Vandenplas, Jeremie; Duenk, Pascal; Henshall, John; Hawken, Rachel; Calus, Mario.

Genet Sel Evol ; 55(1): 19, 2023 Mar 22.

Artigo em Inglês | MEDLINE | ID: mdl-36949392

RESUMO

BACKGROUND: In genomic prediction, it is common to centre the genotypes of single nucleotide polymorphisms based on the allele frequencies in the current population, rather than those in the base generation. The mean breeding value of non-genotyped animals is conditional on the mean performance of genotyped relatives, but can be corrected by fitting the mean performance of genotyped individuals as a fixed regression. The associated covariate vector has been referred to as a 'J-factor', which if fitted as a fixed effect can improve the accuracy and dispersion bias of sire genomic estimated breeding values (GEBV). To date, this has only been performed on populations with a single breed. Here, we investigated whether there was any benefit in fitting a separate J-factor for each breed in a three-way crossbred population, and in using pedigree-based expected or genome-based estimated breed fractions to define the J-factors. RESULTS: For body weight at 7 days, dispersion bias decreased when fitting multiple J-factors, but only with a low proportion of genotyped individuals with selective genotyping. On average, the mean regression coefficients of validation records on those of GEBV increased with one J-factor compared to none, and further increased with multiple J-factors. However, for body weight at 35 days this was not observed. The accuracy of GEBV remained unchanged regardless of the J-factor method used. Differences between the J-factor methods were limited with correlations approaching 1 for the estimated covariate vector, the estimated coefficients of the regression on the J-factors, and the GEBV. CONCLUSIONS: Based on our results and in the particular design analysed here, i.e. all the animals with phenotype are of the same type of crossbreds, fitting a single J-factor should be sufficient, to reduce dispersion bias. Fitting multiple J-factors may reduce dispersion bias further but this depends on the trait and genotyping rate. For the crossbred population analysed, fitting multiple J-factors has no adverse consequences and if this is done, it does not matter if the breed fractions used are based on the pedigree-expectation or the genomic estimates. Finally, when GEBV are estimated from crossbred data, any observed bias can potentially be reduced by including a straightforward regression on actual breed proportions.

Assuntos

Genoma , Modelos Genéticos , Animais , Genótipo , Genômica/métodos , Fenótipo , Polimorfismo de Nucleotídeo Único , Linhagem

2.

Integration of beef cattle international pedigree and genomic estimated breeding values into national evaluations, with an application to the Italian Limousin population.

Bonifazi, Renzo; Calus, Mario P L; Ten Napel, Jan; Veerkamp, Roel F; Biffani, Stefano; Cassandro, Martino; Savoia, Simone; Vandenplas, Jérémie.

Genet Sel Evol ; 55(1): 41, 2023 Jun 12.

Artigo em Inglês | MEDLINE | ID: mdl-37308814

RESUMO

BACKGROUND: International evaluations combine data from different countries allowing breeders to have access to larger panels of elite bulls and to increase the accuracy of estimated breeding values (EBV). However, international and national evaluations can use different sources of information to compute EBV (EBVINT and EBVNAT, respectively), leading to differences between them. Choosing one of these EBV results in losing the information that is contained only in the discarded EBV. Our objectives were to define and validate a procedure to integrate publishable sires' EBVINT and their associated reliabilities computed from pedigree-based or single-step international beef cattle evaluations into national evaluations to obtain "blended" EBV. The Italian (ITA) pedigree-based national evaluation was used as a case study to validate the integration procedure. METHODS: Publishable sires' international information, i.e. EBVINT and their associated reliabilities, was included in the national evaluation as pseudo-records. Data were available for 444,199 individual age-adjusted weaning weights of Limousin cattle from eight countries and 17,607 genotypes from four countries (ITA excluded). To mimic differences between international and national evaluations, international evaluations included phenotypes (and genotypes) of animals born prior to January 2019, while national evaluations included ITA phenotypes of animals born until April 2019. International evaluations using all available information were considered as reference scenarios. Publishable sires were divided into three groups: sires with ≥ 15, < 15 and no recorded offspring in ITA. RESULTS: Overall, for these three groups, integrating either pedigree-based or single-step international information into national pedigree-based evaluations improved the similarity of the blended EBV with the reference EBV compared to national evaluations without integration. For instance, the correlation with the reference EBV for direct (maternal) EBV went from 0.61 (0.79) for a national evaluation without integration to 0.97 (0.88) when integrating single-step international information, on average across all groups of publishable sires. CONCLUSIONS: Our proposed one-animal-at-a-time integration procedure yields blended EBV that are in close agreement with full international EBV for all groups of animals analysed. The procedure can be directly applied by countries since it does not rely on specific software and is computationally inexpensive, allowing straightforward integration of publishable sires' EBVINT from pedigree-based or single-step based international beef cattle evaluations into national evaluations.

Assuntos

Genômica , Bovinos , Animais , Masculino , Linhagem , Genótipo , Fenótipo , Valores de Referência

3.

Efficient large-scale single-step evaluations and indirect genomic prediction of genotyped selection candidates.

Vandenplas, Jeremie; Ten Napel, Jan; Darbaghshahi, Saeid Naderi; Evans, Ross; Calus, Mario P L; Veerkamp, Roel; Cromie, Andrew; Mäntysaari, Esa A; Strandén, Ismo.

Genet Sel Evol ; 55(1): 37, 2023 Jun 08.

Artigo em Inglês | MEDLINE | ID: mdl-37291510

RESUMO

BACKGROUND: Single-step genomic best linear unbiased prediction (ssGBLUP) models allow the combination of genomic, pedigree, and phenotypic data into a single model, which is computationally challenging for large genotyped populations. In practice, genotypes of animals without their own phenotype and progeny, so-called genotyped selection candidates, can become available after genomic breeding values have been estimated by ssGBLUP. In some breeding programmes, genomic estimated breeding values (GEBV) for these animals should be known shortly after obtaining genotype information but recomputing GEBV using the full ssGBLUP takes too much time. In this study, first we compare two equivalent formulations of ssGBLUP models, i.e. one that is based on the Woodbury matrix identity applied to the inverse of the genomic relationship matrix, and one that is based on marker equations. Second, we present computationally-fast approaches to indirectly compute GEBV for genotyped selection candidates, without the need to do the full ssGBLUP evaluation. RESULTS: The indirect approaches use information from the latest ssGBLUP evaluation and rely on the decomposition of GEBV into its components. The two equivalent ssGBLUP models and indirect approaches were tested on a six-trait calving difficulty model using Irish dairy and beef cattle data that include 2.6 million genotyped animals of which about 500,000 were considered as genotyped selection candidates. When using the same computational approaches, the solving phase of the two equivalent ssGBLUP models showed similar requirements for memory and time per iteration. The computational differences between them were due to the preprocessing phase of the genomic information. Regarding the indirect approaches, compared to GEBV obtained from single-step evaluations including all genotypes, indirect GEBV had correlations higher than 0.99 for all traits while showing little dispersion and level bias. CONCLUSIONS: In conclusion, ssGBLUP predictions for the genotyped selection candidates were accurately approximated using the presented indirect approaches, which are more memory efficient and computationally fast, compared to solving a full ssGBLUP evaluation. Thus, indirect approaches can be used even on a weekly basis to estimate GEBV for newly genotyped animals, while the full single-step evaluation is done only a few times within a year.

Assuntos

Genoma , Modelos Genéticos , Animais , Bovinos/genética , Genótipo , Genômica , Fenótipo , Linhagem

4.

Invited review: Reliability computation from the animal model era to the single-step genomic model era.

Ben Zaabza, Hafedh; Van Tassell, Curtis P; Vandenplas, Jeremie; VanRaden, Paul; Liu, Zengting; Eding, Herwin; McKay, Stephanie; Haugaard, Katrine; Lidauer, Martin H; Mäntysaari, Esa A; Strandén, Ismo.

J Dairy Sci ; 106(3): 1518-1532, 2023 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-36567247

RESUMO

The calculation of exact reliabilities involving the inversion of mixed model equations poses a heavy computational challenge when the system of equations is large. This has prompted the development of different approximation methods. We give an overview of the various methods and computational approaches in calculating reliability from the era before the animal model to the era of single-step genomic models. The different methods are discussed in terms of modeling, development, and applicability in large dairy cattle populations. The paper also describes the problems faced in reliability computation. Many details dispersed throughout the literature are presented in this paper. It is clear that a universal solution applicable to every model and input data may not be possible, but we point out several efficient and accurate algorithms developed recently for a variety of very large genomic evaluations.

Assuntos

Genoma , Genômica , Bovinos , Animais , Reprodutibilidade dos Testes , Genômica/métodos , Modelos Animais , Algoritmos , Genótipo , Modelos Genéticos , Fenótipo

5.

Impact of genomic preselection on subsequent ssGBLUP evaluation of preselected animals for scarcely recorded feed intake in pigs.

Jibrila, Ibrahim; Ten Napel, Jan; Vandenplas, Jeremie; Bergsma, Rob; Veerkamp, Roel F; Calus, Mario P L.

J Anim Breed Genet ; 140(3): 253-263, 2023 May.

Artigo em Inglês | MEDLINE | ID: mdl-36637041

RESUMO

We have previously shown that single-step genomic best linear unbiased prediction (ssGBLUP) estimates breeding values of genomically preselected animals without preselection bias for widely recorded traits, that is traits recorded for the majority of animals in the breeding population. This study investigated the impact of genomic preselection (GPS) on accuracy and bias in ssGBLUP evaluation of genomically preselected animals for a scarcely recorded trait, that is a trait recorded for only a small proportion of the animals, which generally has a lower prediction accuracy than widely recorded traits, mainly due to having a much smaller number of phenotypes available. We used data from a commercial pig breeding program, considering feed intake as a scarcely recorded target trait, being available for ~30% of the animals with phenotypes for any trait, and average daily gain, backfat thickness and loin depth as widely recorded predictor traits, being available for >95% of the animals with phenotypes for any trait. The data contained the routine GPS implemented by commercial animal breeding programs, and we retrospectively implemented two scenarios with additional layers of GPS by discarding pedigree, genotypes and phenotypes of animals without progeny. The ssGBLUP evaluation following GPS used records only from the target trait, only from the predictor traits, or both. Accuracy for feed intake did not differ statistically across GPS scenarios, although it tended to decrease with more intense GPS. The accuracy had average values of 0.37, 0.44, and 0.45 across all GPS scenarios when, respectively, records from only the target trait, only the predictor traits, or both were used in the ssGBLUP evaluation. Considerable deflation of the genomic breeding values for feed intake was observed in the most stringent GPS scenario, due to the variance components being underestimated as a result of the limited amount of strongly preselected data. As long as (co)variance components were unbiased, no or only marginal bias was observed. These results for accuracy and bias were observed whether records of the scarcely recorded target trait, of the predictor traits, or both were used in the ssGBLUP evaluation. Our results show that for the scarcely recorded feed intake in pigs, ssGBLUP is able to estimate breeding values of preselected animals without preselection bias, similarly as previously observed for widely recorded traits.

Assuntos

Genoma , Genômica , Animais , Suínos/genética , Estudos Retrospectivos , Genômica/métodos , Genótipo , Fenótipo , Ingestão de Alimentos/genética , Linhagem , Modelos Genéticos

6.

Estimation of dam line composition of 3-way crossbred animals using genomic information.

Calus, Mario P L; Henshall, John M; Hawken, Rachel; Vandenplas, Jérémie.

Genet Sel Evol ; 54(1): 44, 2022 Jun 15.

Artigo em Inglês | MEDLINE | ID: mdl-35705918

RESUMO

BACKGROUND: In genomic prediction including data of 3- or 4-way crossbred animals, line composition is usually fitted as a regression on expected line proportions, which are 0.5, 0.25 and 0.25, respectively, for 3-way crossbred animals. However, actual line proportions for the dam lines can vary between ~ 0.1 and 0.4, and ignoring this variation may affect the genomic estimated breeding values of purebred selection candidates. Our aim was to validate a proposed gold standard to evaluate different approaches for estimating line proportions using simulated data, and to subsequently use this in actual 3-way crossbred broiler data to evaluate several other methods. RESULTS: Analysis of simulated data confirmed that line proportions computed from assigned breed-origin-of-alleles (BOA) provide a very accurate gold standard, even if the parental lines are closely related. Alternative investigated methods were linear regression of genotypes on line-specific allele frequencies, maximum likelihood estimation using the program ADMIXTURE, and the genomic relationship of crossbred animals with their maternal grandparents. The results from the simulated data showed that the genomic relationship with the maternal grandparent was most accurate, and least affected by closer relationships between the dam lines. Linear regression and ADMIXTURE performed similarly for unrelated lines, but their accuracy dropped considerably when the dam lines were more closely related. In almost all cases, estimates improved after adjusting them to ensure that the sum of dam line contributions within animals was equal to 0.5, and within dam line and across animals the average was equal to 0.25. Results from the broiler data were much more similar between methods. In both cases, stringent linkage disequilibrium pruning of genotype data led to a relatively low accuracy of predicted line proportions, due to the loss of too many single nucleotide polymorphisms. CONCLUSIONS: With relatively unrelated parental lines as typical in crosses in pigs and poultry, linear regression of crossbred genotypes on line-specific allele frequencies and ADMIXTURE are very competitive methods. Thus, linear regression may be the method of choice, as it does not require genotypes of grandparents, is computationally very efficient, and easily implemented and adapted for considering the specific nature of the crossbred animals analysed.

Assuntos

Galinhas , Modelos Genéticos , Alelos , Animais , Galinhas/genética , Genômica , Genótipo , Hibridização Genética , Desequilíbrio de Ligação , Polimorfismo de Nucleotídeo Único , Suínos

7.

Impact of genomic preselection on subsequent genetic evaluations with ssGBLUP using real data from pigs.

Jibrila, Ibrahim; Vandenplas, Jeremie; Ten Napel, Jan; Bergsma, Rob; Veerkamp, Roel F; Calus, Mario P L.

Genet Sel Evol ; 54(1): 48, 2022 Jun 28.

Artigo em Inglês | MEDLINE | ID: mdl-35764921

RESUMO

BACKGROUND: Empirically assessing the impact of preselection on genetic evaluation of preselected animals requires comparing scenarios that take different approaches into account, including scenarios without preselection. However, preselection is almost always performed in animal breeding programs, so it is difficult to have a dataset without preselection. Hence, most studies on preselection have used simulated datasets, and have concluded that genomic estimated breeding values (GEBV) from subsequent single-step genomic best linear unbiased prediction (ssGBLUP) evaluations are unbiased. The aim of this study was to investigate the impact of genomic preselection (GPS) on accuracy and bias in subsequent ssGBLUP evaluations, using data from a commercial pig breeding program. METHODS: We used data on average daily gain during performance testing, average daily gain throughout life, backfat thickness, and loin depth from one sire line and one dam line of pigs. As these traits have different weights in the breeding goals of the two lines, we analyzed the lines separately. For each line, we implemented a reference GPS scenario that kept all available data, against which the next two scenarios were compared. We then implemented two other scenarios with additional layers of GPS by removing all animals without progeny either (i) only in the validation generation, or (ii) in all generations. We conducted subsequent ssGBLUP evaluations for each GPS scenario, using all the data remaining after implementing the GPS scenario. Accuracy and bias were computed by comparing GEBV against progeny yield deviations of validation animals. RESULTS: Results for all traits and in both lines showed a marginal loss in accuracy due to the additional layers of GPS. Average accuracies across all GPS scenarios in the two lines were 0.39, 0.47, 0.56, and 0.60, for average daily gain during performance testing and throughout life, backfat thickness, and loin depth, respectively. Biases were largely absent, and when present, did not differ greatly between the GPS scenarios. CONCLUSIONS: We conclude that the impact of preselection on accuracy and bias in subsequent ssGBLUP evaluations of selection candidates in pigs is generally minimal. We expect this conclusion to apply for other animal breeding programs as well, since preselection of any type or intensity generally has the same effect in animal breeding programs.

Assuntos

Genoma , Modelos Genéticos , Animais , Viés , Genômica/métodos , Fenótipo , Suínos/genética

8.

International single-step SNPBLUP beef cattle evaluations for Limousin weaning weight.

Bonifazi, Renzo; Calus, Mario P L; Ten Napel, Jan; Veerkamp, Roel F; Michenet, Alexis; Savoia, Simone; Cromie, Andrew; Vandenplas, Jérémie.

Genet Sel Evol ; 54(1): 57, 2022 Sep 04.

Artigo em Inglês | MEDLINE | ID: mdl-36057564

RESUMO

BACKGROUND: Compared to national evaluations, international collaboration projects further improve accuracies of estimated breeding values (EBV) by building larger reference populations or performing a joint evaluation using data (or proxy of them) from different countries. Genomic selection is increasingly adopted in beef cattle, but, to date, the benefits of including genomic information in international evaluations have not been explored. Our objective was to develop an international beef cattle single-step genomic evaluation and investigate its impact on the accuracy and bias of genomic evaluations compared to current pedigree-based evaluations. METHODS: Weaning weight records were available for 331,593 animals from seven European countries. The pedigree included 519,740 animals. After imputation and quality control, 17,607 genotypes at a density of 57,899 single nucleotide polymorphisms (SNPs) from four countries were available. We implemented two international scenarios where countries were modelled as different correlated traits: an international genomic single-step SNP best linear unbiased prediction (SNPBLUP) evaluation (ssSNPBLUPINT) and an international pedigree-based BLUP evaluation (PBLUPINT). Two national scenarios were implemented for pedigree and genomic evaluations using only nationally submitted phenotypes and genotypes. Accuracies, level and dispersion bias of EBV of animals born from 2014 onwards, and increases in population accuracies were estimated using the linear regression method. RESULTS: On average across countries, 39 and 17% of sires and maternal-grand-sires with recorded (grand-)offspring across two countries were genotyped. ssSNPBLUPINT showed the highest accuracies of EBV and, compared to PBLUPINT, led to increases in population accuracy of 13.7% for direct EBV, and 25.8% for maternal EBV, on average across countries. Increases in population accuracies when moving from national scenarios to ssSNPBLUPINT were observed for all countries. Overall, ssSNPBLUPINT level and dispersion bias remained similar or slightly reduced compared to PBLUPINT and national scenarios. CONCLUSIONS: International single-step SNPBLUP evaluations are feasible and lead to higher population accuracies for both large and small countries compared to current international pedigree-based evaluations and national evaluations. These results are likely related to the larger multi-country reference population and the inclusion of phenotypes from relatives recorded in other countries via single-step international evaluations. The proposed international single-step approach can be applied to other traits and breeds.

Assuntos

Modelos Genéticos , Polimorfismo de Nucleotídeo Único , Animais , Bovinos/genética , Genoma , Genótipo , Linhagem , Fenótipo , Desmame

9.

Convergence behavior of single-step GBLUP and SNPBLUP for different termination criteria.

Vandenplas, Jeremie; Calus, Mario P L; Eding, Herwin; van Pelt, Mathijs; Bergsma, Rob; Vuik, Cornelis.

Genet Sel Evol ; 53(1): 34, 2021 Apr 09.

Artigo em Inglês | MEDLINE | ID: mdl-33836661

RESUMO

BACKGROUND: The preconditioned conjugate gradient (PCG) method is the current method of choice for iterative solving of genetic evaluations. The relative difference between two successive iterates and the relative residual of the system of equations are usually chosen as a termination criterion for the PCG method in animal breeding. However, our initial analyses showed that these two commonly used termination criteria may report that a PCG method applied to a single-step single nucleotide polymorphism best linear unbiased prediction (ssSNPBLUP) is not converged yet, whereas the solutions are accurate enough for practical use. Therefore, the aim of this study was to propose two termination criteria that have been (partly) developed in other fields, but are new in animal breeding, and to compare their behavior to that of the two termination criteria widely used in animal breeding for the PCG method applied to ssSNPBLUP. The convergence patterns of ssSNPBLUP were also compared to the convergence patterns of single-step genomic BLUP (ssGBLUP). RESULTS: Building upon previous work, we propose two termination criteria that take the properties of the system of equations into account. These two termination criteria are directly related to the relative error of the iterates with respect to the true solutions. Based on pig and dairy cattle datasets, we show that the preconditioned coefficient matrices of ssSNPBLUP and ssGBLUP have similar properties when using a second-level preconditioner for ssSNPBLUP. Therefore, the PCG method applied to ssSNPBLUP and ssGBLUP converged similarly based on the relative error of the iterates with respect to the true solutions. This similar convergence behavior between ssSNPBLUP and ssGBLUP was observed for both proposed termination criteria. This was, however, not the case for the termination criterion defined as the relative residual when applied to the dairy cattle evaluations. CONCLUSION: Our results showed that the PCG method can converge similarly when applied to ssSNPBLUP and to ssGBLUP. The two proposed termination criteria always depicted these similar convergence behaviors, and we recommend them for comparing convergence properties of different models and for routine evaluations.

Assuntos

Cruzamento/métodos , Estudo de Associação Genômica Ampla/métodos , Animais , Bovinos/genética , Estudo de Associação Genômica Ampla/veterinária , Modelos Genéticos , Polimorfismo de Nucleotídeo Único

10.

Technical note: Genetic groups in single-step single nucleotide polymorphism best linear unbiased predictor.

Vandenplas, Jeremie; Eding, Herwin; Calus, Mario P L.

J Dairy Sci ; 104(3): 3298-3303, 2021 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-33455759

RESUMO

Genetic groups, also called unknown or phantom parents groups, are often used in dairy cattle genetic evaluations to account for selection that cannot be accounted for by known genetic relationships. With the advent of genomic evaluations, the theory of genetic groups was extended to the so-called single-step genomic BLUP (ssGBLUP). In short, genetic groups can be fitted in ssGBLUP through regression effects, or by including them in the pedigree and computing the adequate combined pedigree and genomic relationship matrix. In this study, we applied the so-called Quaas and Pollak transformation to a system of equations for single-step SNP BLUP (ssSNPBLUP), such that genetic groups can thereafter be included in the pedigree. The example in this study showed that including the genetic groups in the pedigree for ssSNPBLUP allowed reduced memory burden and computational costs in comparison to genetic groups fitted as covariates.

Assuntos

Genoma , Polimorfismo de Nucleotídeo Único , Animais , Bovinos/genética , Ingestão de Alimentos , Genômica , Genótipo , Modelos Genéticos , Linhagem , Fenótipo , Polimorfismo de Nucleotídeo Único/genética

11.

Avoiding preselection bias in subsequent single-step genomic BLUP evaluations of genomically preselected animals.

Jibrila, Ibrahim; Vandenplas, Jeremie; Ten Napel, Jan; Veerkamp, Roel F; Calus, Mario P L.

J Anim Breed Genet ; 138(4): 432-441, 2021 Jul.

Artigo em Inglês | MEDLINE | ID: mdl-33372707

RESUMO

In animal breeding, parents of the next generation are usually selected in multiple stages, and the initial stages of this selection are called preselection. Preselection reduces the information available for subsequent evaluation of preselected animals and this sometimes leads to bias. The objective of this study was to establish the minimum information required to subsequently evaluate genomically preselected animals without bias arising from preselection, with single-step genomic best linear unbiased prediction (ssGBLUP). We simulated a nucleus of a breeding program in which a recent population of 15 generations was produced. In each generation, parents of the next generation were selected in a single-stage selection based on pedigree BLUP. However, in generation 15, 10% of male and 15% of female offspring were preselected on their genomic estimated breeding values (GEBV). These GEBV were estimated using ssGBLUP, including the pedigree of all animals in generations 0-15, genotypes of all animals in generations 13-15 and phenotypes of all animals in generations 11-14. In subsequent ssGBLUP evaluation of these preselected animals, genotypes and phenotypes from various groups of animals were excluded one after another. We found that GEBV of the preselected animals were only estimated without preselection bias when genotypes and phenotypes of all animals in generations 13 and 14 and of the preselected animals were included in the subsequent evaluation. We also found that genotypes of the animals discarded at preselection only helped in reducing preselection bias in GEBV of their preselected sibs when genotypes of their parents were absent or excluded from the subsequent evaluation. We concluded that to prevent preselection bias in subsequent ssGBLUP evaluation of genomically preselected animals, information representative of the reference data used in the evaluation at preselection and genotypes and phenotypes of the preselected animals are needed in the subsequent evaluation.

Assuntos

Genoma , Animais , Feminino , Genômica , Genótipo , Masculino , Modelos Genéticos , Linhagem , Fenótipo

12.

Computational strategies for the preconditioned conjugate gradient method applied to ssSNPBLUP, with an application to a multivariate maternal model.

Vandenplas, Jeremie; Eding, Herwin; Bosmans, Maarten; Calus, Mario P L.

Genet Sel Evol ; 52(1): 24, 2020 May 13.

Artigo em Inglês | MEDLINE | ID: mdl-32404053

RESUMO

BACKGROUND: The single-step single nucleotide polymorphism best linear unbiased prediction (ssSNPBLUP) is one of the single-step evaluations that enable a simultaneous analysis of phenotypic and pedigree information of genotyped and non-genotyped animals with a large number of genotypes. The aim of this study was to develop and illustrate several computational strategies to efficiently solve different ssSNPBLUP systems with a large number of genotypes on current computers. RESULTS: The different developed strategies were based on simplified computations of some terms of the preconditioner, and on splitting the coefficient matrix of the different ssSNPBLUP systems into multiple parts to perform its multiplication by a vector more efficiently. Some matrices were computed explicitly and stored in memory (e.g. the inverse of the pedigree relationship matrix), or were stored using a compressed form (e.g. the Plink 1 binary form for the genotype matrix), to permit the use of efficient parallel procedures while limiting the required amount of memory. The developed strategies were tested on a bivariate genetic evaluation for livability of calves for the Netherlands and the Flemish region in Belgium. There were 29,885,286 animals in the pedigree, 25,184,654 calf records, and 131,189 genotyped animals. The ssSNPBLUP system required around 18 GB Random Access Memory and 12 h to be solved with the most performing implementation. CONCLUSIONS: Based on our proposed approaches and results, we showed that ssSNPBLUP provides a feasible approach in terms of memory and time requirements to estimate genomic breeding values using current computers.

Assuntos

Cruzamento , Bovinos/genética , Biologia Computacional , Genômica , Polimorfismo de Nucleotídeo Único/genética , Algoritmos , Animais , Feminino , Genótipo , Modelos Genéticos , Linhagem

13.

Investigating the impact of preselection on subsequent single-step genomic BLUP evaluation of preselected animals.

Jibrila, Ibrahim; Ten Napel, Jan; Vandenplas, Jeremie; Veerkamp, Roel F; Calus, Mario P L.

Genet Sel Evol ; 52(1): 42, 2020 Jul 29.

Artigo em Inglês | MEDLINE | ID: mdl-32727349

RESUMO

BACKGROUND: Preselection of candidates, hereafter referred to as preselection, is a common practice in breeding programs. Preselection can cause bias and accuracy loss in subsequent pedigree-based best linear unbiased prediction (PBLUP). However, the impact of preselection on subsequent single-step genomic BLUP (ssGBLUP) is not completely clear yet. Therefore, in this study, we investigated, across different heritabilities, the impact of intensity and type of preselection on subsequent ssGBLUP evaluation of preselected animals. METHODS: We simulated a nucleus of a breeding programme, in which a recent population of 15 generations was produced with PBLUP-based selection. In generation 15 of this recent population, the parents of the next generation were preselected using several preselection scenarios. These scenarios were combinations of three intensities of preselection (no, high or very high preselection) and three types of preselection (genomic, parental average or random), across three heritabilities (0.5, 0.3 or 0.1). Following each preselection scenario, a subsequent evaluation was performed using ssGBLUP by excluding all the information from the preculled animals, and these genetic evaluations were compared in terms of accuracy and bias for the preselected animals, and in terms of realized genetic gain. RESULTS: Type of preselection affected selection accuracy at both preselection and subsequent evaluation stages. While preselection accuracy decreased, accuracy in the subsequent ssGBLUP evaluation increased, from genomic to parent average to random preselection scenarios. Bias was always negligible. Genetic gain decreased from genomic to parent average to random preselection scenarios. Genetic gain also decreased with increasing intensity of preselection, but only by a maximum of 0.1 additive genetic standard deviation from no to very high genomic preselection scenarios. CONCLUSIONS: Using ssGBLUP in subsequent evaluations prevents preselection bias, irrespective of intensity and type of preselection, and heritability. With GPS, in addition to reducing the phenotyping effort considerably, the use of ssGBLUP in subsequent evaluations realizes only a slightly lower genetic gain than that realized without preselection. This is especially the case for traits that are expensive to measure (e.g. feed intake of individual broiler chickens), and traits for which phenotypes can only be measured at advanced stages of life (e.g. litter size in pigs).

Assuntos

Cruzamento/métodos , Gado/genética , Aves Domésticas/genética , Animais , Feminino , Masculino , Linhagem , Fenótipo , Polimorfismo de Nucleotídeo Único , Característica Quantitativa Herdável , Seleção Genética

14.

Impact of sub-setting the data of the main Limousin beef cattle population on the estimates of across-country genetic correlations.

Bonifazi, Renzo; Vandenplas, Jeremie; Napel, Jan Ten; Matilainen, Kaarina; Veerkamp, Roel F; Calus, Mario P L.

Genet Sel Evol ; 52(1): 32, 2020 Jun 23.

Artigo em Inglês | MEDLINE | ID: mdl-32576143

RESUMO

BACKGROUND: Cattle international genetic evaluations allow the comparison of estimated breeding values (EBV) across different environments, i.e. countries. For international evaluations, across-country genetic correlations (rg) need to be estimated. However, lack of convergence of the estimated parameters and high standard errors of the rg are often experienced for beef cattle populations due to limited across-country genetic connections. Furthermore, using all available genetic connections to estimate rg is prohibitive due to computational constraints, thus sub-setting the data is necessary. Our objective was to investigate and compare the impact of strategies of data sub-setting on estimated across-country rg and their computational requirements. METHODS: Phenotype and pedigree information for age-adjusted weaning weight was available for ten European countries and 3,128,338 Limousin beef cattle males and females. Using a Monte Carlo based expectation-maximization restricted maximum likelihood (MC EM REML) methodology, we estimated across-country rg by using a multi-trait animal model where countries are modelled as different correlated traits. Values of rg were estimated using the full data and four different sub-setting strategies that aimed at selecting the most connected herds from the largest population. RESULTS: Using all available data, direct and maternal rg (standard errors in parentheses) were on average equal to 0.79 (0.14) and 0.71 (0.19), respectively. Direct-maternal within-country and between-country rg were on average equal to - 0.12 (0.09) and 0.00 (0.14), respectively. Data sub-setting scenarios gave similar results: on average, estimated rg were smaller compared to using all data for direct (0.02) and maternal (0.05) genetic effects. The largest differences were obtained for the direct-maternal within-country and between-country rg, which were, on average 0.13 and 0.12 smaller compared to values obtained by using all data. Standard errors always increased when reducing the data, by 0.02 to 0.06, on average. The proposed sub-setting strategies reduced the required computing time up to 22% compared to using all data. CONCLUSIONS: Estimating all 120 across-country rg that are required for beef cattle international evaluations, using a multi-trait MC EM REML approach, is feasible but involves long computing time. We propose four strategies to reduce computational requirements while keeping a multi-trait estimation approach. In all scenarios with data sub-setting, the estimated rg were consistently smaller (mainly for direct-maternal rg) and had larger standard errors.

Assuntos

Bovinos/genética , Técnicas de Genotipagem/métodos , Seleção Genética/genética , Algoritmos , Animais , Peso Corporal , Cruzamento , Europa (Continente) , Feminino , Genoma/genética , Genômica/métodos , Genótipo , Masculino , Modelos Genéticos , Método de Monte Carlo , Linhagem , Fenótipo , Carne Vermelha , Desmame

15.

Single-step genome-wide association studies (GWAS) and post-GWAS analyses to identify genomic regions and candidate genes for milk yield in Brazilian Girolando cattle.

Otto, Pamela I; Guimarães, Simone E F; Calus, Mario P L; Vandenplas, Jeremie; Machado, Marco A; Panetto, João Cláudio C; da Silva, Marcos Vinícius G B.

J Dairy Sci ; 103(11): 10347-10360, 2020 Nov.

Artigo em Inglês | MEDLINE | ID: mdl-32896396

RESUMO

Milk production is economically important to the Brazilian agribusiness, and the majority of the country's milk production derives from Girolando (Gir × Holstein) cows. This study aimed to identify quantitative trait loci (QTL) and candidate genes associated with 305-d milk yield (305MY) in Girolando cattle. In addition, we investigated the SNP-specific variances for Holstein and Gir breeds of origin within the sequence of candidate genes. A single-step genomic BLUP procedure was used to identify QTL associated with 305MY, and the most likely candidate genes were identified through follow-up analyses. Genomic breeding values specific for Holstein and Gir were estimated in the Girolando animals using a model that uses breed-specific partial relationship matrices, which were converted to breed of origin SNP effects. Differences between breed of origin were evaluated by comparing estimated SNP variances between breeds. From 10 genome regions explaining most additive genetic variance for 305MY in Girolando cattle, 7 candidate genes were identified on chromosomes 1, 4, 6, and 26. Within the sequence of these 7 candidate genes, Gir breed of origin SNP alleles showed the highest genetic variance. These results indicated QTL regions that could be further explored in genomic selection panels and which may also help in understanding the gene mechanisms involved in milk production in the Girolando breed.

Assuntos

Bovinos/genética , Estudo de Associação Genômica Ampla/veterinária , Genômica , Leite/metabolismo , Locos de Características Quantitativas/genética , Alelos , Animais , Brasil , Cruzamento , Bovinos/fisiologia , Feminino , Fenótipo , Polimorfismo de Nucleotídeo Único/genética

16.

A second-level diagonal preconditioner for single-step SNPBLUP.

Vandenplas, Jeremie; Calus, Mario P L; Eding, Herwin; Vuik, Cornelis.

Genet Sel Evol ; 51(1): 30, 2019 Jun 25.

Artigo em Inglês | MEDLINE | ID: mdl-31238880

RESUMO

BACKGROUND: The preconditioned conjugate gradient (PCG) method is an iterative solver of linear equations systems commonly used in animal breeding. However, the PCG method has been shown to encounter convergence issues when applied to single-step single nucleotide polymorphism BLUP (ssSNPBLUP) models. Recently, we proposed a deflated PCG (DPCG) method for solving ssSNPBLUP efficiently. The DPCG method introduces a second-level preconditioner that annihilates the effect of the largest unfavourable eigenvalues of the ssSNPBLUP preconditioned coefficient matrix on the convergence of the iterative solver. While it solves the convergence issues of ssSNPBLUP, the DPCG method requires substantial additional computations, in comparison to the PCG method. Accordingly, the aim of this study was to develop a second-level preconditioner that decreases the largest eigenvalues of the ssSNPBLUP preconditioned coefficient matrix at a lower cost than the DPCG method, in addition to comparing its performance to the (D)PCG methods applied to two different ssSNPBLUP models. RESULTS: Based on the properties of the ssSNPBLUP preconditioned coefficient matrix, we proposed a second-level diagonal preconditioner that decreases the largest eigenvalues of the ssSNPBLUP preconditioned coefficient matrix under some conditions. This proposed second-level preconditioner is easy to implement in current software and does not result in additional computing costs as it can be combined with the commonly used (block-)diagonal preconditioner. Tested on two different datasets and with two different ssSNPBLUP models, the second-level diagonal preconditioner led to a decrease of the largest eigenvalues and the condition number of the preconditioned coefficient matrices. It resulted in an improvement of the convergence pattern of the iterative solver. For the largest dataset, the convergence of the PCG method with the proposed second-level diagonal preconditioner was slower than the DPCG method, but it performed better than the DPCG method in terms of total computing time. CONCLUSIONS: The proposed second-level diagonal preconditioner can improve the convergence of the (D)PCG methods applied to two ssSNPBLUP models. Based on our results, the PCG method combined with the proposed second-level diagonal preconditioner seems to be more efficient than the DPCG method in solving ssSNPBLUP. However, the optimal combination of ssSNPBLUP and solver will most likely be situation-dependent.

Assuntos

Bovinos/genética , Interpretação Estatística de Dados , Modelos Genéticos , Polimorfismo de Nucleotídeo Único , Software , Animais , Cruzamento , Conjuntos de Dados como Assunto

17.

Imputation to whole-genome sequence using multiple pig populations and its use in genome-wide association studies.

van den Berg, Sanne; Vandenplas, Jérémie; van Eeuwijk, Fred A; Bouwman, Aniek C; Lopes, Marcos S; Veerkamp, Roel F.

Genet Sel Evol ; 51(1): 2, 2019 Jan 24.

Artigo em Inglês | MEDLINE | ID: mdl-30678638

RESUMO

BACKGROUND: Use of whole-genome sequence data (WGS) is expected to improve identification of quantitative trait loci (QTL). However, this requires imputation to WGS, often with a limited number of sequenced animals for the target population. The objective of this study was to investigate imputation to WGS in two pig lines using a multi-line reference population and, subsequently, to investigate the effect of using these imputed WGS (iWGS) for GWAS. METHODS: Phenotypes and genotypes were available on 12,184 Large White pigs (LW-line) and 4943 Dutch Landrace pigs (DL-line). Imputed 660 K and 80 K genotypes for the LW-line and DL-line, respectively, were imputed to iWGS using Beagle v.4.1. Since only 32 LW-line and 12 DL-line boars were sequenced, 142 animals from eight commercial lines were added. GWAS were performed for each line using the 80 K and 660 K SNPs, the genotype scores of iWGS SNPs that had an imputation accuracy (Beagle R2) higher than 0.6, and the dosage scores of all iWGS SNPs. RESULTS: For the DL-line (LW-line), imputation of 80 K genotypes to iWGS resulted in an average Beagle R2 of 0.39 (0.49). After quality control, 2.5 × 106 (3.5 × 106) SNPs had a Beagle R2 higher than 0.6, resulting in an average Beagle R2 of 0.83 (0.93). Compared to the 80 K and 660 K genotypes, using iWGS led to the identification of 48.9 and 64.4% more QTL regions, for the DL-line and LW-line, respectively, and the most significant SNPs in the QTL regions explained a higher proportion of phenotypic variance. Using dosage instead of genotype scores improved the identification of QTL, because the model accounted for uncertainty of imputation, and all SNPs were used in the analysis. CONCLUSIONS: Imputation to WGS using the multi-line reference population resulted in relatively poor imputation, especially when imputing from 80 K (DL-line). In spite of the poor imputation accuracies, using iWGS instead of a lower density SNP chip increased the number of detected QTL and the estimated proportion of phenotypic variance explained by these QTL, especially when dosage scores were used instead of genotype scores. Thus, iWGS, even with poor imputation accuracy, can be used to identify possible interesting regions for fine mapping.

Assuntos

Estudo de Associação Genômica Ampla/métodos , Suínos/genética , Sequenciamento Completo do Genoma/métodos , Animais , Estudo de Associação Genômica Ampla/normas , Estudo de Associação Genômica Ampla/veterinária , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Sequenciamento Completo do Genoma/normas , Sequenciamento Completo do Genoma/veterinária

18.

Genome-wide association studies for heat stress response in Bos taurus × Bos indicus crossbred cattle.

Otto, Pamela I; Guimarães, Simone E F; Verardo, Lucas L; Azevedo, Ana Luísa S; Vandenplas, Jeremie; Sevillano, Claudia A; Marques, Daniele B D; Pires, Maria de Fatima A; de Freitas, Célio; Verneque, Rui S; Martins, Marta Fonseca; Panetto, João Cláudio C; Carvalho, Wanessa A; Gobo, Diego O R; da Silva, Marcos Vinícius G B; Machado, Marco A.

J Dairy Sci ; 102(9): 8148-8158, 2019 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-31279558

RESUMO

Heat stress is an important issue in the global dairy industry. In tropical areas, an alternative to overcome heat stress is the use of crossbred animals or synthetic breeds, such as the Girolando. In this study, we performed a genome-wide association study (GWAS) and post-GWAS analyses for heat stress in an experimental Gir × Holstein F2 population. Rectal temperature (RT) was measured in heat-stressed F2 animals, and the variation between 2 consecutive RT measurements (ΔRT) was used as the dependent variable. Illumina BovineSNP50v1 BeadChip (Illumina Inc., San Diego, CA) and single-SNP approach were used for GWAS. Post-GWAS analyses were performed by gene ontology terms enrichment and gene-transcription factor (TF) networks, generated from enriched TF. The breed origin of marker alleles in the F2 population was assigned using the breed of origin of alleles (BOA) approach. Heritability and repeatability estimates (± standard error) for ΔRT were 0.13 ± 0.08 and 0.29 ± 0.06, respectively. Association analysis revealed 6 SNP significantly associated with ΔRT. Genes involved with biological processes in response to heat stress effects (LIF, OSM, TXNRD2, and DGCR8) were identified as putative candidate genes. After performing the BOA approach, the 10% of F2 animals with the lowest breeding values for ΔRT were classified as low-ΔRT, and the 10% with the highest breeding values for ΔRT were classified as high-ΔRT. On average, 49.4% of low-ΔRT animals had 2 alleles from the Holstein breed (HH), and 39% had both alleles from the Gir breed (GG). In high-ΔRT animals, the average proportion of animals for HH and GG were 1.4 and 50.2%, respectively. This study allowed the identification of candidate genes for ΔRT in Gir × Holstein crossbred animals. According to the BOA approach, Holstein breed alleles could be associated with better response to heat stress effects, which could be explained by the fact that Holstein animals are more affected by heat stress than Gir animals and thus require a genetic architecture to defend the body from the deleterious effects of heat stress. Future studies can provide further knowledge to uncover the genetic architecture underlying heat stress in crossbred cattle.

Assuntos

Bovinos/genética , Redes Reguladoras de Genes , Estudo de Associação Genômica Ampla/veterinária , Resposta ao Choque Térmico/genética , Locos de Características Quantitativas/genética , Alelos , Animais , Cruzamento , Bovinos/fisiologia , Indústria de Laticínios , Feminino , Masculino

19.

Significance testing and genomic inflation factor using high-density genotypes or whole-genome sequence data.

van den Berg, Sanne; Vandenplas, Jérémie; van Eeuwijk, Fred A; Lopes, Marcos S; Veerkamp, Roel F.

J Anim Breed Genet ; 136(6): 418-429, 2019 Nov.

Artigo em Inglês | MEDLINE | ID: mdl-31215703

RESUMO

Significance testing for genome-wide association study (GWAS) with increasing SNP density up to whole-genome sequence data (WGS) is not straightforward, because of strong LD between SNP and population stratification. Therefore, the objective of this study was to investigate genomic control and different significance testing procedures using data from a commercial pig breeding scheme. A GWAS was performed in GCTA with data of 4,964 Large White pigs using medium density, high density or imputed whole-genome sequence data, fitting a genomic relationship matrix based on a leave-one-chromosome-out approach to account for population structure. Subsequently, genomic inflation factors were assessed on whole-genome level and the chromosome level. To establish a significance threshold, permutation testing, Bonferroni corrections using either the total number of SNPs or the number of independent chromosome fragments, and false discovery rates (FDR) using either the Benjamini-Hochberg procedure or the Benjamini and Yekutieli procedure were evaluated. We found that genomic inflation factors did not differ between different density genotypes but do differ between chromosomes. Also, the leave-one-chromosome-out approach for GWAS or using the pedigree relationships did not account appropriately for population stratification and gave strong genomic inflation. Regarding different procedures for significance testing, when the aim is to find QTL regions that are associated with a trait of interest, we recommend applying the FDR following the Benjamini and Yekutieli approach to establish a significance threshold that is adjusted for multiple testing. When the aim is to pinpoint a specific mutation, the more conservative Bonferroni correction based on the total number of SNPs is more appropriate, till an appropriate method is established to adjust for the number of independent tests.

Assuntos

Estudo de Associação Genômica Ampla , Genômica , Genótipo , Sequenciamento Completo do Genoma , Animais , Cruzamento , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas/genética , Suínos/genética

20.

SNPrune: an efficient algorithm to prune large SNP array and sequence datasets based on high linkage disequilibrium.

Calus, Mario P L; Vandenplas, Jérémie.

Genet Sel Evol ; 50(1): 34, 2018 06 26.

Artigo em Inglês | MEDLINE | ID: mdl-29940846

RESUMO

BACKGROUND: High levels of pairwise linkage disequilibrium (LD) in single nucleotide polymorphism (SNP) array or whole-genome sequence data may affect both performance and efficiency of genomic prediction models. Thus, this warrants pruning of genotyping data for high LD. We developed an algorithm, named SNPrune, which enables the rapid detection of any pair of SNPs in complete or high LD throughout the genome. METHODS: LD, measured as the squared correlation between phased alleles (r2), can only reach a value of 1 when both loci have the same count of the minor allele. Sorting loci based on the minor allele count, followed by comparison of their alleles, enables rapid detection of loci in complete LD. Detection of loci in high LD can be optimized by computing the range of the minor allele count at another locus for each possible value of the minor allele count that can yield LD values higher than a predefined threshold. This efficiently reduces the number of pairs of loci for which LD needs to be computed, instead of considering all pairwise combinations of loci. The implemented algorithm SNPrune considered bi-allelic loci either using phased alleles or allele counts as input. SNPrune was validated against PLINK on two datasets, using an r2 threshold of 0.99. The first dataset contained 52k SNP genotypes on 3534 pigs and the second dataset contained simulated whole-genome sequence data with 10.8 million SNPs and 2500 animals. RESULTS: SNPrune removed a similar number of SNPs as PLINK from the pig data but SNPrune was almost 12 times faster than PLINK. From the simulated sequence data with 10.8 million SNPs, SNPrune removed 6.4 and 1.4 million SNPs due to complete and high LD. Results were very similar regardless of whether phased alleles or allele counts were used. Using allele counts and multi-threading with 10 threads, SNPrune completed the analysis in 21 min. Using a sliding window of up to 500,000 SNPs, PLINK removed ~ 43,000 less SNPs (0.6%) in the sequence data and SNPrune was 24 to 170 times faster, using one or ten threads, respectively. CONCLUSIONS: The SNPrune algorithm developed here is able to remove SNPs in high LD throughout the genome very efficiently in large datasets.

Assuntos

Algoritmos , Desequilíbrio de Ligação , Sequenciamento Completo do Genoma/veterinária , Animais , Bases de Dados Genéticas , Modelos Genéticos , Polimorfismo de Nucleotídeo Único , Suínos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA