Rechercher | Portail Régional BVS

1.

Comparison of genomic prediction accuracy using different models for egg production traits in Taiwan country chicken.

Tu, Tsung-Che; Lin, Chen-Jyuan; Liu, Ming-Che; Hsu, Zhi-Ting; Chen, Chih-Feng.

Poult Sci ; 103(10): 104063, 2024 Jul 06.

Article de Anglais | MEDLINE | ID: mdl-39098301

RÉSUMÉ

In local chickens targeted for niche markets, genotyping costs are relatively high due to the small population size and diverse breeding goals. The single-step genomic best linear unbiased prediction (ssGBLUP) model, which combines pedigree and genomic information, has been introduced to increase the accuracy of genomic estimated breeding value (GEBV). Therefore, this model may be more beneficial than the genomic BLUP (GBLUP) model for genomic selection in local chickens. Additionally, the single-step genome-wide association study (ssGWAS) can be used to extend the ssGBLUP model results to animals with available phenotypic information but without genotypic data. In this study, we compared the accuracy of (G)EBVs using the pedigree-based BLUP (PBLUP), GBLUP, and ssGBLUP models. Moreover, we conducted single-SNP GWAS (SNP-GWAS), GBLUP-GWAS, and ssGWAS methods to identify genes associated with egg production traits in the NCHU-G101 chicken to understand the feasibility of using genomic selection in a small population. The average prediction accuracy of (G)EBV for egg production traits using the PBLUP, GBLUP, and ssGBLUP models is 0.536, 0.531, and 0.555, respectively. In total, 22 suggestive- and 5% Bonferroni genome-wide significant-level SNPs for total egg number (EN), average laying rate (LR), average clutch length, and total clutch number are detected using 3 GWAS methods. These SNPs are mapped onto Gallus gallus chromosomes (GGA) 4, 6, 10, 18, and 25 in NCHU-G101 chicken. Furthermore, through SNP-GWAS and ssGWAS methods, we identify 2 genes on GGA4 associated with EN and LR: ENSGALG00000023172 and PPARGC1A. In conclusion, the ssGBLUP model demonstrates superior prediction accuracy, performing on average 3.41% than the PBLUP model. The implications of our gene results may guide future selection strategies for Taiwan Country chickens. Our results highlight the applicability of the ssGBLUP model for egg production traits selection in a small population, specifically NCHU-G101 chicken in Taiwan.

2.

Enhancing genomic prediction with Stacking Ensemble Learning in Arabica Coffee.

Nascimento, Moyses; Nascimento, Ana Carolina Campana; Azevedo, Camila Ferreira; de Oliveira, Antonio Carlos Baiao; Caixeta, Eveline Teixeira; Jarquin, Diego.

Front Plant Sci ; 15: 1373318, 2024.

Article de Anglais | MEDLINE | ID: mdl-39086911

RÉSUMÉ

Coffee Breeding programs have traditionally relied on observing plant characteristics over years, a slow and costly process. Genomic selection (GS) offers a DNA-based alternative for faster selection of superior cultivars. Stacking Ensemble Learning (SEL) combines multiple models for potentially even more accurate selection. This study explores SEL potential in coffee breeding, aiming to improve prediction accuracy for important traits [yield (YL), total number of the fruits (NF), leaf miner infestation (LM), and cercosporiosis incidence (Cer)] in Coffea Arabica. We analyzed data from 195 individuals genotyped for 21,211 single-nucleotide polymorphism (SNP) markers. To comprehensively assess model performance, we employed a cross-validation (CV) scheme. Genomic Best Linear Unbiased Prediction (GBLUP), multivariate adaptive regression splines (MARS), Quantile Random Forest (QRF), and Random Forest (RF) served as base learners. For the meta-learner within the SEL framework, various options were explored, including Ridge Regression, RF, GBLUP, and Single Average. The SEL method was able to predict the predictive ability (PA) of important traits in Coffea Arabica. SEL presented higher PA compared with those obtained for all base learner methods. The gains in PA in relation to GBLUP were 87.44% (the ratio between the PA obtained from best Stacking model and the GBLUP), 37.83%, 199.82%, and 14.59% for YL, NF, LM and Cer, respectively. Overall, SEL presents a promising approach for GS. By combining predictions from multiple models, SEL can potentially enhance the PA of GS for complex traits.

3.

To be or not to be tetraploid-the impact of marker ploidy on genomic prediction and GWAS of potato.

Aalborg, Trine; Nielsen, Kåre Lehmann.

Front Plant Sci ; 15: 1386837, 2024.

Article de Anglais | MEDLINE | ID: mdl-39139728

RÉSUMÉ

Cultivated potato, Solanum tuberosum L., is considered an autotetraploid with 12 chromosomes with four homologous phases. However, recent evidence found that, due to frequent large phase deletions in the genome, gene ploidy is not constant across the genome. The elite cultivar "Otava" was found to have an average gene copy number of 3.2 across all loci. Breeding programs for elite potato cultivars rely increasingly on genomic prediction tools for selection breeding and elucidation of quantitative trait loci underpinning trait genetic variance. These are typically based on anonymous single nucleotide polymorphism (SNP) markers, which are usually called from, for example, SNP array or sequencing data using a tetraploid model. In this study, we analyzed the impact of using whole genome markers genotyped as either tetraploid or observed allele frequencies from genotype-by-sequencing data on single-trait additive genomic best linear unbiased prediction (GBLUP) genomic prediction (GP) models and single-marker regression genome-wide association studies of potato to evaluate the implications of capturing varying ploidy on the statistical models employed in genomic breeding. A panel of 762 offspring of a diallel cross of 18 parents of elite breeding material was used for modeling. These were genotyped by sequencing and phenotyped for five key performance traits: chipping quality, length/width ratio, senescence, dry matter content, and yield. We also estimated the read coverage required to confidently discriminate between a heterozygous triploid and tetraploid state from simulated data. It was found that using a tetraploid model neither impaired nor improved genomic predictions compared to using the observed allele frequencies that account for true marker ploidy. In genome-wide associations studies (GWAS), very minor variations of both signal amplitude and number of SNPs supporting both minor and major quantitative trait loci (QTLs) were observed between the two data sets. However, all major QTLs were reproducible using both data sets.

4.

Single nucleotide polymorphism profile for quantitative trait nucleotide in populations with small effective size and its impact on mapping and genomic predictions.

Pocrnic, Ivan; Lourenco, Daniela; Misztal, Ignacy.

Genetics ; 227(4)2024 Aug 07.

Article de Anglais | MEDLINE | ID: mdl-38913695

RÉSUMÉ

Increasing SNP density by incorporating sequence information only marginally increases prediction accuracies of breeding values in livestock. To find out why, we used statistical models and simulations to investigate the shape of distribution of estimated SNP effects (a profile) around quantitative trait nucleotides (QTNs) in populations with a small effective population size (Ne). A QTN profile created by averaging SNP effects around each QTN was similar to the shape of expected pairwise linkage disequilibrium (PLD) based on Ne and genetic distance between SNP, with a distinct peak for the QTN. Populations with smaller Ne showed lower but wider QTN profiles. However, adding more genotyped individuals with phenotypes dragged the profile closer to the QTN. The QTN profile was higher and narrower for populations with larger compared to smaller Ne. Assuming the PLD curve for the QTN profile, 80% of the additive genetic variance explained by each QTN was contained in ± 1/Ne Morgan interval around the QTN, corresponding to 2 Mb in cattle and 5 Mb in pigs and chickens. With such large intervals, identifying QTN is difficult even if all of them are in the data and the assumed genetic architecture is simplistic. Additional complexity in QTN detection arises from confounding of QTN profiles with signals due to relationships, overlapping profiles with closely spaced QTN, and spurious signals. However, small Ne allows for accurate predictions with large data even without QTN identification because QTNs are accounted for by QTN profiles if SNP density is sufficient to saturate the segments.

Sujet(s)

Déséquilibre de liaison , Modèles génétiques , Polymorphisme de nucléotide simple , Locus de caractère quantitatif , Animaux , Bovins/génétique , Cartographie chromosomique/méthodes , Densité de population , Poulets/génétique , Suidae/génétique , Génomique/méthodes , Caractère quantitatif héréditaire

5.

Exploring unknown parent groups and metafounders in single-step genomic BLUP: Insights from a simulated cattle population.

Himmelbauer, Judith; Schwarzenbacher, Hermann; Fuerst, Christian; Fuerst-Waltl, Birgit.

J Dairy Sci ; 2024 Jun 20.

Article de Anglais | MEDLINE | ID: mdl-38908687

RÉSUMÉ

This study explores how the metafounder (MF) concept enhances genetic evaluations in dairy cattle populations using single-step genomic best linear unbiased prediction (ssGBLUP). By improving the consideration of relationships among founder populations, MF ensures accurate alignment of pedigree and genomic relationships. The research aims to propose a method for grouping MF based on genotypic information, assess different approaches for estimating the gamma matrix, and compare unknown parent groups (UPG) and MF methodologies across various scenarios, including those with low and high pedigree completeness based on a simulated dairy cattle population. In the scenario where unknown ancestors are rare, the impact of UPG or MF on breeding values is minimal but MF still performs slightly better compared with UPG. The scenario with lower genotyping rates and more unknown parents shows significant differences in evaluations with and without UPG and also compared with MF. The study shows that ssGBLUP evaluations where UPG are considered via Quaas-Pollak-transformation in the pedigree-based and genomic relationship matrix (UPG_fullQP) results in double counting and subsequently in a pronounced bias and overdispersion. Another focus is on the estimation of the gamma matrix, emphasizing the importance of crossbred genotypes for accuracy. Challenges emerge in classifying animals into subpopulations and further into MF or UPG, but the method used in this study, which is based on genotypes, results in predictions which are comparable to those obtained using the true subpopulations for the assignment. Estimated validation results using the linear regression method confirm the superior performance of MF evaluations, although differences compared with true validations are smaller. Notably, UPG_fullQP's extreme bias is less evident in routine validation statistics.

6.

Ability of Genomic Prediction to Bi-Parent-Derived Breeding Population Using Public Data for Soybean Oil and Protein Content.

Li, Chenhui; Yang, Qing; Liu, Bingqiang; Shi, Xiaolei; Liu, Zhi; Yang, Chunyan; Wang, Tao; Xiao, Fuming; Zhang, Mengchen; Shi, Ainong; Yan, Long.

Plants (Basel) ; 13(9)2024 Apr 30.

Article de Anglais | MEDLINE | ID: mdl-38732474

RÉSUMÉ

Genomic selection (GS) is a marker-based selection method used to improve the genetic gain of quantitative traits in plant breeding. A large number of breeding datasets are available in the soybean database, and the application of these public datasets in GS will improve breeding efficiency and reduce time and cost. However, the most important problem to be solved is how to improve the ability of across-population prediction. The objectives of this study were to perform genomic prediction (GP) and estimate the prediction ability (PA) for seed oil and protein contents in soybean using available public datasets to predict breeding populations in current, ongoing breeding programs. In this study, six public datasets of USDA GRIN soybean germplasm accessions with available phenotypic data of seed oil and protein contents from different experimental populations and their genotypic data of single-nucleotide polymorphisms (SNPs) were used to perform GP and to predict a bi-parent-derived breeding population in our experiment. The average PA was 0.55 and 0.50 for seed oil and protein contents within the bi-parents population according to the within-population prediction; and 0.45 for oil and 0.39 for protein content when the six USDA populations were combined and employed as training sets to predict the bi-parent-derived population. The results showed that four USDA-cultivated populations can be used as a training set individually or combined to predict oil and protein contents in GS when using 800 or more USDA germplasm accessions as a training set. The smaller the genetic distance between training population and testing population, the higher the PA. The PA increased as the population size increased. In across-population prediction, no significant difference was observed in PA for oil and protein content among different models. The PA increased as the SNP number increased until a marker set consisted of 10,000 SNPs. This study provides reasonable suggestions and methods for breeders to utilize public datasets for GS. It will aid breeders in developing GS-assisted breeding strategies to develop elite soybean cultivars with high oil and protein contents.

7.

Genomic prediction based on a joint reference population for the Xinjiang Brown cattle.

Zhang, Menghua; Xu, Lei; Lu, Haibo; Luo, Hanpeng; Zhou, Jinghang; Wang, Dan; Zhang, Xiaoxue; Huang, Xixia; Wang, Yachun.

Front Genet ; 15: 1394636, 2024.

Article de Anglais | MEDLINE | ID: mdl-38737126

RÉSUMÉ

Introduction: Xinjiang Brown cattle constitute the largest breed of cattle in Xinjiang. Therefore, it is crucial to establish a genomic evaluation system, especially for those with low levels of breed improvement. Methods: This study aimed to establish a cross breed joint reference population by analyzing the genetic structure of 485 Xinjiang Brown cattle and 2,633 Chinese Holstein cattle (Illumina GeneSeek GGP bovine 150 K chip). The Bayes method single-step genome-wide best linear unbiased prediction was used to conduct a genomic evaluation of the joint reference population for the milk traits of Xinjiang Brown cattle. The reference population of Chinese Holstein cattle was randomly divided into groups to construct the joint reference population. By comparing the prediction accuracy, estimation bias, and inflation coefficient of the validation population, the optimal number of joint reference populations was determined. Results and Discussion: The results indicated a distinct genetic structure difference between the two breeds of adult cows, and both breeds should be considered when constructing multi-breed joint reference and validation populations. The reliability range of genome prediction of milk traits in the joint reference population was 0.142-0.465. Initially, it was determined that the inclusion of 600 and 900 Chinese Holstein cattle in the joint reference population positively impacted the genomic prediction of Xinjiang Brown cattle to certain extent. It was feasible to incorporate the Chinese Holstein into Xinjiang Brown cattle population to form a joint reference population for multi-breed genomic evaluation. However, for different Xinjiang Brown cattle populations, a fixed number of Chinese Holstein cattle cannot be directly added during multi-breed genomic selection. Pre-evaluation analysis based on the genetic structure, kinship, and other factors of the current population is required to ensure the authenticity and reliability of genomic predictions and improve estimation accuracy.

8.

Genomic prediction for sugarcane diseases including hybrid Bayesian-machine learning approaches.

Chen, Chensong; Bhuiyan, Shamsul A; Ross, Elizabeth; Powell, Owen; Dinglasan, Eric; Wei, Xianming; Atkin, Felicity; Deomano, Emily; Hayes, Ben.

Front Plant Sci ; 15: 1398903, 2024.

Article de Anglais | MEDLINE | ID: mdl-38751840

RÉSUMÉ

Sugarcane smut and Pachymetra root rots are two serious diseases of sugarcane, with susceptible infected crops losing over 30% of yield. A heritable component to both diseases has been demonstrated, suggesting selection could improve disease resistance. Genomic selection could accelerate gains even further, enabling early selection of resistant seedlings for breeding and clonal propagation. In this study we evaluated four types of algorithms for genomic predictions of clonal performance for disease resistance. These algorithms were: Genomic best linear unbiased prediction (GBLUP), including extensions to model dominance and epistasis, Bayesian methods including BayesC and BayesR, Machine learning methods including random forest, multilayer perceptron (MLP), modified convolutional neural network (CNN) and attention networks designed to capture epistasis across the genome-wide markers. Simple hybrid methods, that first used BayesR/GWAS to identify a subset of 1000 markers with moderate to large marginal additive effects, then used attention networks to derive predictions from these effects and their interactions, were also developed and evaluated. The hypothesis for this approach was that using a subset of markers more likely to have an effect would enable better estimation of interaction effects than when there were an extremely large number of possible interactions, especially with our limited data set size. To evaluate the methods, we applied both random five-fold cross-validation and a structured PCA based cross-validation that separated 4702 sugarcane clones (that had disease phenotypes and genotyped for 26k genome wide SNP markers) by genomic relationship. The Bayesian methods (BayesR and BayesC) gave the highest accuracy of prediction, followed closely by hybrid methods with attention networks. The hybrid methods with attention networks gave the lowest variation in accuracy of prediction across validation folds (and lowest MSE), which may be a criteria worth considering in practical breeding programs. This suggests that hybrid methods incorporating the attention mechanism could be useful for genomic prediction of clonal performance, particularly where non-additive effects may be important.

9.

Comparison of ddRADseq and EUChip60K SNP genotyping systems for population genetics and genomic selection in Eucalyptus dunnii (Maiden).

Aguirre, Natalia Cristina; Villalba, Pamela Victoria; García, Martín Nahuel; Filippi, Carla Valeria; Rivas, Juan Gabriel; Martínez, María Carolina; Acuña, Cintia Vanesa; López, Augusto J; López, Juan Adolfo; Pathauer, Pablo; Palazzini, Dino; Harrand, Leonel; Oberschelp, Javier; Marcó, Martín Alberto; Cisneros, Esteban Felipe; Carreras, Rocío; Martins Alves, Ana Maria; Rodrigues, José Carlos; Hopp, H Esteban; Grattapaglia, Dario; Cappa, Eduardo Pablo; Paniego, Norma Beatriz; Marcucci Poltri, Susana Noemí.

Front Genet ; 15: 1361418, 2024.

Article de Anglais | MEDLINE | ID: mdl-38606359

RÉSUMÉ

Eucalyptus dunnii is one of the most important Eucalyptus species for short-fiber pulp production in regions where other species of the genus are affected by poor soil and climatic conditions. In this context, E. dunnii holds promise as a resource to address and adapt to the challenges of climate change. Despite its rapid growth and favorable wood properties for solid wood products, the advancement of its improvement remains in its early stages. In this work, we evaluated the performance of two single nucleotide polymorphism, (SNP), genotyping methods for population genetics analysis and Genomic Selection in E. dunnii. Double digest restriction-site associated DNA sequencing (ddRADseq) was compared with the EUChip60K array in 308 individuals from a provenance-progeny trial. The compared SNP set included 8,011 and 19,008 informative SNPs distributed along the 11 chromosomes, respectively. Although the two datasets differed in the percentage of missing data, genome coverage, minor allele frequency and estimated genetic diversity parameters, they revealed a similar genetic structure, showing two subpopulations with little differentiation between them, and low linkage disequilibrium. GS analyses were performed for eleven traits using Genomic Best Linear Unbiased Prediction (GBLUP) and a conventional pedigree-based model (ABLUP). Regardless of the SNP dataset, the predictive ability (PA) of GBLUP was better than that of ABLUP for six traits (Cellulose content, Total and Ethanolic extractives, Total and Klason lignin content and Syringyl and Guaiacyl lignin monomer ratio). When contrasting the SNP datasets used to estimate PAs, the GBLUP-EUChip60K model gave higher and significant PA values for six traits, meanwhile, the values estimated using ddRADseq gave higher values for three other traits. The PAs correlated positively with narrow sense heritabilities, with the highest correlations shown by the ABLUP and GBLUP-EUChip60K. The two genotyping methods, ddRADseq and EUChip60K, are generally comparable for population genetics and genomic prediction, demonstrating the utility of the former when subjected to rigorous SNP filtering. The results of this study provide a basis for future whole-genome studies using ddRADseq in non-model forest species for which SNP arrays have not yet been developed.

10.

Marker Density and Models to Improve the Accuracy of Genomic Selection for Growth and Slaughter Traits in Meat Rabbits.

Li, Wenjie; Li, Wenqiang; Song, Zichen; Gao, Zihao; Xie, Kerui; Wang, Yubing; Wang, Bo; Hu, Jiaqing; Zhang, Qin; Ning, Chao; Wang, Dan; Fan, Xinzhong.

Genes (Basel) ; 15(4)2024 04 03.

Article de Anglais | MEDLINE | ID: mdl-38674388

RÉSUMÉ

The selection and breeding of good meat rabbit breeds are fundamental to their industrial development, and genomic selection (GS) can employ genomic information to make up for the shortcomings of traditional phenotype-based breeding methods. For the practical implementation of GS in meat rabbit breeding, it is necessary to assess different marker densities and GS models. Here, we obtained low-coverage whole-genome sequencing (lcWGS) data from 1515 meat rabbits (including parent herd and half-sibling offspring). The specific objectives were (1) to derive a baseline for heritability estimates and genomic predictions based on randomly selected marker densities and (2) to assess the accuracy of genomic predictions for single- and multiple-trait linear mixed models. We found that a marker density of 50 K can be used as a baseline for heritability estimation and genomic prediction. For GS, the multi-trait genomic best linear unbiased prediction (GBLUP) model results in more accurate predictions for virtually all traits compared to the single-trait model, with improvements greater than 15% for all of them, which may be attributed to the use of information on genetically related traits. In addition, we discovered a positive correlation between the performance of the multi-trait GBLUP and the genetic correlation between the traits. We anticipate that this approach will provide solutions for GS, as well as optimize breeding programs, in meat rabbits.

Sujet(s)

Génomique , Viande , Animaux , Lapins/génétique , Génomique/méthodes , Sélection génétique , Phénotype , Sélection/méthodes , Modèles génétiques , Caractère quantitatif héréditaire , Marqueurs génétiques , Séquençage du génome entier/méthodes , Locus de caractère quantitatif , Génome/génétique

11.

The effect of marker types and density on genomic prediction and GWAS of key performance traits in tetraploid potato.

Aalborg, Trine; Sverrisdóttir, Elsa; Kristensen, Heidi Thorgaard; Nielsen, Kåre Lehmann.

Front Plant Sci ; 15: 1340189, 2024.

Article de Anglais | MEDLINE | ID: mdl-38525152

RÉSUMÉ

Genomic prediction and genome-wide association studies are becoming widely employed in potato key performance trait QTL identifications and to support potato breeding using genomic selection. Elite cultivars are tetraploid and highly heterozygous but also share many common ancestors and generation-spanning inbreeding events, resulting from the clonal propagation of potatoes through seed potatoes. Consequentially, many SNP markers are not in a 1:1 relationship with a single allele variant but shared over several alleles that might exert varying effects on a given trait. The impact of such redundant "diluted" predictors on the statistical models underpinning genome-wide association studies (GWAS) and genomic prediction has scarcely been evaluated despite the potential impact on model accuracy and performance. We evaluated the impact of marker location, marker type, and marker density on the genomic prediction and GWAS of five key performance traits in tetraploid potato (chipping quality, dry matter content, length/width ratio, senescence, and yield). A 762-offspring panel of a diallel cross of 18 elite cultivars was genotyped by sequencing, and markers were annotated according to a reference genome. Genomic prediction models (GBLUP) were trained on four marker subsets [non-synonymous (29,553 SNPs), synonymous (31,229), non-coding (32,388), and a combination], and robustness to marker reduction was investigated. Single-marker regression GWAS was performed for each trait and marker subset. The best cross-validated prediction correlation coefficients of 0.54, 0.75, 0.49, 0.35, and 0.28 were obtained for chipping quality, dry matter content, length/width ratio, senescence, and yield, respectively. The trait prediction abilities were similar across all marker types, with only non-synonymous variants improving yield predictive ability by 16%. Marker reduction response did not depend on marker type but rather on trait. Traits with high predictive abilities, e.g., dry matter content, reached a plateau using fewer markers than traits with intermediate-low correlations, such as yield. The predictions were unbiased across all traits, marker types, and all marker densities >100 SNPs. Our results suggest that using non-synonymous variants does not enhance the performance of genomic prediction of most traits. The major known QTLs were identified by GWAS and were reproducible across exonic and whole-genome variant sets for dry matter content, length/width ratio, and senescence. In contrast, minor QTL detection was marker type dependent.

12.

Deep learning methods improve genomic prediction of wheat breeding.

Montesinos-López, Abelardo; Crespo-Herrera, Leonardo; Dreisigacker, Susanna; Gerard, Guillermo; Vitale, Paolo; Saint Pierre, Carolina; Govindan, Velu; Tarekegn, Zerihun Tadesse; Flores, Moisés Chavira; Pérez-Rodríguez, Paulino; Ramos-Pulido, Sofía; Lillemo, Morten; Li, Huihui; Montesinos-López, Osval A; Crossa, Jose.

Front Plant Sci ; 15: 1324090, 2024.

Article de Anglais | MEDLINE | ID: mdl-38504889

RÉSUMÉ

In the field of plant breeding, various machine learning models have been developed and studied to evaluate the genomic prediction (GP) accuracy of unseen phenotypes. Deep learning has shown promise. However, most studies on deep learning in plant breeding have been limited to small datasets, and only a few have explored its application in moderate-sized datasets. In this study, we aimed to address this limitation by utilizing a moderately large dataset. We examined the performance of a deep learning (DL) model and compared it with the widely used and powerful best linear unbiased prediction (GBLUP) model. The goal was to assess the GP accuracy in the context of a five-fold cross-validation strategy and when predicting complete environments using the DL model. The results revealed the DL model outperformed the GBLUP model in terms of GP accuracy for two out of the five included traits in the five-fold cross-validation strategy, with similar results in the other traits. This indicates the superiority of the DL model in predicting these specific traits. Furthermore, when predicting complete environments using the leave-one-environment-out (LOEO) approach, the DL model demonstrated competitive performance. It is worth noting that the DL model employed in this study extends a previously proposed multi-modal DL model, which had been primarily applied to image data but with small datasets. By utilizing a moderately large dataset, we were able to evaluate the performance and potential of the DL model in a context with more information and challenging scenario in plant breeding.

13.

Impacts of reference population size and methods on the accuracy of genomic prediction for fleece traits in Inner Mongolia Cashmere Goats.

Yan, Xiaochun; Zhang, Jiaxin; Li, Jinquan; Wang, Na; Su, Rui; Wang, Zhiying.

Front Vet Sci ; 11: 1325831, 2024.

Article de Anglais | MEDLINE | ID: mdl-38374988

RÉSUMÉ

Introduction: Inner Mongolia Cashmere Goats (IMCGs) are famous for its cashmere quality and it's a unique genetic resource in China. Therefore, it is necessary to use genomic selection to improve the accuracy of selection for fleece traits in Inner Mongolia cashmere goats. The aim of this study was to determine the effect of methods (GBLUP, BayesA, BayesB, Bayesian LASSO, Bayesian Ridge Region) and the reference population size on accuracy of genomic selection in IMCGs. Methods: This study fully utilizes the pedigree and phenotype records of fleece traits in 2255 individuals, genotype of 50794 SNPs after quality control, and environmental data to perform genomic selection of fleece traits. Then GBLUP and Bayes series methods (BayesA, BayesB, Bayesian LASSO, Bayesian Ridge Region) were used to perform estimates of genetic parameter and genomic breeding value. And the accuracy of genomic estimated breeding value (GEBV) is evaluated using the five-fold cross validation method. And the analysis of variance and multiple comparison methods were used to determine the best method for genomic selection in fleece traits of IMCGs. Further the different reference population sizes (500, 1000, 1500, and 2000) was set. Then the best method was applied to estimate genome breeding values, and evaluate the impact of reference population sizes on the accuracy of genome selection for fleece traits in IMCGs. Results: It was found that the genomic prediction accuracy for each fleece trait in IMCGs by GBLUP method is highest, and it is significantly higher than that obtained by Bayesian method. The accuracy of breeding value estimation is 58.52% -68.49%. Also, it was found that the size of the reference population has a significant impact on the accuracy of genome prediction of fleece traits. When the reference population size is 2000, the accuracy of genomic prediction for each fleece trait is significantly higher than other levels, with accuracy of 55.47% -67.87%. This provides a theoretical basis for design a reasonable genome selection plan for Inner Mongolia cashmere goats in the later stag.

14.

Short Communication: Reduced GBLUP equations to core animals in the algorithm for proven and young (APY).

Nilforooshan, Mohammad Ali.

Vet Anim Sci ; 23: 100334, 2024 Mar.

Article de Anglais | MEDLINE | ID: mdl-38283332

RÉSUMÉ

The number of animal genotypes is rapidly increasing, and a major challenge for animal models is inverting the genomic relationship matrix (G). Matrix G has a limited dimensionality, and the algorithm for proven and young (APY) makes inverting a large G possible via the inverse of a block diagonal of G with a size equivalent to the dimensionality of G. APY divides genotyped animals into core and non-core groups, and breeding values of non-core animals are conditioned on the breeding values of core animals. Therefore, there is the possibility of opting out equations for non-core animals from the model. A methodology was presented for a reduced APY genomic BLUP (GBLUP) to equations for core animals. Using a small example dataset, the method was validated by the equality of the full and the reduced model analysis results. Absorption of fixed effect equations into random effect equations was successful in reducing the number of equations to solve and producing the same random effect solutions. Extending the method to APY single-step GBLUP (ssGBLUP) was not computationally justifiable. Other reduction techniques exist for ssGBLUP (regardless of APY or non-APY) that work by reducing the number of equations for non-genotyped animals. The number of equations can further be reduced by data pruning.

15.

Genomic dissection of additive and non-additive genetic effects and genomic prediction in an open-pollinated family test of Japanese larch.

Dong, Leiming; Xie, Yunhui; Zhang, Yalin; Wang, Ruizhen; Sun, Xiaomei.

BMC Genomics ; 25(1): 11, 2024 Jan 02.

Article de Anglais | MEDLINE | ID: mdl-38166605

RÉSUMÉ

Genomic dissection of genetic effects on desirable traits and the subsequent use of genomic selection hold great promise for accelerating the rate of genetic improvement of forest tree species. In this study, a total of 661 offspring trees from 66 open-pollinated families of Japanese larch (Larix kaempferi (Lam.) Carrière) were sampled at a test site. The contributions of additive and non-additive effects (dominance, imprinting and epistasis) were evaluated for nine valuable traits related to growth, wood physical and chemical properties, and competitive ability using three pedigree-based and four Genomics-based Best Linear Unbiased Predictions (GBLUP) models and used to determine the genetic model. The predictive ability (PA) of two genomic prediction methods, GBLUP and Reproducing Kernel Hilbert Spaces (RKHS), was compared. The traits could be classified into two types based on different quantitative genetic architectures: for type I, including wood chemical properties and Pilodyn penetration, additive effect is the main source of variation (38.20-67.46%); for type II, including growth, competitive ability and acoustic velocity, epistasis plays a significant role (50.76-91.26%). Dominance and imprinting showed low to moderate contributions (< 36.26%). GBLUP was more suitable for traits of type I (PAs = 0.37-0.39 vs. 0.14-0.25), and RKHS was more suitable for traits of type II (PAs = 0.23-0.37 vs. 0.07-0.23). Non-additive effects make no meaningful contribution to the enhancement of PA of GBLUP method for all traits. These findings enhance our current understanding of the architecture of quantitative traits and lay the foundation for the development of genomic selection strategies in Japanese larch.

Sujet(s)

Larix , Larix/génétique , Génotype , Japon , Génome , Génomique/méthodes , Phénotype , Modèles génétiques , Polymorphisme de nucléotide simple

16.

Genomic evaluation for breeding and genetic management in Cordia africana, a multipurpose tropical tree species.

Ousmael, Kedra M; Cappa, Eduardo P; Hansen, Jon K; Hendre, Prasad; Hansen, Ole K.

BMC Genomics ; 25(1): 9, 2024 Jan 02.

Article de Anglais | MEDLINE | ID: mdl-38166623

RÉSUMÉ

BACKGROUND: Planting tested forest reproductive material is crucial to ensure the increased resilience of intensively managed productive stands for timber and wood product markets under climate change scenarios. Single-step Genomic Best Linear Unbiased Prediction (ssGBLUP) analysis is a cost-effective option for using genomic tools to enhance the accuracy of predicted breeding values and genetic parameter estimation in forest tree species. Here, we tested the efficiency of ssGBLUP in a tropical multipurpose tree species, Cordia africana, by partial population genotyping. A total of 8070 trees from three breeding seedling orchards (BSOs) were phenotyped for height. We genotyped 6.1% of the phenotyped individuals with 4373 single nucleotide polymorphisms. The results of ssGBLUP were compared with pedigree-based best linear unbiased prediction (ABLUP) and genomic best linear unbiased prediction (GBLUP), based on genetic parameters, theoretical accuracy of breeding values, selection candidate ranking, genetic gain, and predictive accuracy and prediction bias. RESULTS: Genotyping a subset of the study population provided insights into the level of relatedness in BSOs, allowing better genetic management. Due to the inbreeding detected within the genotyped provenances, we estimated genetic parameters both with and without accounting for inbreeding. The ssGBLUP model showed improved performance in terms of additive genetic variance and theoretical breeding value accuracy. Similarly, ssGBLUP showed improved predictive accuracy and lower bias than the pedigree-based relationship matrix (ABLUP). CONCLUSIONS: This study of C. africana, a species in decline due to deforestation and selective logging, revealed inbreeding depression. The provenance exhibiting the highest level of inbreeding had the poorest overall performance. The use of different relationship matrices and accounting for inbreeding did not substantially affect the ranking of candidate individuals. This is the first study of this approach in a tropical multipurpose tree species, and the analysed BSOs represent the primary effort to breed C. africana.

Sujet(s)

Cordia , Arbres , Humains , Arbres/génétique , Amélioration des plantes , Génome , Génomique/méthodes , Génotype , Phénotype , Modèles génétiques

17.

Realized genomic selection across generations in a reciprocal recurrent selection breeding program of Eucalyptus hybrids.

Simiqueli, Guilherme Ferreira; Resende, Rafael Tassinari; Takahashi, Elizabete Keiko; de Sousa, João Edesio; Grattapaglia, Dario.

Front Plant Sci ; 14: 1252504, 2023.

Article de Anglais | MEDLINE | ID: mdl-37965018

RÉSUMÉ

Introduction: Genomic selection (GS) experiments in forest trees have largely reported estimates of predictive abilities from cross-validation among individuals in the same breeding generation. In such conditions, no effects of recombination, selection, drift, and environmental changes are accounted for. Here, we assessed the effectively realized predictive ability (RPA) for volume growth at harvest age by GS across generations in an operational reciprocal recurrent selection (RRS) program of hybrid Eucalyptus. Methods: Genomic best linear unbiased prediction with additive (GBLUP_G), additive plus dominance (GBLUP_G+D), and additive single-step (HBLUP) models were trained with different combinations of growth data of hybrids and pure species individuals (N = 17,462) of the G1 generation, 1,944 of which were genotyped with ~16,000 SNPs from SNP arrays. The hybrid G2 progeny trial (HPT267) was the GS target, with 1,400 selection candidates, 197 of which were genotyped still at the seedling stage, and genomically predicted for their breeding and genotypic values at the operational harvest age (6 years). Seedlings were then grown to harvest and measured, and their pedigree-based breeding and genotypic values were compared to their originally predicted genomic counterparts. Results: Genomic RPAs ≥0.80 were obtained as the genetic relatedness between G1 and G2 increased, especially when the direct parents of selection candidates were used in training. GBLUP_G+D reached RPAs ≥0.70 only when hybrid or pure species data of G1 were included in training. HBLUP was only marginally better than GBLUP. Correlations ≥0.80 were obtained between pedigree and genomic individual ranks. Rank coincidence of the top 2.5% selections was the highest for GBLUP_G (45% to 60%) compared to GBLUP_G+D. To advance the pure species RRS populations, GS models were best when trained on pure species than hybrid data, and HBLUP yielded ~20% higher predictive abilities than GBLUP, but was not better than ABLUP for ungenotyped trees. Discussion: We demonstrate that genomic data effectively enable accurate ranking of eucalypt hybrid seedlings for their yet-to-be observed volume growth at harvest age. Our results support a two-stage GS approach involving family selection by average genomic breeding value, followed by within-top-families individual GS, significantly increasing selection intensity, optimizing genotyping costs, and accelerating RRS breeding.

18.

Multi-Trait Single-Step Genomic Prediction for Milk Yield and Milk Components for Polish Holstein Population.

Önder, Hasan; Sitskowska, Beata; Kurnaz, Burcu; Piwczynski, Dariusz; Kolenda, Magdalena; Sen, Ugur; Tirink, Cem; Çanga Boga, Demet.

Animals (Basel) ; 13(19)2023 Sep 29.

Article de Anglais | MEDLINE | ID: mdl-37835676

RÉSUMÉ

The objective of our study was to evaluate the predictive ability of a multi-trait genomic prediction model that accounts for interactions between marker effects to estimate heritability and genetic correlations of traits including 305-day milk yield, milk fat percentage, milk protein percentage, milk lactose percentage, and milk dry matter percentage in the Polish Holstein Friesian cow population. For this aim, 14,742 SNP genotype records for 586 Polish Holstein Friesian dairy cows from Poland were used. Single-Trait-ssGBLUP (ST) and Multi-Trait-ssGBLUP (MT) methods were used for estimation. We examined 305-day milk yield (MY, kg), milk fat percentage (MF, %), milk protein percentage (MP, %), milk lactose percentage (ML, %), and milk dry matter percentage (MDM, %). The results showed that the highest marker effect rank correlation was found between milk fat percentage and milk dry matter. The weakest marker effect rank correlation was found between ML and all other traits. Obtained accuracies of this study were between 0.770 and 0.882, and 0.773 and 0.876 for MT and ST, respectively, which were acceptable values. All estimated bias values were positive, which is proof of underestimation. The highest heritability value was obtained for MP (0.3029) and the lowest heritability value was calculated for ML (0.2171). Estimated heritability values were low for milk yield and milk composition as expected. The strongest genetic correlation was estimated between MDM and MF (0.4990) and the weakest genetic correlation was estimated between MY and ML (0.001). The genetic relations with milk yield were negative and can be ignored as they were not significant. In conclusion, multi-trait genomic prediction can be more beneficial than single-trait genomic prediction.

19.

A dimensionality-reduction genomic prediction method without direct inverse of the genomic relationship matrix for large genomic data.

Liu, Hailan; Yu, Shizhou.

Plant Cell Rep ; 42(11): 1825-1832, 2023 Nov.

Article de Anglais | MEDLINE | ID: mdl-37750948

RÉSUMÉ

KEY MESSAGE: A new genomic prediction method (RHPP) was developed via combining randomized Haseman-Elston regression (RHE-reg), PCR based on genomic information of core population, and preconditioned conjugate gradient (PCG) algorithm. Computational efficiency is becoming a hot issue in the practical application of genomic prediction due to the large number of data generated by the high-throughput genotyping technology. In this study, we developed a fast genomic prediction method RHPP via combining randomized Haseman-Elston regression (RHE-reg), PCR based on genomic information of core population, and preconditioned conjugate gradient (PCG) algorithm. The simulation results demonstrated similar prediction accuracy between RHPP and GBLUP, and significantly higher computational efficiency of the former with the increase of individuals. The results of real datasets of both bread wheat and loblolly pine demonstrated that RHPP had a similar or better predictive accuracy in most cases compared with GBLUP. In the future, RHPP may be an attractive choice for analyzing large-scale and high-dimensional data.

20.

Genomic Prediction and Genome-Wide Association Study for Boar Taint Compounds.

Faggion, Sara; Boschi, Elena; Veroneze, Renata; Carnier, Paolo; Bonfatti, Valentina.

Animals (Basel) ; 13(15)2023 Jul 28.

Article de Anglais | MEDLINE | ID: mdl-37570259

RÉSUMÉ

With a perspective future ban on surgical castration in Europe, selecting pigs with reduced ability to accumulate boar taint (BT) compounds (androstenone, indole, skatole) in their tissues seems a promising strategy. BT compound concentrations were quantified in the adipose tissue of 1075 boars genotyped at 29,844 SNPs. Traditional and SNP-based breeding values were estimated using pedigree-based BLUP (PBLUP) and genomic BLUP (GBLUP), respectively. Heritabilities for BT compounds were moderate (0.30-0.52). The accuracies of GBLUP and PBLUP were significantly different for androstenone (0.58 and 0.36, respectively), but comparable for indole and skatole (~0.43 and ~0.47, respectively). Several SNP windows, each explaining a small percentage of the variance of BT compound concentrations, were identified in a genome-wide association study (GWAS). A total of 18 candidate genes previously associated with BT (MX1), reproduction traits (TCF21, NME5, PTGFR, KCNQ1, UMODL1), and fat metabolism (CTSD, SYT8, TNNI2, CD81, EGR1, GIPC2, MIGA1, NEGR1, CCSER1, MTMR2, LPL, ERFE) were identified in the post-GWAS analysis. The large number of genes related to fat metabolism might be explained by the relationship between sexual steroid levels and fat deposition and be partially ascribed to the pig line investigated, which is selected for ham quality and not for lean growth.

RÉSUMÉ

RÉSUMÉ

RÉSUMÉ

RÉSUMÉ

Sujet(s)

RÉSUMÉ

RÉSUMÉ

RÉSUMÉ

RÉSUMÉ

RÉSUMÉ

RÉSUMÉ

Sujet(s)

RÉSUMÉ

RÉSUMÉ

RÉSUMÉ

RÉSUMÉ

RÉSUMÉ

Sujet(s)

RÉSUMÉ

Sujet(s)

RÉSUMÉ

RÉSUMÉ

RÉSUMÉ

RÉSUMÉ

ENVOYER À:

SÉLECTION CITATIONS

DÉTAIL DE RECHERCHE