Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 34
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Genet Sel Evol ; 55(1): 19, 2023 Mar 22.
Artigo em Inglês | MEDLINE | ID: mdl-36949392

RESUMO

BACKGROUND: In genomic prediction, it is common to centre the genotypes of single nucleotide polymorphisms based on the allele frequencies in the current population, rather than those in the base generation. The mean breeding value of non-genotyped animals is conditional on the mean performance of genotyped relatives, but can be corrected by fitting the mean performance of genotyped individuals as a fixed regression. The associated covariate vector has been referred to as a 'J-factor', which if fitted as a fixed effect can improve the accuracy and dispersion bias of sire genomic estimated breeding values (GEBV). To date, this has only been performed on populations with a single breed. Here, we investigated whether there was any benefit in fitting a separate J-factor for each breed in a three-way crossbred population, and in using pedigree-based expected or genome-based estimated breed fractions to define the J-factors. RESULTS: For body weight at 7 days, dispersion bias decreased when fitting multiple J-factors, but only with a low proportion of genotyped individuals with selective genotyping. On average, the mean regression coefficients of validation records on those of GEBV increased with one J-factor compared to none, and further increased with multiple J-factors. However, for body weight at 35 days this was not observed. The accuracy of GEBV remained unchanged regardless of the J-factor method used. Differences between the J-factor methods were limited with correlations approaching 1 for the estimated covariate vector, the estimated coefficients of the regression on the J-factors, and the GEBV. CONCLUSIONS: Based on our results and in the particular design analysed here, i.e. all the animals with phenotype are of the same type of crossbreds, fitting a single J-factor should be sufficient, to reduce dispersion bias. Fitting multiple J-factors may reduce dispersion bias further but this depends on the trait and genotyping rate. For the crossbred population analysed, fitting multiple J-factors has no adverse consequences and if this is done, it does not matter if the breed fractions used are based on the pedigree-expectation or the genomic estimates. Finally, when GEBV are estimated from crossbred data, any observed bias can potentially be reduced by including a straightforward regression on actual breed proportions.


Assuntos
Genoma , Modelos Genéticos , Animais , Genótipo , Genômica/métodos , Fenótipo , Polimorfismo de Nucleotídeo Único , Linhagem
2.
Genet Sel Evol ; 55(1): 14, 2023 Mar 07.
Artigo em Inglês | MEDLINE | ID: mdl-36882689

RESUMO

BACKGROUND: In broiler breeding, genotype-by-environment interaction is known to result in a genetic correlation between body weight measured in bio-secure and commercial environments that is substantially less than 1. Thus, measuring body weights on sibs of selection candidates in a commercial environment and genotyping them could increase genetic progress. Using real data, the aim of this study was to evaluate which genotyping strategy and which proportion of sibs placed in the commercial environment should be genotyped to optimize a sib-testing breeding program in broilers. Phenotypic body weight and genomic information were collected on all sibs raised in a commercial environment, which allowed to retrospectively analyze different sampling strategies and genotyping proportions. RESULTS: Accuracies of genomic estimated breeding values (GEBV) obtained with the different genotyping strategies were assessed by computing their correlation with GEBV obtained when all sibs in the commercial environment were genotyped. Results showed that, compared to random sampling (RND), genotyping sibs with extreme phenotypes (EXT) resulted in higher GEBV accuracy across all genotyping proportions, especially for genotyping proportions of 12.5% or 25%, which resulted in correlations of 0.91 vs 0.88 for 12.5% and 0.94 vs 0.91 for 25% genotyped. Including pedigree on birds with phenotype in the commercial environment that were not genotyped increased accuracy at lower genotyping proportions, especially for the RND strategy (correlations of 0.88 vs 0.65 at 12.5% and 0.91 vs 0.80 at 25%), and a smaller but still substantial increase in accuracy for the EXT strategy (0.91 vs 0.79 for 12.5% and 0.94 vs 0.88 for 25% genotyped). Dispersion bias was virtually absent for RND if 25% or more birds were genotyped. However, GEBV were considerably inflated for EXT, especially when the proportion genotyped was low, which was further exacerbated if the pedigree of non-genotyped sibs was excluded. CONCLUSIONS: When less than 75% of all animals placed in a commercial environment are genotyped, it is recommended to use the EXT strategy, because it yields the highest accuracy. However, caution should be taken when interpreting the resulting GEBV because they will be over-dispersed. When 75% or more of the animals are genotyped, random sampling is recommended because it yields virtually no bias of GEBV and results in similar accuracies as the EXT strategy.


Assuntos
Galinhas , Genômica , Animais , Genótipo , Galinhas/genética , Estudos Retrospectivos , Peso Corporal/genética
3.
J Anim Breed Genet ; 140(3): 304-315, 2023 May.
Artigo em Inglês | MEDLINE | ID: mdl-36806175

RESUMO

Aneuploidy is the loss or gain of one or more chromosomes. Although it is a rare phenomenon in liveborn individuals, it is observed in livestock breeding populations. These breeding populations are often routinely genotyped and the genotype intensity data from single nucleotide polymorphism (SNP) arrays can be exploited to identify aneuploidy cases. This identification is a time-consuming and costly task, because it is often performed by visual inspection of the data per chromosome, usually done in plots of the intensity data by an expert. Therefore, we wanted to explore the feasibility of automated image classification to replace (part of) the visual detection procedure for any diploid species. The aim of this study was to develop a deep learning Convolutional Neural Network (CNN) classification model based on chromosome level plots of SNP array intensity data that can classify the images into disomic, monosomic and trisomic cases. A multispecies dataset enriched for aneuploidy cases was collected containing genotype intensity data of 3321 disomic, 1759 monosomic and 164 trisomic chromosomes. The final CNN model had an accuracy of 99.9%, overall precision was 1, recall was 0.98 and the F1 score was 0.99 for classifying images from intensity data. The high precision assures that cases detected are most likely true cases, however, some trisomy cases may be missed (the recall of the class trisomic was 0.94). This supervised CNN model performed much better than an unsupervised k-means clustering, which reached an accuracy of 0.73 and had especially difficult to classify trisomic cases correctly. The developed CNN classification model provides high accuracy to classify aneuploidy cases based on images of plotted X and Y genotype intensity values. The classification model can be used as a tool for routine screening in large diploid populations that are genotyped to get a better understanding of the incidence and inheritance, and in addition, avoid anomalies in breeding candidates.


Assuntos
Aprendizado Profundo , Animais , Aneuploidia , Redes Neurais de Computação , Genótipo
4.
Genet Sel Evol ; 54(1): 44, 2022 Jun 15.
Artigo em Inglês | MEDLINE | ID: mdl-35705918

RESUMO

BACKGROUND: In genomic prediction including data of 3- or 4-way crossbred animals, line composition is usually fitted as a regression on expected line proportions, which are 0.5, 0.25 and 0.25, respectively, for 3-way crossbred animals. However, actual line proportions for the dam lines can vary between ~ 0.1 and 0.4, and ignoring this variation may affect the genomic estimated breeding values of purebred selection candidates. Our aim was to validate a proposed gold standard to evaluate different approaches for estimating line proportions using simulated data, and to subsequently use this in actual 3-way crossbred broiler data to evaluate several other methods. RESULTS: Analysis of simulated data confirmed that line proportions computed from assigned breed-origin-of-alleles (BOA) provide a very accurate gold standard, even if the parental lines are closely related. Alternative investigated methods were linear regression of genotypes on line-specific allele frequencies, maximum likelihood estimation using the program ADMIXTURE, and the genomic relationship of crossbred animals with their maternal grandparents. The results from the simulated data showed that the genomic relationship with the maternal grandparent was most accurate, and least affected by closer relationships between the dam lines. Linear regression and ADMIXTURE performed similarly for unrelated lines, but their accuracy dropped considerably when the dam lines were more closely related. In almost all cases, estimates improved after adjusting them to ensure that the sum of dam line contributions within animals was equal to 0.5, and within dam line and across animals the average was equal to 0.25. Results from the broiler data were much more similar between methods. In both cases, stringent linkage disequilibrium pruning of genotype data led to a relatively low accuracy of predicted line proportions, due to the loss of too many single nucleotide polymorphisms. CONCLUSIONS: With relatively unrelated parental lines as typical in crosses in pigs and poultry, linear regression of crossbred genotypes on line-specific allele frequencies and ADMIXTURE are very competitive methods. Thus, linear regression may be the method of choice, as it does not require genotypes of grandparents, is computationally very efficient, and easily implemented and adapted for considering the specific nature of the crossbred animals analysed.


Assuntos
Galinhas , Modelos Genéticos , Alelos , Animais , Galinhas/genética , Genômica , Genótipo , Hibridização Genética , Desequilíbrio de Ligação , Polimorfismo de Nucleotídeo Único , Suínos
5.
Genet Sel Evol ; 52(1): 31, 2020 Jun 11.
Artigo em Inglês | MEDLINE | ID: mdl-32527317

RESUMO

BACKGROUND: The traditional way to estimate variance components (VC) is based on the animal model using a pedigree-based relationship matrix (A) (A-AM). After genomic selection was introduced into breeding programs, it was anticipated that VC estimates from A-AM would be biased because the effect of selection based on genomic information is not captured. The single-step method (H-AM), which uses an H matrix as (co)variance matrix, can be used as an alternative to estimate VC. Here, we compared VC estimates from A-AM and H-AM and investigated the effect of genomic selection, genotyping strategy and genotyping proportion on the estimation of VC from the two methods, by analyzing a dataset from a commercial broiler line and a simulated dataset that mimicked the broiler population. RESULTS: VC estimates from H-AM were severely overestimated with a high proportion of selective genotyping, and overestimation increased as proportion of genotyping increased in the analysis of both commercial and simulated data. This bias in H-AM estimates arises when selective genotyping is used to construct the H-matrix, regardless of whether selective genotyping is applied or not in the selection process. For simulated populations under genomic selection, estimates of genetic variance from A-AM were also significantly overestimated when the effect of genomic selection was strong. Our results suggest that VC estimates from H-AM under random genotyping have the expected values. Predicted breeding values from H-AM were inflated when VC estimates were biased, and inflation differed between genotyped and ungenotyped animals, which can lead to suboptimal selection decisions. CONCLUSIONS: We conclude that VC estimates from H-AM are biased with selective genotyping, but are close to expected values with random genotyping.VC estimates from A-AM in populations under genomic selection are also biased but to a much lesser degree. Therefore, we recommend the use of H-AM with random genotyping to estimate VC for populations under genomic selection. Our results indicate that it is still possible to use selective genotyping in selection, but then VC estimation should avoid the use of genotypes from one side only of the distribution of phenotypes. Hence, a dual genotyping strategy may be needed to address both selection and VC estimation.


Assuntos
Cruzamento/métodos , Técnicas de Genotipagem/métodos , Seleção Genética/genética , Análise de Variância , Animais , Galinhas/genética , Simulação por Computador , Genoma/genética , Genômica/métodos , Genótipo , Modelos Animais , Modelos Genéticos , Linhagem , Fenótipo
6.
J Anim Breed Genet ; 137(2): 245-259, 2020 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-31621116

RESUMO

A multivariate model was developed and used to estimate genetic parameters of body weight (BW) at 1-6 weeks of age of broilers raised in a commercial environment. The development of model was based on the predictive ability of breeding values evaluated from a cross-validation procedure that relied on half-sib correlation. The multivariate model accounted for heterogeneous variances between sexes through standardization applied to male and female BWs differently. It was found that the direct additive genetic, permanent environmental maternal and residual variances for BW increased drastically as broilers aged. The drastic increase in variances over weeks of age was mainly due to scaling effects. The ratio of the permanent environmental maternal variance to phenotypic variance decreased gradually with increasing age. Heritability of BW traits ranged from 0.28 to 0.33 at different weeks of age. The direct genetic effects on consecutive weekly BWs had high genetic correlations (0.85-0.99), but the genetic correlations between early and late BWs were low (0.32-0.57). The difference in variance components between sexes increased with increasing age. In conclusion, the permanent environmental maternal effect on broiler chicken BW decreased with increasing age from weeks 1 to 6. Potential bias of the model that considered identical variances for sexes could be reduced when heterogeneous variances between sexes are accounted for in the model.


Assuntos
Peso Corporal/genética , Galinhas/crescimento & desenvolvimento , Galinhas/genética , Animais , Cruzamento , Feminino , Variação Genética , Masculino , Herança Materna , Modelos Genéticos , Modelos Estatísticos , Herança Multifatorial , Fenótipo , Característica Quantitativa Herdável
7.
Genet Sel Evol ; 51(1): 63, 2019 Nov 11.
Artigo em Inglês | MEDLINE | ID: mdl-31711413

RESUMO

Following publication of original article [1], we noticed that there was an error: Eq. (3) on page 5 is the genomic relationship matrix that.

8.
Genet Sel Evol ; 51(1): 68, 2019 Nov 21.
Artigo em Inglês | MEDLINE | ID: mdl-31752665

RESUMO

After publication of this work [1], we noticed that there was an error: the formula to calculate the standard error of the estimated correlation.

9.
Genet Sel Evol ; 51(1): 53, 2019 Sep 23.
Artigo em Inglês | MEDLINE | ID: mdl-31547801

RESUMO

BACKGROUND: The objectives of this study were to (1) simultaneously estimate genetic parameters for BW, feed intake (FI), and body weight gain (Gain) during a FI test in broiler chickens using multi-trait Bayesian analysis; (2) derive phenotypic and genetic residual feed intake (RFI) and estimate genetic parameters of the resulting traits; and (3) compute a Bayesian measure of direct and correlated superiority of a group selected on phenotypic or genetic residual feed intake. A total of 56,649 male and female broiler chickens were measured at one of two ages ([Formula: see text] or [Formula: see text] days). BW, FI, and Gain of males and females at the two ages were considered as separate traits, resulting in a 12-trait model. Phenotypic RFI ([Formula: see text]) and genetic RFI ([Formula: see text]) were estimated from a conditional distribution of FI given BW and Gain using partial phenotypic and partial genetic regression coefficients, respectively. RESULTS: Posterior means of heritability for BW, FI and Gain were moderately high and estimates were significantly different between males and females at the same age for all traits. In addition, the genetic correlations between male and female traits at the same age were significantly different from 1, which suggests a sex-by-genotype interaction. Genetic correlations between [Formula: see text] and [Formula: see text] were significantly different from 1 at an older age but not at a younger age. CONCLUSIONS: The results of the multivariate Bayesian analyses in this study showed that genetic evaluation for production and feed efficiency traits should take sex and age differences into account to increase accuracy of selection and genetic gain. Moreover, for communicating with stakeholders, it is easier to explain results from selection on [Formula: see text] than selection on [Formula: see text], since [Formula: see text] is genetically independent of production traits and it explains the efficiency of birds in nutrient utilization independently of energy requirements for production and maintenance.


Assuntos
Peso Corporal/genética , Galinhas/genética , Ração Animal , Animais , Teorema de Bayes , Galinhas/crescimento & desenvolvimento , Ingestão de Alimentos , Feminino , Masculino
10.
Genet Sel Evol ; 51(1): 50, 2019 Sep 18.
Artigo em Inglês | MEDLINE | ID: mdl-31533614

RESUMO

BACKGROUND: The increase in accuracy of prediction by using genomic information has been well-documented. However, benefits of the use of genomic information and methodology for genetic evaluations are missing when genotype-by-environment interactions (G × E) exist between bio-secure breeding (B) environments and commercial production (C) environments. In this study, we explored (1) G × E interactions for broiler body weight (BW) at weeks 5 and 6, and (2) the benefits of using genomic information for prediction of BW traits when selection candidates were raised and tested in a B environment and close relatives were tested in a C environment. METHODS: A pedigree-based best linear unbiased prediction (BLUP) multivariate model was used to estimate variance components and predict breeding values (EBV) of BW traits at weeks 5 and 6 measured in B and C environments. A single-step genomic BLUP (ssGBLUP) model that combined pedigree and genomic information was used to predict EBV. Cross-validations were based on correlation, mean difference and regression slope statistics for EBV that were estimated from full and reduced datasets. These statistics are indicators of population accuracy, bias and dispersion of prediction for EBV of traits measured in B and C environments. Validation animals were genotyped and non-genotyped birds in the B environment only. RESULTS: Several indications of G × E interactions due to environmental differences were found for BW traits including significant re-ranking, heterogeneous variances and different heritabilities for BW measured in environments B and C. The genetic correlations between BW traits measured in environments B and C ranged from 0.48 to 0.54. The use of combined pedigree and genomic information increased population accuracy of EBV, and reduced bias of EBV prediction for genotyped birds compared to the use of pedigree information only. A slight increase in accuracy of EBV was also observed for non-genotyped birds, but the bias of EBV prediction increased for non-genotyped birds. CONCLUSIONS: The G × E interaction was strong for BW traits of broilers measured in environments B and C. The use of combined pedigree and genomic information increased population accuracy of EBV substantially for genotyped birds in the B environment compared to the use of pedigree information only.


Assuntos
Peso Corporal/genética , Galinhas/genética , Interação Gene-Ambiente , Modelos Genéticos , Animais , Cruzamento , Galinhas/crescimento & desenvolvimento , Feminino , Genômica , Masculino , Modelos Estatísticos
11.
Genet Sel Evol ; 51(1): 38, 2019 Jul 08.
Artigo em Inglês | MEDLINE | ID: mdl-31286857

RESUMO

BACKGROUND: Pig and poultry breeding programs aim at improving crossbred (CB) performance. Selection response may be suboptimal if only purebred (PB) performance is used to compute genomic estimated breeding values (GEBV) because the genetic correlation between PB and CB performance ([Formula: see text]) is often lower than 1. Thus, it may be beneficial to use information on both PB and CB performance. In addition, the accuracy of GEBV of PB animals for CB performance may improve when the breed-of-origin of alleles (BOA) is considered in the genomic relationship matrix (GRM). Thus, our aim was to compare scenarios where GEBV are computed and validated by using (1) either CB offspring averages or individual CB records for validation, (2) either a PB or CB reference population, and (3) a GRM that either accounts for or ignores BOA in the CB individuals. For this purpose, we used data on body weight measured at around 7 (BW7) or 35 (BW35) days in PB and CB broiler chickens and evaluated the accuracy of GEBV based on the correlation GEBV with phenotypes in the validation population (validation correlation). RESULTS: With validation on CB offspring averages, the validation correlation of GEBV of PB animals for CB performance was lower with a CB reference population than with a PB reference population for BW35 ([Formula: see text] = 0.96), and about equal for BW7 ([Formula: see text] = 0.80) when BOA was ignored. However, with validation on individual CB records, the validation correlation was higher with a CB reference population for both traits. The use of a GRM that took BOA into account increased the validation correlation for BW7 but reduced it for BW35. CONCLUSIONS: We argue that the benefit of using a CB reference population for genomic prediction of PB animals for CB performance should be assessed either by validation on CB offspring averages, or by validation on individual CB records while using a GRM that accounts for BOA in the CB individuals. With this recommendation in mind, our results show that the accuracy of GEBV of PB animals for CB performance was equal to or higher with a CB reference population than with a PB reference population for a trait with an [Formula: see text] of 0.8, but lower for a trait with an [Formula: see text] of 0.96. In addition, taking BOA into account was beneficial for a trait with an [Formula: see text] of 0.8 but not for a trait with an [Formula: see text] of 0.96.


Assuntos
Peso Corporal/genética , Cruzamento , Galinhas/genética , Genômica/métodos , Alelos , Animais , Feminino , Genótipo , Masculino , Fenótipo , Valores de Referência
12.
Genet Sel Evol ; 51(1): 6, 2019 Feb 19.
Artigo em Inglês | MEDLINE | ID: mdl-30782121

RESUMO

BACKGROUND: In pig and poultry breeding programs, the breeding goal is to improve crossbred (CB) performance, whereas selection in the purebred (PB) lines is often based on PB performance. Thus, response to selection may be suboptimal, because the genetic correlation between PB and CB performance ([Formula: see text]) is generally lower than 1. Accurate estimates of the [Formula: see text] are needed, so that breeders can decide if they should collect data from CB animals. [Formula: see text] can be estimated either from pedigree or genomic relationships, which may produce different results. With genomic relationships, the [Formula: see text] estimate could be improved when relationships between purebred and crossbred animals are based only on the alleles that originate from the PB line of interest. This work presents the first comparison of estimated [Formula: see text] and variance components of body weight in broilers, using pedigree-based or genotype-based models, where the breed-of-origin of alleles was either ignored or considered. We used genotypes and body weight measurements of PB and CB animals that have a common sire line. RESULTS: Our results showed that the [Formula: see text] estimates depended on the relationship matrix used. Estimates were 5 to 25% larger with genotype-based models than with pedigree-based models. Moreover, [Formula: see text] estimates were similar (max. 7% difference) regardless of whether the model considered breed-of-origin of alleles or not. Standard errors of [Formula: see text] estimates were smaller with genotype-based than with pedigree-based methods, and smaller with models that ignored breed-of-origin than with models that considered breed-of-origin. CONCLUSIONS: We conclude that genotype-based models can be useful for estimating [Formula: see text], even when the PB and CB animals that have phenotypes are closely related. Considering breed-of-origin of alleles did not yield different estimates of [Formula: see text], probably because the parental breeds of the CB animals were distantly related.


Assuntos
Peso Corporal/genética , Cruzamento/métodos , Galinhas/genética , Genótipo , Linhagem , Animais , Galinhas/crescimento & desenvolvimento , Feminino , Masculino , Modelos Genéticos , Fenótipo
13.
Genet Sel Evol ; 50(1): 52, 2018 Nov 03.
Artigo em Inglês | MEDLINE | ID: mdl-30390619

RESUMO

BACKGROUND: A breeding program for commercial broiler chicken that is carried out under strict biosecure conditions can show reduced genetic gain due to genotype by environment interactions (G × E) between bio-secure (B) and commercial production (C) environments. Accuracy of phenotype-based best linear unbiased prediction of breeding values of selection candidates using sib-testing in C is low. Genomic prediction based on dense genetic markers may improve accuracy of selection. Stochastic simulation was used to explore the benefits of genomic selection in breeding schemes for broiler chicken that include birds in both B and C for assessment of phenotype. RESULTS: When genetic correlations ([Formula: see text]) between traits measured in B and C were equal to 0.5 and 0.7, breeding schemes with 15, 30 and 45% of birds assessed in C resulted in higher genetic gain for performance in C compared to those without birds in C. The optimal proportion of birds phenotyped in C for genetic gain was 30%. When the proportion of birds in C was optimal and genotyping effort was limited, allocating 30% of the genotyping effort to birds in C was also the optimal genotyping strategy for genetic gain. When [Formula: see text] was equal to 0.9, genetic gain for performance in C was not improved with birds in C compared to schemes without birds in C. Increasing the heritability of traits assessed in C increased genetic gain significantly. Rates of inbreeding decreased when the proportion of birds in C increased because of a lower selection intensity among birds retained in B and a reduction in the probability of co-selecting close relatives. CONCLUSIONS: If G × E interactions ([Formula: see text] of 0.5 and 0.7) are strong, a genomic selection scheme in which 30% of the birds hatched are phenotyped in C has larger genetic gain for performance in C compared to phenotyping all birds in B. Rates of inbreeding decreased as the proportion of birds moved to C increased from 15 to 45%.


Assuntos
Cruzamento/métodos , Galinhas/genética , Interação Gene-Ambiente , Seleção Genética , Criação de Animais Domésticos/métodos , Animais , Cruzamento/normas , Modelos Genéticos , Característica Quantitativa Herdável
14.
Genet Sel Evol ; 49(1): 28, 2017 02 28.
Artigo em Inglês | MEDLINE | ID: mdl-28245804

RESUMO

BACKGROUND: DNA-based predictions for hard-to-measure production traits hold great promise for selective breeding programs. DNA pooling might provide a cheap genomic approach to use phenotype data from commercial flocks which are commonly group-mated with parentage unknown. This study on sheep explores if genomic breeding values for stud sires can be estimated from genomic relationships that were obtained from pooled DNA in combination with phenotypes from commercial progeny. METHODS: Phenotypes used in this study were categorical data. Blood was pooled strategically aiming at even pool sizes and within sex and phenotype category. A hybrid genomic relationship matrix was constructed relating pools to sires. This matrix was used to determine the contribution of sires to each of the pools and therefore phenotype category by using a simple regression approach. Genomic breeding values were also estimated using the hybrid genomic relationship matrix. RESULTS: We demonstrated that, using pooled DNA, the genetic performance of sires can be illustrated as their contribution to phenotype categories and can be expressed as a regression coefficient. Genomic estimated breeding values for sires were equivalent to the regression coefficients and are a commonly used industry tool. CONCLUSIONS: Genotyping of DNA from pooled biological samples offers a cheap method to link phenotypic information from commercial production animals to the breeding population and can be turned into information on the genetic value of stud sires for traits that cannot be measured in the stud environment.


Assuntos
Cruzamento/métodos , Técnicas de Genotipagem/métodos , Linhagem , Ovinos/genética , Animais , Cruzamento/normas , DNA/sangue , DNA/genética , Aptidão Genética , Técnicas de Genotipagem/normas , Masculino , Fenótipo
15.
Genet Sel Evol ; 48(1): 75, 2016 Sep 29.
Artigo em Inglês | MEDLINE | ID: mdl-27687320

RESUMO

BACKGROUND: As genomic data becomes more abundant, genomic prediction is more routinely used to estimate breeding values. In genomic prediction, the relationship matrix ([Formula: see text]), which is traditionally used in genetic evaluations is replaced by the genomic relationship matrix ([Formula: see text]). This paper considers alternative ways of building relationship matrices either using single markers or haplotypes of different lengths. We compared the prediction accuracies and log-likelihoods when using these alternative relationship matrices and the traditional [Formula: see text] matrix, for real and simulated data. METHODS: For real data, we built relationship matrices using 50k genotype data for a population of Brahman cattle to analyze three traits: scrotal circumference (SC), age at puberty (AGECL) and weight at first corpus luteum (WTCL). Haplotypes were phased with hsphase and imputed with BEAGLE. The relationship matrices were built using three methods based on haplotypes of different lengths. The log-likelihood was considered to define the optimum haplotype lengths for each trait and each haplotype-based relationship matrix. RESULTS: Based on simulated data, we showed that the inverse of [Formula: see text] matrix and the inverse of the haplotype relationship matrices for methods using one-single nucleotide polymorphism (SNP) phased haplotypes provided coefficients of determination (R2) close to 1, although the estimated genetic variances differed across methods. Using real data and multiple SNPs in the haplotype segments to build the relationship matrices provided better results than the [Formula: see text] matrix based on one-SNP haplotypes. However, the optimal haplotype length to achieve the highest log-likelihood depended on the method used and the trait. The optimal haplotype length (7 to 8 SNPs) was similar for SC and AGECL. One of the haplotype-based methods achieved the largest increase in log-likelihood for SC, i.e. from -1330 when using [Formula: see text] to -1325 when using haplotypes with eight SNPs. CONCLUSIONS: Building the relationship matrix by using haplotypes that comprise multiple SNPs will increase the accuracy of estimated breeding values. However, the optimum haplotype length that shows the correct relationship among individuals for each trait can be derived from the data.

16.
BMC Bioinformatics ; 16: 214, 2015 Jul 09.
Artigo em Inglês | MEDLINE | ID: mdl-26156142

RESUMO

BACKGROUND: Despite ongoing reduction in genotyping costs, genomic studies involving large numbers of species with low economic value (such as Black Tiger prawns) remain cost prohibitive. In this scenario DNA pooling is an attractive option to reduce genotyping costs. However, genotyping of pooled samples comprising DNA from many individuals is challenging due to the presence of errors that exceed the allele frequency quantisation size and therefore cannot be simply corrected by clustering techniques. The solution to the calibration problem is a correction to the allele frequency to mitigate errors incurred in the measurement process. We highlight the limitations of the existing calibration solutions such as the fact they impose assumptions on the variation between allele frequencies 0, 0.5, and 1.0, and address a limited set of error types. We propose a novel machine learning method to address the limitations identified. RESULTS: The approach is tested on SNPs genotyped with the Sequenom iPLEX platform and compared to existing state of the art calibration methods. The new method is capable of reducing the mean square error in allele frequency to half that achievable with existing approaches. Furthermore for the first time we demonstrate the importance of carefully considering the choice of training data when using calibration approaches built from pooled data. CONCLUSION: This paper demonstrates that improvements in pooled allele frequency estimates result if the genotyping platform is characterised at allele frequencies other than the homozygous and heterozygous cases. Techniques capable of incorporating such information are described along with aspects of implementation.


Assuntos
DNA/análise , DNA/genética , Genômica , Aprendizado de Máquina , Polimorfismo de Nucleotídeo Único/genética , Calibragem , Análise por Conglomerados , Frequência do Gene , Genótipo , Humanos
17.
Genet Sel Evol ; 47: 84, 2015 Nov 02.
Artigo em Inglês | MEDLINE | ID: mdl-26525050

RESUMO

BACKGROUND: The success of genomic selection in animal breeding hinges on the availability of a large reference population on which genomic-based predictions of additive genetic or breeding values are built. Here, we explore the benefit of combining two unrelated populations into a single reference population. METHODS: The datasets consisted of 1829 Brahman and 1973 Tropical Composite cattle with measurements on five phenotypes relevant to tropical adaptation and genotypes for 71,726 genome-wide single nucleotide polymorphisms (SNPs). The underlying genomic correlation for the same phenotype across the two breeds was explored on the basis of consistent linkage disequilibrium (LD) phase and marker effects in both breeds. RESULTS: The proportion of genetic variance explained by the entire set of SNPs ranged from 37.5 to 57.6 %. Estimated genomic correlations were drastically affected by the process used to select SNPs and went from near 0 to more than 0.80 for most traits when using the set of SNPs with significant effects and the same LD phase in the two breeds. We found that, by carefully selecting the subset of SNPs, the missing heritability can be largely recovered and accuracies in genomic predictions can be improved six-fold. However, the increases in accuracy might come at the expense of large biases. CONCLUSIONS: Our results offer hope for the effective implementation of genomic selection schemes in situations where the number of breeds is large, the sample size within any single breed is small and the breeding objective includes many phenotypes.


Assuntos
Cruzamentos Genéticos , Genoma , Genômica/métodos , Modelos Genéticos , Seleção Genética , Algoritmos , Animais , Bovinos , Conjuntos de Dados como Assunto , Evolução Molecular , Genética Populacional , Estudo de Associação Genômica Ampla , Genótipo , Desequilíbrio de Ligação , Linhagem , Fenótipo , Polimorfismo de Nucleotídeo Único , Característica Quantitativa Herdável , Reprodutibilidade dos Testes
18.
Genet Sel Evol ; 46: 51, 2014 Sep 02.
Artigo em Inglês | MEDLINE | ID: mdl-25183297

RESUMO

BACKGROUND: While much attention has focused on the development of high-density single nucleotide polymorphism (SNP) assays, the costs of developing and running low-density assays have fallen dramatically. This makes it feasible to develop and apply SNP assays for agricultural species beyond the major livestock species. Although low-cost low-density assays may not have the accuracy of the high-density assays widely used in human and livestock species, we show that when combined with statistical analysis approaches that use quantitative instead of discrete genotypes, their utility may be improved. The data used in this study are from a 63-SNP marker Sequenom® iPLEX Platinum panel for the Black Tiger shrimp, for which high-density SNP assays are not currently available. RESULTS: For quantitative genotypes that could be estimated, in 5% of cases the most likely genotype for an individual at a SNP had a probability of less than 0.99. Matrix formulations of maximum likelihood equations for parentage assignment were developed for the quantitative genotypes and also for discrete genotypes perturbed by an assumed error term. Assignment rates that were based on maximum likelihood with quantitative genotypes were similar to those based on maximum likelihood with perturbed genotypes but, for more than 50% of cases, the two methods resulted in individuals being assigned to different families. Treating genotypes as quantitative values allows the same analysis framework to be used for pooled samples of DNA from multiple individuals. Resulting correlations between allele frequency estimates from pooled DNA and individual samples were consistently greater than 0.90, and as high as 0.97 for some pools. Estimates of family contributions to the pools based on quantitative genotypes in pooled DNA had a correlation of 0.85 with estimates of contributions from DNA-derived pedigree. CONCLUSIONS: Even with low numbers of SNPs of variable quality, parentage testing and family assignment from pooled samples are sufficiently accurate to provide useful information for a breeding program. Treating genotypes as quantitative values is an alternative to perturbing genotypes using an assumed error distribution, but can produce very different results. An understanding of the distribution of the error is required for SNP genotyping platforms.


Assuntos
Técnicas de Genotipagem/métodos , Penaeidae/genética , Polimorfismo de Nucleotídeo Único , Animais , Cruzamento , DNA/química , Feminino , Frequência do Gene , Masculino , Linhagem , Espectrometria de Massas por Ionização e Dessorção a Laser Assistida por Matriz
19.
Genet Res (Camb) ; 94(4): 223-34, 2012 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-22950902

RESUMO

Recently, a Haley-Knott-type regression method using combined linkage disequilibrium and linkage analyses (LDLA) was proposed to map quantitative trait loci (QTLs). Chromosome of 5 and 25 cM with 0·25 and 0·05 cM, respectively, between markers were simulated. The differences between the LDLA approaches with regard to QTL position accuracy were very limited, with a significantly better mean square error (MSE) with the LDLA regression (LDLA_reg) in sparse map cases; the contrary was observed, but not significantly, in dense map situations. The computing time required for the LDLA variance components (LDLA_vc) model was much higher than the LDLA_reg model. The precision of QTL position estimation was compared for four numbers of half-sib families, four different family sizes and two experimental designs (half-sibs, and full- and half-sibs). Regarding the number of families, MSE values were lowest for 15 or 50 half-sib families, differences not being significant. We observed that the greater the number of progenies per sire, the more accurate the QTL position. However, for a fixed population size, reducing the number of families (e.g. using a small number of large full-sib families) could lead to less accuracy of estimated QTL position.


Assuntos
Mapeamento Cromossômico/métodos , Ligação Genética , Desequilíbrio de Ligação , Locos de Características Quantitativas/genética , Simulação por Computador , Família , Humanos , Modelos Genéticos , Densidade Demográfica
20.
Genet Sel Evol ; 44: 12, 2012 Apr 17.
Artigo em Inglês | MEDLINE | ID: mdl-22507187

RESUMO

BACKGROUND: Studies to detect associations between DNA markers and traits of interest in humans and livestock benefit from increasing the number of individuals genotyped. Performing association studies on pooled DNA samples can provide greater power for a given cost. For quantitative traits, the effect of an SNP is measured in the units of the trait and here we propose and demonstrate a method to estimate SNP effects on quantitative traits from pooled DNA data. METHODS: To obtain estimates of SNP effects from pooled DNA samples, we used logistic regression of estimated allele frequencies in pools on phenotype. The method was tested on a simulated dataset, and a beef cattle dataset using a model that included principal components from a genomic correlation matrix derived from the allele frequencies estimated from the pooled samples. The performance of the obtained estimates was evaluated by comparison with estimates obtained using regression of phenotype on genotype from individual samples of DNA. RESULTS: For the simulated data, the estimates of SNP effects from pooled DNA are similar but asymptotically different to those from individual DNA data. Error in estimating allele frequencies had a large effect on the accuracy of estimated SNP effects. For the beef cattle dataset, the principal components of the genomic correlation matrix from pooled DNA were consistent with known breed groups, and could be used to account for population stratification. Correctly modeling the contemporary group structure was essential to achieve estimates similar to those from individual DNA data, and pooling DNA from individuals within groups was superior to pooling DNA across groups. For a fixed number of assays, pooled DNA samples produced results that were more correlated with results from individual genotyping data than were results from one random individual assayed from each pool. CONCLUSIONS: Use of logistic regression of allele frequency on phenotype makes it possible to estimate SNP effects on quantitative traits from pooled DNA samples. With pooled DNA samples, genotyping costs are reduced, and in cases where trait records are abundant this approach is promising to obtain SNP associations for marker-assisted selection.


Assuntos
DNA/genética , Genótipo , Polimorfismo de Nucleotídeo Único , Algoritmos , Animais , Biometria , Bovinos/anatomia & histologia , Bovinos/genética , Simulação por Computador , Feminino , Frequência do Gene , Humanos , Modelos Logísticos , Modelos Genéticos , Análise de Componente Principal , Locos de Características Quantitativas
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA