Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 23
Filtrar
1.
J Anim Breed Genet ; 140(3): 304-315, 2023 May.
Artigo em Inglês | MEDLINE | ID: mdl-36806175

RESUMO

Aneuploidy is the loss or gain of one or more chromosomes. Although it is a rare phenomenon in liveborn individuals, it is observed in livestock breeding populations. These breeding populations are often routinely genotyped and the genotype intensity data from single nucleotide polymorphism (SNP) arrays can be exploited to identify aneuploidy cases. This identification is a time-consuming and costly task, because it is often performed by visual inspection of the data per chromosome, usually done in plots of the intensity data by an expert. Therefore, we wanted to explore the feasibility of automated image classification to replace (part of) the visual detection procedure for any diploid species. The aim of this study was to develop a deep learning Convolutional Neural Network (CNN) classification model based on chromosome level plots of SNP array intensity data that can classify the images into disomic, monosomic and trisomic cases. A multispecies dataset enriched for aneuploidy cases was collected containing genotype intensity data of 3321 disomic, 1759 monosomic and 164 trisomic chromosomes. The final CNN model had an accuracy of 99.9%, overall precision was 1, recall was 0.98 and the F1 score was 0.99 for classifying images from intensity data. The high precision assures that cases detected are most likely true cases, however, some trisomy cases may be missed (the recall of the class trisomic was 0.94). This supervised CNN model performed much better than an unsupervised k-means clustering, which reached an accuracy of 0.73 and had especially difficult to classify trisomic cases correctly. The developed CNN classification model provides high accuracy to classify aneuploidy cases based on images of plotted X and Y genotype intensity values. The classification model can be used as a tool for routine screening in large diploid populations that are genotyped to get a better understanding of the incidence and inheritance, and in addition, avoid anomalies in breeding candidates.


Assuntos
Aprendizado Profundo , Animais , Aneuploidia , Redes Neurais de Computação , Genótipo
2.
Genet Sel Evol ; 54(1): 44, 2022 Jun 15.
Artigo em Inglês | MEDLINE | ID: mdl-35705918

RESUMO

BACKGROUND: In genomic prediction including data of 3- or 4-way crossbred animals, line composition is usually fitted as a regression on expected line proportions, which are 0.5, 0.25 and 0.25, respectively, for 3-way crossbred animals. However, actual line proportions for the dam lines can vary between ~ 0.1 and 0.4, and ignoring this variation may affect the genomic estimated breeding values of purebred selection candidates. Our aim was to validate a proposed gold standard to evaluate different approaches for estimating line proportions using simulated data, and to subsequently use this in actual 3-way crossbred broiler data to evaluate several other methods. RESULTS: Analysis of simulated data confirmed that line proportions computed from assigned breed-origin-of-alleles (BOA) provide a very accurate gold standard, even if the parental lines are closely related. Alternative investigated methods were linear regression of genotypes on line-specific allele frequencies, maximum likelihood estimation using the program ADMIXTURE, and the genomic relationship of crossbred animals with their maternal grandparents. The results from the simulated data showed that the genomic relationship with the maternal grandparent was most accurate, and least affected by closer relationships between the dam lines. Linear regression and ADMIXTURE performed similarly for unrelated lines, but their accuracy dropped considerably when the dam lines were more closely related. In almost all cases, estimates improved after adjusting them to ensure that the sum of dam line contributions within animals was equal to 0.5, and within dam line and across animals the average was equal to 0.25. Results from the broiler data were much more similar between methods. In both cases, stringent linkage disequilibrium pruning of genotype data led to a relatively low accuracy of predicted line proportions, due to the loss of too many single nucleotide polymorphisms. CONCLUSIONS: With relatively unrelated parental lines as typical in crosses in pigs and poultry, linear regression of crossbred genotypes on line-specific allele frequencies and ADMIXTURE are very competitive methods. Thus, linear regression may be the method of choice, as it does not require genotypes of grandparents, is computationally very efficient, and easily implemented and adapted for considering the specific nature of the crossbred animals analysed.


Assuntos
Galinhas , Modelos Genéticos , Alelos , Animais , Galinhas/genética , Genômica , Genótipo , Hibridização Genética , Desequilíbrio de Ligação , Polimorfismo de Nucleotídeo Único , Suínos
3.
Genet Sel Evol ; 51(1): 63, 2019 Nov 11.
Artigo em Inglês | MEDLINE | ID: mdl-31711413

RESUMO

Following publication of original article [1], we noticed that there was an error: Eq. (3) on page 5 is the genomic relationship matrix that.

4.
Genet Sel Evol ; 51(1): 38, 2019 Jul 08.
Artigo em Inglês | MEDLINE | ID: mdl-31286857

RESUMO

BACKGROUND: Pig and poultry breeding programs aim at improving crossbred (CB) performance. Selection response may be suboptimal if only purebred (PB) performance is used to compute genomic estimated breeding values (GEBV) because the genetic correlation between PB and CB performance ([Formula: see text]) is often lower than 1. Thus, it may be beneficial to use information on both PB and CB performance. In addition, the accuracy of GEBV of PB animals for CB performance may improve when the breed-of-origin of alleles (BOA) is considered in the genomic relationship matrix (GRM). Thus, our aim was to compare scenarios where GEBV are computed and validated by using (1) either CB offspring averages or individual CB records for validation, (2) either a PB or CB reference population, and (3) a GRM that either accounts for or ignores BOA in the CB individuals. For this purpose, we used data on body weight measured at around 7 (BW7) or 35 (BW35) days in PB and CB broiler chickens and evaluated the accuracy of GEBV based on the correlation GEBV with phenotypes in the validation population (validation correlation). RESULTS: With validation on CB offspring averages, the validation correlation of GEBV of PB animals for CB performance was lower with a CB reference population than with a PB reference population for BW35 ([Formula: see text] = 0.96), and about equal for BW7 ([Formula: see text] = 0.80) when BOA was ignored. However, with validation on individual CB records, the validation correlation was higher with a CB reference population for both traits. The use of a GRM that took BOA into account increased the validation correlation for BW7 but reduced it for BW35. CONCLUSIONS: We argue that the benefit of using a CB reference population for genomic prediction of PB animals for CB performance should be assessed either by validation on CB offspring averages, or by validation on individual CB records while using a GRM that accounts for BOA in the CB individuals. With this recommendation in mind, our results show that the accuracy of GEBV of PB animals for CB performance was equal to or higher with a CB reference population than with a PB reference population for a trait with an [Formula: see text] of 0.8, but lower for a trait with an [Formula: see text] of 0.96. In addition, taking BOA into account was beneficial for a trait with an [Formula: see text] of 0.8 but not for a trait with an [Formula: see text] of 0.96.


Assuntos
Peso Corporal/genética , Cruzamento , Galinhas/genética , Genômica/métodos , Alelos , Animais , Feminino , Genótipo , Masculino , Fenótipo , Valores de Referência
5.
Genet Sel Evol ; 51(1): 6, 2019 Feb 19.
Artigo em Inglês | MEDLINE | ID: mdl-30782121

RESUMO

BACKGROUND: In pig and poultry breeding programs, the breeding goal is to improve crossbred (CB) performance, whereas selection in the purebred (PB) lines is often based on PB performance. Thus, response to selection may be suboptimal, because the genetic correlation between PB and CB performance ([Formula: see text]) is generally lower than 1. Accurate estimates of the [Formula: see text] are needed, so that breeders can decide if they should collect data from CB animals. [Formula: see text] can be estimated either from pedigree or genomic relationships, which may produce different results. With genomic relationships, the [Formula: see text] estimate could be improved when relationships between purebred and crossbred animals are based only on the alleles that originate from the PB line of interest. This work presents the first comparison of estimated [Formula: see text] and variance components of body weight in broilers, using pedigree-based or genotype-based models, where the breed-of-origin of alleles was either ignored or considered. We used genotypes and body weight measurements of PB and CB animals that have a common sire line. RESULTS: Our results showed that the [Formula: see text] estimates depended on the relationship matrix used. Estimates were 5 to 25% larger with genotype-based models than with pedigree-based models. Moreover, [Formula: see text] estimates were similar (max. 7% difference) regardless of whether the model considered breed-of-origin of alleles or not. Standard errors of [Formula: see text] estimates were smaller with genotype-based than with pedigree-based methods, and smaller with models that ignored breed-of-origin than with models that considered breed-of-origin. CONCLUSIONS: We conclude that genotype-based models can be useful for estimating [Formula: see text], even when the PB and CB animals that have phenotypes are closely related. Considering breed-of-origin of alleles did not yield different estimates of [Formula: see text], probably because the parental breeds of the CB animals were distantly related.


Assuntos
Peso Corporal/genética , Cruzamento/métodos , Galinhas/genética , Genótipo , Linhagem , Animais , Galinhas/crescimento & desenvolvimento , Feminino , Masculino , Modelos Genéticos , Fenótipo
6.
Genet Sel Evol ; 49(1): 28, 2017 02 28.
Artigo em Inglês | MEDLINE | ID: mdl-28245804

RESUMO

BACKGROUND: DNA-based predictions for hard-to-measure production traits hold great promise for selective breeding programs. DNA pooling might provide a cheap genomic approach to use phenotype data from commercial flocks which are commonly group-mated with parentage unknown. This study on sheep explores if genomic breeding values for stud sires can be estimated from genomic relationships that were obtained from pooled DNA in combination with phenotypes from commercial progeny. METHODS: Phenotypes used in this study were categorical data. Blood was pooled strategically aiming at even pool sizes and within sex and phenotype category. A hybrid genomic relationship matrix was constructed relating pools to sires. This matrix was used to determine the contribution of sires to each of the pools and therefore phenotype category by using a simple regression approach. Genomic breeding values were also estimated using the hybrid genomic relationship matrix. RESULTS: We demonstrated that, using pooled DNA, the genetic performance of sires can be illustrated as their contribution to phenotype categories and can be expressed as a regression coefficient. Genomic estimated breeding values for sires were equivalent to the regression coefficients and are a commonly used industry tool. CONCLUSIONS: Genotyping of DNA from pooled biological samples offers a cheap method to link phenotypic information from commercial production animals to the breeding population and can be turned into information on the genetic value of stud sires for traits that cannot be measured in the stud environment.


Assuntos
Cruzamento/métodos , Técnicas de Genotipagem/métodos , Linhagem , Ovinos/genética , Animais , Cruzamento/normas , DNA/sangue , DNA/genética , Aptidão Genética , Técnicas de Genotipagem/normas , Masculino , Fenótipo
7.
BMC Bioinformatics ; 16: 214, 2015 Jul 09.
Artigo em Inglês | MEDLINE | ID: mdl-26156142

RESUMO

BACKGROUND: Despite ongoing reduction in genotyping costs, genomic studies involving large numbers of species with low economic value (such as Black Tiger prawns) remain cost prohibitive. In this scenario DNA pooling is an attractive option to reduce genotyping costs. However, genotyping of pooled samples comprising DNA from many individuals is challenging due to the presence of errors that exceed the allele frequency quantisation size and therefore cannot be simply corrected by clustering techniques. The solution to the calibration problem is a correction to the allele frequency to mitigate errors incurred in the measurement process. We highlight the limitations of the existing calibration solutions such as the fact they impose assumptions on the variation between allele frequencies 0, 0.5, and 1.0, and address a limited set of error types. We propose a novel machine learning method to address the limitations identified. RESULTS: The approach is tested on SNPs genotyped with the Sequenom iPLEX platform and compared to existing state of the art calibration methods. The new method is capable of reducing the mean square error in allele frequency to half that achievable with existing approaches. Furthermore for the first time we demonstrate the importance of carefully considering the choice of training data when using calibration approaches built from pooled data. CONCLUSION: This paper demonstrates that improvements in pooled allele frequency estimates result if the genotyping platform is characterised at allele frequencies other than the homozygous and heterozygous cases. Techniques capable of incorporating such information are described along with aspects of implementation.


Assuntos
DNA/análise , DNA/genética , Genômica , Aprendizado de Máquina , Polimorfismo de Nucleotídeo Único/genética , Calibragem , Análise por Conglomerados , Frequência do Gene , Genótipo , Humanos
8.
Genet Sel Evol ; 47: 84, 2015 Nov 02.
Artigo em Inglês | MEDLINE | ID: mdl-26525050

RESUMO

BACKGROUND: The success of genomic selection in animal breeding hinges on the availability of a large reference population on which genomic-based predictions of additive genetic or breeding values are built. Here, we explore the benefit of combining two unrelated populations into a single reference population. METHODS: The datasets consisted of 1829 Brahman and 1973 Tropical Composite cattle with measurements on five phenotypes relevant to tropical adaptation and genotypes for 71,726 genome-wide single nucleotide polymorphisms (SNPs). The underlying genomic correlation for the same phenotype across the two breeds was explored on the basis of consistent linkage disequilibrium (LD) phase and marker effects in both breeds. RESULTS: The proportion of genetic variance explained by the entire set of SNPs ranged from 37.5 to 57.6 %. Estimated genomic correlations were drastically affected by the process used to select SNPs and went from near 0 to more than 0.80 for most traits when using the set of SNPs with significant effects and the same LD phase in the two breeds. We found that, by carefully selecting the subset of SNPs, the missing heritability can be largely recovered and accuracies in genomic predictions can be improved six-fold. However, the increases in accuracy might come at the expense of large biases. CONCLUSIONS: Our results offer hope for the effective implementation of genomic selection schemes in situations where the number of breeds is large, the sample size within any single breed is small and the breeding objective includes many phenotypes.


Assuntos
Cruzamentos Genéticos , Genoma , Genômica/métodos , Modelos Genéticos , Seleção Genética , Algoritmos , Animais , Bovinos , Conjuntos de Dados como Assunto , Evolução Molecular , Genética Populacional , Estudo de Associação Genômica Ampla , Genótipo , Desequilíbrio de Ligação , Linhagem , Fenótipo , Polimorfismo de Nucleotídeo Único , Característica Quantitativa Herdável , Reprodutibilidade dos Testes
9.
Genet Sel Evol ; 46: 51, 2014 Sep 02.
Artigo em Inglês | MEDLINE | ID: mdl-25183297

RESUMO

BACKGROUND: While much attention has focused on the development of high-density single nucleotide polymorphism (SNP) assays, the costs of developing and running low-density assays have fallen dramatically. This makes it feasible to develop and apply SNP assays for agricultural species beyond the major livestock species. Although low-cost low-density assays may not have the accuracy of the high-density assays widely used in human and livestock species, we show that when combined with statistical analysis approaches that use quantitative instead of discrete genotypes, their utility may be improved. The data used in this study are from a 63-SNP marker Sequenom® iPLEX Platinum panel for the Black Tiger shrimp, for which high-density SNP assays are not currently available. RESULTS: For quantitative genotypes that could be estimated, in 5% of cases the most likely genotype for an individual at a SNP had a probability of less than 0.99. Matrix formulations of maximum likelihood equations for parentage assignment were developed for the quantitative genotypes and also for discrete genotypes perturbed by an assumed error term. Assignment rates that were based on maximum likelihood with quantitative genotypes were similar to those based on maximum likelihood with perturbed genotypes but, for more than 50% of cases, the two methods resulted in individuals being assigned to different families. Treating genotypes as quantitative values allows the same analysis framework to be used for pooled samples of DNA from multiple individuals. Resulting correlations between allele frequency estimates from pooled DNA and individual samples were consistently greater than 0.90, and as high as 0.97 for some pools. Estimates of family contributions to the pools based on quantitative genotypes in pooled DNA had a correlation of 0.85 with estimates of contributions from DNA-derived pedigree. CONCLUSIONS: Even with low numbers of SNPs of variable quality, parentage testing and family assignment from pooled samples are sufficiently accurate to provide useful information for a breeding program. Treating genotypes as quantitative values is an alternative to perturbing genotypes using an assumed error distribution, but can produce very different results. An understanding of the distribution of the error is required for SNP genotyping platforms.


Assuntos
Técnicas de Genotipagem/métodos , Penaeidae/genética , Polimorfismo de Nucleotídeo Único , Animais , Cruzamento , DNA/química , Feminino , Frequência do Gene , Masculino , Linhagem , Espectrometria de Massas por Ionização e Dessorção a Laser Assistida por Matriz
10.
Genet Res (Camb) ; 94(4): 223-34, 2012 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-22950902

RESUMO

Recently, a Haley-Knott-type regression method using combined linkage disequilibrium and linkage analyses (LDLA) was proposed to map quantitative trait loci (QTLs). Chromosome of 5 and 25 cM with 0·25 and 0·05 cM, respectively, between markers were simulated. The differences between the LDLA approaches with regard to QTL position accuracy were very limited, with a significantly better mean square error (MSE) with the LDLA regression (LDLA_reg) in sparse map cases; the contrary was observed, but not significantly, in dense map situations. The computing time required for the LDLA variance components (LDLA_vc) model was much higher than the LDLA_reg model. The precision of QTL position estimation was compared for four numbers of half-sib families, four different family sizes and two experimental designs (half-sibs, and full- and half-sibs). Regarding the number of families, MSE values were lowest for 15 or 50 half-sib families, differences not being significant. We observed that the greater the number of progenies per sire, the more accurate the QTL position. However, for a fixed population size, reducing the number of families (e.g. using a small number of large full-sib families) could lead to less accuracy of estimated QTL position.


Assuntos
Mapeamento Cromossômico/métodos , Ligação Genética , Desequilíbrio de Ligação , Locos de Características Quantitativas/genética , Simulação por Computador , Família , Humanos , Modelos Genéticos , Densidade Demográfica
11.
Genet Sel Evol ; 44: 12, 2012 Apr 17.
Artigo em Inglês | MEDLINE | ID: mdl-22507187

RESUMO

BACKGROUND: Studies to detect associations between DNA markers and traits of interest in humans and livestock benefit from increasing the number of individuals genotyped. Performing association studies on pooled DNA samples can provide greater power for a given cost. For quantitative traits, the effect of an SNP is measured in the units of the trait and here we propose and demonstrate a method to estimate SNP effects on quantitative traits from pooled DNA data. METHODS: To obtain estimates of SNP effects from pooled DNA samples, we used logistic regression of estimated allele frequencies in pools on phenotype. The method was tested on a simulated dataset, and a beef cattle dataset using a model that included principal components from a genomic correlation matrix derived from the allele frequencies estimated from the pooled samples. The performance of the obtained estimates was evaluated by comparison with estimates obtained using regression of phenotype on genotype from individual samples of DNA. RESULTS: For the simulated data, the estimates of SNP effects from pooled DNA are similar but asymptotically different to those from individual DNA data. Error in estimating allele frequencies had a large effect on the accuracy of estimated SNP effects. For the beef cattle dataset, the principal components of the genomic correlation matrix from pooled DNA were consistent with known breed groups, and could be used to account for population stratification. Correctly modeling the contemporary group structure was essential to achieve estimates similar to those from individual DNA data, and pooling DNA from individuals within groups was superior to pooling DNA across groups. For a fixed number of assays, pooled DNA samples produced results that were more correlated with results from individual genotyping data than were results from one random individual assayed from each pool. CONCLUSIONS: Use of logistic regression of allele frequency on phenotype makes it possible to estimate SNP effects on quantitative traits from pooled DNA samples. With pooled DNA samples, genotyping costs are reduced, and in cases where trait records are abundant this approach is promising to obtain SNP associations for marker-assisted selection.


Assuntos
DNA/genética , Genótipo , Polimorfismo de Nucleotídeo Único , Algoritmos , Animais , Biometria , Bovinos/anatomia & histologia , Bovinos/genética , Simulação por Computador , Feminino , Frequência do Gene , Humanos , Modelos Logísticos , Modelos Genéticos , Análise de Componente Principal , Locos de Características Quantitativas
12.
Anim Genet ; 43(6): 683-8, 2012 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-22497221

RESUMO

The POLL locus has been mapped to the centromeric region of bovine chromosome 1 (BTA1) in both taurine breeds and taurine-indicine crosses in an interval of approximately 1 Mb. It has not yet been mapped in pure-bred zebu cattle. Despite several efforts, neither causative mutations in candidate genes nor a singular diagnostic DNA marker has been identified. In this study, we genotyped a total of 68 Brahman cattle and 20 Hereford cattle informative for the POLL locus for 33 DNA microsatellites, 16 of which we identified de novo from the bovine genome sequence, mapping the POLL locus to the region of the genes IFNAR2 and SYNJ1. The 303-bp allele of the new microsatellite, CSAFG29, showed strong association with the POLL allele. We then genotyped 855 Brahman cattle for CSAFG29 and confirmed the association between the 303-bp allele and POLL. To determine whether the same association was found in taurine breeds, we genotyped 334 animals of the Angus, Hereford and Limousin breeds and 376 animals of the Brangus, Droughtmaster and Santa Gertrudis composite taurine-zebu breeds. The association between the 303-bp allele and POLL was confirmed in these breeds; however, an additional allele (305 bp) was also associated but not fully predictive of POLL. Across the data, CSAFG29 was in sufficient linkage disequilibrium to the POLL allele in Australian Brahman cattle that it could potentially be used as a diagnostic marker in that breed, but this may not be the case in other breeds. Further, we provide confirmatory evidence that the scur phenotype generally occurs in animals that are heterozygous for the POLL allele.


Assuntos
Bovinos/genética , Mapeamento Cromossômico/veterinária , Cromossomos de Mamíferos/genética , Loci Gênicos , Repetições de Microssatélites/genética , Animais , Bovinos/anatomia & histologia , Marcadores Genéticos , Genótipo , Desequilíbrio de Ligação
13.
Genet Res (Camb) ; 93(3): 203-19, 2011 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-24725775

RESUMO

SummaryGenetic resistance to gastrointestinal worms is a complex trait of great importance in both livestock and humans. In order to gain insights into the genetic architecture of this trait, a mixed breed population of sheep was artificially infected with Trichostrongylus colubriformis (n=3326) and then Haemonchus contortus (n=2669) to measure faecal worm egg count (WEC). The population was genotyped with the Illumina OvineSNP50 BeadChip and 48 640 single nucleotide polymorphism (SNP) markers passed the quality controls. An independent population of 316 sires of mixed breeds with accurate estimated breeding values for WEC were genotyped for the same SNP to assess the results obtained from the first population. We used principal components from the genomic relationship matrix among genotyped individuals to account for population stratification, and a novel approach to directly account for the sampling error associated with each SNP marker regression. The largest marker effects were estimated to explain an average of 0·48% (T. colubriformis) or 0·08% (H. contortus) of the phenotypic variance in WEC. These effects are small but consistent with results from other complex traits. We also demonstrated that methods which use all markers simultaneously can successfully predict genetic merit for resistance to worms, despite the small effects of individual markers. Correlations of genomic predictions with breeding values of the industry sires reached a maximum of 0·32. We estimate that effective across-breed predictions of genetic merit with multi-breed populations will require an average marker spacing of approximately 10 kbp.


Assuntos
Resistência à Doença/genética , Fezes/parasitologia , Marcadores Genéticos , Infecções por Nematoides/veterinária , Contagem de Ovos de Parasitas/veterinária , Polimorfismo de Nucleotídeo Único/genética , Doenças dos Ovinos/parasitologia , Ovinos/genética , Animais , Imunidade Inata/genética , Infecções por Nematoides/genética , Infecções por Nematoides/parasitologia , Ovinos/parasitologia
14.
Front Cell Dev Biol ; 9: 726827, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34660583

RESUMO

In birds, males are the homogametic sex (ZZ) and females are the heterogametic sex (ZW). Here, we investigate the role of chromosomal sex and germ cell competition on avian germ cell differentiation. We recently developed genetically sterile layer cockerels and hens for use as surrogate hosts for primordial germ cell (PGC) transplantation. Using in vitro propagated and cryopreserved PGCs from a pedigree Silkie broiler breed, we now demonstrate that sterile surrogate layer hosts injected with same sex PGCs have normal fertility and produced pure breed Silkie broiler offspring when directly mated to each other in Sire Dam Surrogate mating. We found that female sterile hosts carrying chromosomally male (ZZ) PGCs formed functional oocytes and eggs, which gave rise to 100% male offspring after fertilization. Unexpectedly, we also observed that chromosomally female (ZW) PGCs carried by male sterile hosts formed functional spermatozoa and produced viable offspring. These findings demonstrate that avian PGCs are not sexually restricted for functional gamete formation and provide new insights for the cryopreservation of poultry and other bird species using diploid stage germ cells.

15.
Genet Sel Evol ; 42: 34, 2010 Aug 12.
Artigo em Inglês | MEDLINE | ID: mdl-20701809

RESUMO

BACKGROUND: Repeated blocks of genome sequence have been shown to be associated with genetic diversity and disease risk in humans, and with phenotypic diversity in model organisms and domestic animals. Reliable tests are desirable to determine whether individuals are carriers of copy number variants associated with disease risk in humans and livestock, or associated with economically important traits in livestock. In some cases, copy number variants affect the phenotype through a dosage effect but in other cases, allele combinations have non-additive effects. In the latter cases, it has been difficult to develop tests because assays typically return an estimate of the sum of the copy number counts on the maternally and paternally inherited chromosome segments, and this sum does not uniquely determine the allele configuration. In this study, we show that there is an old solution to this new problem: segregation analysis, which has been used for many years to infer alleles in pedigreed populations. METHODS: Segregation analysis was used to estimate copy number alleles from assay data on simulated half-sib sheep populations. Copy number variation at the Agouti locus, known to be responsible for the recessive self-colour black phenotype, was used as a model for the simulation and an appropriate penetrance function was derived. The precision with which carriers and non-carriers of the undesirable single copy allele could be identified, was used to evaluate the method for various family sizes, assay strategies and assay accuracies. RESULTS: Using relationship data and segregation analysis, the probabilities of carrying the copy number alleles responsible for black or white fleece were estimated with much greater precision than by analyzing assay results for animals individually. The proportion of lambs correctly identified as non-carriers of the undesirable allele increased from 7% when the lambs were analysed alone to 80% when the lambs were analysed in half-sib families. CONCLUSIONS: When a quantitative assay is used to estimate copy number alleles, segregation analysis of related individuals can greatly improve the precision of the estimates. Existing software for segregation analysis would require little if any change to accommodate the penetrance function for copy number assay data.


Assuntos
Segregação de Cromossomos/genética , Variações do Número de Cópias de DNA/genética , Técnicas Genéticas , Linhagem , Ovinos/genética , Alelos , Animais , Simulação por Computador , Feminino , Dosagem de Genes/genética , Frequência do Gene/genética , Genótipo , Heterozigoto , Masculino , Penetrância
16.
Poult Sci ; 98(12): 6270-6280, 2019 Dec 01.
Artigo em Inglês | MEDLINE | ID: mdl-31393589

RESUMO

Broiler breeding programs rely on crossbreeding. With genomic selection, widespread use of crossbred performance in breeding programs comes within reach. Commercial crossbreds, however, may have unknown pedigrees and their genomes may include DNA from 2 to 4 different breeds. Our aim was, for a broiler dataset with a limited number of sires having both purebred and crossbred offspring generated using natural mating, to rapidly derive parentage, assess the distribution of the sire contribution to the offspring generation, and to assess breed-of-origin of alleles in crossbreds. The dataset contained genotypes for 56,075 SNPs for 5,882 purebred and 10,943 3-way crossbred offspring generated by natural mating of 164 purebred sires to 1,016 purebred and 1,386 F1 crossbred hens. Using our algorithm FindParents, joint parentage derivation for the offspring and parent generations required only 1 m 29 s to retrieve parentage for 20,253 animals considering 4,504 possible parents. FindParents was similarly accurate as a maximum likelihood based method, apart from situations where settings of FindParents did not match the genotyping error rate in the data. Numbers of offspring per sire had a very skewed distribution, ranging from 1 to 270 crossbreds and 1 to 154 purebreds. Derivation of breed-of-origin of alleles relied on phasing all genotypes, including 8,205, 372, and 720 animals from the 3 pure lines involved, and allocating haplotypes in the crossbreds to purebred lines based on observed frequencies in the purebred lines. Breed-of-origin could be derived for 96.94% of the alleles of the 1,386 F1 crossbred hens and for 91.88% of the alleles of the 10,943 3-way crossbred offspring, of which 49.49% to the sire line. The achieved percentage of assignment to the sire line was sufficient to proceed with subsequent analyses requiring only the breed-of-origin of the paternal alleles to be known. Although required number of animals may be population dependent, to increase the total percentage of assigned alleles, it seems advisable to use at least approx. 1,000 genotyped purebred animals for each of the lines involved.


Assuntos
Alelos , Cruzamento/métodos , Galinhas/genética , Linhagem , Criação de Animais Domésticos , Animais , Feminino , Genótipo , Masculino , Polimorfismo de Nucleotídeo Único
17.
Comput Biol Med ; 61: 48-55, 2015 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-25863000

RESUMO

BACKGROUND: The costs associated with developing high density microarray technologies are prohibitive for genotyping animals when there is low economic value associated with a single animal (e.g. prawns). DNA pooling is an attempt to address this issue by combining multiple DNA samples prior to genotyping. Instead of genotyping the DNA samples of the individuals, a mixture of DNA samples (i.e. the pool) from the individuals is genotyped only once. This greatly reduces the cost of genotyping. Pooled samples are subject to greater genotyping inaccuracies than individual samples. Wrong genotyping will lead to wrong biological conclusions. It is thus required to calibrate the resulting genotypes (allele frequencies). METHODS: We present a regression based approach to translate raw array output to allele frequency. During training, few pools and the individuals that constitute the pools are genotyped. Given the genotypes of individuals that constitute the pool, we compute the true allele frequency. We then train a regression algorithm to produce a mapping between the raw array outputs to the true allele frequency. We test the algorithm using pool samples withheld from the training set. During prediction, we use this map to genotype pools with no prior knowledge of the individuals constituting the pools. RESULTS AND DISCUSSION: After data quality control we have available a dataset comprised of 912 pools. We estimate allele frequency using three approaches: the raw data, a commonly used piecewise linear transformation, and the proposed local-global learner fusion method. The resulting RMS errors for the three approaches are 0.135, 0.120, and 0.080 respectively.


Assuntos
Alelos , Frequência do Gene , Técnicas de Genotipagem/normas , Análise de Sequência com Séries de Oligonucleotídeos/normas , Polimorfismo de Nucleotídeo Único , Animais , Calibragem , DNA , Bases de Dados Genéticas , Humanos
18.
Vet Parasitol ; 205(3-4): 595-605, 2014 Oct 15.
Artigo em Inglês | MEDLINE | ID: mdl-25200384

RESUMO

Gastrointestinal nematodes remain a major limitation to the productivity of livestock systems. Selective breeding to produce populations that have an enhanced ability to resist infection is a viable and ongoing option to reduce this impact. The development of new phenotypes that facilitate this process is therefore of great interest. For this reason we explored relationships between haematological parameters and the ability of sheep to resist nematode infection. A multivariate analytical approach was used to define algorithms based on the blood parameters that can be used to rank the ability of sheep to resist nematode infection in a single blood sample and can be applied independent of infection status. The algorithms were shown to classify susceptible sheep with a 100% accuracy and resistant sheep with 80% accuracy. Further development of this platform approach may be an important advance for small ruminant production systems worldwide and might also be applied to other diseases of livestock or even environmental stressors such as heat.


Assuntos
Hemoncose/veterinária , Haemonchus/fisiologia , Doenças dos Ovinos/imunologia , Ovinos/imunologia , Algoritmos , Animais , Resistência à Doença , Suscetibilidade a Doenças , Fezes/parasitologia , Hemoncose/imunologia , Hemoncose/parasitologia , Masculino , Modelos Teóricos , Análise Multivariada , Contagem de Ovos de Parasitas/veterinária , Fenótipo , Distribuição Aleatória , Reprodutibilidade dos Testes , Ovinos/sangue , Ovinos/classificação , Doenças dos Ovinos/parasitologia
19.
PLoS One ; 9(11): e113284, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-25419663

RESUMO

Adaptation of global food systems to climate change is essential to feed the world. Tropical cattle production, a mainstay of profitability for farmers in the developing world, is dominated by heat, lack of water, poor quality feedstuffs, parasites, and tropical diseases. In these systems European cattle suffer significant stock loss, and the cross breeding of taurine x indicine cattle is unpredictable due to the dilution of adaptation to heat and tropical diseases. We explored the genetic architecture of ten traits of tropical cattle production using genome wide association studies of 4,662 animals varying from 0% to 100% indicine. We show that nine of the ten have genetic architectures that include genes of major effect, and in one case, a single location that accounted for more than 71% of the genetic variation. One genetic region in particular had effects on parasite resistance, yearling weight, body condition score, coat colour and penile sheath score. This region, extending 20 Mb on BTA5, appeared to be under genetic selection possibly through maintenance of haplotypes by breeders. We found that the amount of genetic variation and the genetic correlations between traits did not depend upon the degree of indicine content in the animals. Climate change is expected to expand some conditions of the tropics to more temperate environments, which may impact negatively on global livestock health and production. Our results point to several important genes that have large effects on adaptation that could be introduced into more temperate cattle without detrimental effects on productivity.


Assuntos
Adaptação Fisiológica/genética , Bovinos/genética , Mudança Climática , Clima Tropical , Algoritmos , Animais , Cruzamento/métodos , Meio Ambiente , Feminino , Expressão Gênica , Frequência do Gene , Variação Genética , Genoma/genética , Genótipo , Haplótipos , Desequilíbrio de Ligação , Masculino , Polimorfismo de Nucleotídeo Único , Característica Quantitativa Herdável , Seleção Genética
20.
Methods Mol Biol ; 1019: 411-21, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-23756902

RESUMO

Validation of the results of genome-wide association studies or genomic selection studies is an essential component of the experimental program. Validation allows users to quantify the benefit of applying gene tests or genomic prediction, relative to the costs of implementing the program. Further, if implemented, an appropriate weight in a selection index can only be derived if estimates of the accuracy of genomic predictions are available. In this chapter the reasons for validation are explored, and a range of commonly encountered scenarios described. General principles are stated, and options for performing validation discussed. Designs for validation are heavily dependent on the availability of phenotyped animals, and also on the pedigree structures that characterize the breeding program. Consequently, there is no single plan that is always applicable, and a custom plan often must be developed.


Assuntos
Cruzamento , Estudo de Associação Genômica Ampla , Genoma , Linhagem , Estudos de Validação como Assunto , Animais , Genótipo , Modelos Genéticos , Fenótipo , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Seleção Genética
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa