RESUMEN
The objective of this study was to analyze the relevance of relationship information on the identification of low heritability quantitative trait loci (QTLs) from a genome-wide association study (GWAS) and on the genomic prediction of complex traits in human, animal and cross-pollinating populations. The simulation-based data sets included 50 samples of 1000 individuals of seven populations derived from a common population with linkage disequilibrium. The populations had non-inbred and inbred progeny structure (50 to 200) with varying number of members (5 to 20). The individuals were genotyped for 10,000 single nucleotide polymorphisms (SNPs) and phenotyped for a quantitative trait controlled by 10 QTLs and 90 minor genes showing dominance. The SNP density was 0.1 cM and the narrow sense heritability was 25%. The QTL heritabilities ranged from 1.1 to 2.9%. We applied mixed model approaches for both GWAS and genomic prediction using pedigree-based and genomic relationship matrices. For GWAS, the observed false discovery rate was kept below the significance level of 5%, the power of detection for the low heritability QTLs ranged from 14 to 50%, and the average bias between significant SNPs and a QTL ranged from less than 0.01 to 0.23 cM. The QTL detection power was consistently higher using genomic relationship matrix. Regardless of population and training set size, genomic prediction provided higher prediction accuracy of complex trait when compared to pedigree-based prediction. The accuracy of genomic prediction when there is relatedness between individuals in the training set and the reference population is much higher than the value for unrelated individuals.
Asunto(s)
Estudio de Asociación del Genoma Completo , Sitios de Carácter Cuantitativo , Carácter Cuantitativo Heredable , Animales , Simulación por Computador , Genética de Población , Genotipo , Humanos , Desequilibrio de Ligamiento , Modelos Genéticos , Linaje , Plantas , Polimorfismo de Nucleótido SimpleRESUMEN
The objectives of this study were to assess linkage disequilibrium (LD) and selection-induced changes in single nucleotide polymorphism (SNP) frequency, and to perform association mapping in popcorn chromosome regions containing quantitative trait loci (QTLs) for quality traits. Seven tropical and two temperate popcorn populations were genotyped for 96 SNPs chosen in chromosome regions containing QTLs for quality traits. The populations were phenotyped for expansion volume, 100-kernel weight, kernel sphericity, and kernel density. The LD statistics were the difference between the observed and expected haplotype frequencies (D), the proportion of D relative to the expected maximum value in the population, and the square of the correlation between the values of alleles at two loci. Association mapping was based on least squares and Bayesian approaches. In the tropical populations, D-values greater than 0.10 were observed for SNPs separated by 100-150 Mb, while most of the D-values in the temperate populations were less than 0.05. Selection for expansion volume indirectly led to increase in LD values, population differentiation, and significant changes in SNP frequency. Some associations were observed for expansion volume and the other quality traits. The candidate genes are involved with starch, storage protein, lipid, and cell wall polysaccharides synthesis.
RESUMEN
The objective was to assess by simulation the efficacy of population structure analysis in plant breeding. Twelve populations and 300 inbred lines were simulated and genotyped using 100 microsatellite loci. The experimental material included populations with and without admixture, ancestry relationship and linkage disequilibrium, and with distinct levels of genetic differentiation and effective sizes. The analyses were performed using Structure software and employed all available models. For all the group number (K) tested, for both populations and inbred lines, the admixture model with correlated allelic frequencies provided the highest value for the logarithm of the marginal likelihood. Fitting appropriate model and using adequate sample size for individuals and markers, Structure was effective in identifying the correct population structure, migrants and individuals with genome from distinct populations. The linkage model did not result in an improvement in clustering relative to the admixture model with correlated allelic frequencies. The inclusion of prior information did not change the results; for some K values the analyses showed slight higher values of the marginal likelihood. The reduction in the number of individuals and markers negatively affected the results. There was a high variation in the most probable K value between the evaluated methods.