RESUMEN
Genome-wide association studies (GWAS) in plants typically suffer from limited statistical power. An alternative to the logistical and cost challenge of increasing sample sizes is to gain power by meta-analysis using information from independent studies. We carried out GWAS for growth traits with six single-marker models and regional heritability mapping (RHM) in four Eucalyptus breeding populations independently and by Joint-GWAS, using gene and segment-based models, with data for 3373 individuals genotyped with a communal EUChip60KSNP platform. While single-single nucleotide polymorphism (SNP) GWAS hardly detected significant associations at high-stringency in each population, gene-based Joint-GWAS revealed nine genes significantly associated with tree height. Associations detected using single-SNP GWAS, RHM and Joint-GWAS set-based models explained on average 3-20% of the phenotypic variance. Whole-genome regression, conversely, captured 64-89% of the pedigree-based heritability in all populations. Several associations independently detected for the same SNPs in different populations provided unprecedented GWAS validation results in forest trees. Rare and common associations were discovered in eight genes involved in cell wall biosynthesis and lignification. With the increasing adoption of genomic prediction of complex phenotypes using shared SNPs and much larger tree breeding populations, Joint-GWAS approaches should provide increasing power to pinpoint discrete associations potentially useful toward tree breeding and molecular applications.
Asunto(s)
Eucalyptus/genética , Genoma de Planta , Estudio de Asociación del Genoma Completo , Fitomejoramiento , Carácter Cuantitativo Heredable , Patrón de Herencia/genética , Desequilibrio de Ligamiento/genética , Polimorfismo de Nucleótido Simple/genética , Análisis de Componente PrincipalRESUMEN
Although genome-wide association studies (GWAS) have provided valuable insights into the decoding of the relationships between sequence variation and complex phenotypes, they have explained little heritability. Regional heritability mapping (RHM) provides heritability estimates for genomic segments containing both common and rare allelic effects that individually contribute too little variance to be detected by GWAS. We carried out GWAS and RHM for seven growth, wood and disease resistance traits in a breeding population of 768 Eucalyptus hybrid trees using EuCHIP60K. Total genomic heritabilities accounted for large proportions (64-89%) of pedigree-based trait heritabilities, providing additional evidence that complex traits in eucalypts are controlled by many sequence variants across the frequency spectrum, each with small contributions to the phenotypic variance. RHM detected 26 quantitative trait loci (QTLs) encompassing 2191 single nucleotide polymorphisms (SNPs), whereas GWAS detected 13 single SNP-trait associations. RHM and GWAS QTLs individually explained 5-15% and 4-6% of the genomic heritability, respectively. RHM was superior to GWAS in capturing larger proportions of genomic heritability. Equated to previously mapped QTLs, our results highlighted genomic regions for further examination towards gene discovery. RHM-QTLs bearing a combination of common and rare variants could be useful enhancements to incorporate prior knowledge of the underlying genetic architecture in genomic prediction models.
Asunto(s)
Resistencia a la Enfermedad/genética , Eucalyptus/genética , Estudio de Asociación del Genoma Completo , Patrón de Herencia/genética , Sitios de Carácter Cuantitativo/genética , Carácter Cuantitativo Heredable , Madera/genética , Cruzamientos Genéticos , Desequilibrio de Ligamiento/genética , Polimorfismo de Nucleótido Simple/genéticaRESUMEN
Introduction: Genomic selection (GS) experiments in forest trees have largely reported estimates of predictive abilities from cross-validation among individuals in the same breeding generation. In such conditions, no effects of recombination, selection, drift, and environmental changes are accounted for. Here, we assessed the effectively realized predictive ability (RPA) for volume growth at harvest age by GS across generations in an operational reciprocal recurrent selection (RRS) program of hybrid Eucalyptus. Methods: Genomic best linear unbiased prediction with additive (GBLUP_G), additive plus dominance (GBLUP_G+D), and additive single-step (HBLUP) models were trained with different combinations of growth data of hybrids and pure species individuals (N = 17,462) of the G1 generation, 1,944 of which were genotyped with ~16,000 SNPs from SNP arrays. The hybrid G2 progeny trial (HPT267) was the GS target, with 1,400 selection candidates, 197 of which were genotyped still at the seedling stage, and genomically predicted for their breeding and genotypic values at the operational harvest age (6 years). Seedlings were then grown to harvest and measured, and their pedigree-based breeding and genotypic values were compared to their originally predicted genomic counterparts. Results: Genomic RPAs ≥0.80 were obtained as the genetic relatedness between G1 and G2 increased, especially when the direct parents of selection candidates were used in training. GBLUP_G+D reached RPAs ≥0.70 only when hybrid or pure species data of G1 were included in training. HBLUP was only marginally better than GBLUP. Correlations ≥0.80 were obtained between pedigree and genomic individual ranks. Rank coincidence of the top 2.5% selections was the highest for GBLUP_G (45% to 60%) compared to GBLUP_G+D. To advance the pure species RRS populations, GS models were best when trained on pure species than hybrid data, and HBLUP yielded ~20% higher predictive abilities than GBLUP, but was not better than ABLUP for ungenotyped trees. Discussion: We demonstrate that genomic data effectively enable accurate ranking of eucalypt hybrid seedlings for their yet-to-be observed volume growth at harvest age. Our results support a two-stage GS approach involving family selection by average genomic breeding value, followed by within-top-families individual GS, significantly increasing selection intensity, optimizing genotyping costs, and accelerating RRS breeding.
RESUMEN
⢠Genomic selection (GS) is expected to cause a paradigm shift in tree breeding by improving its speed and efficiency. By fitting all the genome-wide markers concurrently, GS can capture most of the 'missing heritability' of complex traits that quantitative trait locus (QTL) and association mapping classically fail to explain. Experimental support of GS is now required. ⢠The effectiveness of GS was assessed in two unrelated Eucalyptus breeding populations with contrasting effective population sizes (N(e) = 11 and 51) genotyped with > 3000 DArT markers. Prediction models were developed for tree circumference and height growth, wood specific gravity and pulp yield using random regression best linear unbiased predictor (BLUP). ⢠Accuracies of GS varied between 0.55 and 0.88, matching the accuracies achieved by conventional phenotypic selection. Substantial proportions (74-97%) of trait heritability were captured by fitting all genome-wide markers simultaneously. Genomic regions explaining trait variation largely coincided between populations, although GS models predicted poorly across populations, likely as a result of variable patterns of linkage disequilibrium, inconsistent allelic effects and genotype × environment interaction. ⢠GS brings a new perspective to the understanding of quantitative trait variation in forest trees and provides a revolutionary tool for applied tree improvement. Nevertheless population-specific predictive models will likely drive the initial applications of GS in forest tree breeding.