RESUMEN
Genetic progress of crop plants is required to face human population growth and guarantee production stability in increasingly unstable environmental conditions. Breeding is accompanied by a loss in genetic diversity, which hinders sustainable genetic gain. Methodologies based on molecular marker information have been developed to manage diversity and proved effective in increasing long-term genetic gain. However, with realistic plant breeding population sizes, diversity depletion in closed programs appears ineluctable, calling for the introduction of relevant diversity donors. Although maintained with significant efforts, genetic resource collections remain underutilized, due to a large performance gap with elite germplasm. Bridging populations created by crossing genetic resources to elite lines prior to introduction into elite programs can manage this gap efficiently. To improve this strategy, we explored with simulations different genomic prediction and genetic diversity management options for a global program involving a bridging and an elite component. We analyzed the dynamics of quantitative trait loci fixation and followed the fate of allele donors after their introduction into the breeding program. Allocating 25% of total experimental resources to create a bridging component appears highly beneficial. We showed that potential diversity donors should be selected based on their phenotype rather than genomic predictions calibrated with the ongoing breeding program. We recommend incorporating improved donors into the elite program using a global calibration of the genomic prediction model and optimal cross selection maintaining a constant diversity. These approaches use efficiently genetic resources to sustain genetic gain and maintain neutral diversity, improving the flexibility to address future breeding objectives.
Asunto(s)
Sitios de Carácter Cuantitativo , Selección Genética , Humanos , Fenotipo , Sitios de Carácter Cuantitativo/genética , Genómica , Alelos , Fitomejoramiento , Variación Genética , Modelos GenéticosRESUMEN
Plant aquaporins are involved in numerous physiological processes, such as cellular homeostasis, tissue hydraulics, transpiration, and nutrient supply, and are key players of the response to environmental cues. While varying expression patterns of aquaporin genes have been described across organs, developmental stages, and stress conditions, the underlying regulation mechanisms remain elusive. Hence, this work aimed to shed light on the expression variability of 4 plasma membrane intrinsic protein (PIP) genes in maize (Zea mays) leaves, and its genetic causes, through expression quantitative trait locus (eQTL) mapping across a 252-hybrid diversity panel. Significant genetic variability in PIP transcript abundance was observed to different extents depending on the isoforms. The genome-wide association study mapped numerous eQTLs, both local and distant, thus emphasizing the existing natural diversity of PIP gene expression across the studied panel and the potential to reveal regulatory actors and mechanisms. One eQTL associated with PIP2;5 expression variation was characterized. Genomic sequence comparison and in vivo reporter assay attributed, at least partly, the local eQTL to a transposon-containing polymorphism in the PIP2;5 promoter. This work paves the way to the molecular understanding of PIP gene regulation and its possible integration into larger networks regulating physiological and stress adaptation processes.
Asunto(s)
Acuaporinas , Regulación de la Expresión Génica de las Plantas , Variación Genética , Sitios de Carácter Cuantitativo , Zea mays , Zea mays/genética , Zea mays/metabolismo , Sitios de Carácter Cuantitativo/genética , Acuaporinas/genética , Acuaporinas/metabolismo , Proteínas de Plantas/genética , Proteínas de Plantas/metabolismo , Estudio de Asociación del Genoma Completo , AnimalesRESUMEN
KEY MESSAGE: We validated the efficiency of genomic predictions calibrated on sparse factorial training sets to predict the next generation of hybrids and tested different strategies for updating predictions along generations. Genomic selection offers new prospects for revisiting hybrid breeding schemes by replacing extensive phenotyping of individuals with genomic predictions. Finding the ideal design for training genomic prediction models is still an open question. Previous studies have shown promising predictive abilities using sparse factorial instead of tester-based training sets to predict single-cross hybrids from the same generation. This study aims to further investigate the use of factorials and their optimization to predict line general combining abilities (GCAs) and hybrid values across breeding cycles. It relies on two breeding cycles of a maize reciprocal genomic selection scheme involving multiparental connected reciprocal populations from flint and dent complementary heterotic groups selected for silage performances. Selection based on genomic predictions trained on a factorial design resulted in a significant genetic gain for dry matter yield in the new generation. Results confirmed the efficiency of sparse factorial training sets to predict candidate line GCAs and hybrid values across breeding cycles. Compared to a previous study based on the first generation, the advantage of factorial over tester training sets appeared lower across generations. Updating factorial training sets by adding single-cross hybrids between selected lines from the previous generation or a random subset of hybrids from the new generation both improved predictive abilities. The CDmean criterion helped determine the set of single-crosses to phenotype to update the training set efficiently. Our results validated the efficiency of sparse factorial designs for calibrating hybrid genomic prediction experimentally and showed the benefit of updating it along generations.
Asunto(s)
Hibridación Genética , Zea mays , Genómica/métodos , Fitomejoramiento , Ensilaje , Zea mays/genéticaRESUMEN
KEY MESSAGE: Implementing a collaborative pre-breeding multi-parental population efficiently identifies promising donor x elite pairs to enrich the flint maize elite germplasm. Genetic diversity is crucial for maintaining genetic gains and ensuring breeding programs' long-term success. In a closed breeding program, selection inevitably leads to a loss of genetic diversity. While managing diversity can delay this loss, introducing external sources of diversity is necessary to bring back favorable genetic variation. Genetic resources exhibit greater diversity than elite materials, but their lower performance levels hinder their use. This is the case for European flint maize, for which elite germplasm has incorporated only a limited portion of the diversity available in landraces. To enrich the diversity of this elite genetic pool, we established an original cooperative maize bridging population that involves crosses between private elite materials and diversity donors to create improved genotypes that will facilitate the incorporation of original favorable variations. Twenty donor × elite BC1S2 families were created and phenotyped for hybrid value for yield related traits. Crosses showed contrasted means and variances and therefore contrasted potential in terms of selection as measured by their usefulness criterion (UC). Average expected mean performance gain over the initial elite material was 5%. The most promising donor for each elite line was identified. Results also suggest that one more generation, i.e., 3 in total, of crossing to the elite is required to fully exploit the potential of a donor. Altogether, our results support the usefulness of incorporating genetic resources into elite flint maize. They call for further effort to create fixed diversity donors and identify those most suitable for each elite program.
Asunto(s)
Fitomejoramiento , Zea mays , Humanos , Zea mays/genética , Fenotipo , Genotipo , Variación GenéticaRESUMEN
KEY MESSAGE: Transcriptomics and proteomics information collected on a platform can predict additive and non-additive effects for platform traits and additive effects for field traits. The effects of climate change in the form of drought, heat stress, and irregular seasonal changes threaten global crop production. The ability of multi-omics data, such as transcripts and proteins, to reflect a plant's response to such climatic factors can be capitalized in prediction models to maximize crop improvement. Implementing multi-omics characterization in field evaluations is challenging due to high costs. It is, however, possible to do it on reference genotypes in controlled conditions. Using omics measured on a platform, we tested different multi-omics-based prediction approaches, using a high dimensional linear mixed model (MegaLMM) to predict genotypes for platform traits and agronomic field traits in a panel of 244 maize hybrids. We considered two prediction scenarios: in the first one, new hybrids are predicted (CV-NH), and in the second one, partially observed hybrids are predicted (CV-POH). For both scenarios, all hybrids were characterized for omics on the platform. We observed that omics can predict both additive and non-additive genetic effects for the platform traits, resulting in much higher predictive abilities than GBLUP. It highlights their efficiency in capturing regulatory processes in relation to growth conditions. For the field traits, we observed that the additive components of omics only slightly improved predictive abilities for predicting new hybrids (CV-NH, model MegaGAO) and for predicting partially observed hybrids (CV-POH, model GAOxW-BLUP) in comparison to GBLUP. We conclude that measuring the omics in the fields would be of considerable interest in predicting productivity if the costs of omics drop significantly.
Asunto(s)
Genotipo , Fenotipo , Proteómica , Zea mays , Zea mays/genética , Zea mays/crecimiento & desarrollo , Proteómica/métodos , Fitomejoramiento/métodos , Modelos Genéticos , Genómica/métodos , Transcriptoma , Modelos Lineales , MultiómicaRESUMEN
The effect of drought on maize yield is of particular concern in the context of climate change and human population growth. However, the complexity of drought-response mechanisms makes the design of new drought-tolerant varieties a difficult task that would greatly benefit from a better understanding of the genotype-phenotype relationship. To provide novel insight into this relationship, we applied a systems genetics approach integrating high-throughput phenotypic, proteomic, and genomic data acquired from 254 maize hybrids grown under two watering conditions. Using association genetics and protein coexpression analysis, we detected more than 22,000 pQTLs across the two conditions and confidently identified 15 loci with potential pleiotropic effects on the proteome. We showed that even mild water deficit induced a profound remodeling of the proteome, which affected the structure of the protein coexpression network, and a reprogramming of the genetic control of the abundance of many proteins, including those involved in stress response. Colocalizations between pQTLs and QTLs for ecophysiological traits, found mostly in the water deficit condition, indicated that this reprogramming may also affect the phenotypic level. Finally, we identified several candidate genes that are potentially responsible for both the coexpression of stress response proteins and the variations of ecophysiological traits under water deficit. Taken together, our findings provide novel insights into the molecular mechanisms of drought tolerance and suggest some pathways for further research and breeding.
Asunto(s)
Proteínas de Plantas/genética , Polimorfismo de Nucleótido Simple , Proteoma/genética , Zea mays/genética , Sequías , Ambiente , Genes de Plantas , Genoma de Planta , Estudio de Asociación del Genoma Completo , Proteínas de Plantas/metabolismo , Proteoma/metabolismo , Sitios de Carácter Cuantitativo , Zea mays/metabolismoRESUMEN
Southeast Europe (SEE) is a very important maize-growing region, comparable to the Corn belt region of the United States, with similar dent germplasm (dent by dent hybrids). Historically, this region has undergone several genetic material swaps, following the trends in the US, with one of the most significant swaps related to US aid programs after WWII. The imported accessions used to make double-cross hybrids were also mixed with previously adapted germplasm originating from several more distant OPVs, supporting the transition to single cross-breeding. Many of these materials were deposited at the Maize Gene Bank of the Maize Research Institute Zemun Polje (MRIZP) between the 1960s and 1980s. A part of this Gene Bank (572 inbreds) was genotyped with Affymetrix Axiom Maize Genotyping Array with 616,201 polymorphic variants. Data were merged with two other genotyping datasets with mostly European flint (TUM dataset) and dent (DROPS dataset) germplasm. The final pan-European dataset consisted of 974 inbreds and 460,243 markers. Admixture analysis showed seven ancestral populations representing European flint, B73/B14, Lancaster, B37, Wf9/Oh07, A374, and Iodent pools. Subpanel of inbreds with SEE origin showed a lack of Iodent germplasm, marking its historical context. Several signatures of selection were identified at chromosomes 1, 3, 6, 7, 8, 9, and 10. The regions under selection were mined for protein-coding genes and were used for gene ontology (GO) analysis, showing a highly significant overrepresentation of genes involved in response to stress. Our results suggest the accumulation of favorable allelic diversity, especially in the context of changing climate in the genetic resources of SEE.
Asunto(s)
Variación Genética , Fitomejoramiento , Zea mays , Alelos , Europa (Continente) , Zea mays/genéticaRESUMEN
Landraces, that is, traditional varieties, have a large diversity that is underexploited in modern breeding. A novel DNA pooling strategy was implemented to identify promising landraces and genomic regions to enlarge the genetic diversity of modern varieties. As proof of concept, DNA pools from 156 American and European maize landraces representing 2340 individuals were genotyped with an SNP array to assess their genome-wide diversity. They were compared to elite cultivars produced across the 20th century, represented by 327 inbred lines. Detection of selective footprints between landraces of different geographic origin identified genes involved in environmental adaptation (flowering times, growth) and tolerance to abiotic and biotic stress (drought, cold, salinity). Promising landraces were identified by developing two novel indicators that estimate their contribution to the genome of inbred lines: (i) a modified Roger's distance standardized by gene diversity and (ii) the assignation of lines to landraces using supervised analysis. It showed that most landraces do not have closely related lines and that only 10 landraces, including famous landraces as Reid's Yellow Dent, Lancaster Surecrop and Lacaune, cumulated half of the total contribution to inbred lines. Comparison of ancestral lines directly derived from landraces with lines from more advanced breeding cycles showed a decrease in the number of landraces with a large contribution. New inbred lines derived from landraces with limited contributions enriched more the haplotype diversity of reference inbred lines than those with a high contribution. Our approach opens an avenue for the identification of promising landraces for pre-breeding.
Asunto(s)
Genómica , Fitomejoramiento , Genotipo , Genoma de Planta/genética , ADN , Variación Genética/genética , Zea mays/genéticaRESUMEN
KEY MESSAGE: An original GWAS model integrating the ancestry of alleles was proposed and allowed the detection of background specific additive and dominance QTLs involved in heterotic group complementarity and hybrid performance. Maize genetic diversity is structured into genetic groups selected and improved relative to each other. This process increases group complementarity and differentiation over time and ensures that the hybrids produced from inter-group crosses exhibit high performances and heterosis. To identify loci involved in hybrid performance and heterotic group complementarity, we introduced an original association study model that disentangles allelic effects from the heterotic group origin of the alleles and compared it with a conventional additive/dominance model. This new model was applied on a factorial between Dent and Flint lines and a diallel between Dent-Flint admixed lines with two different layers of analysis: within each environment and in a multiple-environment context. We identified several strong additive QTLs for all traits, including some well-known additive QTLs for flowering time (in the region of Vgt1/2 on chromosome 8). Yield trait displayed significant non-additive effects in the diallel panel. Most of the detected Yield QTLs exhibited overdominance or, more likely, pseudo-overdominance effects. Apparent overdominance at these QTLs contributed to a part of the genetic group complementarity. The comparison between environments revealed a higher stability of additive QTL effects than non-additive ones. Several QTLs showed variations of effects according to the local heterotic group origin. We also revealed large chromosomic regions that display genetic group origin effects. Altogether, our results illustrate how admixed panels combined with dedicated GWAS modeling allow the identification of new QTLs that could not be revealed by a classical hybrid panel analyzed with traditional modeling.
Asunto(s)
Vigor Híbrido , Zea mays , Mapeo Cromosómico/métodos , Zea mays/genética , Estudio de Asociación del Genoma Completo , Sitios de Carácter Cuantitativo , FenotipoRESUMEN
Since their introduction in the 50's, variance component mixed models have been widely used in many application fields. In this context, ReML estimation is by far the most popular procedure to infer the variance components of the model. Although many implementations of the ReML procedure are readily available, there is still need for computational improvements due to the ever-increasing size of the datasets to be handled, and to the complexity of the models to be adjusted. In this paper, we present a Min-Max (MM) algorithm for ReML inference and combine it with several speed-up procedures. The ReML MM algorithm we present is compared to 5 state-of-the-art publicly available algorithms used in statistical genetics. The computational performance of the different algorithms are evaluated on several datasets representing different plant breeding experimental designs. The MM algorithm ranks among the top 2 methods in almost all settings and is more versatile than many of its competitors. The MM algorithm is a promising alternative to the classical AI-ReML algorithm in the context of variance component mixed models. It is available in the MM4LMM R-package.
Asunto(s)
Algoritmos , Biología Computacional/métodos , Modelos Genéticos , Modelos EstadísticosRESUMEN
When handling a structured population in association mapping, group-specific allele effects may be observed at quantitative trait loci (QTLs) for several reasons: (i) a different linkage disequilibrium (LD) between SNPs and QTLs across groups, (ii) group-specific genetic mutations in QTL regions, and/or (iii) epistatic interactions between QTLs and other loci that have differentiated allele frequencies between groups. We present here a new genome-wide association (GWAS) approach to identify QTLs exhibiting such group-specific allele effects. We developed genetic materials including admixed progeny from different genetic groups with known genome-wide ancestries (local admixture). A dedicated statistical methodology was developed to analyze pure and admixed individuals jointly, allowing one to disentangle the factors causing the heterogeneity of allele effects across groups. This approach was applied to maize by developing an inbred "Flint-Dent" panel including admixed individuals that was evaluated for flowering time. Several associations were detected revealing a wide range of configurations of allele effects, both at known flowering QTLs (Vgt1, Vgt2 and Vgt3) and new loci. We found several QTLs whose effect depended on the group ancestry of alleles while others interacted with the genetic background. Our GWAS approach provides useful information on the stability of QTL effects across genetic groups and can be applied to a wide range of species.
Asunto(s)
Epistasis Genética/genética , Flores/genética , Sitios de Carácter Cuantitativo/genética , Zea mays/genética , Alelos , Mapeo Cromosómico , Cromosomas de las Plantas/genética , Frecuencia de los Genes/genética , Antecedentes Genéticos , Genoma de Planta/genética , Estudio de Asociación del Genoma Completo/métodos , Genotipo , Desequilibrio de Ligamiento/genética , Fenotipo , Polimorfismo de Nucleótido Simple/genéticaRESUMEN
KEY MESSAGE: Calibrating a genomic selection model on a sparse factorial design rather than on tester designs is advantageous for some traits, and equivalent for others. In maize breeding, the selection of the candidate inbred lines is based on topcross evaluations using a limited number of testers. Then, a subset of single-crosses between these selected lines is evaluated to identify the best hybrid combinations. Genomic selection enables the prediction of all possible single-crosses between candidate lines but raises the question of defining the best training set design. Previous simulation results have shown the potential of using a sparse factorial design instead of tester designs as the training set. To validate this result, a 363 hybrid factorial design was obtained by crossing 90 dent and flint inbred lines from six segregating families. Two tester designs were also obtained by crossing the same inbred lines to two testers of the opposite group. These designs were evaluated for silage in eight environments and used to predict independent performances of a 951 hybrid factorial design. At a same number of hybrids and lines, the factorial design was as efficient as the tester designs, and, for some traits, outperformed them. All available designs were used as both training and validation set to evaluate their efficiency. When the objective was to predict single-crosses between untested lines, we showed an advantage of increasing the number of lines involved in the training set, by (1) allocating each of them to a different tester for the tester design, or (2) reducing the number of hybrids per line for the factorial design. Our results confirm the potential of sparse factorial designs for genomic hybrid breeding.
Asunto(s)
Fitomejoramiento , Zea mays , Genómica/métodos , Humanos , Hibridación Genética , Ensilaje , Zea mays/genéticaRESUMEN
BACKGROUND: The narrow genetic base of elite germplasm compromises long-term genetic gain and increases the vulnerability to biotic and abiotic stresses in unpredictable environmental conditions. Therefore, an efficient strategy is required to broaden the genetic base of commercial breeding programs while not compromising short-term variety release. Optimal cross selection aims at identifying the optimal set of crosses that balances the expected genetic value and diversity. We propose to consider genomic selection and optimal cross selection to recurrently improve genetic resources (i.e. pre-breeding), to bridge the improved genetic resources with elites (i.e. bridging), and to manage introductions into the elite breeding population. Optimal cross selection is particularly adapted to jointly identify bridging, introduction and elite crosses to ensure an overall consistency of the genetic base broadening strategy. RESULTS: We compared simulated breeding programs introducing donors with different performance levels, directly or indirectly after bridging. We also evaluated the effect of the training set composition on the success of introductions. We observed that with recurrent introductions of improved donors, it is possible to maintain the genetic diversity and increase mid- and long-term performances with only limited penalty at short-term. Considering a bridging step yielded significantly higher mid- and long-term genetic gain when introducing low performing donors. The results also suggested to consider marker effects estimated with a broad training population including donor by elite and elite by elite progeny to identify bridging, introduction and elite crosses. CONCLUSION: Results of this study provide guidelines on how to harness polygenic variation present in genetic resources to broaden elite germplasm.
Asunto(s)
Modelos Genéticos , Cruzamiento , Variación Genética , Análisis de Componente Principal , Selección GenéticaRESUMEN
KEY MESSAGE: Collaborative diversity panels and genomic prediction seem relevant to identify and harness genetic resources for polygenic trait-specific enrichment of elite germplasms. In plant breeding, genetic diversity is important to maintain the pace of genetic gain and the ability to respond to new challenges in a context of climatic and social expectation changes. Many genetic resources are accessible to breeders but cannot all be considered for broadening the genetic diversity of elite germplasm. This study presents the use of genomic predictions trained on a collaborative diversity panel, which assembles genetic resources and elite lines, to identify resources to enrich an elite germplasm. A maize collaborative panel (386 lines) was considered to estimate genome-wide marker effects. Relevant predictive abilities (0.40-0.55) were observed on a large population of private elite materials, which supported the interest of such a collaborative panel for diversity management perspectives. Grain-yield estimated marker effects were used to select a donor that best complements an elite recipient at individual loci or haplotype segments, or that is expected to give the best-performing progeny with the elite. Among existing and new criteria that were compared, some gave more weight to the donor-elite complementarity than to the donor value, and appeared more adapted to long-term objective. We extended this approach to the selection of a set of donors complementing an elite population. We defined a crossing plan between identified donors and elite recipients. Our results illustrated how collaborative projects based on diversity panels including both public resources and elite germplasm can contribute to a better characterization of genetic resources in view of their use to enrich elite germplasm.
Asunto(s)
Conducta Cooperativa , Genómica , Fitomejoramiento , Zea mays/genética , Genotipo , Haploidia , Modelos Genéticos , Sitios de Carácter Cuantitativo/genéticaRESUMEN
Through the local selection of landraces, humans have guided the adaptation of crops to a vast range of climatic and ecological conditions. This is particularly true of maize, which was domesticated in a restricted area of Mexico but now displays one of the broadest cultivated ranges worldwide. Here, we sequenced 67 genomes with an average sequencing depth of 18x to document routes of introduction, admixture and selective history of European maize and its American counterparts. To avoid the confounding effects of recent breeding, we targeted germplasm (lines) directly derived from landraces. Among our lines, we discovered 22,294,769 SNPs and between 0.9% to 4.1% residual heterozygosity. Using a segmentation method, we identified 6,978 segments of unexpectedly high rate of heterozygosity. These segments point to genes potentially involved in inbreeding depression, and to a lesser extent to the presence of structural variants. Genetic structuring and inferences of historical splits revealed 5 genetic groups and two independent European introductions, with modest bottleneck signatures. Our results further revealed admixtures between distinct sources that have contributed to the establishment of 3 groups at intermediate latitudes in North America and Europe. We combined differentiation- and diversity-based statistics to identify both genes and gene networks displaying strong signals of selection. These include genes/gene networks involved in flowering time, drought and cold tolerance, plant defense and starch properties. Overall, our results provide novel insights into the evolutionary history of European maize and highlight a major role of admixture in environmental adaptation, paralleling recent findings in humans.
Asunto(s)
Adaptación Fisiológica/genética , Genes de Plantas/genética , Fitomejoramiento/métodos , Zea mays/genética , Europa (Continente) , Variación Genética , Genoma de Planta/genética , Geografía , Heterocigoto , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , Modelos Genéticos , Filogenia , Polimorfismo de Nucleótido Simple , Selección Genética , Estados Unidos , Zea mays/clasificaciónRESUMEN
BACKGROUND: Single Nucleotide Polymorphism (SNP) array and re-sequencing technologies have different properties (e.g. calling rate, minor allele frequency profile) and drawbacks (e.g. ascertainment bias). This lead us to study their complementarity and the consequences of using them separately or combined in diversity analyses and Genome-Wide Association Studies (GWAS). We performed GWAS on three traits (grain yield, plant height and male flowering time) measured in 22 environments on a panel of 247 F1 hybrids obtained by crossing 247 diverse dent maize inbred lines with a same flint line. The 247 lines were genotyped using three genotyping technologies (Genotyping-By-Sequencing, Illumina Infinium 50 K and Affymetrix Axiom 600 K arrays). RESULTS: The effects of ascertainment bias of the 50 K and 600 K arrays were negligible for deciphering global genetic trends of diversity and for estimating relatedness in this panel. We developed an original approach based on linkage disequilibrium (LD) extent in order to determine whether SNPs significantly associated with a trait and that are physically linked should be considered as a single Quantitative Trait Locus (QTL) or several independent QTLs. Using this approach, we showed that the combination of the three technologies, which have different SNP distributions and densities, allowed us to detect more QTLs (gain in power) and potentially refine the localization of the causal polymorphisms (gain in resolution). CONCLUSIONS: Conceptually different technologies are complementary for detecting QTLs by tagging different haplotypes in association studies. Considering LD, marker density and the combination of different technologies (SNP-arrays and re-sequencing), the genotypic data available were most likely enough to well represent polymorphisms in the centromeric regions, whereas using more markers would be beneficial for telomeric regions.
Asunto(s)
Estudio de Asociación del Genoma Completo/métodos , Técnicas de Genotipaje , Haplotipos , Análisis de Secuencia por Matrices de Oligonucleótidos , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Zea mays/genética , Alelos , Biodiversidad , Cromosomas de las Plantas , Marcadores Genéticos , Genoma de Planta , Desequilibrio de Ligamiento , Zea mays/crecimiento & desarrolloRESUMEN
KEY MESSAGE: Population structure affects genomic selection efficiency as well as the ability to forecast accuracy using standard GBLUP. Genomic prediction models usually assume that the individuals used for calibration belong to the same population as those to be predicted. Most of the a priori indicators of precision, such as the coefficient of determination (CD), were derived from those same models. But genetic structure is a common feature in plant species, and it may impact genomic selection efficiency and the ability to forecast prediction accuracy. We investigated the impact of genetic structure in a dent maize panel ("Amaizing Dent") using different scenarios including within- or across-group predictions. For a given training set size, the best accuracies were achieved when predicting individuals using a model calibrated on the same genetic group. Nevertheless, a diverse training set representing all the groups had a certain predictive efficiency for all the validation sets, and adding extra-group individuals was almost always beneficial. It underlines the potential of such a generic training set for dent maize genomic selection applications. Alternative prediction models, taking genetic structure explicitly into account, did not improve the prediction accuracy compared to GBLUP. We also investigated the ability of different indicators of precision to forecast accuracy in the within- or across-group scenarios. There was a global encouraging trend of the CD to differentiate scenarios, although there were specific combinations of target populations and traits where the efficiency of this indicator proved to be null. One hypothesis to explain such erratic performances is the impact of genetic structure through group-specific allele diversity at QTLs rather than group-specific allele effects.
Asunto(s)
Modelos Genéticos , Fitomejoramiento , Zea mays/genética , Alelos , Genómica , Genotipo , Fenotipo , Sitios de Carácter CuantitativoRESUMEN
KEY MESSAGE: Silage quality traits of maize hybrids between the Dent and Flint heterotic groups mostly involved QTL specific of each parental group, some of them showing unfavorable pleiotropic effects on yield. Maize (Zea mays L.) is commonly used as silage for cattle feeding in Northern Europe. In addition to biomass production, improving whole-plant digestibility is a major breeding objective. To identify loci involved in the general (GCA, parental values) and specific combining ability (SCA, cross-specific value) components of hybrid value, we analyzed an incomplete factorial design of 951 hybrids obtained by crossing inbred lines issued from two multiparental connected populations, each specific to one of the heterotic groups used for silage in Europe ("Dent" and "Flint"). Inbred lines were genotyped for approximately 20K single nucleotide polymorphisms, and hybrids were phenotyped in eight environments for seven silage quality traits measured by near-infrared spectroscopy, biomass yield and precocity (partly analyzed in a previous study). We estimated variance components for GCA and SCA and their interaction with environment. We performed QTL detection using different models adapted to this hybrid population. Strong family effects and a predominance of GCA components compared to SCA were found for all traits. In total, 230 QTL were detected, with only two showing SCA effects significant at the whole-genome level. More than 80% of GCA QTL were specific of one heterotic group. QTL explained individually less than 5% of the phenotypic variance. QTL co-localizations and correlation between QTL effects of quality and productivity traits suggest at least partial pleiotropic effects. This work opens new prospects for improving maize hybrid performances for both biomass productivity and quality accounting for complementarities between heterotic groups.
Asunto(s)
Sitios de Carácter Cuantitativo , Zea mays/genética , Mapeo Cromosómico , Hibridación Genética , Zea mays/crecimiento & desarrolloRESUMEN
KEY MESSAGE: We review and propose easily implemented and affordable indicators to assess the genetic diversity and the potential of a breeding population and propose solutions for its long-term management. Successful plant breeding programs rely on balanced efforts between short-term goals to develop competitive cultivars and long-term goals to improve and maintain diversity in the genetic pool. Indicators of the sustainability of response to selection in breeding pools are of key importance in this context. We reviewed and proposed sets of indicators based on temporal phenotypic and genotypic data and applied them on an early maize grain program implying two breeding pools (Dent and Flint) selected in a reciprocal manner. Both breeding populations showed a significant positive genetic gain summing up to 1.43 qx/ha/year but contrasted evolutions of genetic variance. Advances in high-throughput genotyping permitted the identification of regions of low diversity, mainly localized in pericentromeric regions. Observed changes in genetic diversity were multiple, reflecting a complex breeding system. We estimated the impact of linkage disequilibrium (LD) and of allelic diversity on the additive genetic variance at a genome-wide and chromosome-wide scale. Consistently with theoretical expectation under directional selection, we found a negative contribution of LD to genetic variance, which was unevenly distributed between chromosomes. This suggests different chromosome selection histories and underlines the interest to recombine specific chromosome regions. All three sets of indicators valorize in house data and are easy to implement in the era of genomic selection in every breeding program.
Asunto(s)
Variación Genética , Genoma de Planta , Zea mays/genética , Cruzamiento , Europa (Continente) , Fenotipo , Evaluación de Programas y Proyectos de SaludRESUMEN
BACKGROUND: Maize is well known for its exceptional structural diversity, including copy number variants (CNVs) and presence/absence variants (PAVs), and there is growing evidence for the role of structural variation in maize adaptation. While PAVs have been described in this important crop species, they have been only scarcely characterized at the sequence level and the extent of presence/absence variation and relative chromosomal landscape of inbred-specific regions remain to be elucidated. RESULTS: De novo genome sequencing of the French F2 maize inbred line revealed 10,044 novel genomic regions larger than 1 kb, making up 88 Mb of DNA, that are present in F2 but not in B73 (PAV). This set of maize PAV sequences allowed us to annotate PAV content and to analyze sequence breakpoints. Using PAV genotyping on a collection of 25 temperate lines, we also analyzed Linkage Disequilibrium in PAVs and flanking regions, and PAV frequencies within maize genetic groups. CONCLUSIONS: We highlight the possible role of MMEJ-type double strand break repair in maize PAV formation and discover 395 new genes with transcriptional support. Pattern of linkage disequilibrium within PAVs strikingly differs from this of flanking regions and is in accordance with the intuition that PAVs may recombine less than other genomic regions. We show that most PAVs are ancient, while some are found only in European Flint material, thus pinpointing structural features that may be at the origin of adaptive traits involved in the success of this material. Characterization of such PAVs will provide useful material for further association genetic studies in European and temperate maize.