RESUMEN
KEY MESSAGE: Association mapping with immortalized lines of landraces offers several advantages including a high mapping resolution, as demonstrated here in maize by identifying the causal variants underlying QTL for oil content and the metabolite allantoin. Landraces are traditional varieties of crops that present a valuable yet largely untapped reservoir of genetic variation to meet future challenges of agriculture. Here, we performed association mapping in a panel comprising 358 immortalized maize lines from six European Flint landraces. Linkage disequilibrium decayed much faster in the landraces than in the elite lines included for comparison, permitting a high mapping resolution. We demonstrate this by fine-mapping a quantitative trait locus (QTL) for oil content down to the phenylalanine insertion F469 in DGAT1-2 as the causal variant. For the metabolite allantoin, related to abiotic stress response, we identified promoter polymorphisms and differential expression of an allantoinase as putative cause of variation. Our results demonstrate the power of this approach to dissect QTL potentially down to the causal variants, toward the utilization of natural or engineered alleles in breeding. Moreover, we provide guidelines for studies using ancestral landraces for crop genetic research and breeding.
Asunto(s)
Biblioteca de Genes , Genes de Plantas , Sitios de Carácter Cuantitativo , Zea mays/genética , Estudios de Asociación Genética , Desequilibrio de Ligamiento , Fenotipo , Fitomejoramiento , Especificidad de la EspecieRESUMEN
KEY MESSAGE: Inclusion of historical training data improved the genomics-based prediction of performance of maize hybrids, the extent depending on the phenotypic trait and genotype-by-year interaction. Prediction of hybrid performance using existing phenotypic data on previous hybrids combined with molecular data collected on the parent lines allows to identify the most promising candidates from a huge number of possible hybrids at an early stage. Phenotypic data on yield and dry matter of 1970 grain maize hybrids from 19 years of a public breeding program were aggregated considering the underlying structure of factorial sets of hybrids. Pedigree records and 50 K SNP data were collected on their 170 Dent and 127 Flint parent lines. The performance of untested hybrids was predicted by best linear unbiased predictors (BLUP) on basis of pedigree or genomic data. For composition of training sets (TRN) and test sets (TST), three schemes for collecting factorials from specific years were employed which resulted in 490 scenarios. For each scenario, the predictive ability and genomic relationship between TRN and TST hybrids were determined. For extended TRNs, where earlier years were successively added to the TRN, the maximum relationship increased and the predictive ability improved, with the extent of the latter depending on the phenotypic trait and its genotype-by-year interaction. Genomic BLUP outperformed pedigree BLUP and better utilized the early years' data, especially for prediction of hybrids from factorials in a more distant future. This study on hybrid prediction in grain maize illustrated that including historical phenotypic data for training, although consisting of less related genotypes, can improve genomic prediction and enables optimization of hybrid variety development.
Asunto(s)
Genómica/métodos , Hibridación Genética , Zea mays/genética , AgriculturaRESUMEN
BACKGROUND: Small RNA (sRNA) sequences are known to have a broad impact on gene regulation by various mechanisms. Their performance for the prediction of hybrid traits has not yet been analyzed. Our objective was to analyze the relation of parental sRNA expression with the performance of their hybrids, to develop a sRNA-based prediction approach, and to compare it to more common SNP and mRNA transcript based predictions using a factorial mating scheme of a maize hybrid breeding program. RESULTS: Correlation of genomic differences and messenger RNA (mRNA) or sRNA expression differences between parental lines with hybrid performance of their hybrids revealed that sRNAs showed an inverse relationship in contrast to the other two data types. We associated differences for SNPs, mRNA and sRNA expression between parental inbred lines with the performance of their hybrid combinations and developed two prediction approaches using distance measures based on associated markers. Cross-validations revealed parental differences in sRNA expression to be strong predictors for hybrid performance for grain yield in maize, comparable to genomic and mRNA data. The integration of both positively and negatively associated markers in the prediction approaches enhanced the prediction accurary. The associated sRNAs belong predominantly to the canonical size classes of 22- and 24-nt that show specific genomic mapping characteristics. CONCLUSION: Expression profiles of sRNA are a promising alternative to SNPs or mRNA expression profiles for hybrid prediction, especially for plant species without reference genome or transcriptome information. The characteristics of the sRNAs we identified suggest that association studies based on breeding populations facilitate the identification of sRNAs involved in hybrid performance.
Asunto(s)
Hibridación Genética , ARN Pequeño no Traducido/genética , Zea mays/genética , Cruzamiento , Perfilación de la Expresión Génica , Genómica , Polimorfismo de Nucleótido Simple , ARN Mensajero/genética , Zea mays/crecimiento & desarrolloRESUMEN
Heterosis refers to a quantitative phenomenon in which F1 hybrid trait values exceed the mean of the parental values in a positive direction. Generally, it is dependent on a high degree of heterozygosity, which is maintained in hybrid breeding by developing parental lines in separate, genetically distinct heterotic groups. The mobility of small RNAs (sRNAs) that mediate epigenetic regulation of gene expression renders them promising candidates for modulating the action of combined diverse genomes in trans-and evidence already indicates their contribution to transgressive phenotypes. By sequencing small RNA libraries of a panel of 21 maize parental inbred lines we found a low overlap of 35% between the sRNA populations from both distinct heterotic groups. Surprisingly, in contrast to genetic or gene expression variation, parental sRNA expression variation is negatively correlated with grain yield (GY) heterosis. Among 0.595 million expressed sRNAs, we identified 9,767, predominantly 22- and 24-nt long sRNAs, which showed an association of their differential expression between parental lines and GY heterosis of the respective hybrids. Of these sRNAs, 3,485 or 6,282 showed an association with high or low GY heterosis, respectively, thus the low heterosis associated group prevailing at 64%. The heterosis associated sRNAs map more frequently to genes that show differential expression between parental lines than reference sets. Together these findings suggest that trans-chromosomal actions of sRNAs in hybrids might add up to a negative contribution in heterosis formation, mediated by unfavorable gene expression regulation. We further revealed an exclusive accumulation of 22-nt sRNAs that are associated with low GY heterosis in pericentromeric genomic regions. That recombinational suppression led to this enrichment is indicated by its close correlation with low recombination rates. The existence of this enrichment, which we hypothesize resulted from the separated breeding of inbred lines within heterotic groups, may have implications for hybrid breeding strategies addressing the recombinational constraints characteristic of complex crop genomes.
RESUMEN
The ability to predict the agronomic performance of single-crosses with high precision is essential for selecting superior candidates for hybrid breeding. With recent technological advances, thousands of new parent lines, and, consequently, millions of new hybrid combinations are possible in each breeding cycle, yet only a few hundred can be produced and phenotyped in multi-environment yield trials. Well established prediction approaches such as best linear unbiased prediction (BLUP) using pedigree data and whole-genome prediction using genomic data are limited in capturing epistasis and interactions occurring within and among downstream biological strata such as transcriptome and metabolome. Because mRNA and small RNA (sRNA) sequences are involved in transcriptional, translational and post-translational processes, we expect them to provide information influencing several biological strata. However, using sRNA data of parent lines to predict hybrid performance has not yet been addressed. Here, we gathered genomic, transcriptomic (mRNA and sRNA) and metabolomic data of parent lines to evaluate the ability of the data to predict the performance of untested hybrids for important agronomic traits in grain maize. We found a considerable interaction for predictive ability between predictor and trait, with mRNA data being a superior predictor for grain yield and genomic data for grain dry matter content, while sRNA performed relatively poorly for both traits. Combining mRNA and genomic data as predictors resulted in high predictive abilities across both traits and combining other predictors improved prediction over that of the individual predictors alone. We conclude that downstream "omics" can complement genomics for hybrid prediction, and, thereby, contribute to more efficient selection of hybrid candidates.
Asunto(s)
Estudios de Asociación Genética , Genoma de Planta , Genómica , Zea mays/genética , Algoritmos , Genética de Población , Genómica/métodos , Hibridación Genética , Metabolómica , Modelos Genéticos , Fitomejoramiento , Sitios de Carácter Cuantitativo , Carácter Cuantitativo Heredable , Selección Genética , TranscriptomaRESUMEN
KEY MESSAGE: Complementing genomic data with other "omics" predictors can increase the probability of success for predicting the best hybrid combinations using complex agronomic traits. Accurate prediction of traits with complex genetic architecture is crucial for selecting superior candidates in animal and plant breeding and for guiding decisions in personalized medicine. Whole-genome prediction has revolutionized these areas but has inherent limitations in incorporating intricate epistatic interactions. Downstream "omics" data are expected to integrate interactions within and between different biological strata and provide the opportunity to improve trait prediction. Yet, predicting traits from parents to progeny has not been addressed by a combination of "omics" data. Here, we evaluate several "omics" predictors-genomic, transcriptomic and metabolic data-measured on parent lines at early developmental stages and demonstrate that the integration of transcriptomic with genomic data leads to higher success rates in the correct prediction of untested hybrid combinations in maize. Despite the high predictive ability of genomic data, transcriptomic data alone outperformed them and other predictors for the most complex heterotic trait, dry matter yield. An eQTL analysis revealed that transcriptomic data integrate genomic information from both, adjacent and distant sites relative to the expressed genes. Together, these findings suggest that downstream predictors capture physiological epistasis that is transmitted from parents to their hybrid offspring. We conclude that the use of downstream "omics" data in prediction can exploit important information beyond structural genomics for leveraging the efficiency of hybrid breeding.
Asunto(s)
Zea mays/genética , Mapeo Cromosómico , Genómica , Vigor Híbrido , Metabolómica , Modelos Genéticos , Fenotipo , Fitomejoramiento , Sitios de Carácter Cuantitativo , Carácter Cuantitativo Heredable , TranscriptomaRESUMEN
Thousands of landraces are stored in seed banks as "gold reserves" for future use in plant breeding. In many crops, their utilization is hampered because they represent heterogeneous populations of heterozygous genotypes, which harbor a high genetic load. We show, with high-density genotyping in five landraces of maize, that libraries of doubled-haploid (DH) lines capture the allelic diversity of genetic resources in an unbiased way. By comparing allelic differentiation between heterozygous plants from the original landraces and 266 derived DH lines, we find conclusive evidence that, in the DH production process, sampling of alleles is random across the entire allele frequency spectrum, and purging of landraces from their genetic load does not act on specific genomic regions. Based on overall process efficiency, we show that generating DH lines is feasible for genetic material that has never been selected for inbreeding tolerance. We conclude that libraries of DH lines will make genetic resources accessible to crop improvement by linking molecular inventories of seed banks with meaningful phenotypes.
Asunto(s)
Haploidia , Fitomejoramiento/métodos , Banco de Semillas , Zea mays/genética , Alelos , Bases de Datos de Ácidos Nucleicos , Carga Genética , Heterocigoto , Desequilibrio de Ligamiento , Polimorfismo GenéticoRESUMEN
KEY MESSAGE: Six quantitative trait loci (QTL) for Gibberella ear rot resistance in maize were tested in two different genetic backgrounds; three QTL displayed an effect in few near isogenic line pairs. Few quantitative trait loci (QTL) mapping studies for Gibberella ear rot (GER) have been conducted, but no QTL have been verified so far. QTL validation is prudent before their implementation into marker-assisted selection (MAS) programs. Our objectives were to (1) validate six QTL for GER resistance, (2) evaluate the QTL across two genetic backgrounds, (3) investigate the genetic background outside the targeted introgressions. Pairs of near isogenic lines (NILs) segregating for a single QTL (Qger1, Qger2, Qger10, Qger13, Qger16, or Qger21) were developed by recurrent backcross until generation BC3S2. Donor parents (DP) carrying QTL were backcrossed to a susceptible (UH009) and a moderately resistant (UH007) recurrent parent. MAS was performed using five SNP markers covering a region of 40 cM around each QTL. All NILs were genotyped with the MaizeSNP50 assay and phenotyped for GER severity and deoxynivalenol and zearalenone content. Traits were significantly (P < 0.001) intercorrelated. Out of 34 NIL pairs with the UH009 genetic background, three pairs showed significant differences in at least one trait for three QTL (Qger1, Qger2, Qger13). Out of 25 NIL pairs with the UH007 genetic background, five pairs showed significant differences in at least one trait for two QTL (Qger2, Qger21). However, Qger16, Qger10 and Qger13 were most likely false positives. The genetic background possibly affected NIL pairs comparisons due to linkage drag and/or epistasis with residual loci from the DP in non-target regions. In conclusion, validation rates were disappointingly low, which further indicates that GER resistance is controlled by many low-effect QTL.
Asunto(s)
Resistencia a la Enfermedad/genética , Gibberella , Enfermedades de las Plantas/genética , Sitios de Carácter Cuantitativo , Zea mays/genética , Mapeo Cromosómico , Cruzamientos Genéticos , Ligamiento Genético , Genotipo , Fenotipo , Fitomejoramiento , Enfermedades de las Plantas/microbiología , Polimorfismo de Nucleótido Simple , Tricotecenos/análisis , Zea mays/microbiología , Zearalenona/análisisRESUMEN
In vivo haploid induction (HI) triggered by pollination with special intraspecific genotypes, called inducers, is unique to Zea maysL. within the plant kingdom and has revolutionized maize breeding during the last decade. However, the molecular mechanisms underlying HI in maize are still unclear. To investigate the genetic basis of HI, we developed a new approach for genome-wide association studies (GWAS), termed conditional haplotype extension (CHE) test that allows detection of selective sweeps even under almost perfect confounding of population structure and trait expression. Here, we applied this test to identify genomic regions required for HI expression and dissected the combined support interval (50.34 Mb) of the QTL qhir1, detected in a previous study, into two closely linked genomic segments relevant for HI expression. The first, termed qhir11(0.54 Mb), comprises an already fine-mapped region but was not diagnostic for differentiating inducers and noninducers. The second segment, termed qhir12(3.97 Mb), had a haplotype allele common to all 53 inducer lines but not found in any of the 1482 noninducers. By comparing resequencing data of one inducer with 14 noninducers, we detected in the qhir12 region three candidate genes involved in DNA or amino acid binding, however, none for qhir11 We propose that the CHE test can be utilized in introgression breeding and different fields of genetics to detect selective sweeps in heterogeneous genetic backgrounds.
Asunto(s)
Genoma de Planta , Estudio de Asociación del Genoma Completo , Genómica , Haploidia , Zea mays/genética , Variación Genética , Genética de Población , Estudio de Asociación del Genoma Completo/métodos , Genómica/métodos , Genotipo , Fenotipo , Fitomejoramiento , Polimorfismo de Nucleótido Simple , Sitios de Carácter CuantitativoRESUMEN
KEY MESSAGE: QTL analysis for Fusarium resistance traits with multiple connected families detected more QTL than single-family analysis. Prediction accuracy was tightly associated with the kinship of the validation and training set. ABSTRACT: QTL mapping has recently shifted from analysis of single families to multiple, connected families and several biometric models have been suggested. Using a high-density consensus map with 2472 marker loci, we performed QTL mapping with five connected bi-parental families with 639 doubled-haploid (DH) lines in maize for ear rot resistance and analyzed traits DON, Gibberella ear rot severity (GER), and days to silking (DS). Five biometric models differing in the assumption about the number and effects of alleles at QTL were compared. Model 2 to 5 performing joint analyses across all families and using linkage and/or linkage disequilibrium (LD) information identified all and even further QTL than Model 1 (single-family analyses) and generally explained a higher proportion pG of the genotypic variance for all three traits. QTL for DON and GER were mostly family specific, but several QTL for DS occurred in multiple families. Many QTL displayed large additive effects and most alleles increasing resistance originated from a resistant parent. Interactions between detected QTL and genetic background (family) occurred rarely and were comparatively small. Detailed analysis of three fully connected families yielded higher pG values for Model 3 or 4 than for Model 2 and 5, irrespective of the size NTS of the training set (TS). In conclusion, Model 3 and 4 can be recommended for QTL-based prediction with larger families. Including a sufficiently large number of full sibs in the TS helped to increase QTL-based prediction accuracy (rVS) for various scenarios differing in the composition of the TS.
Asunto(s)
Resistencia a la Enfermedad/genética , Modelos Genéticos , Enfermedades de las Plantas/genética , Sitios de Carácter Cuantitativo , Zea mays/genética , Alelos , Mapeo Cromosómico , Cruzamientos Genéticos , Epistasis Genética , Fusarium , Repeticiones de Microsatélite , Fenotipo , Enfermedades de las Plantas/microbiología , Polimorfismo de Nucleótido Simple , Zea mays/microbiologíaRESUMEN
Maize (Zea mays L.) serves as model plant for heterosis research and is the crop where hybrid breeding was pioneered. We analyzed genomic and phenotypic data of 1254 hybrids of a typical maize hybrid breeding program based on the important Dent × Flint heterotic pattern. Our main objectives were to investigate genome properties of the parental lines (e.g., allele frequencies, linkage disequilibrium, and phases) and examine the prospects of genomic prediction of hybrid performance. We found high consistency of linkage phases and large differences in allele frequencies between the Dent and Flint heterotic groups in pericentromeric regions. These results can be explained by the Hill-Robertson effect and support the hypothesis of differential fixation of alleles due to pseudo-overdominance in these regions. In pericentromeric regions we also found indications for consistent marker-QTL linkage between heterotic groups. With prediction methods GBLUP and BayesB, the cross-validation prediction accuracy ranged from 0.75 to 0.92 for grain yield and from 0.59 to 0.95 for grain moisture. The prediction accuracy of untested hybrids was highest, if both parents were parents of other hybrids in the training set, and lowest, if none of them were involved in any training set hybrid. Optimizing the composition of the training set in terms of number of lines and hybrids per line could further increase prediction accuracy. We conclude that genomic prediction facilitates a paradigm shift in hybrid breeding by focusing on the performance of experimental hybrids rather than the performance of parental lines in test crosses.
Asunto(s)
Genoma de Planta , Hibridación Genética/genética , Zea mays/genética , Cruzamiento , Frecuencia de los Genes , Marcadores Genéticos , Vigor Híbrido/genética , Desequilibrio de Ligamiento , Modelos Genéticos , Fenotipo , Sitios de Carácter CuantitativoRESUMEN
BACKGROUND: The identification of QTL involved in heterosis formation is one approach to unravel the not yet fully understood genetic basis of heterosis - the improved agronomic performance of hybrid F1 plants compared to their inbred parents. The identification of candidate genes underlying a QTL is important both for developing markers and determining the molecular genetic basis of a trait, but remains difficult owing to the large number of genes often contained within individual QTL. To address this problem in heterosis analysis, we applied a meta-analysis strategy for grain yield (GY) of Zea mays L. as example, incorporating QTL-, hybrid field-, and parental gene expression data. RESULTS: For the identification of genes underlying known heterotic QTL, we made use of tight associations between gene expression pattern and the trait of interest, identified by correlation analyses. Using this approach genes strongly associated with heterosis for GY were discovered to be clustered in pericentromeric regions of the complex maize genome. This suggests that expression differences of sequences in recombination-suppressed regions are important in the establishment of heterosis for GY in F1 hybrids and also in the conservation of heterosis for GY across genotypes. Importantly functional analysis of heterosis-associated genes from these genomic regions revealed over-representation of a number of functional classes, identifying key processes contributing to heterosis for GY. Based on the finding that the majority of the analyzed heterosis-associated genes were addtitively expressed, we propose a model referring to the influence of cis-regulatory variation on heterosis for GY by the compensation of fixed detrimental expression levels in parents. CONCLUSIONS: The study highlights the utility of a meta-analysis approach that integrates phenotypic and multi-level molecular data to unravel complex traits in plants. It provides prospects for the identification of genes relevant for QTL, and also suggests a model for the potential role of additive expression in the formation and conservation of heterosis for GY via dominant, multigenic quantitative trait loci. Our findings contribute to a deeper understanding of the multifactorial phenomenon of heterosis, and thus to the breeding of new high yielding varieties.
Asunto(s)
Centrómero/genética , Regulación de la Expresión Génica de las Plantas , Genoma de Planta/genética , Vigor Híbrido/genética , Zea mays/genética , Análisis de Varianza , Mapeo Cromosómico , Cromosomas de las Plantas/genética , Simulación por Computador , Genes de Plantas , Hibridación Genética , Endogamia , Anotación de Secuencia Molecular , Análisis de Secuencia por Matrices de Oligonucleótidos , Sitios de Carácter Cuantitativo/genética , Semillas/crecimiento & desarrolloRESUMEN
High-density genotyping is extensively exploited in genome-wide association mapping studies and genomic selection in maize. By contrast, linkage mapping studies were until now mostly based on low-density genetic maps and theoretical results suggested this to be sufficient. This raises the question, if an increase in marker density would be an overkill for linkage mapping in biparental populations, or if important QTL mapping parameters would benefit from it. In this study, we addressed this question using experimental data and a simulation based on linkage maps with marker densities of 1, 2, and 5 cM. QTL mapping was performed for six diverse traits in a biparental population with 204 doubled haploid maize lines and in a simulation study with varying QTL effects and closely linked QTL for different population sizes. Our results showed that high-density maps neither improved the QTL detection power nor the predictive power for the proportion of explained genotypic variance. By contrast, the precision of QTL localization, the precision of effect estimates of detected QTL, especially for small and medium sized QTL, as well as the power to resolve closely linked QTL profited from an increase in marker density from 5 to 1 cM. In conclusion, the higher costs for high-density genotyping are compensated for by more precise estimates of parameters relevant for knowledge-based breeding, thus making an increase in marker density for linkage mapping attractive.
Asunto(s)
Mapeo Cromosómico , Simulación por Computador , Técnicas de Genotipaje/métodos , Sitios de Carácter Cuantitativo/genética , Zea mays/genética , Cromosomas de las Plantas/genética , Cruzamientos Genéticos , Marcadores Genéticos , Genética de Población , Escala de LodRESUMEN
Chilling sensitivity of maize is a strong limitation for its cultivation in the cooler areas of the northern and southern hemisphere because reduced growth in early stages impairs on later biomass accumulation. Efficient breeding for chilling tolerance is hampered by both the complex physiological response of maize to chilling temperatures and the difficulty to accurately measure chilling tolerance in the field under fluctuating climatic conditions. For this research, we used genome-wide association (GWA) mapping to identify genes underlying chilling tolerance under both controlled and field conditions in a broad germplasm collection of 375 maize inbred lines genotyped with 56 110 single nucleotide polymorphism (SNP). We identified 19 highly significant association signals explaining between 5.7 and 52.5% of the phenotypic variance observed for early growth and chlorophyll fluorescence parameters. The allelic effect of several SNPs identified for early growth was associated with temperature and incident radiation. Candidate genes involved in ethylene signalling, brassinolide, and lignin biosynthesis were found in their vicinity. The frequent involvement of candidate genes into signalling or gene expression regulation underlines the complex response of photosynthetic performance and early growth to climatic conditions, and supports pleiotropism as a major cause of co-locations of quantitative trait loci for these highly polygenic traits.
Asunto(s)
Adaptación Fisiológica/genética , Mapeo Cromosómico , Frío , Estudio de Asociación del Genoma Completo , Endogamia , Zea mays/crecimiento & desarrollo , Zea mays/genética , Agricultura , Cromosomas de las Plantas/genética , Clima , Frecuencia de los Genes/genética , Interacción Gen-Ambiente , Estudios de Asociación Genética , Genotipo , Desequilibrio de Ligamiento/genética , Fenotipo , Fotosíntesis/fisiología , Análisis de Componente Principal , Sitios de Carácter Cuantitativo/genéticaRESUMEN
Recent advances in high-throughput sequencing technologies have triggered a shift toward single-nucleotide polymorphism (SNP) markers. A systematic bias can be introduced if SNPs are ascertained in a small panel of genotypes and then used for characterizing a larger population (ascertainment bias). With the objective of evaluating a potential ascertainment bias of the Illumina MaizeSNP50 array with respect to elite European maize dent and flint inbred lines, we compared the genetic diversity among these materials based on 731 amplified fragment length polymorphisms (AFLPs), 186 simple sequence repeats (SSRs), 41,434 SNPs of the MaizeSNP50 array (SNP-A), and two subsets of it, i.e., 30,068 Panzea (SNP-P) and 11,366 Syngenta markers (SNP-S). We evaluated the bias effects on major allele frequency, allele number, gene diversity, modified Roger's distance (MRD), and on molecular variance (AMOVA). We revealed ascertainment bias in SNP-A, compared to AFLPs and SSRs. It affected especially European flint lines analyzed with markers (SNP-S) specifically developed to maximize differences among North American dent germplasm. The bias affected all genetic parameters, but did not substantially alter the relative distances between inbred lines within groups. For these reasons, we conclude that the SNP markers of the MaizeSNP50 array can be employed for breeding purposes in the investigated material. However, attention should be paid in case of comparisons between genotypes belonging to different heterotic groups. In this case, it is advisable to prefer a marker subset with potentially low ascertainment bias, like in our case the SNP-P marker set.
Asunto(s)
Variación Genética , Polimorfismo de Nucleótido Simple , Zea mays/genética , Alelos , Análisis del Polimorfismo de Longitud de Fragmentos Amplificados , Cruzamientos Genéticos , Europa (Continente) , Frecuencia de los Genes , Marcadores Genéticos , Genotipo , Vigor Híbrido , Modelos EstadísticosRESUMEN
Identifying high performing hybrids is an essential part of every maize breeding program. Genomic prediction of maize hybrid performance allows to identify promising hybrids, when they themselves or other hybrids produced from their parents were not tested in field trials. Using simulations, we investigated the effects of marker density (10, 1, 0.3 marker per mega base pair, Mbp(-1)), convergent or divergent parental populations, number of parents tested in other combinations (2, 1, 0), genetic model (including population-specific and/or dominance marker effects or not), and estimation method (GBLUP or BayesB) on the prediction accuracy. We based our simulations on marker genotypes of Central European flint and dent inbred lines from an ongoing maize breeding program. To simulate convergent or divergent parent populations, we generated phenotypes by assigning QTL to markers with similar or very different allele frequencies in both pools, respectively. Prediction accuracies increased with marker density and number of parents tested and were higher under divergent compared with convergent parental populations. Modeling marker effects as population-specific slightly improved prediction accuracy under lower marker densities (1 and 0.3 Mbp(-1)). This indicated that modeling marker effects as population-specific will be most beneficial under low linkage disequilibrium. Incorporating dominance effects improved prediction accuracies considerably for convergent parent populations, where dominance results in major contributions of SCA effects to the genetic variance among inter-population hybrids. While the general trends regarding the effects of the aforementioned influence factors on prediction accuracy were similar for GBLUP and BayesB, the latter method produced significantly higher accuracies for models incorporating dominance.
Asunto(s)
Genes Dominantes , Genoma de Planta , Hibridación Genética , Zea mays/genética , Identificación Biométrica/métodos , Cruzamiento , Frecuencia de los Genes , Marcadores Genéticos , Variación Genética , Genómica/métodos , Genotipo , Desequilibrio de Ligamiento , Modelos Genéticos , Fenotipo , Sitios de Carácter CuantitativoRESUMEN
The performance of hybrids can be predicted with gene expression data from their parental inbred lines. Implementing such prediction approaches in breeding programs promises to increase the efficiency of hybrid breeding. The objectives of our study were to compare the accuracy of prediction models employing multiple linear regression (MLR), partial least squares regression (PLS), support vector machine regression (SVM), and transcriptome-based distances (D(B)). For a factorial of 7 flint and 14 dent maize lines, the grain yield of the hybrids was assessed and the gene expression of the parental lines was profiled with a 56k microarray. The accuracy of the prediction models was measured by the correlation between predicted and observed yield employing two cross-validation schemes. The first modeled the prediction of hybrids when testcross data are available for both parental lines (type 2 hybrids), and the second modeled the prediction of hybrids when no testcross data for the parental lines were available (type 0 hybrids). MLR, SVM, and PLS resulted in a high correlation between predicted and observed yield for type 2 hybrids, whereas for type 0 hybrids D(B) had greater prediction accuracy. The regression methods were robust to the choice of the set of profiled genes and required only a few hundred genes. In contrast, for an accurate hybrid prediction with D(B), 1,000-1,500 genes were required, and the prediction accuracy depended strongly on the set of profiled genes. We conclude that for prediction within one set of genetic material MLR is a promising approach, and for transferring prediction models from one set of genetic material to a related one, the transcriptome-based distance D(B) is most promising.
Asunto(s)
Cruzamiento/métodos , Perfilación de la Expresión Génica , Hibridación Genética/genética , Modelos Genéticos , Zea mays/genética , Análisis de los Mínimos Cuadrados , Máquina de Vectores de Soporte , Zea mays/metabolismoRESUMEN
BACKGROUND: The importance of maize for human and animal nutrition, but also as a source for bio-energy is rapidly increasing. Maize yield is a quantitative trait controlled by many genes with small effects, spread throughout the genome. The precise location of the genes and the identity of the gene networks underlying maize grain yield is unknown. The objective of our study was to contribute to the knowledge of these genes and gene networks by transcription profiling with microarrays. RESULTS: We assessed the grain yield and grain dry matter content (an indicator for early maturity) of 98 maize hybrids in multi-environment field trials. The gene expression in seedlings of the parental inbred lines, which have four different genetic backgrounds, was assessed with genome-scale oligonucleotide arrays. We identified genes associated with grain yield and grain dry matter content using a newly developed two-step correlation approach and found overlapping gene networks for both traits. The underlying metabolic pathways and biological processes were elucidated. Genes involved in sucrose degradation and glycolysis, as well as genes involved in cell expansion and endocycle were found to be associated with grain yield. CONCLUSIONS: Our results indicate that the capability of providing energy and substrates, as well as expanding the cell at the seedling stage, highly influences the grain yield of hybrids. Knowledge of these genes underlying grain yield in maize can contribute to the development of new high yielding varieties.
Asunto(s)
Biomasa , Perfilación de la Expresión Génica/métodos , Redes y Vías Metabólicas/genética , Plantones/genética , Semillas/crecimiento & desarrollo , Zea mays/crecimiento & desarrollo , Zea mays/genética , Regulación de la Expresión Génica de las Plantas , Genes de Plantas , Glucólisis , Hibridación Genética , Modelos Genéticos , Carácter Cuantitativo Heredable , Plantones/crecimiento & desarrollo , Semillas/genética , Sacarosa/metabolismoRESUMEN
The identification of superior hybrids is important for the success of a hybrid breeding program. However, field evaluation of all possible crosses among inbred lines requires extremely large resources. Therefore, efforts have been made to predict hybrid performance (HP) by using field data of related genotypes and molecular markers. In the present study, the main objective was to assess the usefulness of pedigree information in combination with the covariance between general combining ability (GCA) and per se performance of parental lines for HP prediction. In addition, we compared the prediction efficiency of AFLP and SSR marker data, estimated marker effects separately for reciprocal allelic configurations (among heterotic groups) of heterozygous marker loci in hybrids, and imputed missing AFLP marker data for marker-based HP prediction. Unbalanced field data of 400 maize dent x flint hybrids from 9 factorials and of 79 inbred parents were subjected to joint analyses with mixed linear models. The inbreds were genotyped with 910 AFLP and 256 SSR markers. Efficiency of prediction (R (2)) was estimated by cross-validation for hybrids having no or one parent evaluated in testcrosses. Best linear unbiased prediction of GCA and specific combining ability resulted in the highest efficiencies for HP prediction for both traits (R (2) = 0.6-0.9), if pedigree and line per se data were used. However, without such data, HP for grain yield was more efficiently predicted using molecular markers. The additional modifications of the marker-based approaches had no clear effect. Our study showed the high potential of joint analyses of hybrids and parental inbred lines for the prediction of performance of untested hybrids.
Asunto(s)
Vigor Híbrido/genética , Modelos Genéticos , Zea mays/genética , Marcadores Genéticos , Hibridación Genética , EndogamiaRESUMEN
Maize (Zea mays L.) breeders are concerned about the narrowing of the genetic base of elite germplasm. To reverse this trend, elite germplasm from other geographic regions can be introgressed, but due to lack of adaptation it is difficult to assess their breeding potential in the targeted environment. The objectives of this study were to (1) investigate the relationship between European and US maize germplasm, (2) examine the suitability of different mega-environments and measures of performance to assess the breeding potential of exotics, and (3) study the relationship of genetic distance with mid-parent heterosis (MPH). Eight European inbreds from the Dent and Flint heterotic groups, 11 US inbreds belonging to Stiff Stalk (SS), non-Stiff Stalk (NSS), and CIMMYT Pool 41, and their 88 factorial crosses in F(1) and F(2) generations were evaluated for grain yield and dry matter concentration. The experiments were conducted in three mega-environments: Central Europe (target mega-environment), US Cornbelt (mega-environment where donor lines were developed), and Southeast Europe (an intermediate mega-environment). The inbreds were also fingerprinted with 266 SSR markers. Suitable criteria to identify promising exotic germplasm were F(1) hybrid performance in the targeted mega-environment and F(1) and parental performance in the intermediate mega-environment. Marker-based genetic distances reflected relatedness among the inbreds, but showed no association with MPH. Based on genetic distance, MPH, and F(1) performance, we suggest to introgress SS germplasm into European Dents and NSS into European Flints, in order to exploit the specific adaptation of European flint germplasm and the excellent combining ability of US germplasm in European maize breeding programs.