Your browser doesn't support javascript.
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 111
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Theor Popul Biol ; 132: 16-23, 2020 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-31991144

RESUMO

Whole genome epistasis models with interactions between different loci can be approximated by genomic relationship models based on Hadamard powers of the additive genomic relationship. We illustrate that the quality of this approximation reduces when the degree of interaction d increases. Moreover, considering relationship models defined as weighted sum of interactions of different degree, we investigate the impact of this decreasing quality of approximation of the summands on the approximation of the weighted sum. Our results indicate that these approximations remain on a reliable level, but their quality reduces when the weights of interactions of higher degrees do not decrease quickly.

2.
Front Plant Sci ; 10: 1311, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31787990

RESUMO

Although durum wheat (Triticum turgidum var. durum Desf.) is a minor cereal crop representing just 5-7% of the world's total wheat crop, it is a staple food in Mediterranean countries, where it is used to produce pasta, couscous, bulgur and bread. In this paper, we cover multi-trait prediction of grain yield (GY), days to heading (DH) and plant height (PH) of 270 durum wheat lines that were evaluated in 43 environments (country-location-year combinations) across a broad range of water regimes in the Mediterranean Basin and other locations. Multi-trait prediction analyses were performed by implementing a multi-trait deep learning model (MTDL) with a feed-forward network topology and a rectified linear unit activation function with a grid search approach for the selection of hyper-parameters. The results of the multi-trait deep learning method were also compared with univariate predictions of the genomic best linear unbiased predictor (GBLUP) method and the univariate counterpart of the multi-trait deep learning method (UDL). All models were implemented with and without the genotype × environment interaction term. We found that the best predictions were observed without the genotype × environment interaction term in the UDL and MTDL methods. However, under the GBLUP method, the best predictions were observed when the genotype × environment interaction term was taken into account. We also found that in general the best predictions were observed under the GBLUP model; however, the predictions of the MTDL were very similar to those of the GBLUP model. This result provides more evidence that the GBLUP model is a powerful approach for genomic prediction, but also that the deep learning method is a practical approach for predicting univariate and multivariate traits in the context of genomic selection.

3.
Front Plant Sci ; 10: 1502, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31824533

RESUMO

Genomic selection predicts the genomic estimated breeding values (GEBVs) of individuals not previously phenotyped. Several studies have investigated the accuracy of genomic predictions in maize but there is little empirical evidence on the practical performance of lines selected based on phenotype in comparison with those selected solely on GEBVs in advanced testcross yield trials. The main objectives of this study were to (1) empirically compare the performance of tropical maize hybrids selected through phenotypic selection (PS) and genomic selection (GS) under well-watered (WW) and managed drought stress (WS) conditions in Kenya, and (2) compare the cost-benefit analysis of GS and PS. For this study, we used two experimental maize data sets (stage I and stage II yield trials). The stage I data set consisted of 1492 doubled haploid (DH) lines genotyped with rAmpSeq SNPs. A subset of these lines (855) representing various DH populations within the stage I cohort was crossed with an individual single-cross tester chosen to complement each population. These testcross hybrids were evaluated in replicated trials under WW and WS conditions for grain yield and other agronomic traits, while the remaining 637 DH lines were predicted using the 855 lines as a training set. The second data set (stage II) consists of 348 DH lines from the first data set. Among these 348 best DH lines, 172 lines selected were solely based on GEBVs, and 176 lines were selected based on phenotypic performance. Each of the 348 DH lines were crossed with three common testers from complementary heterotic groups, and the resulting 1042 testcross hybrids and six commercial checks were evaluated in four to five WW locations and one WS condition in Kenya. For stage I trials, the cross-validated prediction accuracy for grain yield was 0.67 and 0.65 under WW and WS conditions, respectively. We found similar responses to selection using PS and GS for grain yield other agronomic traits under WW and WS conditions. The top 15% of hybrids advanced through GS and PS gave 21%-23% higher grain yield under WW and 51%-52% more grain yield under WS than the mean of the checks. The GS reduced the cost by 32% over the PS with similar selection gains. We concluded that the use of GS for yield under WW and WS conditions in maize can produce selection candidates with similar performance as those generated from conventional PS, but at a lower cost, and therefore, should be incorporated into maize breeding pipelines to increase breeding program efficiency.

4.
PLoS One ; 14(11): e0224631, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31710611

RESUMO

For doubled haploid (DH) production in maize, F1 generation has been the most frequently used for haploid induction due to facility in the process. However, using F2 generation would be a good alternative to increase genetic variability owing to the additional recombination in meiosis. Our goals were to compare the effect of F1 and F2 generations on DH production in tropical germplasm, evaluating the R1-navajo expression in seeds, the working steps of the methodology, and the genetic variability of the DH lines obtained. Sources germplasm in F1 and F2 generations were crossed with the tropicalized haploid inducer LI-ESALQ. After harvest, for both induction crosses were calculated the haploid induction rate (HIR), diploid seed rate (DSR), and inhibition seed rate (ISR) using the total number of seeds obtained. In order to study the effectiveness of the DH working steps in each generation, the percentage per se and the relative percentage were verified. In addition, SNP markers were obtained for genetic variability studies. Results showed that the values for HIR, ISR, and DSR were 1.23%, 23.48%, and 75.21% for F1 and 1.78%, 15.82%, and 82.38% for F2, respectively. The effectiveness of the DH working step showed the same percentage per se value (0.4%) for F1 and F2, while the relative percentage was 27.2% for F1 and 22.4% for F2. Estimates of population parameters in DH lines from F1 were higher than F2. Furthermore, population structure and kinship analyses showed that one additional generation was not sufficient to create new genotype subgroups. Additionally, the relative efficiency of the response to selection in the F1 was 31.88% higher than F2 due to the number of cycles that are used to obtain the DH. Our results showed that in tropical maize, the use of F1 generation is recommended due to a superior balance between time and genetic variability.

5.
BMC Plant Biol ; 19(1): 520, 2019 Nov 27.
Artigo em Inglês | MEDLINE | ID: mdl-31775638

RESUMO

BACKGROUND: Germplasm banks maintain collections representing the most comprehensive catalogue of native genetic diversity available for crop improvement. Users of germplasm banks are interested in a fixed number of samples representing as broadly as possible the diversity present in the wider collection. A relevant question is whether it is necessary to develop completely independent germplasm samples or it is possible to select nested sets from a pre-defined core set panel not from the whole collection. We used data from 15,384, maize landraces stored in the CIMMYT germplasm bank to study the impact on 8 diversity criteria and the sample representativeness of: (1) two core selection strategies, a statistical sampling (DM), or a numerical maximization method (CH); (2) selecting samples of varying sizes; and (3) selecting samples of different sizes independently of each other or in a nested manner. RESULTS: Sample sizes greater than 10% of the whole population size retained more than 75% of the polymorphic markers for all selection strategies and types of sample; lower sample sizes showed more variability (instability) among repetitions; the strongest effect of sample size was observed on the CH-independent combination. Independent and nested samples showed similar performance for all the criteria for the DM method, but there were differences between them for the CH method. The DM method achieved better approximations to the known values in the population than the CH method; 2-d multidimensional scaling plots of the collection and samples highlighted tendency of sample selection towards the extremes of diversity in the CH method, compared with sampling more representative of the overall genotypic distribution of diversity under the DM method. CONCLUSIONS: The use of core subsets of size greater than or equal to 10% of the whole collection satisfied well the requirement of representativeness and diversity. Nested samples showed similar diversity and representativeness characteristics as independent samples offering a cost effective method of sample definition for germplasm banks. For most criteria assessed the DM method achieved better approximations to the known values in the whole population than the CH method, that is, it generated more statistically representative samples from collections.

6.
G3 (Bethesda) ; 9(12): 3981-3994, 2019 12 03.
Artigo em Inglês | MEDLINE | ID: mdl-31570501

RESUMO

The constrained linear genomic selection index (CLGSI) is a linear combination of genomic estimated breeding values useful for predicting the net genetic merit, which in turn is a linear combination of true unobservable breeding values of the traits weighted by their respective economic values. The CLGSI is the most general genomic index and allows imposing constraints on the expected genetic gain per trait to make some traits change their mean values based on a predetermined level, while the rest of them remain without restrictions. In addition, it includes the unconstrained linear genomic index as a particular case. Using two real datasets and simulated data for seven selection cycles, we compared the theoretical results of the CLGSI with the theoretical results of the constrained linear phenotypic selection index (CLPSI). The criteria used to compare CLGSI vs. CLPSI efficiency were the estimated expected genetic gain per trait values, the selection response, and the interval between selection cycles. The results indicated that because the interval between selection cycles is shorter for the CLGSI than for the CLPSI, CLGSI is more efficient than CLPSI per unit of time, but its efficiency could be lower per selection cycle. Thus, CLGSI is a good option for performing genomic selection when there are genotyped candidates for selection.

7.
Nat Genet ; 51(10): 1530-1539, 2019 10.
Artigo em Inglês | MEDLINE | ID: mdl-31548720

RESUMO

Bread wheat improvement using genomic tools is essential for accelerating trait genetic gains. Here we report the genomic predictabilities of 35 key traits and demonstrate the potential of genomic selection for wheat end-use quality. We also performed a large genome-wide association study that identified several significant marker-trait associations for 50 traits evaluated in South Asia, Africa and the Americas. Furthermore, we built a reference wheat genotype-phenotype map, explored allele frequency dynamics over time and fingerprinted 44,624 wheat lines for trait-associated markers, generating over 7.6 million data points, which together will provide a valuable resource to the wheat community for enhancing productivity and stress resilience.


Assuntos
Resistência à Doença/genética , Genômica/métodos , Locos de Características Quantitativas , Estresse Fisiológico/imunologia , Triticum/crescimento & desenvolvimento , Triticum/imunologia , Ascomicetos/fisiologia , Mapeamento Cromossômico , Grão Comestível/genética , Grão Comestível/crescimento & desenvolvimento , Estudos de Associação Genética , Marcadores Genéticos , Genoma de Planta , Estudo de Associação Genômica Ampla , Melhoramento Vegetal , Doenças das Plantas/genética , Doenças das Plantas/microbiologia , Seleção Genética , Estresse Fisiológico/genética , Triticum/genética
8.
G3 (Bethesda) ; 9(10): 3381-3393, 2019 10 07.
Artigo em Inglês | MEDLINE | ID: mdl-31427455

RESUMO

In this paper we propose a Bayesian multi-output regressor stacking (BMORS) model that is a generalization of the multi-trait regressor stacking method. The proposed BMORS model consists of two stages: in the first stage, a univariate genomic best linear unbiased prediction (GBLUP including genotype × environment interaction GE) model is implemented for each of the L traits under study; then the predictions of all traits are included as covariates in the second stage, by implementing a Ridge regression model. The main objectives of this research were to study alternative models to the existing multi-trait multi-environment (BMTME) model with respect to (1) genomic-enabled prediction accuracy, and (2) potential advantages in terms of computing resources and implementation. We compared the predictions of the BMORS model to those of the univariate GBLUP model using 7 maize and wheat datasets. We found that the proposed BMORS produced similar predictions to the univariate GBLUP model and to the BMTME model in terms of prediction accuracy; however, the best predictions were obtained under the BMTME model. In terms of computing resources, we found that the BMORS is at least 9 times faster than the BMTME method. Based on our empirical findings, the proposed BMORS model is an alternative for predicting multi-trait and multi-environment data, which are very common in genomic-enabled prediction in plant and animal breeding programs.

9.
G3 (Bethesda) ; 9(9): 2913-2924, 2019 09 04.
Artigo em Inglês | MEDLINE | ID: mdl-31289023

RESUMO

Kernel methods are flexible and easy to interpret and have been successfully used in genomic-enabled prediction of various plant species. Kernel methods used in genomic prediction comprise the linear genomic best linear unbiased predictor (GBLUP or GB) kernel, and the Gaussian kernel (GK). In general, these kernels have been used with two statistical models: single-environment and genomic × environment (GE) models. Recently near infrared spectroscopy (NIR) has been used as an inexpensive and non-destructive high-throughput phenotyping method for predicting unobserved line performance in plant breeding trials. In this study, we used a non-linear arc-cosine kernel (AK) that emulates deep learning artificial neural networks. We compared AK prediction accuracy with the prediction accuracy of GB and GK kernel methods in four genomic data sets, one of which also includes pedigree and NIR information. Results show that for all four data sets, AK and GK kernels achieved higher prediction accuracy than the linear GB kernel for the single-environment and GE multi-environment models. In addition, AK achieved similar or slightly higher prediction accuracy than the GK kernel. For all data sets, the GE model achieved higher prediction accuracy than the single-environment model. For the data set that includes pedigree, markers and NIR, results show that the NIR wavelength alone achieved lower prediction accuracy than the genomic information alone; however, the pedigree plus NIR information achieved only slightly lower prediction accuracy than the marker plus the NIR high-throughput data.


Assuntos
Genômica/métodos , Modelos Genéticos , Melhoramento Vegetal/métodos , Espectrofotometria/métodos , Bases de Dados Genéticas , Aprendizado Profundo , Genômica/estatística & dados numéricos , Fenótipo , Espectrofotometria/estatística & dados numéricos , Triticum/genética , Zea mays/genética
10.
G3 (Bethesda) ; 9(9): 2925-2934, 2019 09 04.
Artigo em Inglês | MEDLINE | ID: mdl-31300481

RESUMO

Genome-enabled prediction plays an essential role in wheat breeding because it has the potential to increase the rate of genetic gain relative to traditional phenotypic and pedigree-based selection. Since the performance of wheat lines is highly influenced by environmental stimuli, it is important to accurately model the environment and its interaction with genetic factors in prediction models. Arguably, multi-environmental best linear unbiased prediction (BLUP) may deliver better prediction performance than single-environment genomic BLUP. We evaluated pedigree and genome-based prediction using 35,403 wheat lines from the Global Wheat Breeding Program of the International Maize and Wheat Improvement Center (CIMMYT). We implemented eight statistical models that included genome-wide molecular marker and pedigree information as prediction inputs in two different validation schemes. All models included main effects, but some considered interactions between the different types of pedigree and genomic covariates via Hadamard products of similarity kernels. Pedigree models always gave better prediction of new lines in observed environments than genome-based models when only main effects were fitted. However, for all traits, the highest predictive abilities were obtained when interactions between pedigree, genomes, and environments were included. When new lines were predicted in unobserved environments, in almost all trait/year combinations, the marker main-effects model was the best. These results provide strong evidence that the different sources of genetic information (molecular markers and pedigree) are not equally useful at different stages of the breeding pipelines, and can be employed differentially to improve the design and prediction of the outcome of future breeding programs.


Assuntos
Genoma de Planta , Modelos Genéticos , Triticum/fisiologia , Interação Gene-Ambiente , Marcadores Genéticos , Fenótipo , Melhoramento Vegetal , Distribuição Aleatória , Reprodutibilidade dos Testes , Triticum/genética
11.
G3 (Bethesda) ; 9(8): 2425-2428, 2019 08 08.
Artigo em Inglês | MEDLINE | ID: mdl-31201204

RESUMO

The dna is the fundamental basis of genetic information, just as bits are for computers. Whenever computers are used to represent genetic data, the computational encoding must be efficient to allow the representation of processes driving the inheritance and variability. This is especially important across simulations in view of the increasing complexity and dimensions brought by genomics. This paper introduces a new binary representation of genetic information. Algorithms as bitwise operations that mimic the inheritance of a wide range of polymorphisms are also presented. Different kinds and mixtures of polymorphisms are discussed and exemplified. Proposed algorithms and data structures were implemented in C++ programming language and is available to end users in the R package "isqg" which is available at the R repository (cran). Supplementary data are available online.


Assuntos
Biologia Computacional/métodos , Genômica/métodos , Software , Algoritmos
12.
PLoS One ; 14(6): e0217571, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31173600

RESUMO

Several studies have shown differences in the abilities of maize genotypes to facilitate or impede Azospirillum brasilense colonization and to receive benefits from this association. Hence, our aim was to study the genetic control, heterosis effect and the prediction accuracy of the shoot and root traits of maize in response to A. brasilense. For that, we evaluated 118 hybrids under two contrasting scenarios: i) N stress (control) and ii) N stress plus A. brasilense inoculation. The diallel analyses were performed using mixed model equations, and the genomic prediction models accounted for the general and specific combining ability (GCA and SCA, respectively) and the presence or not of G×E effects. In addition, the genomic models were fitted considering parametric (G-BLUP) and semi-parametric (RKHS) kernels. The genotypes showed significant inoculation effect for five root traits, and the GCA and SCA were significant for both. The GCA in the inoculated treatment presented a greater magnitude than the control, whereas the opposite was observed for SCA. Heterosis was weakly influenced by the inoculation, and the heterozygosity and N status in the plant can have a role in the benefits that can be obtained from this Plant Growth-Promoting Bacteria (PGPB). Prediction accuracies for N stress plus A. brasilense ranged from 0.42 to 0.78, depending on the scenario and trait, and were higher, in most cases, than the non-inoculated treatment. Finally, our findings provide an understanding of the quantitative variation of maize responsiveness to A. brasilense and important insights to be applied in maize breeding aiming the development of superior hybrids for this association.


Assuntos
Azospirillum brasilense/fisiologia , Genômica/métodos , Vigor Híbrido/genética , Zea mays/genética , Redes Reguladoras de Genes , Genoma de Planta , Heterozigoto , Hibridização Genética , Endogamia , Fenótipo , Raízes de Plantas/genética , Raízes de Plantas/crescimento & desenvolvimento , Característica Quantitativa Herdável , Estresse Fisiológico/genética
13.
Nat Genet ; 51(5): 857-864, 2019 05.
Artigo em Inglês | MEDLINE | ID: mdl-31036963

RESUMO

We report a map of 4.97 million single-nucleotide polymorphisms of the chickpea from whole-genome resequencing of 429 lines sampled from 45 countries. We identified 122 candidate regions with 204 genes under selection during chickpea breeding. Our data suggest the Eastern Mediterranean as the primary center of origin and migration route of chickpea from the Mediterranean/Fertile Crescent to Central Asia, and probably in parallel from Central Asia to East Africa (Ethiopia) and South Asia (India). Genome-wide association studies identified 262 markers and several candidate genes for 13 traits. Our study establishes a foundation for large-scale characterization of germplasm and population genomics, and a resource for trait dissection, accelerating genetic gains in future chickpea breeding.


Assuntos
Cicer/genética , Cicer/classificação , DNA de Plantas/genética , Domesticação , Marcadores Genéticos , Variação Genética , Genoma de Planta , Estudo de Associação Genômica Ampla , Sequenciamento de Nucleotídeos em Larga Escala , Desequilíbrio de Ligação , Filogenia , Melhoramento Vegetal , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Análise de Sequência de DNA
14.
Theor Appl Genet ; 132(8): 2381-2399, 2019 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-31098757

RESUMO

KEY MESSAGE: Analysis of the genetic architecture of MCMV and MLN resistance in maize doubled-haploid populations revealed QTLs with major effects on chromosomes 3 and 6 that were consistent across genetic backgrounds and environments. Two major-effect QTLs, qMCMV3-108/qMLN3-108 and qMCMV6-17/qMLN6-17, were identified as conferring resistance to both MCMV and MLN. Maize lethal necrosis (MLN) is a serious threat to the food security of maize-growing smallholders in sub-Saharan Africa. The ability of the maize chlorotic mottle virus (MCMV) to interact with other members of the Potyviridae causes severe yield losses in the form of MLN. The objective of the present study was to gain insights and validate the genetic architecture of resistance to MCMV and MLN in maize. We applied linkage mapping to three doubled-haploid populations and a genome-wide association study (GWAS) on 380 diverse maize lines. For all the populations, phenotypic variation for MCMV and MLN was significant, and heritability was moderate to high. Linkage mapping revealed 13 quantitative trait loci (QTLs) for MCMV resistance and 12 QTLs conferring MLN resistance. One major-effect QTL, qMCMV3-108/qMLN3-108, was consistent across populations for both MCMV and MLN resistance. Joint linkage association mapping (JLAM) revealed 18 and 21 main-effect QTLs for MCMV and MLN resistance, respectively. Another major-effect QTL, qMCMV6-17/qMLN6-17, was detected for both MCMV and MLN resistance. The GWAS revealed a total of 54 SNPs (MCMV-13 and MLN-41) significantly associated (P ≤ 5.60 × 10-05) with MCMV and MLN resistance. Most of the GWAS-identified SNPs were within or adjacent to the QTLs detected through linkage mapping. The prediction accuracy for within populations as well as the combined populations is promising; however, the accuracy was low across populations. Overall, MCMV resistance is controlled by a few major and many minor-effect loci and seems more complex than the genetic architecture for MLN resistance.


Assuntos
Ligação Genética , Genoma de Planta , Estudo de Associação Genômica Ampla , Doenças das Plantas/virologia , Sementes/genética , Tombusviridae/genética , Zea mays/genética , Zea mays/virologia , Alelos , Área Sob a Curva , Fenótipo , Doenças das Plantas/genética , Polimorfismo de Nucleotídeo Único/genética , Locos de Características Quantitativas/genética , Clima Tropical
15.
Plant Genome ; 12(1)2019 03.
Artigo em Inglês | MEDLINE | ID: mdl-30951082

RESUMO

In this study, we used genotype × environment interactions (G×E) models for hybrid prediction, where similarity between lines was assessed by pedigree and molecular markers, and similarity between environments was accounted for by environmental covariables. We use five genomic and pedigree models (M1-M5) under four cross-validation (CV) schemes: prediction of hybrids when the training set (i) includes hybrids of all males and females evaluated only in some environments (T2FM), (ii) excludes all progenies from a randomly selected male (T1M), (iii) includes all progenies from 20% randomly selected females in combination with all males (T1F), and (iv) includes one randomly selected male plus 40% randomly selected females that were crossed with it (T0FM). Models were tested on a total of 1888 wheat ( L.) hybrids including 18 males and 667 females in three consecutive years. For grain yield, the most complex model (M5) under T2FM had slightly higher prediction accuracy than the less complex model. For T1F, the prediction accuracy of hybrids for grain yield and other traits of the most complete model was 0.50 to 0.55. For T1M, Model M3 exhibited high prediction accuracies for flowering traits (0.71), whereas the more complex model (M5) demonstrated high accuracy for grain yield (0.5). For T0FM, the prediction accuracy for grain yield of Model M5 was 0.61. Including genomic and pedigree gave relatively high prediction accuracy even when both parents were untested. Results show that it is possible to predict unobserved hybrids when modeling genomic general combining ability (GCA) and specific combining ability (SCA) and their interactions with environments.


Assuntos
Hibridização Genética , Modelos Genéticos , Triticum/genética , Interação Gene-Ambiente , Linhagem , Melhoramento Vegetal
16.
G3 (Bethesda) ; 9(5): 1355-1369, 2019 05 07.
Artigo em Inglês | MEDLINE | ID: mdl-30819822

RESUMO

Evidence that genomic selection (GS) is a technology that is revolutionizing plant breeding continues to grow. However, it is very well documented that its success strongly depends on statistical models, which are used by GS to perform predictions of candidate genotypes that were not phenotyped. Because there is no universally better model for prediction and models for each type of response variable are needed (continuous, binary, ordinal, count, etc.), an active area of research aims to develop statistical models for the prediction of univariate and multivariate traits in GS. However, most of the models developed so far are for univariate and continuous (Gaussian) traits. Therefore, to overcome the lack of multivariate statistical models for genome-based prediction by improving the original version of the BMTME, we propose an improved Bayesian multi-trait and multi-environment (BMTME) R package for analyzing breeding data with multiple traits and multiple environments. We also introduce Bayesian multi-output regressor stacking (BMORS) functions that are considerably efficient in terms of computational resources. The package allows parameter estimation and evaluates the prediction performance of multi-trait and multi-environment data in a reliable, efficient and user-friendly way. We illustrate the use of the BMTME with real toy datasets to show all the facilities that the software offers the user. However, for large datasets, the BME() and BMTME() functions of the BMTME R package are very intense in terms of computing time; on the other hand, less intensive computing is required with BMORS functions BMORS() and BMORS_Env() that are also included in the BMTME package.


Assuntos
Teorema de Bayes , Biologia Computacional/métodos , Interação Gene-Ambiente , Genômica/métodos , Característica Quantitativa Herdável , Software , Algoritmos , Modelos Estatísticos , Zea mays/genética
17.
G3 (Bethesda) ; 9(5): 1545-1556, 2019 05 07.
Artigo em Inglês | MEDLINE | ID: mdl-30858235

RESUMO

Multiple-trait experiments with mixed phenotypes (binary, ordinal and continuous) are not rare in animal and plant breeding programs. However, there is a lack of statistical models that can exploit the correlation between traits with mixed phenotypes in order to improve prediction accuracy in the context of genomic selection (GS). For this reason, when breeders have mixed phenotypes, they usually analyze them using univariate models, and thus are not able to exploit the correlation between traits, which many times helps improve prediction accuracy. In this paper we propose applying deep learning for analyzing multiple traits with mixed phenotype data in terms of prediction accuracy. The prediction performance of multiple-trait deep learning with mixed phenotypes (MTDLMP) models was compared to the performance of univariate deep learning (UDL) models. Both models were evaluated using predictors with and without the genotype × environment (G×E) interaction term (I and WI, respectively). The metric used for evaluating prediction accuracy was Pearson's correlation for continuous traits and the percentage of cases correctly classified (PCCC) for binary and ordinal traits. We found that a modest gain in prediction accuracy was obtained only in the continuous trait under the MTDLMP model compared to the UDL model, whereas for the other traits (1 binary and 2 ordinal) we did not find any difference between the two models. In both models we observed that the prediction performance was better for WI than for I. The MTDLMP model is a good alternative for performing simultaneous predictions of mixed phenotypes (binary, ordinal and continuous) in the context of GS.


Assuntos
Aprendizado Profundo , Estudos de Associação Genética , Genoma , Genômica , Modelos Genéticos , Fenótipo , Característica Quantitativa Herdável , Algoritmos , Genoma de Planta , Genômica/métodos , Genótipo , Melhoramento Vegetal , Reprodutibilidade dos Testes , Seleção Genética
18.
Theor Appl Genet ; 132(5): 1587-1606, 2019 May.
Artigo em Inglês | MEDLINE | ID: mdl-30747261

RESUMO

KEY MESSAGE: Current genome-enabled prediction models assumed errors normally distributed, which are sensitive to outliers. We propose a model with errors assumed to follow a Laplace distribution to deal better with outliers. Current genome-enabled prediction models use regressions that fit the expected value (mean) of a response variable with errors assumed normally distributed, which are often sensitive to outliers, either genetic or environmental. For this reason, we propose a robust Bayesian genome median regression (BGMR) model that fits regressions to the medians of a distribution, with errors assumed to follow a Laplace distribution to deal better with outliers. The BGMR model was evaluated under a Bayesian framework with Markov Chain Monte Carlo sampling using a location-scale mixture representation of the Laplace distribution. The BGMR was implemented with two simulated and two real genomic data sets, and we compared its prediction performance with that of a conventional genomic best linear unbiased prediction (GBLUP) model and the Laplace maximum a posteriori (LMAP) method. The prediction accuracies of BGMR were higher than those of the GBLUP and LMAP methods when there were outliers. The BGMR model could be useful to breeders who need to predict and select genotypes based on data with unknown outliers.


Assuntos
Cruzamento , Genoma de Planta , Modelos Teóricos , Plantas/genética , Teorema de Bayes , Simulação por Computador , Cadeias de Markov , Método de Monte Carlo , Análise de Regressão
19.
Front Plant Sci ; 10: 30, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-30778360

RESUMO

Aflatoxin contamination of maize grain and products causes serious health problems for consumers worldwide, and especially in low- and middle-income countries where monitoring and safety standards are inconsistently implemented. Vitamin A deficiency (VAD) also compromises the health of millions of maize consumers in several regions of the world including large parts of sub-Saharan Africa. We investigated whether provitamin A (proVA) enriched maize can simultaneously contribute to alleviate both of these health concerns. We studied aflatoxin accumulation in grain of 120 maize hybrids formed by crossing 3 Aspergillus flavus resistant and three susceptible lines with 20 orange maize lines with low to high carotenoids concentrations. The hybrids were grown in replicated, artificially-inoculated field trials at five environments. Grain of hybrids with larger concentrations of beta-carotene (BC), beta-cryptoxanthin (BCX) and total proVA had significantly less aflatoxin contamination than hybrids with lower carotenoids concentrations. Aflatoxin contamination had negative genetic correlation with BCX (-0.28, p < 0.01), BC (-0.18, p < 0.05), and proVA (-0.23, p < 0.05). The relative ease of breeding for increased proVA carotenoid concentrations as compared to breeding for aflatoxin resistance in maize suggests using the former as a component of strategies to combat aflatoxin contamination problems for maize. Our findings indicate that proVA enriched maize can be particularly beneficial where the health burdens of exposure to aflatoxin and prevalence of VAD converge with high rates of maize consumption.

20.
Theor Appl Genet ; 132(6): 1705-1720, 2019 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-30778634

RESUMO

Genomic selection (GS) models have been validated for many quantitative traits in wheat (Triticum aestivum L.) breeding. However, those models are mostly constrained within the same growing cycle and the extension of GS to the case of across cycles has been a challenge, mainly due to the low predictive accuracy resulting from two factors: reduced genetic relationships between different families and augmented environmental variances between cycles. Using the data collected from diverse field conditions at the International Wheat and Maize Improvement Center, we evaluated GS for grain yield in three elite yield trials across three wheat growing cycles. The objective of this project was to employ the secondary traits, canopy temperature, and green normalized difference vegetation index, which are closely associated with grain yield from high-throughput phenotyping platforms, to improve prediction accuracy for grain yield. The ability to predict grain yield was evaluated reciprocally across three cycles with or without secondary traits. Our results indicate that prediction accuracy increased by an average of 146% for grain yield across cycles with secondary traits. In addition, our results suggest that secondary traits phenotyped during wheat heading and early grain filling stages were optimal for enhancing the prediction accuracy for grain yield.


Assuntos
Genética Populacional , Genoma de Planta , Genômica/métodos , Melhoramento Vegetal/métodos , Seleção Genética , Triticum/genética , Marcadores Genéticos , Fenótipo , Triticum/crescimento & desenvolvimento
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA