Your browser doesn't support javascript.
loading
Montrer: 20 | 50 | 100
Résultats 1 - 20 de 20
Filtrer
Plus de filtres











Gamme d'année
1.
Front Plant Sci ; 15: 1373318, 2024.
Article de Anglais | MEDLINE | ID: mdl-39086911

RÉSUMÉ

Coffee Breeding programs have traditionally relied on observing plant characteristics over years, a slow and costly process. Genomic selection (GS) offers a DNA-based alternative for faster selection of superior cultivars. Stacking Ensemble Learning (SEL) combines multiple models for potentially even more accurate selection. This study explores SEL potential in coffee breeding, aiming to improve prediction accuracy for important traits [yield (YL), total number of the fruits (NF), leaf miner infestation (LM), and cercosporiosis incidence (Cer)] in Coffea Arabica. We analyzed data from 195 individuals genotyped for 21,211 single-nucleotide polymorphism (SNP) markers. To comprehensively assess model performance, we employed a cross-validation (CV) scheme. Genomic Best Linear Unbiased Prediction (GBLUP), multivariate adaptive regression splines (MARS), Quantile Random Forest (QRF), and Random Forest (RF) served as base learners. For the meta-learner within the SEL framework, various options were explored, including Ridge Regression, RF, GBLUP, and Single Average. The SEL method was able to predict the predictive ability (PA) of important traits in Coffea Arabica. SEL presented higher PA compared with those obtained for all base learner methods. The gains in PA in relation to GBLUP were 87.44% (the ratio between the PA obtained from best Stacking model and the GBLUP), 37.83%, 199.82%, and 14.59% for YL, NF, LM and Cer, respectively. Overall, SEL presents a promising approach for GS. By combining predictions from multiple models, SEL can potentially enhance the PA of GS for complex traits.

2.
Sci Rep ; 14(1): 1062, 2024 01 11.
Article de Anglais | MEDLINE | ID: mdl-38212638

RÉSUMÉ

In the context of multi-environment trials (MET), genomic prediction is proposed as a tool that allows the prediction of the phenotype of single cross hybrids that were not tested in field trials. This approach saves time and costs compared to traditional breeding methods. Thus, this study aimed to evaluate the genomic prediction of single cross maize hybrids not tested in MET, grain yield and female flowering time. We also aimed to propose an application of machine learning methodologies in MET in the prediction of hybrids and compare their performance with Genomic best linear unbiased prediction (GBLUP) with non-additive effects. Our results highlight that both methodologies are efficient and can be used in maize breeding programs to accurately predict the performance of hybrids in specific environments. The best methodology is case-dependent, specifically, to explore the potential of GBLUP, it is important to perform accurate modeling of the variance components to optimize the prediction of new hybrids. On the other hand, machine learning methodologies can capture non-additive effects without making any assumptions at the outset of the model. Overall, predicting the performance of new hybrids that were not evaluated in any field trials was more challenging than predicting hybrids in sparse test designs.


Sujet(s)
Hybridation génétique , Zea mays , Génotype , Zea mays/génétique , Génome végétal , Amélioration des plantes , Phénotype , Génomique/méthodes , Apprentissage machine , Modèles génétiques
3.
Theor Appl Genet ; 137(1): 9, 2023 Dec 15.
Article de Anglais | MEDLINE | ID: mdl-38102495

RÉSUMÉ

KEY MESSAGE: An approach for handling visual scores with potential errors and subjectivity in scores was evaluated in simulated and blueberry recurrent selection breeding schemes to assist breeders in their decision-making. Most genomic prediction methods are based on assumptions of normality due to their simplicity and ease of implementation. However, in plant and animal breeding, continuous traits are often visually scored as categorical traits and analyzed as a Gaussian variable, thus violating the normality assumption, which could affect the prediction of breeding values and the estimation of genetic parameters. In this study, we examined the main challenges of visual scores for genomic prediction and genetic parameter estimation using mixed models, Bayesian, and machine learning methods. We evaluated these approaches using simulated and real breeding data sets. Our contribution in this study is a five-fold demonstration: (i) collecting data using an intermediate number of categories (1-3 and 1-5) is the best strategy, even considering errors associated with visual scores; (ii) Linear Mixed Models and Bayesian Linear Regression are robust to the normality violation, but marginal gains can be achieved when using Bayesian Ordinal Regression Models (BORM) and Random Forest Classification; (iii) genetic parameters are better estimated using BORM; (iv) our conclusions using simulated data are also applicable to real data in autotetraploid blueberry; and (v) a comparison of continuous and categorical phenotypes found that investing in the evaluation of 600-1000 categorical data points with low error, when it is not feasible to collect continuous phenotypes, is a strategy for improving predictive abilities. Our findings suggest the best approaches for effectively using visual scores traits to explore genetic information in breeding programs and highlight the importance of investing in the training of evaluator teams and in high-quality phenotyping.


Sujet(s)
Hérédité multifactorielle , Amélioration des plantes , Animaux , Théorème de Bayes , Génome , Génomique/méthodes , Phénotype , Modèles génétiques
4.
Sci Rep ; 13(1): 9585, 2023 06 13.
Article de Anglais | MEDLINE | ID: mdl-37311810

RÉSUMÉ

The aim of this study was to evaluate the performance of Quantile Regression (QR) in Genome-Wide Association Studies (GWAS) regarding the ability to detect QTLs (Quantitative Trait Locus) associated with phenotypic traits of interest, considering different population sizes. For this, simulated data was used, with traits of different levels of heritability (0.30 and 0.50), and controlled by 3 and 100 QTLs. Populations of 1,000 to 200 individuals were defined, with a random reduction of 100 individuals for each population. The power of detection of QTLs and the false positive rate were obtained by means of QR considering three different quantiles (0.10, 0.50 and 0.90) and also by means of the General Linear Model (GLM). In general, it was observed that the QR models showed greater power of detection of QTLs in all scenarios evaluated and a relatively low false positive rate in scenarios with a greater number of individuals. The models with the highest detection power of true QTLs at the extreme quantils (0.10 and 0.90) were the ones with the highest detection power of true QTLs. In contrast, the analysis based on the GLM detected few (scenarios with larger population size) or no QTLs in the evaluated scenarios. In the scenarios with low heritability, QR obtained a high detection power. Thus, it was verified that the use of QR in GWAS is effective, allowing the detection of QTLs associated with traits of interest even in scenarios with few genotyped and phenotyped individuals.


Sujet(s)
Étude d'association pangénomique , Locus de caractère quantitatif , Humains , Densité de population , Locus de caractère quantitatif/génétique , Génotype , Modèles linéaires
5.
Ciênc. rural (Online) ; 53(10): e20220327, 2023. tab, graf
Article de Anglais | VETINDEX | ID: biblio-1418792

RÉSUMÉ

Quantile Random Forest (QRF) is a non-parametric methodology that combines the advantages of Random Forest (RF) and Quantile Regression (QR). Specifically, this approach can explore non-linear functions, determining the probability distribution of a response variable and extracting information from different quantiles instead of just predicting the mean. This evaluated the performance of the QRF in the genomic prediction for complex traits (epistasis and dominance). In addition, compare the accuracies obtained with those derived from the G-BLUP. The simulation created an F2 population with 1,000 individuals and genotyped for 4,010 SNP markers. Besides, twelve traits were simulated from a model considering additive and non-additive effects, QTL (Quantitative trait loci) numbers ranging from eight to 120, and heritability of 0.3, 0.5, or 0.8. For training and validation, the 5-fold cross-validation approach was used. For each fold, the accuracies of all the proposed models were calculated: QRF in five different quantiles and three G-BLUP models (additive effect, additive and epistatic effects, additive and dominant effects). Finally, the predictive performance of these methodologies was compared. In all scenarios, the QRF accuracies were equal to or greater than the methodologies evaluated and proved to be an alternative tool to predict genetic values in complex traits.


Quantile Random Forest (QRF) é uma metodologia não paramétrica, que combina as vantagens do Random Forest (RF) e da Regressão Quantílica (QR). Especificamente, essa abordagem pode explorar funções não lineares, determinando a distribuição de probabilidade de uma variável resposta e extraindo informações de diferentes quantis em vez de apenas prever a média. O objetivo deste trabalho foi avaliar o desempenho do QRF em predizer o valor genético genômico para características com arquitetura genética não aditiva (epistasia e dominância). Adicionalmente, as acurácias obtidas foram comparadas com aquelas advindas do G-BLUP. A simulação criou uma população F2 com 1.000 indivíduos genotipados para 4.010 marcadores SNP. Além disso, doze características foram simuladas a partir de um modelo considerando efeitos aditivos e não aditivos, com número de QTL (Quantitative trait loci) variando de oito a 120 e herdabilidade de 0,3, 0,5 ou 0,8. Para treinamento e validação foi usada a abordagem da validação cruzada 5-fold. Para cada um dos folds foram calculadas as acurácias de todos os modelos propostos: QRF em cinco quantis diferentes e três modelos do G-BLUP (com efeito aditivo, aditivo e epistático, aditivo e dominante). Por fim, o desempenho preditivo dessas metodologias foi comparado. Em todos os cenários, as acurácias do QRF foram iguais ou superiores às metodologias avaliadas e mostrou ser uma ferramenta alternativa para predizer valores genéticos em características complexas.


Sujet(s)
Sélection génétique , Génome , Génomique , Épistasie , Forêts aléatoires
6.
Ciênc. rural (Online) ; 53(10): e20220327, 2023. tab, graf
Article de Anglais | LILACS-Express | VETINDEX | ID: biblio-1430203

RÉSUMÉ

ABSTRACT: Quantile Random Forest (QRF) is a non-parametric methodology that combines the advantages of Random Forest (RF) and Quantile Regression (QR). Specifically, this approach can explore non-linear functions, determining the probability distribution of a response variable and extracting information from different quantiles instead of just predicting the mean. This evaluated the performance of the QRF in the genomic prediction for complex traits (epistasis and dominance). In addition, compare the accuracies obtained with those derived from the G-BLUP. The simulation created an F2 population with 1,000 individuals and genotyped for 4,010 SNP markers. Besides, twelve traits were simulated from a model considering additive and non-additive effects, QTL (Quantitative trait loci) numbers ranging from eight to 120, and heritability of 0.3, 0.5, or 0.8. For training and validation, the 5-fold cross-validation approach was used. For each fold, the accuracies of all the proposed models were calculated: QRF in five different quantiles and three G-BLUP models (additive effect, additive and epistatic effects, additive and dominant effects). Finally, the predictive performance of these methodologies was compared. In all scenarios, the QRF accuracies were equal to or greater than the methodologies evaluated and proved to be an alternative tool to predict genetic values in complex traits.


RESUMO: Quantile Random Forest (QRF) é uma metodologia não paramétrica, que combina as vantagens do Random Forest (RF) e da Regressão Quantílica (QR). Especificamente, essa abordagem pode explorar funções não lineares, determinando a distribuição de probabilidade de uma variável resposta e extraindo informações de diferentes quantis em vez de apenas prever a média. O objetivo deste trabalho foi avaliar o desempenho do QRF em predizer o valor genético genômico para características com arquitetura genética não aditiva (epistasia e dominância). Adicionalmente, as acurácias obtidas foram comparadas com aquelas advindas do G-BLUP. A simulação criou uma população F2 com 1.000 indivíduos genotipados para 4.010 marcadores SNP. Além disso, doze características foram simuladas a partir de um modelo considerando efeitos aditivos e não aditivos, com número de QTL (Quantitative trait loci) variando de oito a 120 e herdabilidade de 0,3, 0,5 ou 0,8. Para treinamento e validação foi usada a abordagem da validação cruzada 5-fold. Para cada um dos folds foram calculadas as acurácias de todos os modelos propostos: QRF em cinco quantis diferentes e três modelos do G-BLUP (com efeito aditivo, aditivo e epistático, aditivo e dominante). Por fim, o desempenho preditivo dessas metodologias foi comparado. Em todos os cenários, as acurácias do QRF foram iguais ou superiores às metodologias avaliadas e mostrou ser uma ferramenta alternativa para predizer valores genéticos em características complexas.

7.
Sci. agric ; 80: e20220056, 2023. tab, ilus
Article de Anglais | VETINDEX | ID: biblio-1410169

RÉSUMÉ

Among the multi-trait models selected to study several traits and environments jointly, the Bayesian framework has been a preferred tool when constructing a more complex and biologically realistic model. In most cases, non-informative prior distributions are adopted in studies using the Bayesian approach. However, the Bayesian approach presents more accurate estimates when informative prior distributions are used. The present study was developed to evaluate the efficiency and applicability of multi-trait multi-environment (MTME) models within a Bayesian framework utilizing a strategy for eliciting informative prior distribution using previous data on rice. The study involved data pertaining to rice (Oryza sativa L.) genotypes in three environments and five crop seasons (2010/2011 until 2014/2015) for the following traits: grain yield (GY), flowering in days (FLOR) and plant height (PH). Variance components, genetic and non-genetic parameters were estimated using the Bayesian method. In general, the informative prior distribution in Bayesian MTME models provided higher estimates of individual narrow-sense heritability and variance components, as well as minor lengths for the highest probability density interval (HPD), compared to their respective non-informative prior distribution analyses. More informative prior distributions make it possible to detect genetic correlations between traits, which cannot be achieved with non-informative prior distributions. Therefore, this mechanism presented to update knowledge for an elicitation of an informative prior distribution can be efficiently applied in rice breeding programs.


Sujet(s)
Oryza/croissance et développement , Aliment génétiquement modifié/statistiques et données numériques
8.
Sci. agric ; 79(6): e20200397, 2022. tab
Article de Anglais | VETINDEX | ID: biblio-1347913

RÉSUMÉ

The principal component regression (PCR) and the independent component regression (ICR) are dimensionality reduction methods and extremely important in genomic prediction. These methods require the choice of the number of components to be inserted into the model. For PCR, there are formal criteria; however, for ICR, the adopted criterion chooses the number of independent components (ICs) associated to greater accuracy and requires high computational time. In this study, seven criteria based on the number of principal components (PCs) and methods of variable selection to guide this choice in ICR are proposed and evaluated in simulated and real data. For both datasets, the most efficient criterion and that drastically reduced computational time determined that the number of ICs should be equal to the number of PCs to reach a higher accuracy value. In addition, the criteria did not recover the simulated heritability and generated biased genomic values.


Sujet(s)
Oryza/génétique , Amélioration des plantes/méthodes , Analyse de régression , Prévision/méthodes
9.
PLoS One ; 16(1): e0243666, 2021.
Article de Anglais | MEDLINE | ID: mdl-33400704

RÉSUMÉ

This study assessed the efficiency of Genomic selection (GS) or genome-wide selection (GWS), based on Regularized Quantile Regression (RQR), in the selection of genotypes to breed autogamous plant populations with oligogenic traits. To this end, simulated data of an F2 population were used, with traits with different heritability levels (0.10, 0.20 and 0.40), controlled by four genes. The generations were advanced (up to F6) at two selection intensities (10% and 20%). The genomic genetic value was computed by RQR for different quantiles (0.10, 0.50 and 0.90), and by the traditional GWS methods, specifically RR-BLUP and BLASSO. A second objective was to find the statistical methodology that allows the fastest fixation of favorable alleles. In general, the results of the RQR model were better than or equal to those of traditional GWS methodologies, achieving the fixation of favorable alleles in most of the evaluated scenarios. At a heritability level of 0.40 and a selection intensity of 10%, RQR (0.50) was the only methodology that fixed the alleles quickly, i.e., in the fourth generation. Thus, it was concluded that the application of RQR in plant breeding, to simulated autogamous plant populations with oligogenic traits, could reduce time and consequently costs, due to the reduction of selfing generations to fix alleles in the evaluated scenarios.


Sujet(s)
Simulation numérique , Génome végétal , Modèles génétiques , Plantes/génétique , Sélection génétique , Marqueurs génétiques , Génotype , Amélioration des plantes , Caractère quantitatif héréditaire
10.
Sci. agric ; 78(4): 1-8, 2021. ilus, graf, tab
Article de Anglais | VETINDEX | ID: biblio-1497961

RÉSUMÉ

Genomic selection (GS) emphasizes the simultaneous prediction of the genetic effects of thousands of scattered markers over the genome. Several statistical methodologies have been used in GS for the prediction of genetic merit. In general, such methodologies require certain assumptions about the data, such as the normality of the distribution of phenotypic values. To circumvent the non-normality of phenotypic values, the literature suggests the use of Bayesian Generalized Linear Regression (GBLASSO). Another alternative is the models based on machine learning, represented by methodologies such as Artificial Neural Networks (ANN), Decision Trees (DT) and related possible refinements such as Bagging, Random Forest and Boosting. This study aimed to use DT and its refinements for predicting resistance to orange rust in Arabica coffee. Additionally, DT and its refinements were used to identify the importance of markers related to the characteristic of interest. The results were compared with those from GBLASSO and ANN. Data on coffee rust resistance of 245 Arabica coffee plants genotyped for 137 markers were used. The DT refinements presented equal or inferior values of Apparent Error Rate compared to those obtained by DT, GBLASSO, and ANN. Moreover, DT refinements were able to identify important markers for the characteristic of interest. Out of 14 of the most important markers analyzed in each methodology, 9.3 markers on average were in regions of quantitative trait loci (QTLs) related to resistance to disease listed in the literature.


Sujet(s)
Coffea/génétique , Coffea/parasitologie , Champignons/croissance et développement , Champignons/pathogénicité , Intelligence artificielle
11.
Sci. agric. ; 78(4): 1-8, 2021. ilus, graf, tab
Article de Anglais | VETINDEX | ID: vti-31520

RÉSUMÉ

Genomic selection (GS) emphasizes the simultaneous prediction of the genetic effects of thousands of scattered markers over the genome. Several statistical methodologies have been used in GS for the prediction of genetic merit. In general, such methodologies require certain assumptions about the data, such as the normality of the distribution of phenotypic values. To circumvent the non-normality of phenotypic values, the literature suggests the use of Bayesian Generalized Linear Regression (GBLASSO). Another alternative is the models based on machine learning, represented by methodologies such as Artificial Neural Networks (ANN), Decision Trees (DT) and related possible refinements such as Bagging, Random Forest and Boosting. This study aimed to use DT and its refinements for predicting resistance to orange rust in Arabica coffee. Additionally, DT and its refinements were used to identify the importance of markers related to the characteristic of interest. The results were compared with those from GBLASSO and ANN. Data on coffee rust resistance of 245 Arabica coffee plants genotyped for 137 markers were used. The DT refinements presented equal or inferior values of Apparent Error Rate compared to those obtained by DT, GBLASSO, and ANN. Moreover, DT refinements were able to identify important markers for the characteristic of interest. Out of 14 of the most important markers analyzed in each methodology, 9.3 markers on average were in regions of quantitative trait loci (QTLs) related to resistance to disease listed in the literature.(AU)


Sujet(s)
Coffea/génétique , Coffea/parasitologie , Champignons/croissance et développement , Champignons/pathogénicité , Intelligence artificielle
12.
Ci. Rural ; 50(1): e20180385, Jan. 31, 2020. tab, graf
Article de Anglais | VETINDEX | ID: vti-24970

RÉSUMÉ

The objective of this study was to adjust nonlinear quantile regression models for the study of dry matter accumulation in garlic plants over time, and to compare them to models fitted by the ordinary least squares method. The total dry matter of nine garlic accessions belonging to the Vegetable Germplasm Bank of Universidade Federal de Viçosa (BGH/UFV) was measured in four stages (60, 90, 120 and 150 days after planting), and those values were used for the nonlinear regression models fitting. For each accession, there was an adjustment of one model of quantile regression (τ=0.5) and one based on the least squares method. The nonlinear regression model fitted was the Logistic. The Akaike Information Criterion was used to evaluate the goodness of fit of the models. Accessions were grouped using the UPGMA algorithm, with the estimates of the parameters with biological interpretation as variables. The nonlinear quantile regression is efficient for the adjustment of models for dry matter accumulation in garlic plants over time. The estimated parameters are more uniform and robust in the presence of asymmetry in the distribution of the data, heterogeneous variances, and outliers.(AU)


Este trabalho teve como objetivo ajustar modelos de regressão quantílica não linear para o estudo do acúmulo de matéria seca total em plantas de alho ao longo do tempo, e compará-los com modelos ajustados pelo método dos mínimos quadrados. A matéria seca total de nove acessos de alho pertencentes ao Banco de Germoplasma de Hortaliças da Universidade Federal de Viçosa (BGH/UFV) foi avaliada em quatro períodos (60, 90, 120 e 150 dias após plantio), e estes valores foram utilizados para o ajuste de modelos de regressão - não linear - logística. Para cada acesso, foram ajustados um modelo de regressão quantílica (τ=0,5) e um modelo pela metodologia dos mínimos quadrados. Para avaliar a qualidade de ajuste dos modelos foi utilizado o Critério de Informação de Akaike. Os acessos foram agrupados pelo algoritmo UPGMA, utilizando as estimativas dos parâmetros com interpretação biológica como variáveis. A regressão quantílica não linear foi eficiente no ajuste de modelos para descrição do acúmulo de matéria seca ao longo do tempo. As estimativas de parâmetros foram mais uniformes e robustas na presença de dados assimétricos, variâncias heterogêneas e de valores discrepantes.(AU)


Sujet(s)
Analyse de régression , Ail , 24444
13.
Ciênc. rural (Online) ; 50(1): e20180385, 2020. tab, graf
Article de Anglais | LILACS-Express | LILACS | ID: biblio-1055840

RÉSUMÉ

ABSTRACT: The objective of this study was to adjust nonlinear quantile regression models for the study of dry matter accumulation in garlic plants over time, and to compare them to models fitted by the ordinary least squares method. The total dry matter of nine garlic accessions belonging to the Vegetable Germplasm Bank of Universidade Federal de Viçosa (BGH/UFV) was measured in four stages (60, 90, 120 and 150 days after planting), and those values were used for the nonlinear regression models fitting. For each accession, there was an adjustment of one model of quantile regression (τ=0.5) and one based on the least squares method. The nonlinear regression model fitted was the Logistic. The Akaike Information Criterion was used to evaluate the goodness of fit of the models. Accessions were grouped using the UPGMA algorithm, with the estimates of the parameters with biological interpretation as variables. The nonlinear quantile regression is efficient for the adjustment of models for dry matter accumulation in garlic plants over time. The estimated parameters are more uniform and robust in the presence of asymmetry in the distribution of the data, heterogeneous variances, and outliers.


RESUMO: Este trabalho teve como objetivo ajustar modelos de regressão quantílica não linear para o estudo do acúmulo de matéria seca total em plantas de alho ao longo do tempo, e compará-los com modelos ajustados pelo método dos mínimos quadrados. A matéria seca total de nove acessos de alho pertencentes ao Banco de Germoplasma de Hortaliças da Universidade Federal de Viçosa (BGH/UFV) foi avaliada em quatro períodos (60, 90, 120 e 150 dias após plantio), e estes valores foram utilizados para o ajuste de modelos de regressão - não linear - logística. Para cada acesso, foram ajustados um modelo de regressão quantílica (τ=0,5) e um modelo pela metodologia dos mínimos quadrados. Para avaliar a qualidade de ajuste dos modelos foi utilizado o Critério de Informação de Akaike. Os acessos foram agrupados pelo algoritmo UPGMA, utilizando as estimativas dos parâmetros com interpretação biológica como variáveis. A regressão quantílica não linear foi eficiente no ajuste de modelos para descrição do acúmulo de matéria seca ao longo do tempo. As estimativas de parâmetros foram mais uniformes e robustas na presença de dados assimétricos, variâncias heterogêneas e de valores discrepantes.

14.
PLoS One ; 14(4): e0215315, 2019.
Article de Anglais | MEDLINE | ID: mdl-30998705

RÉSUMÉ

At present, single-trait best linear unbiased prediction (BLUP) is the standard method for genetic selection in soybean. However, when genetic selection is performed based on two or more genetically correlated traits and these are analyzed individually, selection bias may arise. Under these conditions, considering the correlation structure between the evaluated traits may provide more-accurate genetic estimates for the evaluated parameters, even under environmental influences. The present study was thus developed to examine the efficiency and applicability of multi-trait multi-environment (MTME) models by the residual maximum likelihood (REML/BLUP) and Bayesian approaches in the genetic selection of segregating soybean progeny. The study involved data pertaining to 203 soybean F2:4 progeny assessed in two environments for the following traits: number of days to maturity (DM), 100-seed weight (SW), and average seed yield per plot (SY). Variance components and genetic and non-genetic parameters were estimated via the REML/BLUP and Bayesian methods. The variance components estimated and the breeding values and genetic gains predicted with selection through the Bayesian procedure were similar to those obtained by REML/BLUP. The frequentist and Bayesian MTME models provided higher estimates of broad-sense heritability per plot (or heritability of total effects of progeny; [Formula: see text]) and mean accuracy of progeny than their respective single-trait versions. Bayesian analysis provided the credibility intervals for the estimates of [Formula: see text]. Therefore, MTME led to greater predicted gains from selection. On this basis, this procedure can be efficiently applied in the genetic selection of segregating soybean progeny.


Sujet(s)
Interaction entre gènes et environnement , Génotype , Glycine max/génétique , Modèles génétiques , Hérédité multifactorielle , Sélection génétique
15.
Ci. Rural ; 49(3): e20180045, Mar. 14. tab, graf
Article de Anglais | VETINDEX | ID: vti-17519

RÉSUMÉ

The aim of this study was to use quantile regression (QR) to characterize the effect of the adaptability parameter throughout the distribution of the productivity variable on black bean cultivars launched by different national research institutes (research centers) over the last 50 years. For this purpose, 40 cultivars developed by Brazilian genetic improvement programs between 1959 and 2013 were used. Initially, QR models were adjusted considering three quantiles (τ = 0.2, 0.5 and 0.8). Subsequently, with the confidence intervals, quantile models τ = 0.2 and 0.8 (QR0.2 and QR0.8) showed differences regarding the parameter of adaptability and average productivity. Finally, by grouping the cultivars into one of the two groups defined from QR0.2 and QR0.8, it was reported that the younger cultivars were associated to the quantile τ = 0.8, i.e., those with higher yields and more responsive conditions indicating that genetic improvement over the last 50 years resulted in an increase in both the productivity and the adaptability of cultivars.(AU)


Neste estudo objetivou-se utilizar a regressão quantílica (RQ) para caracterizar o efeito do parâmetro de adaptabilidade ao longo de toda a distribuição da variável produtividade em cultivares de feijão preto lançadas por diferentes instituições nacionais de pesquisa nos últimos 50 anos. Para tanto utilizou-se 40 cultivares desenvolvidas pelos programas brasileiros de melhoramento genético entre os anos de 1959 a 2013. Inicialmente foram ajustados modelos de RQ considerando três quantis (τ=0,2, 0,5, 0,8). Posteriormente, com os intervalos de confiança verificou-se que os modelos quantílicos τ=0,2 e 0,8 (RQ0,2 e RQ0,8) apresentaram diferenças quanto ao parâmetro de adaptabilidade e produtividade média. Finalmente, por meio do agrupamento das cultivares em um dos dois grupos definidos a partir de RQ0,2 e RQ0,8, constatou-se que as cultivares mais novas foram associadas ao quantil τ = 0,8, ou seja, aquelas com maiores produtividades e mais responsivas as condições ambientais indicando que o melhoramento ao longo dos últimos 50 anos possibilitou o incremento tanto na produtividade quanto na adaptabilidade das cultivares.(AU)

16.
Ciênc. rural (Online) ; 49(3): e20180045, 2019. tab, graf
Article de Anglais | LILACS | ID: biblio-1045320

RÉSUMÉ

ABSTRACT: The aim of this study was to use quantile regression (QR) to characterize the effect of the adaptability parameter throughout the distribution of the productivity variable on black bean cultivars launched by different national research institutes (research centers) over the last 50 years. For this purpose, 40 cultivars developed by Brazilian genetic improvement programs between 1959 and 2013 were used. Initially, QR models were adjusted considering three quantiles (τ = 0.2, 0.5 and 0.8). Subsequently, with the confidence intervals, quantile models τ = 0.2 and 0.8 (QR0.2 and QR0.8) showed differences regarding the parameter of adaptability and average productivity. Finally, by grouping the cultivars into one of the two groups defined from QR0.2 and QR0.8, it was reported that the younger cultivars were associated to the quantile τ = 0.8, i.e., those with higher yields and more responsive conditions indicating that genetic improvement over the last 50 years resulted in an increase in both the productivity and the adaptability of cultivars.


RESUMO: Neste estudo objetivou-se utilizar a regressão quantílica (RQ) para caracterizar o efeito do parâmetro de adaptabilidade ao longo de toda a distribuição da variável produtividade em cultivares de feijão preto lançadas por diferentes instituições nacionais de pesquisa nos últimos 50 anos. Para tanto utilizou-se 40 cultivares desenvolvidas pelos programas brasileiros de melhoramento genético entre os anos de 1959 a 2013. Inicialmente foram ajustados modelos de RQ considerando três quantis (τ=0,2, 0,5, 0,8). Posteriormente, com os intervalos de confiança verificou-se que os modelos quantílicos τ=0,2 e 0,8 (RQ0,2 e RQ0,8) apresentaram diferenças quanto ao parâmetro de adaptabilidade e produtividade média. Finalmente, por meio do agrupamento das cultivares em um dos dois grupos definidos a partir de RQ0,2 e RQ0,8, constatou-se que as cultivares mais novas foram associadas ao quantil τ = 0,8, ou seja, aquelas com maiores produtividades e mais responsivas as condições ambientais indicando que o melhoramento ao longo dos últimos 50 anos possibilitou o incremento tanto na produtividade quanto na adaptabilidade das cultivares.

17.
PLoS One ; 13(1): e0190303, 2018.
Article de Anglais | MEDLINE | ID: mdl-29300788

RÉSUMÉ

Flowering is an important agronomic trait. Quantile regression (QR) can be used to fit models for all portions of a probability distribution. In Genome-wide association studies (GWAS), QR can estimate SNP (Single Nucleotide Polymorphism) effects on each quantile of interest. The objectives of this study were to estimate genetic parameters and to use QR to identify genomic regions for phenological traits (Days to first flower-DFF; Days for flowering-DTF; Days to end of flowering-DEF) in common bean. A total of 80 genotypes of common beans, with 3 replicates were raised at 4 locations and seasons. Plants were genotyped for 384 SNPs. Traditional single-SNP and 9 QR models, ranging from equally spaced quantiles (τ) 0.1 to 0.9, were used to associate SNPs to phenotype. Heritabilities were moderate high, ranging from 0.32 to 0.58. Genetic and phenotypic correlations were all high, averaging 0.66 and 0.98, respectively. Traditional single-SNP GWAS model was not able to find any SNP-trait association. On the other hand, when using QR methodology considering one extreme quantile (τ = 0.1) we found, respectively 1 and 7, significant SNPs associated for DFF and DTF. Significant SNPs were found on Pv01, Pv02, Pv03, Pv07, Pv10 and Pv11 chromosomes. We investigated potential candidate genes in the region around these significant SNPs. Three genes involved in the flowering pathways were identified, including Phvul.001G214500, Phvul.007G229300 and Phvul.010G142900.1 on Pv01, Pv07 and Pv10, respectively. These results indicate that GWAS-based QR was able to enhance the understanding on genetic architecture of phenological traits (DFF and DTF) in common bean.


Sujet(s)
Fabaceae/génétique , Fleurs/génétique , Gènes de plante , Étude d'association pangénomique , Polymorphisme de nucléotide simple
18.
Ciênc. rural (Online) ; 48(1): e20170322, 2018. tab, graf
Article de Anglais | LILACS | ID: biblio-1044971

RÉSUMÉ

ABSTRACT: Plant growth analyses are important because they generate information on the demand and necessary care for each development stage of a plant. Nonlinear regression models are appropriate for the description of curves of growth, since they include parameters with practical biological interpretation. However, these models present information in terms of the conditional mean, and they are subject to problems in the adjustment caused by possible outliers or asymmetry in the distribution of the data. Quantile regression can solve these problems, and it allows the estimation of different quantiles, generating more complete and robust results. The objective of this research was to adjust a nonlinear quantile regression model for the study of dry matter accumulation in garlic plants (Allium sativum L.) over time, estimating parameters at three different quantiles and classifying each garlic accession according to its growth rate and asymptotic weight. The nonlinear regression model fitted was a Logistic model, and 30 garlic accessions were evaluated. These 30 accessions were divided based on the model with the closest quantile estimates; 12 accessions were classified as of lesser interest for planting, 6 were classified as intermediate, and 12 were classified as of greater interest for planting.


RESUMO: Análises de crescimento de plantas são importantes, pois geram informações sobre a demanda e os cuidados necessários para cada etapa de seu desenvolvimento. Modelos de regressão não linear são apropriados para descrever curvas de crescimento por apresentarem parâmetros com interpretação prática biológica. Entretanto, estes modelos apresentam informações em termos médios, e estão sujeitos a problemas no ajuste proporcionados por possíveis valores extremos ou assimetria na distribuição dos dados. A regressão quantílica pode contornar estes problemas, e ainda permite estimativas de diferentes quantis, gerando resultados mais completos e robustos. Assim, o objetivo deste trabalho foi ajustar um modelo de regressão quantílica não linear para o estudo do acúmulo de matéria seca em plantas de alho (Allium sativum L.) ao longo do tempo, estimando seus parâmetros em três diferentes quantis e classificando cada acesso de alho de acordo com sua taxa de crescimento e peso assintótico. O modelo de regressão não linear ajustado foi o Logístico, e foram utilizados 30 acessos de alho. Estes foram divididos de acordo com a curva do quantil de estimativas mais próximas, sendo classificados 12 acessos como de baixo interesse para o plantio, 6 de interesse intermediário e 12 como de alto interesse.

19.
Ci. Rural ; 48(1)2018. tab, graf
Article de Anglais | VETINDEX | ID: vti-734849

RÉSUMÉ

Plant growth analyses are important because they generate information on the demand and necessary care for each development stage of a plant. Nonlinear regression models are appropriate for the description of curves of growth, since they include parameters with practical biological interpretation. However, these models present information in terms of the conditional mean, and they are subject to problems in the adjustment caused by possible outliers or asymmetry in the distribution of the data. Quantile regression can solve these problems, and it allows the estimation of different quantiles, generating more complete and robust results. The objective of this research was to adjust a nonlinear quantile regression model for the study of dry matter accumulation in garlic plants (Allium sativum L.) over time, estimating parameters at three different quantiles and classifying each garlic accession according to its growth rate and asymptotic weight. The nonlinear regression model fitted was a Logistic model, and 30 garlic accessions were evaluated. These 30 accessions were divided based on the model with the closest quantile estimates; 12 accessions were classified as of lesser interest for planting, 6 were classified as intermediate, and 12 were classified as of greater interest for planting.(AU)


Análises de crescimento de plantas são importantes, pois geram informações sobre a demanda e os cuidados necessários para cada etapa de seu desenvolvimento. Modelos de regressão não linear são apropriados para descrever curvas de crescimento por apresentarem parâmetros com interpretação prática biológica. Entretanto, estes modelos apresentam informações em termos médios, e estão sujeitos a problemas no ajuste proporcionados por possíveis valores extremos ou assimetria na distribuição dos dados. A regressão quantílica pode contornar estes problemas, e ainda permite estimativas de diferentes quantis, gerando resultados mais completos e robustos. Assim, o objetivo deste trabalho foi ajustar um modelo de regressão quantílica não linear para o estudo do acúmulo de matéria seca em plantas de alho (Allium sativum L.) ao longo do tempo, estimando seus parâmetros em três diferentes quantis e classificando cada acesso de alho de acordo com sua taxa de crescimento e peso assintótico. O modelo de regressão não linear ajustado foi o Logístico, e foram utilizados 30 acessos de alho. Estes foram divididos de acordo com a curva do quantil de estimativas mais próximas, sendo classificados 12 acessos como de baixo interesse para o plantio, 6 de interesse intermediário e 12 como de alto interesse.(AU)


Sujet(s)
Ail/croissance et développement , Produits agricoles/croissance et développement , Analyse de régression , Modèles logistiques , Développement des plantes , 24444
20.
PLoS One ; 12(7): e0181195, 2017.
Article de Anglais | MEDLINE | ID: mdl-28715507

RÉSUMÉ

Gene expression time series (GETS) analysis aims to characterize sets of genes according to their longitudinal patterns of expression. Due to the large number of genes evaluated in GETS analysis, an useful strategy to summarize biological functional processes and regulatory mechanisms is through clustering of genes that present similar expression pattern over time. Traditional cluster methods usually ignore the challenges in GETS, such as the lack of data normality and small number of temporal observations. Independent Component Analysis (ICA) is a statistical procedure that uses a transformation to convert raw time series data into sets of values of independent variables, which can be used for cluster analysis to identify sets of genes with similar temporal expression patterns. ICA allows clustering small series of distribution-free data while accounting for the dependence between subsequent time-points. Using temporal simulated and real (four libraries of two pig breeds at 21, 40, 70 and 90 days of gestation) RNA-seq data set we present a methodology (ICAclust) that jointly considers independent components analysis (ICA) and a hierarchical method for clustering GETS. We compare ICAclust results with those obtained for K-means clustering. ICAclust presented, on average, an absolute gain of 5.15% over the best K-means scenario. Considering the worst scenario for K-means, the gain was of 84.85%, when compared with the best ICAclust result. For the real data set, genes were grouped into six distinct clusters with 89, 51, 153, 67, 40, and 58 genes each, respectively. In general, it can be observed that the 6 clusters presented very distinct expression patterns. Overall, the proposed two-step clustering method (ICAclust) performed well compared to K-means, a traditional method used for cluster analysis of temporal gene expression data. In ICAclust, genes with similar expression pattern over time were clustered together.


Sujet(s)
Algorithmes , Analyse de profil d'expression de gènes , Séquençage par oligonucléotides en batterie , ARN , Animaux , Analyse de regroupements , Simulation numérique , Analyse de profil d'expression de gènes/méthodes , Régulation de l'expression des gènes au cours du développement , Modèles moléculaires , Muscles squelettiques/embryologie , Muscles squelettiques/métabolisme , Séquençage par oligonucléotides en batterie/méthodes , ARN/métabolisme , Suidae , Facteurs temps
SÉLECTION CITATIONS
DÉTAIL DE RECHERCHE