Your browser doesn't support javascript.
loading
Data Augmentation Enhances Plant-Genomic-Enabled Predictions.
Montesinos-López, Osval A; Solis-Camacho, Mario Alberto; Crespo-Herrera, Leonardo; Saint Pierre, Carolina; Huerta Prado, Gloria Isabel; Ramos-Pulido, Sofia; Al-Nowibet, Khalid; Fritsche-Neto, Roberto; Gerard, Guillermo; Montesinos-López, Abelardo; Crossa, José.
Afiliação
  • Montesinos-López OA; Facultad de Telemática, Universidad de Colima, Colima 28040, Colima, Mexico.
  • Solis-Camacho MA; Facultad de Telemática, Universidad de Colima, Colima 28040, Colima, Mexico.
  • Crespo-Herrera L; International Maize and Wheat Improvement Center (CIMMYT), Km 45, Carretera Mexico-Veracruz, Texcoco 52640, Edo. de México, Mexico.
  • Saint Pierre C; International Maize and Wheat Improvement Center (CIMMYT), Km 45, Carretera Mexico-Veracruz, Texcoco 52640, Edo. de México, Mexico.
  • Huerta Prado GI; Independent Researcher, Zinacatepec 75960, Puebla, Mexico.
  • Ramos-Pulido S; Centro Universitario de Ciencias Exactas e Ingenierías (CUCEI), Universidad de Guadalajara, Guadalajara 44430, Jalisco, Mexico.
  • Al-Nowibet K; Distinguish Scientist Fellowship Program and Department of Statistics and Operations Research, King Saud University, Riyah 11451, Saudi Arabia.
  • Fritsche-Neto R; Louisiana State University, Baton Rouge, LA 70803, USA.
  • Gerard G; International Maize and Wheat Improvement Center (CIMMYT), Km 45, Carretera Mexico-Veracruz, Texcoco 52640, Edo. de México, Mexico.
  • Montesinos-López A; Centro Universitario de Ciencias Exactas e Ingenierías (CUCEI), Universidad de Guadalajara, Guadalajara 44430, Jalisco, Mexico.
  • Crossa J; International Maize and Wheat Improvement Center (CIMMYT), Km 45, Carretera Mexico-Veracruz, Texcoco 52640, Edo. de México, Mexico.
Genes (Basel) ; 15(3)2024 02 24.
Article em En | MEDLINE | ID: mdl-38540344
ABSTRACT
Genomic selection (GS) is revolutionizing plant breeding. However, its practical implementation is still challenging, since there are many factors that affect its accuracy. For this reason, this research explores data augmentation with the goal of improving its accuracy. Deep neural networks with data augmentation (DA) generate synthetic data from the original training set to increase the training set and to improve the prediction performance of any statistical or machine learning algorithm. There is much empirical evidence of their success in many computer vision applications. Due to this, DA was explored in the context of GS using 14 real datasets. We found empirical evidence that DA is a powerful tool to improve the prediction accuracy, since we improved the prediction accuracy of the top lines in the 14 datasets under study. On average, across datasets and traits, the gain in prediction performance of the DA approach regarding the Conventional method in the top 20% of lines in the testing set was 108.4% in terms of the NRMSE and 107.4% in terms of the MAAPE, but a worse performance was observed on the whole testing set. We encourage more empirical evaluations to support our findings.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Genoma de Planta / Genômica Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Genoma de Planta / Genômica Idioma: En Ano de publicação: 2024 Tipo de documento: Article