Your browser doesn't support javascript.
loading
Machine learning assists prediction of genes responsible for plant specialized metabolite biosynthesis by integrating multi-omics data.
Bai, Wenhui; Li, Cheng; Li, Wei; Wang, Hai; Han, Xiaohong; Wang, Peipei; Wang, Li.
Affiliation
  • Bai W; College of Computer Science and Technology (College of Data Science), Taiyuan University of Technology, Taiyuan, 030024, China.
  • Li C; Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, China, 518000, Shenzhen.
  • Li W; Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, China, 518000, Shenzhen.
  • Wang H; Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, China, 518000, Shenzhen.
  • Han X; National Maize Improvement Center, Key Laboratory of Crop Heterosis and Utilization, Joint Laboratory for International Cooperation in Crop Molecular Breeding, China Agricultural University, Beijing, 100193, China.
  • Wang P; College of Computer Science and Technology (College of Data Science), Taiyuan University of Technology, Taiyuan, 030024, China. hanxiaohong@tyut.edu.cn.
  • Wang L; Kunpeng Institute of Modern Agriculture at Foshan, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518124, China. wangpeipei02@caas.cn.
BMC Genomics ; 25(1): 418, 2024 Apr 29.
Article in En | MEDLINE | ID: mdl-38679745
ABSTRACT

BACKGROUND:

Plant specialized (or secondary) metabolites (PSM), also known as phytochemicals, natural products, or plant constituents, play essential roles in interactions between plants and environment. Although many research efforts have focused on discovering novel metabolites and their biosynthetic genes, the resolution of metabolic pathways and identified biosynthetic genes was limited by rudimentary analysis approaches and enormous number of candidate genes.

RESULTS:

Here we integrated state-of-the-art automated machine learning (ML) frame AutoGluon-Tabular and multi-omics data from Arabidopsis to predict genes encoding enzymes involved in biosynthesis of plant specialized metabolite (PSM), focusing on the three main PSM categories terpenoids, alkaloids, and phenolics. We found that the related features of genomics and proteomics were the top two crucial categories of features contributing to the model performance. Using only these key features, we built a new model in Arabidopsis, which performed better than models built with more features including those related with transcriptomics and epigenomics. Finally, the built models were validated in maize and tomato, and models tested for maize and trained with data from two other species exhibited either equivalent or superior performance to intraspecies predictions.

CONCLUSIONS:

Our external validation results in grape and poppy on the one hand implied the applicability of our model to the other species, and on the other hand showed enormous potential to improve the prediction of enzymes synthesizing PSM with the inclusion of valid data from a wider range of species.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Arabidopsis / Genomics / Machine Learning Language: En Journal: BMC Genomics / BMC genomics Journal subject: GENETICA Year: 2024 Document type: Article Affiliation country: China Country of publication: Reino Unido

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Arabidopsis / Genomics / Machine Learning Language: En Journal: BMC Genomics / BMC genomics Journal subject: GENETICA Year: 2024 Document type: Article Affiliation country: China Country of publication: Reino Unido