Your browser doesn't support javascript.
loading
Learning to Classify Organic and Conventional Wheat - A Machine Learning Driven Approach Using the MeltDB 2.0 Metabolomics Analysis Platform.
Kessler, Nikolas; Bonte, Anja; Albaum, Stefan P; Mäder, Paul; Messmer, Monika; Goesmann, Alexander; Niehaus, Karsten; Langenkämper, Georg; Nattkemper, Tim W.
Afiliación
  • Kessler N; Biodata Mining Group, Faculty of Technology, Bielefeld University , Bielefeld , Germany ; Bioinformatics Resource Facility, Center for Biotechnology, Bielefeld University , Bielefeld , Germany.
  • Bonte A; Department of Safety and Quality of Cereals, Max Rubner-Institut , Detmold , Germany.
  • Albaum SP; Bioinformatics Resource Facility, Center for Biotechnology, Bielefeld University , Bielefeld , Germany.
  • Mäder P; Department of Soil Sciences, Research Institute of Organic Agriculture (FiBL) , Frick , Switzerland.
  • Messmer M; Department of Crop Sciences, Research Institute of Organic Agriculture (FiBL) , Frick , Switzerland.
  • Goesmann A; Bioinformatics and Systems Biology, Justus-Liebig-University Gießen , Gießen , Germany.
  • Niehaus K; Department of Proteome and Metabolome Research, Faculty of Biology, Center for Biotechnology, Bielefeld University , Bielefeld , Germany.
  • Langenkämper G; Department of Safety and Quality of Cereals, Max Rubner-Institut , Detmold , Germany.
  • Nattkemper TW; Biodata Mining Group, Faculty of Technology, Bielefeld University , Bielefeld , Germany.
Article en En | MEDLINE | ID: mdl-25853128
ABSTRACT
We present results of our machine learning approach to the problem of classifying GC-MS data originating from wheat grains of different farming systems. The aim is to investigate the potential of learning algorithms to classify GC-MS data to be either from conventionally grown or from organically grown samples and considering different cultivars. The motivation of our work is rather obvious nowadays increased demand for organic food in post-industrialized societies and the necessity to prove organic food authenticity. The background of our data set is given by up to 11 wheat cultivars that have been cultivated in both farming systems, organic and conventional, throughout 3 years. More than 300 GC-MS measurements were recorded and subsequently processed and analyzed in the MeltDB 2.0 metabolomics analysis platform, being briefly outlined in this paper. We further describe how unsupervised (t-SNE, PCA) and supervised (SVM) methods can be applied for sample visualization and classification. Our results clearly show that years have most and wheat cultivars have second-most influence on the metabolic composition of a sample. We can also show that for a given year and cultivar, organic and conventional cultivation can be distinguished by machine-learning algorithms.
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Idioma: En Revista: Front Bioeng Biotechnol Año: 2015 Tipo del documento: Article País de afiliación: Alemania Pais de publicación: CH / SUIZA / SUÍÇA / SWITZERLAND

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Idioma: En Revista: Front Bioeng Biotechnol Año: 2015 Tipo del documento: Article País de afiliación: Alemania Pais de publicación: CH / SUIZA / SUÍÇA / SWITZERLAND