Your browser doesn't support javascript.
loading
A Multi-Label Classifier for Predicting the Most Appropriate Instrumental Method for the Analysis of Contaminants of Emerging Concern.
Alygizakis, Nikiforos; Konstantakos, Vasileios; Bouziotopoulos, Grigoris; Kormentzas, Evangelos; Slobodnik, Jaroslav; Thomaidis, Nikolaos S.
Afiliación
  • Alygizakis N; Laboratory of Analytical Chemistry, Department of Chemistry, National and Kapodistrian University of Athens, Panepistimiopolis Zografou, 15771 Athens, Greece.
  • Konstantakos V; Environmental Institute, Okruzná 784/42, 97241 Kos, Slovakia.
  • Bouziotopoulos G; National Centre for Scientific Research "Demokritos", Institute of Informatics and Telecommunications, 15341 Agia Paraskevi, Greece.
  • Kormentzas E; Department of Informatics, National and Kapodistrian University of Athens, Panepistimiopolis Zografou, 15771 Athens, Greece.
  • Slobodnik J; Cognity S.A., Leof. Kifisias 42, 15125 Marousi Athens, Greece.
  • Thomaidis NS; Environmental Institute, Okruzná 784/42, 97241 Kos, Slovakia.
Metabolites ; 12(3)2022 Feb 23.
Article en En | MEDLINE | ID: mdl-35323641
ABSTRACT
Liquid chromatography-high resolution mass spectrometry (LC-HRMS) and gas chromatography-high resolution mass spectrometry (GC-HRMS) have revolutionized analytical chemistry among many other disciplines. These advanced instrumentations allow to theoretically capture the whole chemical universe that is contained in samples, giving unimaginable opportunities to the scientific community. Laboratories equipped with these instruments produce a lot of data daily that can be digitally archived. Digital storage of data opens up the opportunity for retrospective suspect screening investigations for the occurrence of chemicals in the stored chromatograms. The first step of this approach involves the prediction of which data is more appropriate to be searched. In this study, we built an optimized multi-label classifier for predicting the most appropriate instrumental method (LC-HRMS or GC-HRMS or both) for the analysis of chemicals in digital specimens. The approach involved the generation of a baseline model based on the knowledge that an expert would use and the generation of an optimized machine learning model. A multi-step feature selection approach, a model selection strategy, and optimization of the classifier's hyperparameters led to a model with accuracy that outperformed the baseline implementation. The models were used to predict the most appropriate instrumental technique for new substances. The scripts are available at GitHub and the dataset at Zenodo.
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Metabolites Año: 2022 Tipo del documento: Article País de afiliación: Grecia

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Metabolites Año: 2022 Tipo del documento: Article País de afiliación: Grecia
...