Your browser doesn't support javascript.
loading
Efficient cross-validation traversals in feature subset selection.
Lausser, Ludwig; Szekely, Robin; Schmid, Florian; Maucher, Markus; Kestler, Hans A.
Afiliación
  • Lausser L; Institute of Medical Systems Biology, Ulm University, Ulm, Germany.
  • Szekely R; Faculty of Computer Science, Technische Hochschule Ingolstadt, Ingolstadt, Germany.
  • Schmid F; Institute of Medical Systems Biology, Ulm University, Ulm, Germany.
  • Maucher M; Institute of Medical Systems Biology, Ulm University, Ulm, Germany.
  • Kestler HA; Institute of Medical Systems Biology, Ulm University, Ulm, Germany.
Sci Rep ; 12(1): 21485, 2022 12 12.
Article en En | MEDLINE | ID: mdl-36509882
ABSTRACT
Sparse and robust classification models have the potential for revealing common predictive patterns that not only allow for categorizing objects into classes but also for generating mechanistic hypotheses. Identifying a small and informative subset of features is their main ingredient. However, the exponential search space of feature subsets and the heuristic nature of selection algorithms limit the coverage of these analyses, even for low-dimensional datasets. We present methods for reducing the computational complexity of feature selection criteria allowing for higher efficiency and coverage of screenings. We achieve this by reducing the preparation costs of high-dimensional subsets [Formula see text] to those of one-dimensional ones [Formula see text]. Our methods are based on a tight interaction between a parallelizable cross-validation traversal strategy and distance-based classification algorithms and can be used with any product distance or kernel. We evaluate the traversal strategy exemplarily in exhaustive feature subset selection experiments (perfect coverage). Its runtime, fitness landscape, and predictive performance are analyzed on publicly available datasets. Even in low-dimensional settings, we achieve approximately a 15-fold increase in exhaustively generating distance matrices for feature combinations bringing a new level of evaluations into reach.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Proyectos de Investigación / Algoritmos Tipo de estudio: Prognostic_studies Idioma: En Revista: Sci Rep Año: 2022 Tipo del documento: Article País de afiliación: Alemania

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Proyectos de Investigación / Algoritmos Tipo de estudio: Prognostic_studies Idioma: En Revista: Sci Rep Año: 2022 Tipo del documento: Article País de afiliación: Alemania
...