Your browser doesn't support javascript.
loading
Improving the combination of results in the ensembles of prototype selectors.
Cerruela-García, Gonzalo; de Haro-García, Aida; Toledano, José Pérez-Parras; García-Pedrajas, Nicolás.
Afiliação
  • Cerruela-García G; University of Córdoba, Department of Computing and Numerical Analysis, Edificio Einstein, Campus Universitario de Rabanales, 14071 Córdoba, Spain. Electronic address: gcerruela@uco.es.
  • de Haro-García A; University of Córdoba, Department of Computing and Numerical Analysis, Edificio Einstein, Campus Universitario de Rabanales, 14071 Córdoba, Spain. Electronic address: adeharo@uco.es.
  • Toledano JP; University of Córdoba, Department of Computing and Numerical Analysis, Edificio Einstein, Campus Universitario de Rabanales, 14071 Córdoba, Spain. Electronic address: jperezparras@uco.es.
  • García-Pedrajas N; University of Córdoba, Department of Computing and Numerical Analysis, Edificio Einstein, Campus Universitario de Rabanales, 14071 Córdoba, Spain. Electronic address: npedrajas@uco.es.
Neural Netw ; 118: 175-191, 2019 Oct.
Article em En | MEDLINE | ID: mdl-31299623
ABSTRACT
Prototype selection is one of the most common preprocessing tasks in data mining applications. The vast amounts of data that we must handle in practical problems render the removal of noisy, redundant or useless instances a convenient first step for any real-world application. Many algorithms have been proposed for prototype selection. For difficult problems, however, the use of only a single method would unlikely achieve the desired performance. Similar to the problem of classification, ensembles of prototype selectors have been proposed to overcome the limitations of single algorithms. In ensembles of prototype selectors, the usual combination method is based on a voting scheme coupled with an acceptance threshold. However, this method is suboptimal, because the relationships among the prototypes are not taken into account. In this paper, we propose a different approach, in which we consider not only the number of times every prototype has been selected but also the subsets of prototypes that are selected. With this additional information we develop GEEBIES, which is a new way of combining the results of ensembles of prototype selectors. In a large set of problems, we show that our proposal outperforms the standard boosting approach. A way of scaling up our method to large datasets is also proposed and experimentally tested.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Algoritmos / Bases de Dados Factuais / Estudo de Prova de Conceito Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2019 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Algoritmos / Bases de Dados Factuais / Estudo de Prova de Conceito Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2019 Tipo de documento: Article