Performance Analysis and Architecture of a Clustering Hybrid Algorithm Called FA+GA-DBSCAN Using Artificial Datasets.

Perafan-Lopez, Juan Carlos; Ferrer-Gregory, Valeria Lucía; Nieto-Londoño, César; Sierra-Pérez, Julián

Perafan-Lopez, Juan Carlos; Ferrer-Gregory, Valeria Lucía; Nieto-Londoño, César; Sierra-Pérez, Julián.

Afiliação

Perafan-Lopez JC; Grupo de Investigación en Ingeniería Aeroespacial, Universidad Pontificia Bolivariana, Medellín 050031, Colombia.
Ferrer-Gregory VL; Semillero de Investigación en Ingeniería Aeroespacial, Universidad Pontificia Bolivariana, Medellín 050031, Colombia.
Nieto-Londoño C; Grupo de Investigación en Energía y Termodinámica, Universidad Pontificia Bolivariana, Medellín 050031, Colombia.
Sierra-Pérez J; Grupo de Investigación en Ingeniería Aeroespacial, Universidad Pontificia Bolivariana, Medellín 050031, Colombia.

Entropy (Basel) ; 24(7)2022 Jun 25.

Article em En | MEDLINE | ID: mdl-35885099

ABSTRACT

ABSTRACT

Density-Based Spatial Clustering of Applications with Noise (DBSCAN) is a widely used algorithm for exploratory clustering applications. Despite the DBSCAN algorithm being considered an unsupervised pattern recognition method, it has two parameters that must be tuned prior to the clustering process in order to reduce uncertainties, the minimum number of points in a clustering segmentation MinPts, and the radii around selected points from a specific dataset Eps. This article presents the performance of a clustering hybrid algorithm for automatically grouping datasets into a two-dimensional space using the well-known algorithm DBSCAN. Here, the function nearest neighbor and a genetic algorithm were used for the automation of parameters MinPts and Eps. Furthermore, the Factor Analysis (FA) method was defined for pre-processing through a dimensionality reduction of high-dimensional datasets with dimensions greater than two. Finally, the performance of the clustering algorithm called FA+GA-DBSCAN was evaluated using artificial datasets. In addition, the precision and Entropy of the clustering hybrid algorithm were measured, which showed there was less probability of error in clustering the most condensed datasets.

Palavras-chave

DBSCAN; clustering; entropy; factor analysis; genetic algorithm; pattern recognition

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2022 Tipo de documento: Article