Your browser doesn't support javascript.
loading
Fast Fusion Clustering via Double Random Projection.
Wang, Hongni; Li, Na; Zhou, Yanqiu; Yan, Jingxin; Jiang, Bei; Kong, Linglong; Yan, Xiaodong.
  • Wang H; School of Statistics and Mathematics, Shandong University of Finance and Economics, Jinan 250014, China.
  • Li N; School of Statistics and Mathematics, Shandong University of Finance and Economics, Jinan 250014, China.
  • Zhou Y; School of Science, Guangxi University of Science and Technology, Liuzhou 545006, China.
  • Yan J; Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China.
  • Jiang B; Department of Mathematical and Statistical Sciences, University of Alberta, Edmonton, AB T6G 2G1, Canada.
  • Kong L; Department of Mathematical and Statistical Sciences, University of Alberta, Edmonton, AB T6G 2G1, Canada.
  • Yan X; Zhongtai Securities Institute for Financial Studies, Shandong University, Jinan 250100, China.
Entropy (Basel) ; 26(5)2024 Apr 28.
Article en En | MEDLINE | ID: mdl-38785624
ABSTRACT
In unsupervised learning, clustering is a common starting point for data processing. The convex or concave fusion clustering method is a novel approach that is more stable and accurate than traditional methods such as k-means and hierarchical clustering. However, the optimization algorithm used with this method can be slowed down significantly by the complexity of the fusion penalty, which increases the computational burden. This paper introduces a random projection ADMM algorithm based on the Bernoulli distribution and develops a double random projection ADMM method for high-dimensional fusion clustering. These new approaches significantly outperform the classical ADMM algorithm due to their ability to significantly increase computational speed by reducing complexity and improving clustering accuracy by using multiple random projections under a new evaluation criterion. We also demonstrate the convergence of our new algorithm and test its performance on both simulated and real data examples.
Palabras clave