Fast Fusion Clustering via Double Random Projection

Author:

Wang Hongni1,Li Na1,Zhou Yanqiu2,Yan Jingxin3,Jiang Bei4,Kong Linglong4ORCID,Yan Xiaodong5ORCID

Affiliation:

1. School of Statistics and Mathematics, Shandong University of Finance and Economics, Jinan 250014, China

2. School of Science, Guangxi University of Science and Technology, Liuzhou 545006, China

3. Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China

4. Department of Mathematical and Statistical Sciences, University of Alberta, Edmonton, AB T6G 2G1, Canada

5. Zhongtai Securities Institute for Financial Studies, Shandong University, Jinan 250100, China

Abstract

In unsupervised learning, clustering is a common starting point for data processing. The convex or concave fusion clustering method is a novel approach that is more stable and accurate than traditional methods such as k-means and hierarchical clustering. However, the optimization algorithm used with this method can be slowed down significantly by the complexity of the fusion penalty, which increases the computational burden. This paper introduces a random projection ADMM algorithm based on the Bernoulli distribution and develops a double random projection ADMM method for high-dimensional fusion clustering. These new approaches significantly outperform the classical ADMM algorithm due to their ability to significantly increase computational speed by reducing complexity and improving clustering accuracy by using multiple random projections under a new evaluation criterion. We also demonstrate the convergence of our new algorithm and test its performance on both simulated and real data examples.

Funder

National Key R&D Program of China

the National Natural Science Foundation of China

the National Statistical Science Research Project

Jinan Science and Technology Bureau

the China Academy of Engineering Science and Technology Development Strategy Shandong Research Institute Consulting Research Project

the State Scholarship Fund from China Scholarship Council

the Alberta Machine Intelligence Institute

Natural Sciences and Engineering Council of Canada

Canada Research Chair program from NSERC

Publisher

MDPI AG

Reference34 articles.

1. CDLSTM: A novel model for climate change forecasting;Haq;Comput. Mater. Contin.,2022

2. SMOTEDNN: A novel model for air pollution forecasting and AQI classification;Haq;Comput. Mater. Contin.,2022

3. Instability of hierarchical cluster analysis due to input order of the data: The PermuCLUSTER solution;Spaans;Psychol. Methods,2005

4. Survey of clustering algorithms;Xu;IEEE Trans. Neural Netw.,2005

5. High-dimensional integrative analysis with homogeneity and sparsity recovery;Yang;J. Multivar. Anal.,2019

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3