Subspace clustering using ensembles of K-subspaces

Author:

Lipor John1,Hong David2,Tan Yan Shuo3,Balzano Laura4

Affiliation:

1. Department of Electrical and Computer Engineering, Portland State University, 1900 SW Fourth Ave Suite 160, Portland, OR 97201, USA

2. Wharton Statistics, University of Pennsylvania, 400 Jon M. Huntsman Hall, 3730 Walnut St., Philadelphia, PA 19104, USA

3. Department of Statistics, University of California, Berkeley, 367 Evans Hall, University Dr Berkeley, CA 94720, USA

4. Department of Electrical Engineering and Computer Science University of Michigan, Ann Arbor, 1301 Beal Ave Ann Arbor, MI 48109, USA

Abstract

Abstract Subspace clustering is the unsupervised grouping of points lying near a union of low-dimensional linear subspaces. Algorithms based directly on geometric properties of such data tend to either provide poor empirical performance, lack theoretical guarantees or depend heavily on their initialization. We present a novel geometric approach to the subspace clustering problem that leverages ensembles of the $K$-subspace (KSS) algorithm via the evidence accumulation clustering framework. Our algorithm, referred to as ensemble $K$-subspaces (EKSSs), forms a co-association matrix whose $(i,j)$th entry is the number of times points $i$ and $j$ are clustered together by several runs of KSS with random initializations. We prove general recovery guarantees for any algorithm that forms an affinity matrix with entries close to a monotonic transformation of pairwise absolute inner products. We then show that a specific instance of EKSS results in an affinity matrix with entries of this form, and hence our proposed algorithm can provably recover subspaces under similar conditions to state-of-the-art algorithms. The finding is, to the best of our knowledge, the first recovery guarantee for evidence accumulation clustering and for KSS variants. We show on synthetic data that our method performs well in the traditionally challenging settings of subspaces with large intersection, subspaces with small principal angles and noisy data. Finally, we evaluate our algorithm on six common benchmark datasets and show that unlike existing methods, EKSS achieves excellent empirical performance when there are both a small and large number of points per subspace.

Funder

National Science Foundation

Graduate Research Fellowship Program

Defense Advanced Research Projects Agency

U.S. Army Engineer Research and Development Center

Portland State University

BIGDATA Information and Intelligend Systems

Dean’s Fund for Postdoctoral Research of the Wharton School

Simons Institute for the Theory of Computing

Computing and Communication Foundations

Institute of Ismaili Studies

Air Force Office of Scientific Research

Army Research Office

Publisher

Oxford University Press (OUP)

Subject

Applied Mathematics,Computational Theory and Mathematics,Numerical Analysis,Statistics and Probability,Analysis

Reference57 articles.

1. K-means projective clustering;Agarwal,2004

2. K-Subspaces with missing data;Balzano,2012

3. Lambertian reflectance and linear subspaces;Basri;IEEE Trans. Pattern Anal. Mach. Intell.,2003

4. k-Plane clustering;Bradley;J. Global Optim.,2000

5. Invariant scattering convolution networks;Bruna;IEEE Trans. Pattern Anal. Mach. Intell.,2013

Cited by 8 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. A comprehensive investigation of clustering algorithms for User and Entity Behavior Analytics;Frontiers in Big Data;2024-05-09

2. Aggregation of Random Initializations for Robust Linear Clustering;Advances in Intelligent Systems and Computing;2024

3. K-Subspaces for Sequential Data;2023 IEEE 9th International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP);2023-12-10

4. Large-scale non-negative subspace clustering based on Nyström approximation;Information Sciences;2023-08

5. A Sparse Framework for Robust Possibilistic K-Subspace Clustering;IEEE Transactions on Fuzzy Systems;2023-04

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3