Comparing Clustering with Pairwise and Relative Constraints-Reference-Cited by-同舟云学术

Comparing Clustering with Pairwise and Relative Constraints

Published:2016-12-26 Issue:2 Volume:11 Page:1-26
ISSN:1556-4681
Container-title:ACM Transactions on Knowledge Discovery from Data
language:en
Short-container-title:ACM Trans. Knowl. Discov. Data

Author:

Pei Yuanli¹,Fern Xiaoli Z.¹,Tjahja Teresa Vania¹,Rosales Rómer²

Affiliation:

1. Oregon State University, Corvallis, OR

2. LinkedIn, Mountain View, CA

Abstract

Clustering can be improved with the help of side information about the similarity relationships among instances. Such information has been commonly represented by two types of constraints: pairwise constraints and relative constraints, regarding similarities about instance pairs and triplets, respectively. Prior work has mostly considered these two types of constraints separately and developed individual algorithms to learn from each type. In practice, however, it is critical to understand/compare the usefulness of the two types of constraints as well as the cost of acquiring them, which has not been studied before. This paper provides an extensive comparison of clustering with these two types of constraints. Specifically, we compare their impacts both on human users that provide such constraints and on the learning system that incorporates such constraints into clustering. In addition, to ensure that the comparison of clustering is performed on equal ground (without the potential bias introduced by different learning algorithms), we propose a probabilistic semi-supervised clustering framework that can learn from either type of constraints. Our experiments demonstrate that the proposed semi-supervised clustering framework is highly effective at utilizing both types of constraints to aid clustering. Our user study provides valuable insights regarding the impact of the constraints on human users, and our experiments on clustering with the human-labeled constraints reveal that relative constraint is often more efficient at improving clustering.

Funder

National Science Foundation

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/2996467

Reference52 articles.

1. A Kernel-Learning Approach to Semi-supervised Clustering with Relative Distance Comparisons

2. Hierarchical constraints

3. A probabilistic framework for semi-supervised clustering

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Semi-supervised nonnegative matrix factorization with pairwise constraints for image clustering;International Journal of Machine Learning and Cybernetics;2022-09-10

2. Consistency regularization for deep semi-supervised clustering with pairwise constraints;International Journal of Machine Learning and Cybernetics;2022-07-07

3. Semi-supervised clustering under a compact-cluster assumption;IEEE Transactions on Knowledge and Data Engineering;2022

4. A classification-based approach to semi-supervised clustering with pairwise constraints;Neural Networks;2020-07

5. Safety-aware Graph-based Semi-Supervised Learning;Expert Systems with Applications;2018-10