Large Language Models Enable Few-Shot Clustering-Reference-Cited by-同舟云学术

Large Language Models Enable Few-Shot Clustering

Published:2024 Issue: Volume:12 Page:321-333
ISSN:2307-387X
Container-title:Transactions of the Association for Computational Linguistics
language:en
Short-container-title:

Author:

Viswanathan Vijay¹,Gashteovski Kiril²,Gashteovski Kiril³,Lawrence Carolin²,Wu Tongshuang¹,Neubig Graham¹

Affiliation:

1. Carnegie Mellon University, USA

2. NEC Laboratories Europe, Germany

3. Center for Advanced Interdisciplinary Research, Ss. Cyril and Methodius Uni. of Skopje, Germany

Abstract

Abstract Unlike traditional unsupervised clustering, semi-supervised clustering allows users to provide meaningful structure to the data, which helps the clustering algorithm to match the user’s intent. Existing approaches to semi-supervised clustering require a significant amount of feedback from an expert to improve the clusters. In this paper, we ask whether a large language model (LLM) can amplify an expert’s guidance to enable query-efficient, few-shot semi-supervised text clustering. We show that LLMs are surprisingly effective at improving clustering. We explore three stages where LLMs can be incorporated into clustering: before clustering (improving input features), during clustering (by providing constraints to the clusterer), and after clustering (using LLMs post-correction). We find that incorporating LLMs in the first two stages routinely provides significant improvements in cluster quality, and that LLMs enable a user to make trade-offs between cost and accuracy to produce desired clusters. We release our code and LLM prompts for the public to use.1

Publisher

MIT Press

Link

https://direct.mit.edu/tacl/article-pdf/doi/10.1162/tacl_a_00648/2362202/tacl_a_00648.pdf

Reference40 articles.

1. A survey of text clustering algorithms;Aggarwal,2012

2. k-means++: the advantages of careful seeding;Arthur,2007

3. Local algorithms for interactive clustering;Awasthi;Journal of Machine Learning Research,2013

4. Interactive clustering: A comprehensive review;Bae;ACM Computing Surveys,2020

5. Open information extraction from the web;Banko,2007

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Consumer segmentation with large language models;Journal of Retailing and Consumer Services;2025-01

2. Classifying User Roles in Online News Forums: A Model for User Interaction and Behavior Analysis;Adjunct Proceedings of the 32nd ACM Conference on User Modeling, Adaptation and Personalization;2024-06-27

3. Large Language Model-assisted Clustering and Concept Identification of Engineering Design Data;2024 IEEE Conference on Artificial Intelligence (CAI);2024-06-25

4. Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM;Proceedings of the CHI Conference on Human Factors in Computing Systems;2024-05-11

5. Beyond Words: A Comparative Analysis of LLM Embeddings for Effective Clustering;Lecture Notes in Computer Science;2024