Affiliation:
1. ISTI-CNR, Pisa, Italy
2. Università Ca' Foscari Venezia, Italy
Abstract
Although Web search engines still answer user queries with lists of
ten blue links
to webpages, people are increasingly issuing queries to accomplish their daily tasks (e.g.,
finding a recipe
,
booking a flight
,
reading online news
, etc.). In this work, we propose a two-step methodology for discovering tasks that users try to perform through search engines. First, we identify
user tasks
from individual user sessions stored in search engine query logs. In our vision, a user task is a set of possibly noncontiguous queries (within a user search session), which refer to the same need. Second, we discover
collective tasks
by aggregating similar user tasks, possibly performed by distinct users. To discover user tasks, we propose query similarity functions based on unsupervised and supervised learning approaches. We present a set of query clustering methods that exploit these functions in order to detect user tasks. All the proposed solutions were evaluated on a manually-built ground truth, and two of them performed better than state-of-the-art approaches. To detect collective tasks, we propose four methods that cluster previously discovered user tasks, which in turn are represented by the bag-of-words extracted from their composing queries. These solutions were also evaluated on another manually-built ground truth.
Funder
European Commission
Ministero dell'Istruzione, dell'Università e della Ricerca
Seventh Framework Programme
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Science Applications,General Business, Management and Accounting,Information Systems
Cited by
46 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献