Diversifying Query Auto-Completion

Author:

Cai Fei1,Reinanda Ridho2,Rijke Maarten De2

Affiliation:

1. National University of Defense Technology and University of Amsterdam, The Netherlands

2. University of Amsterdam, Amsterdam, The Netherlands

Abstract

Query auto-completion assists web search users in formulating queries with a few keystrokes, helping them to avoid spelling mistakes and to produce clear query expressions, and so on. Previous work on query auto-completion mainly centers around returning a list of completions to users, aiming to push queries that are most likely intended by the user to the top positions but ignoring the redundancy among the query candidates in the list. Thus, semantically related queries matching the input prefix are often returned together. This may push valuable suggestions out of the list, given that only a limited number of candidates can be shown to the user, which may result in a less than optimal search experience. In this article, we consider the task of diversifying query auto-completion, which aims to return the correct query completions early in a ranked list of candidate completions and at the same time reduce the redundancy among query auto-completion candidates. We develop a greedy query selection approach that predicts query completions based on the current search popularity of candidate completions and on the aspects of previous queries in the same search session. The popularity of completion candidates at query time can be directly aggregated from query logs. However, query aspects are implicitly expressed by previous clicked documents in the search context. To determine the query aspect, we categorize clicked documents of a query using a hierarchy based on the open directory project. Bayesian probabilistic matrix factorization is applied to derive the distribution of queries over all aspects. We quantify the improvement of our greedy query selection model against a state-of-the-art baseline using two large-scale, real-world query logs and show that it beats the baseline in terms of well-known metrics used in query auto-completion and diversification. In addition, we conduct a side-by-side experiment to verify the effectiveness of our proposal.

Funder

Elsevier

Netherlands Institute for Sound and Vision

Netherlands Organisation for Scientific Research

Amsterdam Data Science

Netherlands eScience Center

Innovation Foundation of NUDT for Postgraduate

Dutch national program COMMIT

Microsoft Research Ph.D. program

Bloomberg Research Grant program

ESF Research Network Program ELIAS

European Community's Seventh Framework Programme

Royal Dutch Academy of Sciences (KNAW) under the Elite Network Shifts project

Yahoo Faculty Research and Engagement Program; and Yandex

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Science Applications,General Business, Management and Accounting,Information Systems

Cited by 38 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Result Diversification in Search and Recommendation: A Survey;IEEE Transactions on Knowledge and Data Engineering;2024-10

2. End-to-end pseudo relevance feedback based vertical web search queries recommendation;Multimedia Tools and Applications;2024-02-21

3. Auto-Complete: A Hidden Recommendation Engine;2024

4. DIPT: Diversified Personalized Transformer for QAC systems;2023 13th International Conference on Computer and Knowledge Engineering (ICCKE);2023-11-01

5. Interpassivity instead of interactivity? The uses and gratifications of automated features;Behaviour & Information Technology;2023-03-02

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3