Clarifying Ambiguous Keywords with Personal Word Embeddings for Personalized Search-Reference-Cited by-同舟云学术

Clarifying Ambiguous Keywords with Personal Word Embeddings for Personalized Search

Published:2022-07-31 Issue:3 Volume:40 Page:1-29
ISSN:1046-8188
Container-title:ACM Transactions on Information Systems
language:en
Short-container-title:ACM Trans. Inf. Syst.

Author:

Yao Jing¹,Dou Zhicheng²,Wen Ji-Rong³

Affiliation:

1. School of Information, Renmin University of China, Beijing

2. Gaoling School of Artificial Intelligence, Renmin University of China, Beijing

3. Beijing Key Laboratory of Big Data Management and Analysis Methods, Key Laboratory of Data Engineering and Knowledge Engineering, MOE, Beijing

Abstract

Personalized search tailors document ranking lists for each individual user based on her interests and query intent to better satisfy the user’s information need. Many personalized search models have been proposed. They first build a user interest profile from the user’s search history, and then re-rank the documents based on the personalized matching scores between the created profile and candidate documents. In this article, we attempt to solve the personalized search problem from an alternative perspective of clarifying the user’s intention of the current query. We know that there are many ambiguous words in natural language such as “Apple.” People with different knowledge backgrounds and interests have personalized understandings of these words. Therefore, we propose a personalized search model with personal word embeddings for each individual user that mainly contain the word meanings that the user already knows and can reflect the user interests. To learn great personal word embeddings, we design a pre-training model that captures both the textual information of the query log and the information about user interests contained in the click-through data represented as a graph structure. With personal word embeddings, we obtain the personalized word and context-aware representations of the query and documents. Furthermore, we also employ the current session as the short-term search context to dynamically disambiguate the current query. Finally, we use a matching model to calculate the matching score between the personalized query and document representations for ranking. Experimental results on two large-scale query logs show that our designed model significantly outperforms state-of-the-art personalization models.

Funder

National Natural Science Foundation of China

Beijing Outstanding Young Scientist Program

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Science Applications,General Business, Management and Accounting,Information Systems

Link

https://dl.acm.org/doi/pdf/10.1145/3470564

Reference53 articles.

1. Context Attentive Document Ranking and Query Suggestion

2. Toward word embedding for personalized information retrieval;Amer Nawal Ould;CoRR,2016

3. Inferring and using location metadata to personalize web search

4. Modeling the impact of short- and long-term behavior on search personalization

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Contrastive Learning for User Sequence Representation in Personalized Product Search;Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2023-08-04

2. Distilling Semantic Concept Embeddings from Contrastively Fine-Tuned Language Models;Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval;2023-07-18