Exploiting query logs for cross-lingual query suggestions

Author:

Gao Wei1,Niu Cheng2,Nie Jian-Yun3,Zhou Ming2,Wong Kam-Fai1,Hon Hsiao-Wuen2

Affiliation:

1. The Chinese University of Hong Kong, Hong Kong, China

2. Microsoft Research Asia, Beijing

3. Université de Montréal, Montréal, Canada

Abstract

Query suggestion aims to suggest relevant queries for a given query, which helps users better specify their information needs. Previous work on query suggestion has been limited to the same language. In this article, we extend it to cross-lingual query suggestion (CLQS): for a query in one language, we suggest similar or relevant queries in other languages. This is very important to the scenarios of cross-language information retrieval (CLIR) and other related cross-lingual applications. Instead of relying on existing query translation technologies for CLQS, we present an effective means to map the input query of one language to queries of the other language in the query log. Important monolingual and cross-lingual information such as word translation relations and word co-occurrence statistics, and so on, are used to estimate the cross-lingual query similarity with a discriminative model. Benchmarks show that the resulting CLQS system significantly outperforms a baseline system that uses dictionary-based query translation. Besides, we evaluate CLQS with French-English and Chinese-English CLIR tasks on TREC-6 and NTCIR-4 collections, respectively. The CLIR experiments using typical retrieval models demonstrate that the CLQS-based approach has significantly higher effectiveness than several traditional query translation methods. We find that when combined with pseudo-relevance feedback, the effectiveness of CLIR using CLQS is enhanced for different pairs of languages.

Funder

Innovation and Technology Commmission

Chinese University of Hong Kong

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Science Applications,General Business, Management and Accounting,Information Systems

Reference48 articles.

1. Phrasal translation and query expansion techniques for cross-language information retrieval

2. Resolving ambiguity for cross-language retrieval

3. Chang C. C. and Lin C. 2001. LIBSVM: a library for support vector machines (version 2.3). http://www.csie.ntu.edu.tw/~cjlin/libsvm/ 10.1145/1961189.1961199 Chang C. C. and Lin C. 2001. LIBSVM: a library for support vector machines (version 2.3). http://www.csie.ntu.edu.tw/~cjlin/libsvm/ 10.1145/1961189.1961199

Cited by 21 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Query Expansion Using Proposed Location-Based Algorithm for Hindi–English CLIR: Analyzing Three Test Collections;International Journal of Pattern Recognition and Artificial Intelligence;2024-04

2. Multilingual News Search—A Comparative User Study of Desktop and Mobile Interfaces;International Journal of Human–Computer Interaction;2023-08-02

3. A Hybrid Framework for Session Context Modeling;ACM Transactions on Information Systems;2021-07-26

4. Towards a Better Understanding of Query Reformulation Behavior in Web Search;Proceedings of the Web Conference 2021;2021-04-19

5. How do multilingual users search? An investigation of query and result list language choices;Journal of the Association for Information Science and Technology;2020-12-28

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3