Intent Identification by Semantically Analyzing the Search Query
-
Published:2024-02-22
Issue:1
Volume:5
Page:292-314
-
ISSN:2673-3951
-
Container-title:Modelling
-
language:en
-
Short-container-title:Modelling
Author:
Sultana Tangina12ORCID, Mandal Ashis Kumar34ORCID, Saha Hasi3ORCID, Sultan Md. Nahid3ORCID, Hossain Md. Delowar23ORCID
Affiliation:
1. Department of Electronics and Communication Engineering, Hajee Mohammad Danesh Science and Technology University, Dinajpur 5200, Bangladesh 2. Department of Computer Science and Engineering, Global Campus, Kyung Hee University, Yongin-si 1732, Republic of Korea 3. Department of Computer Science and Engineering, Hajee Mohammad Danesh Science and Technology University, Dinajpur 5200, Bangladesh 4. Department of Computer Science, University of Saskatchewan, Saskatoon, SK S7N 5A2, Canada
Abstract
Understanding and analyzing the search intent of a user semantically based on their input query has emerged as an intriguing challenge in recent years. It suffers from small-scale human-labeled training data that produce a very poor hypothesis of rare words. The majority of data portals employ keyword-driven search functionality to explore content within their repositories. However, the keyword-based search cannot identify the users’ search intent accurately. Integrating a query-understandable framework into keyword search engines has the potential to enhance their performance, bridging the gap in interpreting the user’s search intent more effectively. In this study, we have proposed a novel approach that focuses on spatial and temporal information, phrase detection, and semantic similarity recognition to detect the user’s intent from the search query. We have used the n-gram probabilistic language model for phrase detection. Furthermore, we propose a probability-aware gated mechanism for RoBERTa (Robustly Optimized Bidirectional Encoder Representations from Transformers Approach) embeddings to semantically detect the user’s intent. We analyze and compare the performance of the proposed scheme with the existing state-of-the-art schemes. Furthermore, a detailed case study has been conducted to validate the model’s proficiency in semantic analysis, emphasizing its adaptability and potential for real-world applications where nuanced intent understanding is crucial. The experimental result demonstrates that our proposed system can significantly improve the accuracy for detecting the users’ search intent as well as the quality of classification during search.
Reference45 articles.
1. Cheung, J.C.K., and Li, X. (2012, January 8–12). Sequence clustering and labeling for unsupervised query intent discovery. Proceedings of the Fifth ACM International Conference on Web Search and Data Mining, Seattle, WA, USA. 2. Hu, J., Wang, G., Lochovsky, F., Sun, J.T., and Chen, Z. (2009, January 20–24). Understanding user’s query intent with Wikipedia. Proceedings of the 18th International Conference on World Wide Web, Madrid, Spain. 3. Shneiderman, B., Byrd, D., and Croft, W.B. (2024, February 21). Clarifying Search: A User-Interface Framework for Text Searches. D-Lib Magazine. Available online: https://dl.acm.org/doi/abs/10.5555/865578. 4. A taxonomy of web search;Broder;ACM Sigir Forum,2002 5. Cao, H., Hu, D.H., Shen, D., Jiang, D., Sun, J.T., Chen, E., and Yang, Q. (2009, January 19–23). Context-aware query classification. Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, Boston, MA, USA.
|
|