Boolean interpretation, matching, and ranking of natural language queries in product selection systems

Author:

Moulton Matthew,Ng Yiu-Kai

Abstract

AbstractE-commerce is a massive sector in the US economy, generating $767.7 billion in revenue in 2021. E-commerce sites maximize their revenue by helping customers find, examine, and purchase products. To help users easily find the most relevant products in the database for their individual needs, e-commerce sites are equipped with a product retrieval system. Many of these modern retrieval systems parse user-specified constraints or keywords embedded in a simple natural language query, which is generally easier and faster for the customer to specify their needs than navigating a product specification form, and does not require the seller to design or develop such a form. These natural language product retrieval systems, however, suffer from low relevance in retrieved products, especially for complex constraints specified on products. The reduced accuracy is in part due to under-utilizing the rich semantics of natural language, specifically queries that include Boolean operators, and lacking of the ranking on partially-matched relevant results that could be of interest to the customers. This undesirable effect costs e-commerce vendors to lose sales on their merchandise. In solving this problem, we propose a novel product retrieval system, called $${\textit{QuePR}}$$ QuePR , that parses arbitrarily simple and complex natural language queries with(out) Boolean operators, utilizes combinatorial numeric and content-based matching to extract relevant products from a database, and ranks retrieved resultant products by relevance before presenting them to the end-user. The advantages of $${\textit{QuePR}}$$ QuePR are its ability to process explicit and implicit Boolean operators in queries, handle natural language queries using similarity measures on partially-matched records, and perform best guess or match on ambiguous or incomplete queries. $${\textit{QuePR}}$$ QuePR is unique, easy to use, and scalable to all product categories. To verify the accuracy of $${\textit{QuePR}}$$ QuePR in retrieving relevant products on different product domains, we have conducted different performance analyses and compared $${\textit{QuePR}}$$ QuePR with other ranking and retrieval systems. The empirical results verify that $${\textit{QuePR}}$$ QuePR outperforms others while maintaining an optimal runtime speed.

Publisher

Springer Science and Business Media LLC

Reference51 articles.

1. Scrapehero. How many products does Amazon sell? (2021). https://www.scrapehero.com/how-many-products-does-amazon-sell-march-2021/.

2. Statista. Number of digital buyers in the United States from 2017 to 2025. https://www-statista-com.erl.lib.byu.edu/statistics/273957/number-of-digital-buyers-in-the-unitedstates/.

3. Wu J. A design methodology for form-based knowledge reuse and representation. Inf Manag. 2009;46(7):365–75.

4. Nambiar U, Kambhampati S. Answering imprecise queries over autonomous web databases. In: Proceedings of the 22nd international conference on data engineering (ICDE’06). IEEE; 2006. p. 45.

5. Sugiki K, Matsubara S. Product retrieval based on semantic similarity of consumer reviews to natural language query. Int J Knowl Web Intell. 2010;1(3–4):209–26.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3