Author:
Moulton Matthew,Ng Yiu-Kai
Abstract
AbstractE-commerce is a massive sector in the US economy, generating $767.7 billion in revenue in 2021. E-commerce sites maximize their revenue by helping customers find, examine, and purchase products. To help users easily find the most relevant products in the database for their individual needs, e-commerce sites are equipped with a product retrieval system. Many of these modern retrieval systems parse user-specified constraints or keywords embedded in a simple natural language query, which is generally easier and faster for the customer to specify their needs than navigating a product specification form, and does not require the seller to design or develop such a form. These natural language product retrieval systems, however, suffer from low relevance in retrieved products, especially for complex constraints specified on products. The reduced accuracy is in part due to under-utilizing the rich semantics of natural language, specifically queries that include Boolean operators, and lacking of the ranking on partially-matched relevant results that could be of interest to the customers. This undesirable effect costs e-commerce vendors to lose sales on their merchandise. In solving this problem, we propose a novel product retrieval system, called $${\textit{QuePR}}$$
QuePR
, that parses arbitrarily simple and complex natural language queries with(out) Boolean operators, utilizes combinatorial numeric and content-based matching to extract relevant products from a database, and ranks retrieved resultant products by relevance before presenting them to the end-user. The advantages of $${\textit{QuePR}}$$
QuePR
are its ability to process explicit and implicit Boolean operators in queries, handle natural language queries using similarity measures on partially-matched records, and perform best guess or match on ambiguous or incomplete queries. $${\textit{QuePR}}$$
QuePR
is unique, easy to use, and scalable to all product categories. To verify the accuracy of $${\textit{QuePR}}$$
QuePR
in retrieving relevant products on different product domains, we have conducted different performance analyses and compared $${\textit{QuePR}}$$
QuePR
with other ranking and retrieval systems. The empirical results verify that $${\textit{QuePR}}$$
QuePR
outperforms others while maintaining an optimal runtime speed.
Publisher
Springer Science and Business Media LLC
Reference51 articles.
1. Scrapehero. How many products does Amazon sell? (2021). https://www.scrapehero.com/how-many-products-does-amazon-sell-march-2021/.
2. Statista. Number of digital buyers in the United States from 2017 to 2025. https://www-statista-com.erl.lib.byu.edu/statistics/273957/number-of-digital-buyers-in-the-unitedstates/.
3. Wu J. A design methodology for form-based knowledge reuse and representation. Inf Manag. 2009;46(7):365–75.
4. Nambiar U, Kambhampati S. Answering imprecise queries over autonomous web databases. In: Proceedings of the 22nd international conference on data engineering (ICDE’06). IEEE; 2006. p. 45.
5. Sugiki K, Matsubara S. Product retrieval based on semantic similarity of consumer reviews to natural language query. Int J Knowl Web Intell. 2010;1(3–4):209–26.