Author:
Zhang Jin,Ren Ming,Xiao Xian,Zhang Jilong
Abstract
Purpose
The purpose of this paper is to find a representative subset from large-scale online reviews for consumers. The subset is significantly small in size, but covers the majority amount of information in the original reviews and contains little redundant information.
Design/methodology/approach
A heuristic approach named RewSel is proposed to successively select representatives until the number of representatives meets the requirement. To reveal the advantages of the approach, extensive data experiments and a user study are conducted on real data.
Findings
The proposed approach has the advantage over the benchmarks in terms of coverage and redundancy. People show preference to the representative subsets provided by RewSel. The proposed approach also has good scalability, and is more adaptive to big data applications.
Research limitations/implications
The paper contributes to the literature of review selection, by proposing a heuristic approach which achieves both high coverage and low redundancy. This study can be applied as the basis for conducting further analysis of large-scale online reviews.
Practical implications
The proposed approach offers a novel way to select a representative subset of online reviews to facilitate consumer decision making. It can also enhance the existing information retrieval system to provide representative information to users rather than a large amount of results.
Originality/value
The proposed approach finds the representative subset by adopting the concept of relative entropy and sentiment analysis methods. Compared with state-of-the-art approaches, it offers a more effective and efficient way for users to handle a large amount of online information.
Subject
Library and Information Sciences,Computer Science Applications,Information Systems
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. An orthogonal-space-learning-based method for selecting semantically helpful reviews;Electronic Commerce Research and Applications;2022-05
2. Personalized Review Selection;Procedia Computer Science;2019
3. Information Discovery in E-commerce;The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval;2018-06-27
4. What to Read Next? Challenges and Preliminary Results in Selecting Representative Documents;Communications in Computer and Information Science;2018