Affiliation:
1. Arizona State University
Abstract
Studies show that about 50% of Web search is for
information exploration
purposes, where a user would like to investigate, compare, evaluate, and synthesize multiple relevant results. Due to the absence of general tools that can effectively analyze and differentiate multiple results, a user has to manually read and comprehend potential large results in an exploratory search. Such a process is time consuming, labor intensive and error prone. Interestingly, we find that the metadata information embedded in structured data provides a potential for automating or semi-automating the comparison of multiple results.
In this article we present an approach for structured data search result differentiation. We define the differentiability of query results and quantify the degree of difference. Then we define the problem of identifying a limited number of valid features in a result that can maximally differentiate this result from the others, which is proved NP-hard. We propose two local optimality conditions, namely single-swap and multi-swap, and design efficient algorithms to achieve local optimality. We then present a feature type-based approach, which further improves the quality of the features identified for result differentiation. To show the usefulness of our approach, we implemented a system CompareIt, which can be used to compare structured search results as well as any objects. Our empirical evaluation verifies the effectiveness and efficiency of the proposed approach.
Funder
Division of Information and Intelligent Systems
Publisher
Association for Computing Machinery (ACM)
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Weighted Aggregating Stochastic Gradient Descent for Parallel Deep Learning;IEEE Transactions on Knowledge and Data Engineering;2020
2. Fast and Practical Snippet Generation for RDF Datasets;ACM Transactions on the Web;2019-12-20
3. Fine-grained queue measurement in the data plane;Proceedings of the 15th International Conference on Emerging Networking Experiments And Technologies;2019-12-03
4. Generating Illustrative Snippets for Open Data on the Web;Proceedings of the Tenth ACM International Conference on Web Search and Data Mining;2017-02-02
5. A Randomized Trial Comparing Vaginal and Laparoscopic Hysterectomy vs Robot-Assisted Hysterectomy;Journal of Minimally Invasive Gynecology;2015-01