Improving XML search by generating and utilizing informative result snippets-Reference-Cited by-同舟云学术

Improving XML search by generating and utilizing informative result snippets

Published:2010-07 Issue:3 Volume:35 Page:1-45
ISSN:0362-5915
Container-title:ACM Transactions on Database Systems
language:en
Short-container-title:ACM Trans. Database Syst.

Author:

Liu Ziyang¹,Huang Yu¹,Chen Yi¹

Affiliation:

1. Arizona State University, Tempe, AZ

Abstract

Snippets are used by almost every text search engine to complement the ranking scheme in order to effectively handle user searches, which are inherently ambiguous and whose relevance semantics are difficult to assess. Despite the fact that XML is a standard representation format of Web data, research on generating result snippets for XML search remains limited. To tackle this important yet open problem, in this article, we present a system eXtract which generates snippets for XML search results. We identify that a good XML result snippet should be a meaningful information unit of a small size that effectively summarizes this query result and differentiates it from others, according to which users can quickly assess the relevance of the query result. We have designed and implemented a novel algorithm to satisfy these requirements. Furthermore, we propose to cluster the query results based on their snippets. Since XML result clustering can only be done at query time, snippet-based clustering significantly improves the efficiency while compromising little clustering accuracy. We verified the efficiency and effectiveness of our approach through experiments.

Funder

Division of Information and Intelligent Systems

Publisher

Association for Computing Machinery (ACM)

Subject

Information Systems

Link

https://dl.acm.org/doi/pdf/10.1145/1806907.1806911

Reference51 articles.

1. Xproj

2. DescribeX: Interacting with AxPRE Summaries

3. Effective XML Keyword Search with Relevance Oriented Ranking

4. Structural proximity searching for large collections of semi-structured data

5. The use of MMR, diversity-based reranking for reordering documents and producing summaries

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. XSnippets: Exploring semi-structured data via snippets;Data & Knowledge Engineering;2019-11

2. Towards improving XML search by using structure clustering technique;Journal of Information Science;2014-12-12

3. Exploiting and Maintaining Materialized Views for XML Keyword Queries;ACM Transactions on Internet Technology;2012-12

4. LAF: a new XML encoding and indexing strategy for keyword-based XML search;Concurrency and Computation: Practice and Experience;2012-07-24

5. Differentiating search results on structured data;ACM Transactions on Database Systems;2012-02