Affiliation:
1. University of Winnipeg, Canada
Abstract
In this chapter, the authors discuss an efficient and effective index mechanism for search engines to support both conjunctive and disjunctive queries. The main idea behind it is to decompose an inverted list into a collection of disjoint sub-lists. The authors associate each word with an interval sequence, which is created by applying a kind of tree coding to a trie structure constructed over all the word sequences in a database. Then, attach each interval, instead of a word, with an inverted sub-list. In this way, both set intersection and union can be conducted by performing a series of simple interval containment checks. Experiments have been conducted, which shows that the new index is promising. Also, how to maintain indices, when inserting or deleting documents, is discussed in great detail.
Reference22 articles.
1. Inverted Index Compression Using Word-Aligned Binary Codes
2. Ao, N., Zhang, F., Wu, D., Stones, D., Wang, G., Liu, X., …, Lin, S. (2011). Efficient Parallel Lists Intersection and Index Compression Algorithms using Graphics Processing Units. Proceedings of the PVLDB ‘11, Seattle, USA.
3. Approximate Encoding for Direct Access and Query Processing over Compressed Bitmaps.;T.Apaydin;VLDB,2006
4. Searching the Web
5. Inverted indexes vs. bitmap indexes in decision support systems