Affiliation:
1. Univ. of Bologna, Bologna, Italy
2. Technical Univ. of Brno, Brno, Czech Republic
Abstract
Access methods based on signature files can largely benefit from possibilities offered by parallel environments. To this end, an effective
declustering
strategy that would distribute signatures over a set of parallel independent disks has to be combined with a synergic
clustering
which is employed to avoid searching the whole signature file while executing a query. This article proposes two parallel signature file organizations, Hamming Filter (
HF
) and Hamming
+
Filter (
H
+
F
), whose common declustering strategy is based on
error correcting codes
, and where clustering is achieved by organizing signatures into fixed-size buckets, each containing signatures sharing the same
key
value.
HF
allocates signatures on disks in a static way and works well if a correct relationship holds between the parameters of the code and the size of the file.
H
+
F
is a generalization of
HF
suitable to manage highly dynamic files. It uses a dynamic declustering, obtained through a
sequence
of codes, and organizes a smooth migration of signatures between disks so that high performance levels are retained regardless of current file size. Theoretical analysis characterizes the best-case, expected, and worst-case behaviors of these organizations. Analytical results are verified by experiments on prototype systems.
Publisher
Association for Computing Machinery (ACM)
Cited by
19 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Study on Content‐Based Image Retrieval;Integrating Metaheuristics in Computer Vision for Real‐World Optimization Problems;2024-07-31
2. Inverted files for text search engines;ACM Computing Surveys;2006-07-25
3. On the cost of searching signature trees;Information Processing Letters;2006-07
4. Iterative-improvement-based declustering heuristics for multi-disk databases;Information Systems;2005-03
5. On the General Signature Trees;Lecture Notes in Computer Science;2005