Declustering of key-based partitioned signature files

Author:

Ciaccia Paolo1,Tiberio Paolo1,Zezula Pavel2

Affiliation:

1. Univ. of Bologna, Bologna, Italy

2. Technical Univ. of Brno, Brno, Czech Republic

Abstract

Access methods based on signature files can largely benefit from possibilities offered by parallel environments. To this end, an effective declustering strategy that would distribute signatures over a set of parallel independent disks has to be combined with a synergic clustering which is employed to avoid searching the whole signature file while executing a query. This article proposes two parallel signature file organizations, Hamming Filter ( HF ) and Hamming + Filter ( H + F ), whose common declustering strategy is based on error correcting codes , and where clustering is achieved by organizing signatures into fixed-size buckets, each containing signatures sharing the same key value. HF allocates signatures on disks in a static way and works well if a correct relationship holds between the parameters of the code and the size of the file. H + F is a generalization of HF suitable to manage highly dynamic files. It uses a dynamic declustering, obtained through a sequence of codes, and organizes a smooth migration of signatures between disks so that high performance levels are retained regardless of current file size. Theoretical analysis characterizes the best-case, expected, and worst-case behaviors of these organizations. Analytical results are verified by experiments on prototype systems.

Publisher

Association for Computing Machinery (ACM)

Subject

Information Systems

Reference27 articles.

1. Optimal disk allocation for partial match queries

2. An associative/parallel processor for partial match retrieval using superimposed codes

3. Estimating accesses in partitioned signature file organizations

4. DFpPIS~~ U. 1986. S-tree: A dynamic ba}anced signature index for office retrieval. In Proceedingx of thc Nlnth ACM SIG1R International Conference on Research and Developmets! in ln/brmatio Retrieval fPisa Ita}y Sept.) 77-87. 10.1145/253168.253189 DFpPIS~~ U. 1986. S-tree: A dynamic ba}anced signature index for office retrieval. In Proceedingx of thc Nlnth ACM SIG1R International Conference on Research and Developmets! in ln/brmatio Retrieval fPisa Ita}y Sept.) 77-87. 10.1145/253168.253189

5. D~:WITT D. AND GR~-W J. 1992. Para}le} database systems: The future of high performance database systems. (' nnmu~z. ACM 35 6 (dune) 85-98. 10.1145/129888.129894 D~:WITT D. AND GR~-W J. 1992. Para}le} database systems: The future of high performance database systems. (' nnmu~z. ACM 35 6 (dune) 85-98. 10.1145/129888.129894

Cited by 19 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Study on Content‐Based Image Retrieval;Integrating Metaheuristics in Computer Vision for Real‐World Optimization Problems;2024-07-31

2. Inverted files for text search engines;ACM Computing Surveys;2006-07-25

3. On the cost of searching signature trees;Information Processing Letters;2006-07

4. Iterative-improvement-based declustering heuristics for multi-disk databases;Information Systems;2005-03

5. On the General Signature Trees;Lecture Notes in Computer Science;2005

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3