From discrepancy to declustering

Author:

Chen Chung-Min1,Cheng Christine T.2

Affiliation:

1. Telcordia Technologies, Piscataway, New Jersey

2. University of Wisconsin-Milwaukee, Milwaukee, Wisconsin

Abstract

Declustering schemes allocate data blocks among multiple disks to enable parallel retrieval. Given a declustering scheme D , its response time with respect to a query Q , rt ( Q ), is defined to be the maximum number of data blocks of the query stored by the scheme in any one of the disks. If | Q | is the number of data blocks in Q and M is the number of disks, then rt ( Q ) is at least ⌈| Q |/ M ⌉. One way to evaluate the performance of D with respect to a set of range queries Q is to measure its additive error ---the maximum difference of rt ( Q ) from ⌈| Q |/ M ⌉ over all range queries Q ∈ Q.In this article, we consider the problem of designing declustering schemes for uniform multidimensional data arranged in a d -dimensional grid so that their additive errors with respect to range queries are as small as possible. It has been shown that for a fixed dimension d ≥ 2, any declustering scheme on an M d grid, a grid with length M on each dimension, will always incur an additive error with respect to range queries of Ω(log M ) when d = 2 and Ω(log d −1/2 M ) when d > 2.Asymptotically optimal declustering schemes exist for 2-dimensional data. However, the best general upper bound known so far for the worst-case additive errors of d -dimensional declustering schemes, d ≥ 3, is O ( M d −1 ), which is large when compared to the lower bound. In this article, we propose two declustering schemes based on low-discrepancy points in d -dimensions. When d is fixed, both schemes have an additive error of O (log d −1 M ) with respect to range queries, provided that certain conditions are satisfied: the first scheme requires that the side lengths of the grid grow at a rate polynomial in M , while the second scheme requires d ≥ 2 and M = p t where dpC , C a constant, and t is a positive integer such that t ( d − 1) ≥ 2. These are the first multidimensional declustering schemes with additive errors proven to be near optimal.

Publisher

Association for Computing Machinery (ACM)

Subject

Artificial Intelligence,Hardware and Architecture,Information Systems,Control and Systems Engineering,Software

Reference46 articles.

1. An economic method of computing lpr sequences (in Russian). Zh. Vychisl. Mat. i Mat;Antonov I.;Fiz.,1979

2. (Almost) optimal parallel block access to range queries

3. Fast parallel similarity search in multimedia databases

Cited by 7 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3