Affiliation:
1. Hong Kong University of Science and Technology, Hong Kong, China
2. City University of Hong Kong, Hong Kong, China
Abstract
Given two spatial datasets
P
(e.g., facilities) and
Q
(queries), an
aggregate nearest neighbor
(ANN) query retrieves the point(s) of
P
with the smallest aggregate distance(s) to points in
Q
. Assuming, for example,
n
users at locations
q
1
,…
q
n
, an ANN query outputs the facility
p
∈
P
that minimizes the
sum
of distances |
pq
i
| for 1 ≤
i
≤
n
that the users have to travel in order to meet there. Similarly, another ANN query may report the point
p
∈
P
that minimizes the
maximum
distance that any user has to travel, or the
minimum
distance from some user to his/her closest facility. If
Q
fits in memory and
P
is indexed by an R-tree, we develop algorithms for aggregate nearest neighbors that capture several versions of the problem, including weighted queries and incremental reporting of results. Then, we analyze their performance and propose cost models for query optimization. Finally, we extend our techniques for disk-resident queries and approximate ANN retrieval. The efficiency of the algorithms and the accuracy of the cost models are evaluated through extensive experiments with real and synthetic datasets.
Publisher
Association for Computing Machinery (ACM)
Cited by
188 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献