Connected k-Center and k-Diameter Clustering
-
Published:2024-09-02
Issue:
Volume:
Page:
-
ISSN:0178-4617
-
Container-title:Algorithmica
-
language:en
-
Short-container-title:Algorithmica
Author:
Drexler Lukas,Eube Jan,Luo Kelin,Reineccius Dorian,Röglin Heiko,Schmidt Melanie,Wargalla Julian
Abstract
AbstractMotivated by an application from geodesy, we study the connected k-center problem and the connected k-diameter problem. The former problem has been introduced by Ge et al. (ACM Trans Knowl Discov Data 2(2):1–35, 2008. https://doi.org/10.1145/1376815.1376816) to model clustering of data sets with both attribute and relationship data. These problems arise from the classical k-center and k-diameter problems by adding a side constraint. For the side constraint, we are given an undirected connectivity graphG on the input points, and a clustering is now only feasible if every cluster induces a connected subgraph in G. Usually in clustering problems one assumes that the clusters are pairwise disjoint. We study this case but additionally also the case that clusters are allowed to be non-disjoint. This can help to satisfy the connectivity constraints. Our main result is an $$O(\log ^2k)$$
O
(
log
2
k
)
-approximation algorithm for the disjoint connected k-center and k-diameter problem. For Euclidean spaces of constant dimension and for metrics with constant doubling dimension, the approximation factor improves to O(1). Our algorithm works by computing a non-disjoint connected clustering first and transforming it into a disjoint connected clustering. We complement these upper bounds by several upper and lower bounds for variations and special cases of the model.
Funder
Rheinische Friedrich-Wilhelms-Universität Bonn
Publisher
Springer Science and Business Media LLC
Reference43 articles.
1. Ge, R., Ester, M., Gao, B.J., Hu, Z., Bhattacharya, B.K., Ben-Moshe, B.: Joint cluster analysis of attribute data and relationship data: the connected k-center problem, algorithms and applications. ACM Trans. Knowl. Discov. Data 2(2), 1–35 (2008). https://doi.org/10.1145/1376815.1376816 2. Permanent Service for Mean Sea Level (PSMSL): Tide gauge data. http://www.psmsl.org/data/obtaining/. Accessed 03 Feb 2022 3. Holgate, S.J., Matthews, A., Woodworth, P.L., Rickards, L.J., Tamisiea, M.E., Bradshaw, E., Foden, P.R., Gordon, K.M., Jevrejeva, S., Pugh, J.: New data systems and products at the permanent service for mean sea level. J. Coastal Res. 29, 493–504 (2013) 4. Hsu, W., Nemhauser, G.L.: Easy and hard bottleneck location problems. Discret. Appl. Math. 1(3), 209–215 (1979) 5. Hochbaum, D.S.: When are NP-hard location problems easy? Ann. Oper. Res. 1(3), 201–214 (1984)
|
|