DDD: Distributed Dataset DNS
-
Published:2022-02-04
Issue:4
Volume:25
Page:2855-2872
-
ISSN:1386-7857
-
Container-title:Cluster Computing
-
language:en
-
Short-container-title:Cluster Comput
Author:
White-Swift JosephORCID, Fumagalli Andrea
Abstract
AbstractThe advent of inexpensive data storage has resulted in larger and larger datasets as the cost of pruning data becomes more expensive then storing it for future insights. This decreasing cost of storage has also led to the practice of storing data in multiple locations for redundancy. However, without any uniform method of determining link costs to different storage sites, a dataset is not always retrieved from the most cost effective site. Distributed dataset DNS, or DDD, solves this problem in two key ways. The first allows “local” servers to provide meaningful information to a user in order to ensure that they target the location that offers the most advantageous network connection. The second allows other trusted servers to easily gain access to this information in a distributed way. These combined approaches aim to both lower aggregate network bandwidth usage and prevent single points of failure when retrieving dataset pointers.
Publisher
Springer Science and Business Media LLC
Subject
Computer Networks and Communications,Software
Reference46 articles.
1. Abdul-Rahman, A., Hailes, S.: A distributed trust model. In: Proceedings of the 1997 Workshop on New Security Paradigms, NSPW ’97, pp. 48–60. Association for Computing Machinery, New York, NY, USA (1998). https://doi.org/10.1145/283699.283739 2. Alimi, R., Penno, R., Yang, Y.R., Kiesel, S., Previdi, S., Roome, W., Shalunov, S., Woundy, R.: Application-layer traffic optimization (ALTO) protocol. RFC 7285, RFC Editor (2014). https://tools.ietf.org/html/rfc7285 3. Barisits, M., Beermann, T., Berghaus, F., Bockelman, B., Bogado, J., Cameron, D., Christidis, D., Ciangottini, D., Dimitrov, G., Elsing, M., Garonne, V., di Girolamo, A., Goossens, L., Guan, W., Guenther, J., Javurek, T., Kuhn, D., Lassnig, M., Lopez, F., Magini, N., Molfetas, A., Nairz, A., Ould-Saada, F., Prenner, S., Serfon, C., Stewart, G., Vaandering, E., Vasileva, P., Vigne, R., Wegner, T.: Rucio: Scientific data management. Comput. Softw. Big Sci. 3(1), 11 (2019). https://doi.org/10.1007/s41781-019-0026-3 4. Bartholomew, D.: Mariadb vs. mysql. Tech. rep., SkySQL AB (2012) 5. Bassi, A., Beck, M., Moore, T., Plank, J.S., Swany, M., Wolski, R., Fagg, G.: The internet backplane protocol: a study in resource sharing. Future Gen. Comput. Syst. 19(4), 551–561 (2003). Selected papers from the IEEE/ACM International Symposium on Cluster Computing and the Grid, Berlin-Brandenburg Academy of Sciences and Humanities, Berlin, Germany, 21–24 May 2002. https://doi.org/10.1016/S0167-739X(03)00033-5. https://www.sciencedirect.com/science/article/pii/S0167739X03000335
|
|