DDD: Distributed Dataset DNS-Reference-Cited by-同舟云学术

DDD: Distributed Dataset DNS

Published:2022-02-04 Issue:4 Volume:25 Page:2855-2872
ISSN:1386-7857
Container-title:Cluster Computing
language:en
Short-container-title:Cluster Comput

Author:

White-Swift Joseph^ORCID,Fumagalli Andrea

Abstract

AbstractThe advent of inexpensive data storage has resulted in larger and larger datasets as the cost of pruning data becomes more expensive then storing it for future insights. This decreasing cost of storage has also led to the practice of storing data in multiple locations for redundancy. However, without any uniform method of determining link costs to different storage sites, a dataset is not always retrieved from the most cost effective site. Distributed dataset DNS, or DDD, solves this problem in two key ways. The first allows “local” servers to provide meaningful information to a user in order to ensure that they target the location that offers the most advantageous network connection. The second allows other trusted servers to easily gain access to this information in a distributed way. These combined approaches aim to both lower aggregate network bandwidth usage and prevent single points of failure when retrieving dataset pointers.

Publisher

Springer Science and Business Media LLC

Subject

Computer Networks and Communications,Software

Link

https://link.springer.com/content/pdf/10.1007/s10586-021-03515-w.pdf

Reference46 articles.

1. Abdul-Rahman, A., Hailes, S.: A distributed trust model. In: Proceedings of the 1997 Workshop on New Security Paradigms, NSPW ’97, pp. 48–60. Association for Computing Machinery, New York, NY, USA (1998). https://doi.org/10.1145/283699.283739

2. Alimi, R., Penno, R., Yang, Y.R., Kiesel, S., Previdi, S., Roome, W., Shalunov, S., Woundy, R.: Application-layer traffic optimization (ALTO) protocol. RFC 7285, RFC Editor (2014). https://tools.ietf.org/html/rfc7285

3. Barisits, M., Beermann, T., Berghaus, F., Bockelman, B., Bogado, J., Cameron, D., Christidis, D., Ciangottini, D., Dimitrov, G., Elsing, M., Garonne, V., di Girolamo, A., Goossens, L., Guan, W., Guenther, J., Javurek, T., Kuhn, D., Lassnig, M., Lopez, F., Magini, N., Molfetas, A., Nairz, A., Ould-Saada, F., Prenner, S., Serfon, C., Stewart, G., Vaandering, E., Vasileva, P., Vigne, R., Wegner, T.: Rucio: Scientific data management. Comput. Softw. Big Sci. 3(1), 11 (2019). https://doi.org/10.1007/s41781-019-0026-3

4. Bartholomew, D.: Mariadb vs. mysql. Tech. rep., SkySQL AB (2012)

5. Bassi, A., Beck, M., Moore, T., Plank, J.S., Swany, M., Wolski, R., Fagg, G.: The internet backplane protocol: a study in resource sharing. Future Gen. Comput. Syst. 19(4), 551–561 (2003). Selected papers from the IEEE/ACM International Symposium on Cluster Computing and the Grid, Berlin-Brandenburg Academy of Sciences and Humanities, Berlin, Germany, 21–24 May 2002. https://doi.org/10.1016/S0167-739X(03)00033-5. https://www.sciencedirect.com/science/article/pii/S0167739X03000335