Abstract
AbstractSpatialHadoop could handle spatial data operations in a low partitioning execution time compared to the traditional Hadoop. However, developing an efficient and an accurate partitioning algorithm is still a research field opened to many researchers. Confidently, this paper proposes a Minimum Boundary Rectangle-aware Priority R-Tree (MBR-aware PR-Tree) as an enhanced partitioning algorithm applicable at SpatialHadoop. Compared to state-of-art partitioning algorithms, our proposed algorithm outperforms them in terms of query execution time, file size, number of partitions, indexing time, and number of returned objects. The experimental results show superiority of our algorithm which have been confirmed for both spatial range query and k-nearest-neighbour query through evaluating the performance in different scenarios using a real dataset.
Publisher
Springer Science and Business Media LLC
Subject
Computational Mathematics,General Computer Science
Reference47 articles.
1. Haynes, D., Ray, S., Manson, S.: Terra populus: challenges and opportunities with heterogeneous big spatial data. In: Griffith, D.A., Chun, Y., Dean, D.J. (eds.) Advances in geocomputation: geocomputation 2015–The 13th International Conference, pp. 115–121. Springer International Publishing, Cham (2017)
2. Katzis, K., Efstathiades, C.: Resource management supporting big data for real-time applications in the 5G Era. In: Mavromoustakis, C.X., Mastorakis, G., Dobre, C. (eds.) Advances in mobile cloud computing and big data in the 5G era, pp. 289–307. Springer International Publishing, Cham (2017)
3. Rajaraman, V.: Toward a computing utility. Ann. Indian Natl. Acad. Eng. 3, 1–10 (2006)
4. Auradkar, P., et al.: Performance tuning analysis of spatial operations on Spatial Hadoop cluster with SSD. Procedia Comput. Sci. 167, 2253–2266 (2020)
5. Oussous, A., et al.: Big data technologies: a survey. J. King Saud Univ. Comput. Inform. Sci. 30, 431–448 (2017)