Distributed Subtrajectory Join on Massive Datasets-Reference-Cited by-同舟云学术

Distributed Subtrajectory Join on Massive Datasets

Published:2020-02-28 Issue:2 Volume:6 Page:1-29
ISSN:2374-0353
Container-title:ACM Transactions on Spatial Algorithms and Systems
language:en
Short-container-title:ACM Trans. Spatial Algorithms Syst.

Author:

Tampakis Panagiotis¹,Doulkeridis Christos¹,Pelekis Nikos¹,Theodoridis Yannis¹

Affiliation:

1. University of Piraeus, Piraeus, Greece

Abstract

Joining trajectory datasets is a significant operation in mobility data analytics and the cornerstone of various methods that aim to extract knowledge out of them. In the era of Big Data, the production of mobility data has become massive and, consequently, performing such an operation in a centralized way is not feasible. In this article, we address the problem of Distributed Subtrajectory Join processing by utilizing the MapReduce programming model. Compared to traditional trajectory join queries, this problem is even more challenging since the goal is to retrieve all the “maximal” portions of trajectories that are “similar.” We propose three solutions: (i) a well-designed basic solution, coined DTJb ; (ii) a solution that uses a preprocessing step that repartitions the data, labeled DTJr ; and (iii) a solution that, additionally, employs an indexing scheme, named DTJi . In our experimental study, we utilize a 56GB dataset of real trajectories from the maritime domain, which, to the best of our knowledge, is the largest real dataset used for experimentation in the literature of trajectory data management. The results show that DTJi performs up to 16× faster compared with DTJb , 10× faster than DTJr , and 3× faster than the closest related state-of-the-art algorithm.

Funder

MASTER

Track8Know

Operational Program Competitiveness, Entrepreneurship, and Innovation

European Regional Development Fund of the European Union and Greek national funds

EU Horizon 2020 R8I Programme

datACRON

RESEARCH-CREATE-INNOVATE

Publisher

Association for Computing Machinery (ACM)

Subject

Discrete Mathematics and Combinatorics,Geometry and Topology,Computer Science Applications,Modelling and Simulation,Information Systems,Signal Processing

Link

https://dl.acm.org/doi/pdf/10.1145/3373642

Reference35 articles.

1. Pankaj K. Agarwal Kyle Fox Kamesh Munagala Abhinandan Nath Jiangwei Pan and Erin Taylor. 2018. Subtrajectory clustering: Models and algorithms. In PODS. 75--87. Pankaj K. Agarwal Kyle Fox Kamesh Munagala Abhinandan Nath Jiangwei Pan and Erin Taylor. 2018. Subtrajectory clustering: Models and algorithms. In PODS. 75--87.

2. Hadoop GIS

3. Petko Bakalov and Vassilis J. Tsotras. 2006. Continuous spatiotemporal trajectory joins. In GSN. 109--128. Petko Bakalov and Vassilis J. Tsotras. 2006. Continuous spatiotemporal trajectory joins. In GSN. 109--128.

Cited by 18 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Sub-trajectory clustering with deep reinforcement learning;The VLDB Journal;2024-01-25

2. Continuous frequent contact detection over moving objects;GeoInformatica;2023-07-17

3. A distributed framework for large-scale semantic trajectory similarity join;Multimedia Tools and Applications;2023-07-13

4. PMMTss: A Parallel Multi-Way Merging-Based Trajectory Similarity Search for a Million Metro Passengers;Applied Sciences;2023-07-07

5. Efficient Non-Learning Similar Subtrajectory Search;Proceedings of the VLDB Endowment;2023-07