Affiliation:
1. Institute for Infocomm Research, A*STAR, Singapore
2. GrabTaxi Holdings, Singapore
3. Northwestern Polytechnical University, China
4. Grab-NUS AI Lab, NUS, Singapore
Abstract
Automatic inference of missing road attributes (e.g., road type and speed limit) for enriching digital maps has attracted significant research attention in recent years. A number of machine learning-based approaches have been proposed to detect road attributes from GPS traces, dash-cam videos, or satellite images. However, existing solutions mostly focus on a single modality without modeling the correlations among multiple data sources. To bridge this gap, we present a multimodal road attribute detection method, which improves the robustness by performing pixel-level fusion of crowdsourced GPS traces and satellite images. A GPS trace is usually given by a sequence of location, bearing, and speed. To align it with satellite imagery in the spatial domain, we render GPS traces into a sequence of multi-channel images that simultaneously capture the global distribution of the GPS points, the local distribution of vehicles’ moving directions and speeds, and their temporal changes over time, at each pixel. Unlike previous GPS-based road feature extraction methods, our proposed GPS rendering does not require map matching in the data preprocessing step. Moreover, our multimodal solution addresses single-modal challenges such as occlusions in satellite images and data sparsity in GPS traces by learning the pixel-wise correspondences among different data sources. On top of this, we observe that geographic objects and their attributes in the map are not isolated but correlated with each other. Thus, if a road is partially labeled, then the existing information can be of great help on inferring the missing attributes. To fully use the existing information, we extend our model and discuss the possibilities for further performance improvement when partially labeled map data is available. Extensive experiments have been conducted on two real-world datasets in Singapore and Jakarta. Compared with previous work, our method is able to improve the detection accuracy on road attributes by a large margin.
Funder
Grab-NUS AI Lab
GrabTaxi Holdings Pte. Ltd.
National University of Singapore
Industrial Postgraduate Program
Economic Development Board of Singapore
Singapore Ministry of Education Academic Research Fund Tier 2
Publisher
Association for Computing Machinery (ACM)
Subject
Discrete Mathematics and Combinatorics,Geometry and Topology,Computer Science Applications,Modeling and Simulation,Information Systems,Signal Processing
Reference43 articles.
1. DiDi Chuxing. 2022. GAIA Open Dataset Initiative. Retrieved from https://outreach.didichuxing.com/research/opendata/en/
2. Mapbox. 2022. Static Images API. Retrieved from https://docs.mapbox.com/api/maps/static-images/
3. Kaggle. 2022. Uber Pickups in New York City. Retrieved from https://www.kaggle.com/fivethirtyeight/uber-pickups-in-new-york-city
4. Robust and ubiquitous smartphone-based lane detection
5. Heba Aly and Moustafa Youssef. 2015. semMatch: Road semantics-based accurate map matching for challenging positioning data. In Proceedings of the ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. 5:1–5:10.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献