SIDELOADING – INGESTION OF LARGE POINT CLOUDS INTO THE APACHE SPARK BIG DATA ENGINE-Reference-Cited by-同舟云学术

SIDELOADING – INGESTION OF LARGE POINT CLOUDS INTO THE APACHE SPARK BIG DATA ENGINE

Published:2016-06-07 Issue: Volume:XLI-B2 Page:343-348
ISSN:2194-9034
Container-title:The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
language:en
Short-container-title:Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci.

Author:

Boehm J.,Liu K.,Alis C.

Abstract

Abstract. In the geospatial domain we have now reached the point where data volumes we handle have clearly grown beyond the capacity of most desktop computers. This is particularly true in the area of point cloud processing. It is therefore naturally lucrative to explore established big data frameworks for big geospatial data. The very first hurdle is the import of geospatial data into big data frameworks, commonly referred to as data ingestion. Geospatial data is typically encoded in specialised binary file formats, which are not naturally supported by the existing big data frameworks. Instead such file formats are supported by software libraries that are restricted to single CPU execution. We present an approach that allows the use of existing point cloud file format libraries on the Apache Spark big data framework. We demonstrate the ingestion of large volumes of point cloud data into a compute cluster. The approach uses a map function to distribute the data ingestion across the nodes of a cluster. We test the capabilities of the proposed method to load billions of points into a commodity hardware compute cluster and we discuss the implications on scalability and performance. The performance is benchmarked against an existing native Apache Spark data import implementation.

Publisher

Copernicus GmbH

Link

https://www.int-arch-photogramm-remote-sens-spatial-inf-sci.net/XLI-B2/343/2016/isprs-archives-XLI-B2-343-2016.pdf

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. SPSLiDAR: towards a multi-purpose repository for large scale LiDAR datasets;International Journal of Geographical Information Science;2022-03-03

2. Big Data in Smart City: Management Challenges;Applied Sciences;2021-05-17

3. Future Location Prediction for Emergency Vehicles Using Big Data: A Case Study of Healthcare Engineering;Journal of Healthcare Engineering;2020-11-27