A Parallel Computing Approach to Spatial Neighboring Analysis of Large Amounts of Terrain Data Using Spark-Reference-Cited by-同舟云学术

A Parallel Computing Approach to Spatial Neighboring Analysis of Large Amounts of Terrain Data Using Spark

Published:2021-01-07 Issue:2 Volume:21 Page:365
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Zhang Jianbo,Ye Zhuangzhuang^ORCID,Zheng Kai^ORCID

Abstract

Spatial neighboring analysis is an indispensable part of geo-raster spatial analysis. In the big data era, high-resolution raster data offer us abundant and valuable information, and also bring enormous computational challenges to the existing focal statistics algorithms. Simply employing the in-memory computing framework Spark to serve such applications might incur performance issues due to its lack of native support for spatial data. In this article, we present a Spark-based parallel computing approach for the focal algorithms of neighboring analysis. This approach implements efficient manipulation of large amounts of terrain data through three steps: (1) partitioning a raster digital elevation model (DEM) file into multiple square tile files by adopting a tile-based multifile storing strategy suitable for the Hadoop Distributed File System (HDFS), (2) performing the quintessential slope algorithm on these tile files using a dynamic calculation window (DCW) computing strategy, and (3) writing back and merging the calculation results into a whole raster file. Experiments with the digital elevation data of Australia show that the proposed computing approach can effectively improve the parallel performance of focal statistics algorithms. The results also show that the approach has almost the same calculation accuracy as that of ArcGIS. The proposed approach also exhibits good scalability when the number of Spark executors in clusters is increased.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/21/2/365/pdf

Reference30 articles.

1. Comparison of Surface Water Volume Estimation Methodologies that Couple Surface Reflectance Data and Digital Terrain Models

2. LiDAR-based TWI and terrain attributes in improving parametric predictor for tree growth in southeast Finland

3. Reorienting with terrain slope and landmarks

4. Adaptive Slope Filtering of Airborne LiDAR Data in Urban Areas for Digital Terrain Model (DTM) Generation

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. High-speed parallel segmentation algorithms of MeanShift for litchi canopies based on Spark and Hadoop;International Journal of Modeling, Simulation, and Scientific Computing;2024-05-04

2. Development of a Low-Cost Distributed Computing Pipeline for High-Throughput Cotton Phenotyping;Sensors;2024-02-02

3. Encryption Techniques for Hadoop Distributed File System (HDFS);2023 5th International Conference on Advances in Computing, Communication Control and Networking (ICAC3N);2023-12-15

4. MultiscaleDTM: An open‐source R package for multiscale geomorphometric analysis;Transactions in GIS;2023-05-26

5. A Fast Large-Scale Path Planning Method on Lunar DEM Using Distributed Tile Pyramid Strategy;IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing;2023