Generalized Sparse Convolutional Neural Networks for Semantic Segmentation of Point Clouds Derived from Tri-Stereo Satellite Imagery-Reference-Cited by-同舟云学术

Generalized Sparse Convolutional Neural Networks for Semantic Segmentation of Point Clouds Derived from Tri-Stereo Satellite Imagery

Published:2020-04-18 Issue:8 Volume:12 Page:1289
ISSN:2072-4292
Container-title:Remote Sensing
language:en
Short-container-title:Remote Sensing

Author:

Bachhofner Stefan^ORCID,Loghin Ana-Maria^ORCID,Otepka Johannes^ORCID,Pfeifer Norbert^ORCID,Hornacek Michael,Siposova Andrea^ORCID,Schmidinger Niklas,Hornik Kurt^ORCID,Schiller Nikolaus,Kähler Olaf,Hochreiter Ronald^ORCID

Abstract

We studied the applicability of point clouds derived from tri-stereo satellite imagery for semantic segmentation for generalized sparse convolutional neural networks by the example of an Austrian study area. We examined, in particular, if the distorted geometric information, in addition to color, influences the performance of segmenting clutter, roads, buildings, trees, and vehicles. In this regard, we trained a fully convolutional neural network that uses generalized sparse convolution one time solely on 3D geometric information (i.e., 3D point cloud derived by dense image matching), and twice on 3D geometric as well as color information. In the first experiment, we did not use class weights, whereas in the second we did. We compared the results with a fully convolutional neural network that was trained on a 2D orthophoto, and a decision tree that was once trained on hand-crafted 3D geometric features, and once trained on hand-crafted 3D geometric as well as color features. The decision tree using hand-crafted features has been successfully applied to aerial laser scanning data in the literature. Hence, we compared our main interest of study, a representation learning technique, with another representation learning technique, and a non-representation learning technique. Our study area is located in Waldviertel, a region in Lower Austria. The territory is a hilly region covered mainly by forests, agriculture, and grasslands. Our classes of interest are heavily unbalanced. However, we did not use any data augmentation techniques to counter overfitting. For our study area, we reported that geometric and color information only improves the performance of the Generalized Sparse Convolutional Neural Network (GSCNN) on the dominant class, which leads to a higher overall performance in our case. We also found that training the network with median class weighting partially reverts the effects of adding color. The network also started to learn the classes with lower occurrences. The fully convolutional neural network that was trained on the 2D orthophoto generally outperforms the other two with a kappa score of over 90% and an average per class accuracy of 61%. However, the decision tree trained on colors and hand-crafted geometric features has a 2% higher accuracy for roads.

Publisher

MDPI AG

Subject

General Earth and Planetary Sciences

Link

https://www.mdpi.com/2072-4292/12/8/1289/pdf

Reference161 articles.

1. Deep learning

2. Deep learning in neural networks: An overview

3. Machine learning: Trends, perspectives, and prospects

4. Advances in natural language processing

5. Representation Learning: A Review and New Perspectives

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multimodal Co-Learning for Building Change Detection: A Domain Adaptation Framework Using VHR Images and Digital Surface Models;IEEE Transactions on Geoscience and Remote Sensing;2024

2. A 2D/3D multimodal data simulation approach with applications on urban semantic segmentation, building extraction and change detection;ISPRS Journal of Photogrammetry and Remote Sensing;2023-11

3. Deep learning methods applied to digital elevation models: state of the art;Geocarto International;2023-09-06

4. Multimodal Co-learning: A Domain Adaptation Method for Building Extraction from Optical Remote Sensing Imagery;2023 Joint Urban Remote Sensing Event (JURSE);2023-05-17

5. A co-learning method to utilize optical images and photogrammetric point clouds for building extraction;International Journal of Applied Earth Observation and Geoinformation;2023-02