Affiliation:
1. China Aero Geophysical Survey and Remote Sensing Center for Natural Resources, Beijing 100083, China
2. School of Earth Sciences and Resources, China University of Geosciences (Beijing), Beijing 100083, China
3. School of Geosciences and Surveying Engineering, China University of Mining and Technology (Beijing), Beijing 100083, China
Abstract
With the advancement of artificial intelligence, deep learning has become instrumental in land cover classification. While there has been a notable emphasis on refining model structures to improve classification accuracy, it is imperative to also emphasize the pivotal role of data-driven optimization techniques. This paper presents an in-depth investigation into optimizing multi-class land cover classification using high-resolution multispectral images from Worldview3. We explore various optimization strategies, including refined sampling strategies, data band combinations, loss functions, and model enhancements. Our optimizations led to a substantial increase in the Mean Intersection over Union (mIoU) classification accuracy, improving from a baseline of 0.520 to a final accuracy of 0.709, which represents a 35.2% enhancement. Specifically, by optimizing the classic semantic segmentation network in four key aspects, we improved the mIoU by 15.5%. Further improvements through changes in data combinations, sampling methods, and loss functions led to an overall 17.2% increase in mIoU. The proposed model optimization methods enabled the OUNet to outperform the baseline model by providing more precise edge detection and feature representation, while reducing the model parameters scale. Experimental evidence shows that in the application of multi-class land surface classification, increasing the quantity and diversity of samples, avoiding data imbalance issues, is equally valuable for improving overall classification accuracy as it is for enhancing model performance.
Funder
National Key Research and Development Program of China
Reference35 articles.
1. Algorithms for semantic segmentation of multispectral remote sensing imagery using deep learning;Kemker;ISPRS J. Photogramm. Remote Sens.,2018
2. Lin, G., Milan, A., Shen, C., and Reid, I. (2017, January 21–26). Refinenet: Multi-path refinement networks for high-resolution semantic segmentation. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.
3. Foreground-aware relation network for geospatial object segmentation in high spatial resolution remote sensing imagery;Zheng;Proc. IEEE/CVF Comput. Soc. Conf. Comput. Vis. Pattern Recognit.,2020
4. Joint spatial-spectral hyperspectral image classification based on convolutional neural network;Han;Pattern Recognit. Lett.,2020
5. Kernel-based methods for hyperspectral image classification;Bruzzone;IEEE Trans. Geosci. Remote Sens.,2005