RiSSNet: Contrastive Learning Network with a Relaxed Identity Sampling Strategy for Remote Sensing Image Semantic Segmentation-Reference-Cited by-同舟云学术

RiSSNet: Contrastive Learning Network with a Relaxed Identity Sampling Strategy for Remote Sensing Image Semantic Segmentation

Published:2023-07-06 Issue:13 Volume:15 Page:3427
ISSN:2072-4292
Container-title:Remote Sensing
language:en
Short-container-title:Remote Sensing

Author:

Li Haifeng¹²^ORCID,Jing Wenxuan¹,Wei Guo³,Wu Kai³,Su Mingming³,Liu Lu³,Wu Hao³,Li Penglong⁴,Qi Ji¹^ORCID

Affiliation:

1. School of Geosciences and Info-Physics, Central South University, Changsha 410083, China

2. Xiangjiang Laboratory, Changsha 410205, China

3. Inner Mongolia Civil-Military Integration Development Research Center, Hohhot 010070, China

4. Chongqing Geomatic and Remote Sensing Center, Chongqing 401147, China

Abstract

Contrastive learning techniques make it possible to pretrain a general model in a self-supervised paradigm using a large number of unlabeled remote sensing images. The core idea is to pull positive samples defined by data augmentation techniques closer together while pushing apart randomly sampled negative samples to serve as supervised learning signals. This strategy is based on the strict identity hypothesis, i.e., positive samples are strictly defined by each (anchor) sample’s own augmentation transformation. However, this leads to the over-instancing of the features learned by the model and the loss of the ability to fully identify ground objects. Therefore, we proposed a relaxed identity hypothesis governing the feature distribution of different instances within the same class of features. The implementation of the relaxed identity hypothesis requires the sampling and discrimination of the relaxed identical samples. In this study, to realize the sampling of relaxed identical samples under the unsupervised learning paradigm, the remote sensing image was used to show that nearby objects often present a large correlation; neighborhood sampling was carried out around the anchor sample; and the similarity between the sampled samples and the anchor samples was defined as the semantic similarity. To achieve sample discrimination under the relaxed identity hypothesis, the feature loss was calculated and reordered for the samples in the relaxed identical sample queue and the anchor samples, and the feature loss between the anchor samples and the sample queue was defined as the feature similarity. Through the sampling and discrimination of the relaxed identical samples, the leap from instance-level features to class-level features was achieved to a certain extent while enhancing the network’s invariant learning of features. We validated the effectiveness of the proposed method on three datasets, and our method achieved the best experimental results on all three datasets compared to six self-supervised methods.

Funder

Chongqing Natural Science Foundation Project

Chongqing Talent Plan “Contract System” Project

Major Special Project of High-Resolution Earth Observation System

Publisher

MDPI AG

Subject

General Earth and Planetary Sciences

Link

https://www.mdpi.com/2072-4292/15/13/3427/pdf

Reference51 articles.

1. Semivariogram-Based Spatial Bandwidth Selection for Remote Sensing Image Segmentation With Mean-Shift Algorithm;Ming;IEEE Geosci. Remote Sens. Lett.,2012

2. Wang, J., Qin, Q., Li, Z., Ye, X., Wang, J., Yang, X., and Qin, X. (2015, January 26–31). Deep Hierarchical Representation and Segmentation of High Resolution Remote Sensing Images. Proceedings of the 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Milan, Italy.

3. Liu, X., Chi, M., Zhang, Y., and Qin, Y. (2018, January 22–27). Classifying High Resolution Remote Sensing Images by Fine-Tuned VGG Deep Networks. Proceedings of the IGARSS 2018—2018 IEEE International Geoscience and Remote Sensing Symposium, Valencia, Spain.

4. Ma, W., Pan, Z., Guo, J., and Lei, B. (2018, January 22–27). Super-Resolution of Remote Sensing Images Based on Transferred Generative Adversarial Network. Proceedings of the IGARSS 2018—2018 IEEE International Geoscience and Remote Sensing Symposium, Valencia, Spain.

5. Misra, I., and van der Maaten, L. (2020, January 13–19). Self-Supervised Learning of Pretext-Invariant Representations. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. CDEST: Class Distinguishability-Enhanced Self-Training Method for Adopting Pre-Trained Models to Downstream Remote Sensing Image Semantic Segmentation;Remote Sensing;2024-04-06