Alignment Integration Network for Salient Object Detection and Its Application for Optical Remote Sensing Images
Author:
Zhang Xiaoning12ORCID, Yu Yi1, Wang Yuqing1, Chen Xiaolin1, Wang Chenglong1
Affiliation:
1. Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, China 2. University of Chinese Academy of Sciences, Beijing 100049, China
Abstract
Salient object detection has made substantial progress due to the exploitation of multi-level convolutional features. The key point is how to combine these convolutional features effectively and efficiently. Due to the step by step down-sampling operations in almost all CNNs, multi-level features usually have different scales. Methods based on fully convolutional networks directly apply bilinear up-sampling to low-resolution deep features and then combine them with high-resolution shallow features by addition or concatenation, which neglects the compatibility of features, resulting in misalignment problems. In this paper, to solve the problem, we propose an alignment integration network (ALNet), which aligns adjacent level features progressively to generate powerful combinations. To capture long-range dependencies for high-level integrated features as well as maintain high computational efficiency, a strip attention module (SAM) is introduced into the alignment integration procedures. Benefiting from SAM, multi-level semantics can be selectively propagated to predict precise salient objects. Furthermore, although integrating multi-level convolutional features can alleviate the blur boundary problem to a certain extent, it is still unsatisfactory for the restoration of a real object boundary. Therefore, we design a simple but effective boundary enhancement module (BEM) to guide the network focus on boundaries and other error-prone parts. Based on BEM, an attention weighted loss is proposed to boost the network to generate sharper object boundaries. Experimental results on five benchmark datasets demonstrate that the proposed method can achieve state-of-the-art performance on salient object detection. Moreover, we extend the experiments on the remote sensing datasets, and the results further prove the universality and scalability of ALNet.
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference71 articles.
1. Fan, D., Gong, C., Cao, Y., Ren, B., Cheng, M.-M., and Borji, A. (2018, January 13–19). Enhanced-alignment measure for binary foreground map evaluation. Proceedings of the International Joint Conference on Artificial Intelligence, Stockholm, Sweden. 2. Scalable mobile image retrieval by exploring contextual saliency;Yang;IEEE Trans. Image Process.,2015 3. Biologically inspired object tracking using center-surround saliency mechanisms;Mahadevan;IEEE Trans. Pattern Anal. Mach. Intell.,2013 4. Borji, A., Frintrop, S., Sihite, D.N., and Itti, L. (2012, January 16–21). Adaptive object tracking by learning background context. Proceedings of the 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, Providence, RI, USA. 5. Chen, F., Liu, H., Zeng, Z., Zhou, X., and Tan, X. (2022). Bes-net: Boundary enhancing semantic context network for high-resolution image semantic segmentation. Remote Sens., 14.
|
|