CF‐Net: Cross fusion network for semantic segmentation-Reference-Cited by-同舟云学术

CF‐Net: Cross fusion network for semantic segmentation

Published:2024-08-08 Issue: Volume: Page:
ISSN:1751-9659
Container-title:IET Image Processing
language:en
Short-container-title:IET Image Processing

Author:

Wang Baoyu¹,Shen Aihong¹,Dong Xu¹,Cao Pingping¹

Affiliation:

1. College of Basic Education and Research Criminal Investigation Police University of China Shenyang China

Abstract

AbstractSemantic segmentation is a fundamental computer vision task, and deep learning methods have been successfully applied to this field. However, target morphology continues to exhibit the incomplete prediction problem, which is attributable to the low feature utilisation and the insufficiency of spatial location information. This paper proposes a novel cross fusion network with unit attention mechanism (CF‐Net) for semantic segmentation. The two hallmarks of the framework are the usage of a multi‐scale fusion module and the unit attention mechanism. Multi‐scale fusion module can integrate multi‐branch outputs with different receptive fields, which obtain fine‐grained target details and visual contextual information. The cross fusion network is optimised with a unit attention mechanism to fuse intermediate features, which enables the acquisition of more accurate and effective spatial location information while maintaining consistency in feature space. The experimental results demonstrate that the proposed CF‐Net outperforms favourably comparable with other existing methods on the CamVid, Cityscapes, and PASCAL VOC 2012 databases, which also verifies the Effectiveness and reliability of our method.

Publisher

Institution of Engineering and Technology (IET)

Reference69 articles.

1. Zhang W. Shi H. Guo J. et al.:MAGIC: multimodal relational graph adversarial inference for diverse and unpaired text‐based image captioning. In:Proceedings of the AAAI Conference on Artificial Intelligence pp.3335–3343.AAAI Publication Washington D.C. (2022)

2. Hong S. You T. Kwak S Han B.:Online tracking by learning discriminative saliency map with convolutional neural network.arXiv:1502.06796(2015)

3. Salient object detection by aggregating contextual information;Liu Y.;Pattern Recognit. Lett.,2022

4. Huang S. Lu Z. Cheng R. He C.:FaPN: feature‐aligned pyramid network for dense image prediction. In:Proceedings of the IEEE/CVF International Conference on Computer Vision pp.864–873.IEEE Piscataway NJ(2021)

5. Multi‐feature aggregation network for salient object detection;Huang H.;Signal, Image Video Process.,2023