Refined Division Features Based on Transformer for Semantic Image Segmentation-Reference-Cited by-同舟云学术

Refined Division Features Based on Transformer for Semantic Image Segmentation

Published:2023-08-19 Issue: Volume:2023 Page:1-15
ISSN:1098-111X
Container-title:International Journal of Intelligent Systems
language:en
Short-container-title:International Journal of Intelligent Systems

Author:

Li Tianping¹^ORCID,Wei Yanjun¹,Liu Meilin¹,Yang Xiaolong¹,Zhang Zhenyi¹,Du Jun¹^ORCID

Affiliation:

1. School of Physics and Electronics, Shandong Normal University, Jinan, Shandong, China

Abstract

Transformer can build global relationships between pixels and enhance pixel representation. The existing methods only establish the context relationship from the whole image but will reduce the representation between the category areas. In addition, the existing methods based on the transformer self-attention do not combine the advantages of convolution and transformer, resulting in more calculation parameters of the model. In order to solve these two problems, this paper proposes to enhance the segmentation accuracy and performance by enhancing the relationship between image-level regions and the relationship between semantic level pixels. First, we design a refined division feature (RDF) module to enhance the channel representation and thus the same locale representation. Second, we design a transformer based on convolution (CTrans), which first computes the relationship between similar pixels and enhances the pixel representation. Then, the feature map is compressed and enriched to reduce the computational load of CTrans, and finally the relationship between pixels is established from a global perspective. We design a refined division feature module based on transformer for semantic image segmentation (RFT) model combining RDF and CTrans module. The experimental results show that the mIoU result of our method in Cityscapes test data set is 81.9%, and the model parameter is 64.6M, which is superior to other methods in terms of data. In addition, we conducted visualization experiments with Cityscapes and Pascal voc 2012 datasets with other methods, and the results showed that our method was superior to other methods.

Funder

National Natural Science Foundation of China

Publisher

Hindawi Limited

Subject

Artificial Intelligence,Human-Computer Interaction,Theoretical Computer Science,Software

Link

http://downloads.hindawi.com/journals/ijis/2023/6358162.pdf

Reference58 articles.

1. Fully Convolutional Networks for Semantic Segmentation

2. Research on image inpainting algorithm of improved total variation minimization method

3. FFTI: Image inpainting algorithm via features fusion and two-steps inpainting

4. Improved anti-occlusion object tracking algorithm using Unscented Rauch-Tung-Striebel smoother and kernel correlation filter

5. Image super-resolution reconstruction based on feature map attention mechanism

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Semantic segmentation feature fusion network based on transformer;2024-06-27

2. Combining transformer global and local feature extraction for object detection;Complex & Intelligent Systems;2024-04-15

3. Research progress and challenges in real-time semantic segmentation for deep learning;Journal of Image and Graphics;2024