Semantic Segmentation in Autonomous Driving using Multi-scale Feature Fusion and Graph Network
Author:
Li Yajing1, Zhang Yong1, Zhang Yuqing1, Piao Xinglin1, Pei Huimin2, Hu Yongli1
Affiliation:
1. Beijing University of Technology 2. Jiangsu Normal University
Abstract
Abstract
Semantic segmentation in urban scenes is an important task in computer vision. However, there are many challenges in urban road scenes, such as category imbalance and complex backgrounds. These problems lead to unclear edge segmentation and fuzzy categorization of occluded objects in existing semantic segmentation methods in urban scenes, which limits the accuracy and robustness in practical applications. In this paper, we propose a model that recursively enhances edge feature representation combined with local spatial context. We address the problem of unclear edge segmentation by introducing Multi-scale Central Difference Convolution (MS-CDC) to fuse multi-scale edge features. The FeedBack Connection (FBC) module based on feature pyramid fuses the multi-scale features while recursively augmenting the original network to make the occluded objects more robust. Meanwhile, we design a Local Feature Extraction (LFE) module to capture pixel-wise relationships by constructing local pixel graph and center pixel graph. It can learn local contextual information to extract finer pixel features. Experimental results on the Cityscapes and Mapillary Vista dataset validate the effectiveness of our proposed model. Our model can achieve new results by 80.67 \(%\) and 45.5$%$ mIoU on the Cityscapes and Mapillary Vista val set. We open-source our code at \href{https://github.com/sanmanaa/segmentation-autodriving-graph-centralconv}{https://github.com/sanmanaa/segmentation-autodriving-graph-centralconv}
Publisher
Springer Science and Business Media LLC
Reference54 articles.
1. Szegedy, Christian and Liu, Wei and Jia, Yangqing and Sermanet, Pierre and Reed, Scott and Anguelov, Dragomir and Erhan, Dumitru and Vanhoucke, Vincent and Rabinovich, Andrew (2015) Going deeper with convolutions. 1--9, Proceedings of the IEEE conference on computer vision and pattern recognition 2. Long, Jonathan and Shelhamer, Evan and Darrell, Trevor (2015) Fully convolutional networks for semantic segmentation. 3431--3440, Proceedings of the IEEE conference on computer vision and pattern recognition 3. Fang, Leyuan and Zhou, Peng and Liu, Xinxin and Ghamisi, Pedram and Chen, Siwei (2022) Context Enhancing Representation for Semantic Segmentation in Remote Sensing Images. IEEE Transactions on Neural Networks and Learning Systems IEEE 4. Zhang, Dong and Zhang, Hanwang and Tang, Jinhui and Wang, Meng and Hua, Xiansheng and Sun, Qianru (2020) Feature pyramid transformer. Springer, 323--339, Computer Vision--ECCV 2020: 16th European Conference, Glasgow, UK, August 23--28, 2020, Proceedings, Part XXVIII 16 5. Zhang, Zhenli and Zhang, Xiangyu and Peng, Chao and Xue, Xiangyang and Sun, Jian (2018) Exfuse: Enhancing feature fusion for semantic segmentation. 269--284, Proceedings of the European conference on computer vision (ECCV)
|
|