A Multi-Task Road Feature Extraction Network with Grouped Convolution and Attention Mechanisms-Reference-Cited by-同舟云学术

A Multi-Task Road Feature Extraction Network with Grouped Convolution and Attention Mechanisms

Published:2023-09-30 Issue:19 Volume:23 Page:8182
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Zhu Wenjie¹^ORCID,Li Hongwei²,Cheng Xianglong¹,Jiang Yirui¹

Affiliation:

1. School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, China

2. School of Geo-Science & Technology, Zhengzhou University, Zhengzhou 450052, China

Abstract

To cope with the challenges of autonomous driving in complex road environments, the need for collaborative multi-tasking has been proposed. This research direction explores new solutions at the application level and has become a hot topic of great interest. In the field of natural language processing and recommendation algorithms, the use of multi-task learning networks has been proven to reduce time, computing power, and storage usage in various task coupling cases. Due to the characteristics of the multi-task learning network, it has also been applied to visual road feature extraction in recent years. This article proposes a multi-task road feature extraction network that combines group convolution with transformer and squeeze excitation attention mechanisms. The network can simultaneously perform drivable area segmentation, lane line segmentation, and traffic object detection tasks. The experimental results of the BDD-100K dataset show that the proposed method performs well for different tasks and has a higher accuracy than similar algorithms. The proposed method provides new ideas and methods for the autonomous road perception of vehicles and the generation of highly accurate maps in visual-based autonomous driving processes.

Funder

National Natural Science Foundation of China

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/23/19/8182/pdf

Reference33 articles.

1. Semantics for robotic mapping, perception and interaction: A survey;Garg;Found. Trends Robot.,2020

2. Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 23–28). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.

3. Girshick, R. (2015, January 7–13). Fast R-CNN. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.

4. Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (July, January 26). You only look once: Unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.

5. Redmon, J., and Ali, F. (2017, January 21–26). YOLO9000: Better, faster, stronger. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An Efficient Algorithm for Extracting Railway Tracks Based on Spatial-Channel Graph Convolutional Network and Deep Neural Residual Network;ISPRS International Journal of Geo-Information;2024-08-29

2. A panoramic driving perception fusion algorithm based on multi-task learning;PLOS ONE;2024-06-04

3. A Multi-Task Network Based on Dual-Neck Structure for Autonomous Driving Perception;Sensors;2024-02-28

4. Road-MobileSeg: Lightweight and Accurate Road Extraction Model from Remote Sensing Images for Mobile Devices;Sensors;2024-01-15

5. Home Appliance Load Forecasting Based on Improved Informer;2023 3rd International Conference on Intelligent Communications and Computing (ICC);2023-11-24