A Novel Network Framework on Simultaneous Road Segmentation and Vehicle Detection for UAV Aerial Traffic Images-Reference-Cited by-同舟云学术

A Novel Network Framework on Simultaneous Road Segmentation and Vehicle Detection for UAV Aerial Traffic Images

Published:2024-06-03 Issue:11 Volume:24 Page:3606
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Xiao Min¹,Min Wei¹,Yang Congmao²³,Song Yongchao²^ORCID

Affiliation:

1. Project Construction Management Company of Jiangxi Transportation Investment Group Co., Ltd., Nanchang 330108, China

2. School of Computer and Control Engineering, Yantai University, Yantai 264005, China

3. School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, China

Abstract

Unmanned Aerial Vehicle (UAV) aerial sensors are an important means of collecting ground image data. Through the road segmentation and vehicle detection of drivable areas in UAV aerial images, they can be applied to monitoring roads, traffic flow detection, traffic management, etc. As well, they can be integrated with intelligent transportation systems to support the related work of transportation departments. Existing algorithms only realize a single task, while intelligent transportation requires the simultaneous processing of multiple tasks, which cannot meet complex practical needs. However, UAV aerial images have the characteristics of variable road scenes, a large number of small targets, and dense vehicles, which make it difficult to complete the tasks. In response to these issues, we propose to implement road segmentation and on-road vehicle detection tasks in the same framework for UAV aerial images, and we conduct experiments on a self-constructed dataset based on the DroneVehicle dataset. For road segmentation, we propose a new algorithm C-DeepLabV3+. The new algorithm introduces the coordinate attention (CA) module, which can obtain more accurate segmentation target location information and make the segmentation target edges more continuous. Also, the improved algorithm introduces the cascade feature fusion module to prevent the loss of detail information in road segmentation and to obtain better segmentation performance. For vehicle detection, we propose an improved algorithm S-YOLOv5 by adding a parameter-free lightweight attention module SimAM. Finally, the proposed road segmentation–vehicle detection framework is utilized to unite the C-DeepLabV3+ and S-YOLOv5 algorithms for the implementation of the serial tasks. The experimental results show that on the constructed ViDroneVehicle dataset, the C-DeepLabV3+ algorithm has an mPA value of 98.75% and an mIoU value of 97.53%, which can better segment the road area and solve the problem of occlusion. The mAP value of the S-YOLOv5 algorithm has an mAP value of 97.40%, which is more than YOLOv5’s 96.95%, which effectively reduces the vehicle omission and false detection rates. By comparison, the results of both algorithms are superior to multiple state-of-the-art methods. The overall framework proposed in this paper has superior performance and is capable of realizing high-quality and high-precision road segmentation and vehicle detection from UAV aerial images.

Funder

Natural Science Foundation of Shandong Province

Publisher

MDPI AG

Link

https://www.mdpi.com/1424-8220/24/11/3606/pdf

Reference66 articles.

1. UAV photogrammetry for topographic monitoring of coastal areas;Goncalves;ISPRS J. Photogramm. Remote Sens.,2015

2. Siam, M., and ElHelw, M. (2012, January 21–25). Robust autonomous visual detection and tracking of moving targets in UAV imagery. Proceedings of the 2012 IEEE 11th International Conference on Signal Processing, Beijing, China.

3. Automatic road feature extraction from high resolution satellite images using LVQ neural networks;Wijesingha;Asian J. Geoinform.,2013

4. Mnih, V., and Hinton, G.E. (2010, January 5–11). Learning to detect roads in high-resolution aerial images. Proceedings of the Computer Vision–ECCV 2010: 11th European Conference on Computer Vision, Heraklion, Crete, Greece. Proceedings, Part VI 11.

5. A modified vision transformer architecture with scratch learning capabilities for effective fire detection;Yar;Expert Syst. Appl.,2024