Real-Time Object Detection from UAV Inspection Videos by Combining YOLOv5s and DeepStream

Author:

Xie Shidun1,Deng Guanghong1,Lin Baihao1,Jing Wenlong2ORCID,Li Yong2,Zhao Xiaodan1

Affiliation:

1. Guangdong Engineering Technology Research Center of UAV Remote Sensing Network, Guangzhou iMapCloud Intelligent Technology Co., Ltd., Guangzhou 510095, China

2. Guangdong Province Engineering Laboratory for Geographic Spatiotemporal Big Data, Key Laboratory of Guangdong for Utilization of Remote Sensing and Geographical Information System, Guangdong Open Laboratory of Geospatial Information Technology and Application, Guangzhou Institute of Geography, Guangdong Academy of Sciences, Guangzhou 510070, China

Abstract

The high-altitude real-time inspection of unmanned aerial vehicles (UAVs) has always been a very challenging task. Because high-altitude inspections are susceptible to interference from different weather conditions, interference from communication signals and a larger field of view result in a smaller object area to be identified. We adopted a method that combines a UAV system scheduling platform with artificial intelligence object detection to implement the UAV automatic inspection technology. We trained the YOLOv5s model on five different categories of vehicle data sets, in which mAP50 and mAP50-95 reached 93.2% and 71.7%, respectively. The YOLOv5s model size is only 13.76 MB, and the detection speed of a single inspection photo reaches 11.26 ms. It is a relatively lightweight model and is suitable for deployment on edge devices for real-time detection. In the original DeepStream framework, we set up the http communication protocol to start quickly to enable different users to call and use it at the same time. In addition, asynchronous sending of alarm frame interception function was added and the auxiliary services were set up to quickly resume video streaming after interruption. We deployed the trained YOLOv5s model on the improved DeepStream framework to implement automatic UAV inspection.

Funder

GDAS’ Project of Science and Technology Development

National Key R&D Program of China

National Natural Science Foundation of China

Science and Technology Program of Guangdong

Science and Technology Program of Guangzhou

National Key Research and Development Program of China

Publisher

MDPI AG

Reference44 articles.

1. FAIRMOT: On the fairness of detection and reidentification in multiple object tracking;Zhang;Int. J. Comput. Vis.,2021

2. Deep multi-modal object detection and semantic segmentation for autonomous driving: Datasets, methods, and challenges;Feng;IEEE Trans. Intell. Transp. Syst.,2020

3. Jaeger, P.F., Kohl, S.A.A., Bickelhaupt, S., Isensee, F., Kuder, T.A., Schlemmer, H.-P., and Maier-Hein, K.H. (2020). Retina U-Net: Embarrassingly simple exploitation of segmentation supervision for medical object detection. arXiv.

4. Design of Fruit-Carrying Monitoring System for Monorail Transporter in Mountain Orchard;Li;J. Circuits Syst. Comput.,2023

5. MCUNet: Tiny deep learning on IoT devices;Lin;Adv. Neural Inf. Process. Syst. (NeurIPS),2020

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3