Improving the model of object detection on aerial photographs and video in unmanned aerial systems-Reference-Cited by-同舟云学术

Improving the model of object detection on aerial photographs and video in unmanned aerial systems

Published:2022-02-28 Issue:9(115) Volume:1 Page:24-34
ISSN:1729-4061
Container-title:Eastern-European Journal of Enterprise Technologies
language:
Short-container-title:EEJET

Author:

Slyusar Vadym^ORCID,Protsenko Mykhailo^ORCID,Chernukha Anton^ORCID,Melkin Vasyl^ORCID,Biloborodov Oleh^ORCID,Samoilenko Mykola^ORCID,Kravchenko Olena^ORCID,Kalynychenko Halyna^ORCID,Rohovyi Anton^ORCID,Soloshchuk Mykhaylo^ORCID

Abstract

This paper considers a model of object detection on aerial photographs and video using a neural network in unmanned aerial systems. The development of artificial intelligence and computer vision systems for unmanned systems (drones, robots) requires the improvement of models for detecting and recognizing objects in images and video streams. The results of video and aerial photography in unmanned aircraft systems are processed by the operator manually but there are objective difficulties associated with the operator’s processing of a large number of videos and aerial photographs, so it is advisable to automate this process. Analysis of neural network models has revealed that the YOLOv5x model (USA) is most suitable, as a basic model, for performing the task of object detection on aerial photographs and video. The Microsoft COCO suite (USA) is used to train this model. This set contains more than 200,000 images across 80 categories. To improve the YOLOv5x model, the neural network was trained with a set of VisDrone 2021 images (China) with the choice of such optimal training parameters as the optimization algorithm SGD; the initial learning rate (step) of 0.0005; the number of epochs of 25. As a result, a new model of object detection on aerial photographs and videos with the proposed name VisDroneYOLOv5x was obtained. The effectiveness of the improved model was studied using aerial photographs and videos from the VisDrone 2021 set. To assess the effectiveness of the model, the following indicators were chosen as the main indicators: accuracy, sensitivity, the estimation of average accuracy. Using a convolutional neural network has made it possible to automate the process of object detection on aerial photographs and video in unmanned aerial systems.

Publisher

Private Company Technology Center

Subject

Applied Mathematics,Electrical and Electronic Engineering,Management of Technology and Innovation,Industrial and Manufacturing Engineering,Computer Science Applications,Mechanical Engineering,Energy Engineering and Power Technology,Control and Systems Engineering

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Modern Materials for Fire Protection of Reinforced Concrete Agro-Industrial Structures;Key Engineering Materials;2023-08-18

2. DEEP LEARNING BASED HUMAN ROBOT INTERACTION WITH 5G COMMUNICATION;Konya Journal of Engineering Sciences;2023-06-01

3. Application of Neural Network Technologies for Underwater Munitions Detection;Radioelectronics and Communications Systems;2022-12

4. Methodology for Armaments Identification Using a Neural Network;2022 IEEE 9th International Conference on Problems of Infocommunications, Science and Technology (PIC S&T);2022-10-10

5. Face Mask Wearing Detection Based on YOLOv5;International Journal of Advanced Network, Monitoring and Controls;2022-01-01