Improvement of the model of object recognition in aero photographs using deep convolutional neural networks-Reference-Cited by-同舟云学术

Improvement of the model of object recognition in aero photographs using deep convolutional neural networks

Published:2021-10-31 Issue:2 (113) Volume:5 Page:6-21
ISSN:1729-4061
Container-title:Eastern-European Journal of Enterprise Technologies
language:
Short-container-title:EEJET

Author:

Slyusar Vadym^ORCID,Protsenko Mykhailo^ORCID,Chernukha Anton^ORCID,Kovalov Pavlo^ORCID,Borodych Pavlo^ORCID,Shevchenko Serhii^ORCID,Chernikov Oleksandr^ORCID,Vazhynskyi Serhii^ORCID,Bogatov Oleg^ORCID,Khrustalev Kirill^ORCID

Abstract

Detection and recognition of objects in images is the main problem to be solved by computer vision systems. As part of solving this problem, the model of object recognition in aerial photographs taken from unmanned aerial vehicles has been improved. A study of object recognition in aerial photographs using deep convolutional neural networks has been carried out. Analysis of possible implementations showed that the AlexNet 2012 model (Canada) trained on the ImageNet image set (China) is most suitable for this problem solution. This model was used as a basic one. The object recognition error for this model with the use of the ImageNet test set of images amounted to 15 %. To solve the problem of improving the effectiveness of object recognition in aerial photographs for 10 classes of images, the final fully connected layer was modified by rejection from 1,000 to 10 neurons and additional two-stage training of the resulting model. Additional training was carried out with a set of images prepared from aerial photographs at stage 1 and with a set of VisDrone 2021 (China) images at stage 2. Optimal training parameters were selected: speed (step) (0.0001), number of epochs (100). As a result, a new model under the proposed name of AlexVisDrone was obtained. The effectiveness of the proposed model was checked with a test set of 100 images for each class (the total number of classes was 10). Accuracy and sensitivity were chosen as the main indicators of the model effectiveness. As a result, an increase in recognition accuracy from 7 % (for images from aerial photographs) to 9 % (for the VisDrone 2021 set) was obtained which has indicated that the choice of neural network architecture and training parameters was correct. The use of the proposed model makes it possible to automate the process of object recognition in aerial photographs. In the future, it is advisable to use this model at ground stations of unmanned aerial vehicle complex control when processing aerial photographs taken from unmanned aerial vehicles, in robotic systems, in video surveillance complexes and when designing unmanned vehicle systems

Publisher

Private Company Technology Center

Subject

Applied Mathematics,Electrical and Electronic Engineering,Management of Technology and Innovation,Industrial and Manufacturing Engineering,Computer Science Applications,Mechanical Engineering,Energy Engineering and Power Technology,Control and Systems Engineering

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Identification of varieties in Camellia oleifera leaf based on deep learning technology;Industrial Crops and Products;2024-09

2. Experimental Investigation of the Pyrolysis of Synthetic Materials Exposed to External and Internal Fires;Key Engineering Materials;2023-08-18

3. Modern Materials for Fire Protection of Reinforced Concrete Agro-Industrial Structures;Key Engineering Materials;2023-08-18

4. A Conceptual Model for Increasing the Speed of Decision-Making Based on Images Obtained from UAVs;Mathematical Modeling and Simulation of Systems;2023

5. Improved PSP and U-Net Architectures for Forest Segmentation in Remote Sensing Pictures;2022 IEEE 2nd Ukrainian Microwave Week (UkrMW);2022-11-14