On-Board Small-Scale Object Detection for Unmanned Aerial Vehicles (UAVs)-Reference-Cited by-同舟云学术

On-Board Small-Scale Object Detection for Unmanned Aerial Vehicles (UAVs)

Published:2023-05-06 Issue:5 Volume:7 Page:310
ISSN:2504-446X
Container-title:Drones
language:en
Short-container-title:Drones

Author:

Saeed Zubair¹,Yousaf Muhammad Haroon¹²^ORCID,Ahmed Rehan³,Velastin Sergio A.⁴⁵^ORCID,Viriri Serestina⁶^ORCID

Affiliation:

1. Swarm Robotics Lab (SRL), National Center of Robotics and Automation (NCRA), University of Engineering and Technology (UET), Taxila 47080, Pakistan

2. Department of Computer Engineering, University of Engineering and Technology (UET), Taxila 47080, Pakistan

3. School of Electrical Engineering and Computer Science, National University of Sciences and Technology, Islamabad 24090, Pakistan

4. School of Electronic Engineering and Computer Science, Queen Mary University of London, London E1 4NS, UK

5. Department of Computer Science and Engineering, University Carlos III Madrid, 28911 Leganés, Spain

6. School of Mathematics, Statistics and Computer Science, University of KwaZulu-Natal, Durban 4041, South Africa

Abstract

Object detection is a critical task that becomes difficult when dealing with onboard detection using aerial images and computer vision technique. The main challenges with aerial images are small target sizes, low resolution, occlusion, attitude, and scale variations, which affect the performance of many object detectors. The accuracy of the detection and the efficiency of the inference are always trade-offs. We modified the architecture of CenterNet and used different CNN-based backbones of ResNet18, ResNet34, ResNet50, ResNet101, ResNet152, Res2Net50, Res2Net101, DLA-34, and hourglass14. A comparison of the modified CenterNet with nine CNN-based backbones is conducted and validated using three challenging datasets, i.e., VisDrone, Stanford Drone dataset (SSD), and AU-AIR. We also implemented well-known off-the-shelf object detectors, i.e., YoloV1 to YoloV7, SSD-MobileNet-V2, and Faster RCNN. The proposed approach and state-of-the-art object detectors are optimized and then implemented on cross-edge platforms, i.e., NVIDIA Jetson Xavier, NVIDIA Jetson Nano, and Neuro Compute Stick 2 (NCS2). A detailed comparison of performance between edge platforms is provided. Our modified CenterNet combination with hourglass as a backbone achieved 91.62%, 75.61%, and 34.82% mAP using the validation sets of AU-AIR, SSD, and VisDrone datasets, respectively. An FPS of 40.02 was achieved using the ResNet18 backbone. We also compared our approach with the latest cutting-edge research and found promising results for both discrete GPU and edge platforms.

Funder

Higher Education Commission of Pakistan funding

Publisher

MDPI AG

Subject

Artificial Intelligence,Computer Science Applications,Aerospace Engineering,Information Systems,Control and Systems Engineering

Link

https://www.mdpi.com/2504-446X/7/5/310/pdf

Reference40 articles.

1. Abbas, S.M., and Singh, S.N. (2018, January 9–10). Region-based object detection and classification using faster R-CNN. Proceedings of the 2018 4th International Conference on Computational Intelligence Communication Technology (CICT), Ghaziabad, India.

2. Enhanced semantic feature pyramid network for small object detection;Chen;Signal Process. Image Commun.,2023

3. Jung, H.-K., and Choi, G.-S. (2022). Improved YOLOv5: Efficient Object Detection Using Drone Images under Various Conditions. Appl. Sci., 12.