On-Board Small-Scale Object Detection for Unmanned Aerial Vehicles (UAVs)
Author:
Saeed Zubair1, Yousaf Muhammad Haroon12ORCID, Ahmed Rehan3, Velastin Sergio A.45ORCID, Viriri Serestina6ORCID
Affiliation:
1. Swarm Robotics Lab (SRL), National Center of Robotics and Automation (NCRA), University of Engineering and Technology (UET), Taxila 47080, Pakistan 2. Department of Computer Engineering, University of Engineering and Technology (UET), Taxila 47080, Pakistan 3. School of Electrical Engineering and Computer Science, National University of Sciences and Technology, Islamabad 24090, Pakistan 4. School of Electronic Engineering and Computer Science, Queen Mary University of London, London E1 4NS, UK 5. Department of Computer Science and Engineering, University Carlos III Madrid, 28911 Leganés, Spain 6. School of Mathematics, Statistics and Computer Science, University of KwaZulu-Natal, Durban 4041, South Africa
Abstract
Object detection is a critical task that becomes difficult when dealing with onboard detection using aerial images and computer vision technique. The main challenges with aerial images are small target sizes, low resolution, occlusion, attitude, and scale variations, which affect the performance of many object detectors. The accuracy of the detection and the efficiency of the inference are always trade-offs. We modified the architecture of CenterNet and used different CNN-based backbones of ResNet18, ResNet34, ResNet50, ResNet101, ResNet152, Res2Net50, Res2Net101, DLA-34, and hourglass14. A comparison of the modified CenterNet with nine CNN-based backbones is conducted and validated using three challenging datasets, i.e., VisDrone, Stanford Drone dataset (SSD), and AU-AIR. We also implemented well-known off-the-shelf object detectors, i.e., YoloV1 to YoloV7, SSD-MobileNet-V2, and Faster RCNN. The proposed approach and state-of-the-art object detectors are optimized and then implemented on cross-edge platforms, i.e., NVIDIA Jetson Xavier, NVIDIA Jetson Nano, and Neuro Compute Stick 2 (NCS2). A detailed comparison of performance between edge platforms is provided. Our modified CenterNet combination with hourglass as a backbone achieved 91.62%, 75.61%, and 34.82% mAP using the validation sets of AU-AIR, SSD, and VisDrone datasets, respectively. An FPS of 40.02 was achieved using the ResNet18 backbone. We also compared our approach with the latest cutting-edge research and found promising results for both discrete GPU and edge platforms.
Funder
Higher Education Commission of Pakistan funding
Subject
Artificial Intelligence,Computer Science Applications,Aerospace Engineering,Information Systems,Control and Systems Engineering
Reference40 articles.
1. Abbas, S.M., and Singh, S.N. (2018, January 9–10). Region-based object detection and classification using faster R-CNN. Proceedings of the 2018 4th International Conference on Computational Intelligence Communication Technology (CICT), Ghaziabad, India. 2. Enhanced semantic feature pyramid network for small object detection;Chen;Signal Process. Image Commun.,2023 3. Jung, H.-K., and Choi, G.-S. (2022). Improved YOLOv5: Efficient Object Detection Using Drone Images under Various Conditions. Appl. Sci., 12. 4. Saeed, Z., Awan, M.N.M., and Yousaf, M.H. (December, January 30). A Robust Approach for Small-Scale Object Detection From Aerial-View. Proceedings of the 2022 International Conference on Digital Image Computing: Techniques and Applications (DICTA), Sydney, Australia. 5. Jawaharlalnehru, A., Sambandham, T., Sekar, V., Ravikumar, D., Loganathan, V., Kannadasan, R., Khan, A.A., Wechtaisong, C., Haq, M.A., and Alhussen, A. (2022). Target Object Detection from Unmanned Aerial Vehicle (UAV) Images Based on Improved YOLO Algorithm. Electronics, 11.
Cited by
11 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|