Vision-Guided Object Recognition and 6D Pose Estimation System Based on Deep Neural Network for Unmanned Aerial Vehicles towards Intelligent Logistics
-
Published:2022-12-22
Issue:1
Volume:13
Page:115
-
ISSN:2076-3417
-
Container-title:Applied Sciences
-
language:en
-
Short-container-title:Applied Sciences
Author:
Luo Sijin, Liang Yu, Luo Zhehao, Liang GuoyuanORCID, Wang CanORCID, Wu Xinyu
Abstract
Unmanned aerial vehicle (UAV) express delivery is facing a period of rapid development and continues to promote the aviation logistics industry due to its advantages of elevated delivery efficiency and low labor costs. Automatic detection, localization, and estimation of 6D poses of targets in dynamic environments are key prerequisites for UAV intelligent logistics. In this study, we proposed a novel vision system based on deep neural networks to locate targets and estimate their 6D pose parameters from 2D color images and 3D point clouds captured by an RGB-D sensor mounted on a UAV. The workflow of this system can be summarized as follows: detect the targets and locate them, separate the object region from the background using a segmentation network, and estimate the 6D pose parameters from a regression network. The proposed system provides a solid foundation for various complex operations for UAVs. To better verify the performance of the proposed system, we built a small dataset called SIAT comprising some household staff. Comparative experiments with several state-of-the-art networks on the YCB-Video dataset and SIAT dataset verified the effectiveness, robustness, and superior performance of the proposed method, indicating its promising applications in UAV-based delivery tasks.
Funder
National Natural Science Foundation of China Shenzhen Science and Technology Innovation Committee Shenzhen Key Fundamental Research Project
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference51 articles.
1. Estimation of leaf area index of sugarcane using crop surface model based on UAV image;Yang;Trans. Chin. Soc. Agric. Eng.,2017 2. Viguier, R., Lin, C.C., Aliakbarpour, H., Bunyak, F., Pankanti, S., Seetharaman, G., and Palaniappan, K. (2015, January 14–16). Automatic Video Content Summarization Using Geospatial Mosaics of Aerial Imagery. Proceedings of the 2015 IEEE International Symposium on Multimedia (ISM), Miami, FL, USA. 3. Thomas, J., Loianno, G., Daniilidis, K., and Kumar, V. (2016, January 17–21). The role of vision in perching and grasping for MAVs. Proceedings of the Micro- & Nanotechnology Sensors, Systems, & Applications VIII, Baltimore, MD, USA. 4. Visual Servoing of Quadrotors for Perching by Hanging from Cylindrical Objects;Thomas;IEEE Robot. Autom. Lett.,2016 5. Kehl, W., Manhardt, F., Tombari, F., Ilic, S., and Navab, N. (2017, January 22–29). SSD-6D: Making RGB-based 3D detection and 6D pose estimation great again. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy.
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|