Affiliation:
1. Department of Electrical Engineering, National Taiwan University of Science and Technology, Taipei 106, Taiwan
2. Department of Mechanical Engineering, National Taiwan University, Taipei 106, Taiwan
Abstract
Obstacle avoidance is essential for the effective operation of autonomous mobile robots, enabling them to detect and navigate around obstacles in their environment. While deep learning provides significant benefits for autonomous navigation, it typically requires large, accurately labeled datasets, making the data’s preparation and processing time-consuming and labor-intensive. To address this challenge, this study introduces a transfer learning (TL)-based automatic labeling segmentation (ALS) framework. This framework utilizes a pretrained attention-based network, DifferNet, to efficiently perform semantic segmentation tasks on new, unlabeled datasets. DifferNet leverages prior knowledge from the Cityscapes dataset to identify high-entropy areas as road obstacles by analyzing differences between the input and resynthesized images. The resulting road anomaly map was refined using depth information to produce a robust drivable area and map of road anomalies. Several off-the-shelf RGB-D semantic segmentation neural networks were trained using pseudo-labels generated by the ALS framework, with validation conducted on the GMRPD dataset. Experimental results demonstrated that the proposed ALS framework achieved mean precision, mean recall, and mean intersection over union (IoU) rates of 80.31%, 84.42%, and 71.99%, respectively. The ALS framework, through the use of transfer learning and the DifferNet network, offers an efficient solution for semantic segmentation of new, unlabeled datasets, underscoring its potential for improving obstacle avoidance in autonomous mobile robots.
Funder
Ministry of Science and Technology, Taiwan
Reference51 articles.
1. Ozkan, Z., Bayhan, E., Namdar, M., and Basgumus, A. (2021, January 21). Object Detection and Recognition of Unmanned Aerial Vehicles Using Raspberry Pi Platform. Proceedings of the 2021 5th International Symposium on Multidisciplinary Studies and Innovative Technologies (ISMSIT), Ankara, Türkiye.
2. ImFusion: Boosting Two-Stage 3D Object Detection via Image Candidates;Tao;IEEE Signal Process. Lett.,2024
3. Multi-Sensor Fusion Technology for 3D Object Detection in Autonomous Driving: A Review;Wang;IEEE Trans. Intell. Transp. Syst.,2023
4. AttentionTrack: Multiple Object Tracking in Traffic Scenarios Using Features Attention;Zhang;IEEE Trans. Intell. Transport. Syst.,2024
5. Xing, Y., Wang, J., Chen, X., and Zeng, G. (2019, January 22–25). Coupling Two-Stream RGB-D Semantic Segmentation Network by Idempotent Mappings. Proceedings of the 2019 IEEE International Conference on Image Processing (ICIP), Taipei, Taiwan.