Semantic stereo visual SLAM toward outdoor dynamic environments based on ORB-SLAM2-Reference-Cited by-同舟云学术

Semantic stereo visual SLAM toward outdoor dynamic environments based on ORB-SLAM2

Published:2023-01-27 Issue:3 Volume:50 Page:542-554
ISSN:0143-991X
Container-title:Industrial Robot: the international journal of robotics research and application
language:en
Short-container-title:IR

Author:

Li Yawen,Song Guangming,Hao Shuang,Mao Juzheng,Song Aiguo

Abstract

Purpose The prerequisite for most traditional visual simultaneous localization and mapping (V-SLAM) algorithms is that most objects in the environment should be static or in low-speed locomotion. These algorithms rely on geometric information of the environment and restrict the application scenarios with dynamic objects. Semantic segmentation can be used to extract deep features from images to identify dynamic objects in the real world. Therefore, V-SLAM fused with semantic information can reduce the influence from dynamic objects and achieve higher accuracy. This paper aims to present a new semantic stereo V-SLAM method toward outdoor dynamic environments for more accurate pose estimation. Design/methodology/approach First, the Deeplabv3+ semantic segmentation model is adopted to recognize semantic information about dynamic objects in the outdoor scenes. Second, an approach that combines prior knowledge to determine the dynamic hierarchy of moveable objects is proposed, which depends on the pixel movement between frames. Finally, a semantic stereo V-SLAM based on ORB-SLAM2 to calculate accurate trajectory in dynamic environments is presented, which selects corresponding feature points on static regions and eliminates useless feature points on dynamic regions. Findings The proposed method is successfully verified on the public data set KITTI and ZED2 self-collected data set in the real world. The proposed V-SLAM system can extract the semantic information and track feature points steadily in dynamic environments. Absolute pose error and relative pose error are used to evaluate the feasibility of the proposed method. Experimental results show significant improvements in root mean square error and standard deviation error on both the KITTI data set and an unmanned aerial vehicle. That indicates this method can be effectively applied to outdoor environments. Originality/value The main contribution of this study is that a new semantic stereo V-SLAM method is proposed with greater robustness and stability, which reduces the impact of moving objects in dynamic scenes.

Publisher

Emerald

Subject

Industrial and Manufacturing Engineering,Computer Science Applications,Control and Systems Engineering

Reference24 articles.

1. Stereo camera visual SLAM with hierarchical masking and motion-state classification at outdoor construction sites containing large dynamic objects;Advanced Robotics,2021

2. Utilization of semantic planes: improved localization and dense semantic map for monocular SLAM in urban environment;IEEE Robotics and Automation Letters,2021

3. DynaSLAM: tracking, mapping and inpainting in dynamic scenes;IEEE Robotics and Automation Letters,2018

4. Visual-inertial SLAM method based on optical flow in a GPS-denied environment;Industrial Robot: An International Journal,2018

5. Encoder-decoder with atrous separable convolution for semantic image segmentation,2018

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Dynamic Object Detection and Tracking in Vision SLAM;Applied Mathematics and Nonlinear Sciences;2024-01-01