Affiliation:
1. School of Automotive Studies, Tongji University, Shanghai 201800, China
2. Nanchang Automotive Institute of Intelligence & New Energy, Tongji University, Nanchang 330052, China
Abstract
Most visual simultaneous localization and mapping (SLAM) systems are based on the assumption of a static environment in autonomous vehicles. However, when dynamic objects, particularly vehicles, occupy a large portion of the image, the localization accuracy of the system decreases significantly. To mitigate this challenge, this paper unveils DOT-SLAM, a novel stereo visual SLAM system that integrates dynamic object tracking through graph optimization. By integrating dynamic object pose estimation into the SLAM system, the system can effectively utilize both foreground and background points for ego vehicle localization and obtain a static feature points map. To rectify the inaccuracies in depth estimation from stereo disparity directly on the foreground points of dynamic objects due to their self-similarity characteristics, a coarse-to-fine depth estimation method based on camera–road plane geometry is presented. This method uses rough depth to guide fine stereo matching, thereby obtaining the 3 dimensions (3D)spatial positions of feature points on dynamic objects. Subsequently, by establishing constraints on the dynamic object’s pose using the road plane and non-holonomic constraints (NHCs) of the vehicle, reducing the initial pose uncertainty of dynamic objects leads to more accurate dynamic object initialization. Finally, by considering foreground points, background points, the local road plane, the ego vehicle pose, and dynamic object poses as optimization nodes, through the establishment and joint optimization of a nonlinear model based on graph optimization, accurate six degrees of freedom (DoFs) pose estimations are obtained for both the ego vehicle and dynamic objects. Experimental validation on the KITTI-360 dataset demonstrates that DOT-SLAM effectively utilizes features from the background and dynamic objects in the environment, resulting in more accurate vehicle trajectory estimation and a static environment map. Results obtained from a real-world dataset test reinforce the effectiveness.
Funder
Perspective Study Funding of Nanchang Automotive 708 Institute of Intelligence and New Energy, Tongji University
Reference57 articles.
1. Bala, J.A., Adeshina, S.A., and Aibinu, A.M. (2022). Advances in Visual Simultaneous Localisation and Mapping Techniques for Autonomous Vehicles: A Review. Sensors, 22.
2. A Review of Visual SLAM Methods for Autonomous Driving Vehicles;Cheng;Eng. Appl. Artif. Intell.,2022
3. Past, Present, and Future of Simultaneous Localization and Mapping: Toward the Robust-Perception Age;Cadena;IEEE Trans. Robot.,2016
4. ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras;IEEE Trans. Robot.,2017
5. ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual–Inertial, and Multimap SLAM;Campos;IEEE Trans. Robot.,2021