Regression-Based Camera Pose Estimation through Multi-Level Local Features and Global Features
Author:
Xu Meng1, Zhang Zhihuang2, Gong Yuanhao3ORCID, Poslad Stefan1ORCID
Affiliation:
1. School of Electronic Engineering and Computer Science, Queen Mary University of London, London E1 4NS, UK 2. School of Vehicle and Mobility, Tsinghua University, Beijing 100084, China 3. College of Electronics and Information Engineering, Shenzhen University, Shenzhen 518061, China
Abstract
Accurate and robust camera pose estimation is essential for high-level applications such as augmented reality and autonomous driving. Despite the development of global feature-based camera pose regression methods and local feature-based matching guided pose estimation methods, challenging conditions, such as illumination changes and viewpoint changes, as well as inaccurate keypoint localization, continue to affect the performance of camera pose estimation. In this paper, we propose a novel relative camera pose regression framework that uses global features with rotation consistency and local features with rotation invariance. First, we apply a multi-level deformable network to detect and describe local features, which can learn appearances and gradient information sensitive to rotation variants. Second, we process the detection and description processes using the results from pixel correspondences of the input image pairs. Finally, we propose a novel loss that combines relative regression loss and absolute regression loss, incorporating global features with geometric constraints to optimize the pose estimation model. Our extensive experiments report satisfactory accuracy on the 7Scenes dataset with an average mean translation error of 0.18 m and a rotation error of 7.44° using image pairs as input. Ablation studies were also conducted to verify the effectiveness of the proposed method in the tasks of pose estimation and image matching using the 7Scenes and HPatches datasets.
Funder
China Scholarship Council Queen Mary University of London
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference59 articles.
1. Garcia, P.P., Santos, T.G., Machado, M.A., and Mendes, N. (2023). Deep Learning Framework for Controlling Work Sequence in Collaborative Human–Robot Assembly Processes. Sensors, 23. 2. Mundt, M., Born, Z., Goldacre, M., and Alderson, J. (2023). Estimating Ground Reaction Forces from Two-Dimensional Pose Data: A Biomechanics-Based Comparison of AlphaPose, BlazePose, and OpenPose. Sensors, 23. 3. Xu, M., Wang, Y., Xu, B., Zhang, J., Ren, J., Poslad, S., and Xu, P. (2022). A critical analysis of image-based camera pose estimation techniques. arXiv. 4. Zhang, Z., Xu, M., Zhou, W., Peng, T., Li, L., and Poslad, S. (2022). BEV-Locator: An End-to-end Visual Semantic Localization Network Using Multi-View Images. arXiv. 5. Yan, G., Luo, Z., Liu, Z., and Li, Y. (2023). SensorX2car: Sensors-to-car calibration for autonomous driving in road scenarios. arXiv.
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. EGGS: Edge Guided Gaussian Splatting for Radiance Fields;Proceedings of the 29th International ACM Conference on 3D Web Technology;2024-09-25 2. Accurate Body Pose Matching for Individuals with Stroke using Siamese Networks;2024 IEEE/ACM Conference on Connected Health: Applications, Systems and Engineering Technologies (CHASE);2024-06-19 3. Axis-Aligned Gaussian Splatting for Radiance Fields;Proceedings of the 2024 8th International Conference on High Performance Compilation, Computing and Communications;2024-06-07 4. DCSPose: A Dual-Channel Siamese Framework for Unseen Textureless Object Pose Estimation;Applied Sciences;2024-01-15 5. 基于视觉定位引导的相机偏移像素坐标矫正方法;Laser & Optoelectronics Progress;2024
|
|