Affiliation:
1. School of Computer Science and Cyber Engineering Guangzhou University Guangzhou China
2. School of Information Technology Deakin University Waurn Ponds Campus Geelong Australia
3. School of Cyberspace Security Harbin Institute of Technology Harbin China
Abstract
AbstractExisting 3D single object tracking methods primarily extract features from the global coordinates of point clouds, overlooking the potential exploitation of their positional information. However, due to the unordered, sparse, and irregular nature of point clouds, effectively exploring their positional information presents a significant challenge. In this letter, the network is explicitly reformulated by introducing a point position embedding module in conjunction with a self‐attention coding module, replacing the use of global coordinate inputs. The proposed reformulation is further integrated into a top‐notch model M2‐Track, called Point Position Embedding (PPE) in this letter. Comprehensive empirical analysis are performed on the KITTI and NuScenes datasets. Experimental results show that the PPE surpasses M2‐Track by a large margin in overall performance. Especially for the challenging NuScenes dataset, the method attains the highest precision and success in all classes compared to state‐of‐the‐art methods. The code is available at https://github.com/GZHU‐DVL/PPE.
Funder
National Natural Science Foundation of China
Fundamental Research Funds for the Central Universities
Publisher
Institution of Engineering and Technology (IET)
Subject
Electrical and Electronic Engineering