Author:
Xu Yuanlu,Liu Xiaobai,Qin Lei,Zhu Song-Chun
Abstract
In this paper, we propose a Spatio-temporal Attributed Parse Graph (ST-APG) to integrate semantic attributes with trajectories for cross-view people tracking. Given videos from multiple cameras with overlapping field of view (FOV), our goal is to parse the videos and organize the trajectories of all targets into a scene-centered representation. We leverage rich semantic attributes of human, e.g., facing directions, postures and actions, to enhance cross-view tracklet associations, besides frequently used appearance and geometry features in the literature.In particular, the facing direction of a human in 3D, once detected, often coincides with his/her moving direction or trajectory. Similarly, the actions of humans, once recognized, provide strong cues for distinguishing one subject from the others. The inference is solved by iteratively grouping tracklets with cluster sampling and estimating people semantic attributes by dynamic programming.In experiments, we validate our method on one public dataset and create another new dataset that records people's daily life in public, e.g., food court, office reception and plaza, each of which includes 3-4 cameras. We evaluate the proposed method on these challenging videos and achieve promising multi-view tracking results.
Publisher
Association for the Advancement of Artificial Intelligence (AAAI)
Cited by
17 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Track initialization and re-identification for 3D multi-view multi-object tracking;Information Fusion;2024-11
2. Complementing Vehicle Trajectories Using Two Camera Viewpoints;2024 IEEE International Conference on Consumer Electronics (ICCE);2024-01-06
3. Blockchain-Empowered Distributed Multicamera Multitarget Tracking in Edge Computing;IEEE Transactions on Industrial Informatics;2024-01
4. Learning to Track With Dynamic Message Passing Neural Network for Multi-Camera Multi-Object Tracking;IEEE Access;2024
5. Edge Computing Enabled Real-Time Video Analysis via Adaptive Spatial-Temporal Semantic Filtering;2023 IEEE International Conferences on Internet of Things (iThings) and IEEE Green Computing & Communications (GreenCom) and IEEE Cyber, Physical & Social Computing (CPSCom) and IEEE Smart Data (SmartData) and IEEE Congress on Cybermatics (Cybermatics);2023-12-17