MHDNet: A Multi-Scale Hybrid Deep Learning Model for Person Re-Identification-Reference-Cited by-同舟云学术

MHDNet: A Multi-Scale Hybrid Deep Learning Model for Person Re-Identification

Published:2024-04-10 Issue:8 Volume:13 Page:1435
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Wang Jinghui¹,Wang Jun¹

Affiliation:

1. School of Mathematical Sciences, Jiangsu University, Zhenjiang 212013, China

Abstract

The primary objective of person re-identification is to identify individuals from surveillance videos across various scenarios. Conventional pedestrian recognition models typically employ convolutional neural network (CNN) and vision transformer (ViT) networks to extract features, and while CNNs are adept at extracting local features through convolution operations, capturing global information can be challenging, especially when dealing with high-resolution images. In contrast, ViT rely on cascaded self-attention modules to capture long-range feature dependencies, sacrificing local feature details. In light of these limitations, this paper presents the MHDNet, a hybrid network structure for pedestrian recognition that combines convolutional operations and self-attention mechanisms to enhance representation learning. The MHDNet is built around the Feature Fusion Module (FFM), which harmonizes global and local features at different resolutions. With a parallel structure, the MHDNet model maximizes the preservation of local features and global representations. Experiments on two person re-identification datasets demonstrate the superiority of the MHDNet over other state-of-the-art methods.

Publisher

MDPI AG

Link

https://www.mdpi.com/2079-9292/13/8/1435/pdf

Reference75 articles.

1. A Survey on Deep Learning-Based Person Re-Identification Systems;Almasawa;IEEE Access,2019

2. Person re-identification: A retrospective on domain specific open challenges and future trends;Zahra;Pattern Recognit.,2023

3. Huang, H., Li, D., Zhang, Z., Chen, X., and Huang, K. (2018, January 18–23). Adversarially Occluded Samples for Person Re-identification. Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.

4. Hou, R., Ma, B., Chang, H., Gu, X., Shan, S., and Chen, X. (2019). VRSTC: Occlusion-Free Video Person Re-Identification. arXiv.

5. Zhao, H., Tian, M., Sun, S., Shao, J., Yan, J., Yi, S., Wang, X., and Tang, X. (2017, January 21–26). Spindle Net: Person Re-identification with Human Body Region Guided Feature Decomposition and Fusion. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An Efficient Multi-Branch Attention Network for Person Re-Identification;Electronics;2024-08-12