A three-stream fusion and self-differential attention network for multi-modal crowd counting
-
Published:2024-07
Issue:
Volume:183
Page:35-41
-
ISSN:0167-8655
-
Container-title:Pattern Recognition Letters
-
language:en
-
Short-container-title:Pattern Recognition Letters
Author:
Tang Haihan,
Wang YiORCID,
Lin Zhiping,
Chau Lap-PuiORCID,
Zhuang Huiping
Reference32 articles.
1. A survey of recent advances in cnn-based single image crowd counting and density estimation;Sindagi;Pattern Recognit. Lett.,2018
2. An image is worth 16x16 words: Transformers for image recognition at scale;Dosovitskiy,2021
3. Lw-count: an effective lightweight encoding-decoding crowd counting network;Liu;IEEE Trans. Circuits Syst. Video Technol.,2022
4. Y. Li, X. Zhang, D. Chen, Csrnet: Dilated convolutional neural networks for understanding the highly congested scenes, in: IEEE Conference on Computer Vision and Pattern Recognition, 2018, pp. 1091–1100.
5. Semantic-refined spatial pyramid network for crowd counting;Zhou;Pattern Recognit. Lett.,2022
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献