Multimodal semantic enhanced representation network for micro-video event detection
-
Published:2024-10
Issue:
Volume:301
Page:112255
-
ISSN:0950-7051
-
Container-title:Knowledge-Based Systems
-
language:en
-
Short-container-title:Knowledge-Based Systems
Author:
Li Yun,
Liu Xianyi,
Zhang Lijuan,
Tian Haoyu,
Jing PeiguangORCID
Reference77 articles.
1. Multi-scale modeling temporal hierarchical attention for sequential recommendation;Huang;Inform. Sci.,2023
2. Attention based consistent semantic learning for micro-video scene recognition;Guo;Inform. Sci.,2021
3. Y. Du, Y. Wei, W. Ji, F. Liu, X. Luo, L. Nie, Multi-queue Momentum Contrast for Microvideo-Product Retrieval, in: Proceedings of ACM International Conference on Web Search and Data Mining, 2023, pp. 1003–1011.
4. L. Nie, L. Qu, D. Meng, M. Zhang, Q. Tian, A.D. Bimbo, Search-oriented Micro-video Captioning, in: Proceedings of ACM International Conference on Multimedia, 2022, pp. 3234–3243.
5. LCEMH: Label correlation enhanced multi-modal hashing for efficient multi-modal retrieval;Zheng;Inform. Sci.,2024