Author:
Zhang Mengmi,Ma Keng Teck,Lim Joo Hwee,Zhao Qi,Feng Jiashi
Cited by
68 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Multi-modal transformer with language modality distillation for early pedestrian action anticipation;Computer Vision and Image Understanding;2024-09
2. A Transformer-Based Model for the Prediction of Human Gaze Behavior on Videos;Proceedings of the 2024 Symposium on Eye Tracking Research and Applications;2024-06-04
3. An Outlook into the Future of Egocentric Vision;International Journal of Computer Vision;2024-05-28
4. Self-Motion As Supervision For Egocentric Audiovisual Localization;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14
5. SwinGaze: Egocentric Gaze Estimation with Video Swin Transformer;2023 IEEE 16th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC);2023-12-18