Ultrafast Video Attention Prediction with Coupled Knowledge Distillation-Reference-Cited by-同舟云学术

Ultrafast Video Attention Prediction with Coupled Knowledge Distillation

Published:2020-04-03 Issue:07 Volume:34 Page:10802-10809
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Fu Kui,Shi Peipei,Song Yafei,Ge Shiming,Lu Xiangju,Li Jia

Abstract

Large convolutional neural network models have recently demonstrated impressive performance on video attention prediction. Conventionally, these models are with intensive computation and large memory. To address these issues, we design an extremely light-weight network with ultrafast speed, named UVA-Net. The network is constructed based on depth-wise convolutions and takes low-resolution images as input. However, this straight-forward acceleration method will decrease performance dramatically. To this end, we propose a coupled knowledge distillation strategy to augment and train the network effectively. With this strategy, the model can further automatically discover and emphasize implicit useful cues contained in the data. Both spatial and temporal knowledge learned by the high-resolution complex teacher networks also can be distilled and transferred into the proposed low-resolution light-weight spatiotemporal network. Experimental results show that the performance of our model is comparable to 11 state-of-the-art models in video attention prediction, while it costs only 0.68 MB memory footprint, runs about 10,106 FPS on GPU and 404 FPS on CPU, which is 206 times faster than previous models.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Package Arrival Time Prediction via Knowledge Distillation Graph Neural Network;ACM Transactions on Knowledge Discovery from Data;2024-02-28

2. OFF-ViNet: Optical Flow-Based Feature Warping ViNet for Video Saliency Prediction Considering Future Prediction;IEEE Access;2024

3. Prediction of Driver's Visual Attention in Critical Moment Using Optical Flow;IEICE Transactions on Information and Systems;2023-05-01

4. TinyHD: Efficient Video Saliency Prediction with Heterogeneous Decoders using Hierarchical Maps Distillation;2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV);2023-01

5. Dynamic Gesture Recognition Based on Three-Stream Coordinate Attention Network and Knowledge Distillation;IEEE Access;2023