Author:
Xia Kaiguo,Pan Zhisong,Mao Pengqiang
Abstract
Video compression sensing can use a few measurements to obtain the original video by reconstruction algorithms. There is a natural correlation between video frames, and how to exploit this feature becomes the key to improving the reconstruction quality. More and more deep learning-based video compression sensing (VCS) methods are proposed. Some methods overlook interframe information, so they fail to achieve satisfactory reconstruction quality. Some use complex network structures to exploit the interframe information, but it increases the parameters and makes the training process more complicated. To overcome the limitations of existing VCS methods, we propose an efficient end-to-end VCS network, which integrates the measurement and reconstruction into one whole framework. In the measurement part, we train a measurement matrix rather than a pre-prepared random matrix, which fits the video reconstruction task better. An unfolded LSTM network is utilized in the reconstruction part, deeply fusing the intra- and interframe spatial–temporal information. The proposed method has higher reconstruction accuracy than existing video compression sensing networks and even performs well at measurement ratios as low as 0.01.
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference27 articles.
1. Modeling the impact of frame rate on perceptual quality of video;Ou;Proceedings of the 15th IEEE International Conference on Image Processing (ICIP 2008),2008
2. An Introduction To Compressive Sampling
3. Single-pixel imaging via compressive sampling
4. Video from a Single Coded Exposure Photograph using a Learned Over-Complete Dictionary;Hitomi;Proceedings of the IEEE International Conference on Computer Vision (ICCV),2011
5. Coded Strobing Photography: Compressive Sensing of High Speed Periodic Videos
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献