1. Goodfellow, I., Bengio., Y., and Courville, A., Deep Learning, Adaptive Computation and Machine Learning series, Cambridge, MA: MIT Press, 2016.
2. LeCun, Y., Bengio, Y., and Hinton, G., Deep learning, Nature, 2015, vol. 521, no. 7553, pp. 436–444.
3. Miech, A., Laptev, I., and Sivic, J., Learnable pooling with Context Gating for video classification, 2017 https://arxiv.org/abs/1706.06905.
4. Yang, J., Ren, P., Chen, D., Wen, F., Li, H., and Hua, G., Neural aggregation network for video face recognition, Proc. 29th IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 2017, pp. 4362–4371.
5. Savchenko, A.V., Deep neural networks and maximum likelihood search for approximate nearest neighbor in video-based image recognition, Opt. Mem. Neural Networks, 2017, vol. 26, no. 2, pp. 129–136.