1. Yeung, S., Russakovsky, O., Mori, G., and Lei, F.-F. End-to-end learning of action detection from frame glimpses in videos. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
2. Yuan, J., Ni, B., Yang, X., and Kassim, A.A. Temporal action localization with pyramid of score distribution features. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
3. Shou, Z., Wang, D., and Chang, S.F. Temporal action localization in untrimmed videos via multi-stage cnns. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
4. Zhu, Y., and Newsam, S. Efficient action detection in untrimmed videos via multi-task learning. Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision (WACV).
5. Chao, Y.W., Vijayanarasimhan, S., Seybold, B., Ross, D.A., Deng, J., and Sukthankar, R. Rethinking the Faster R-CNN Architecture for Temporal Action Localization. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.