1. Alwassel, H., Heilbron, F. C., Escorcia, V., & Ghanem, B. (2018). Diagnosing error in temporal action detectors. In Proceedings of the European conference on computer vision (pp. 256–272).
2. Generalized denoising auto-encoders as generative models;Bengio,2013
3. Activitynet: A large-scale video benchmark for human activity understanding;Caba Heilbron,2015
4. Quo vadis, action recognition? a new model and the kinetics dataset;Carreira,2017
5. Rethinking the faster r-cnn architecture for temporal action localization;Chao,2018