1. Video jigsaw: Unsupervised learning of spatiotemporal context for video action recognition;Ahsan,2019
2. Autoencoders, unsupervised learning, and deep architectures;Baldi,2012
3. Benaim, S., Ephrat, A., Lang, O., Mosseri, I., Freeman, W.T., Rubinstein, M., Irani, M., Dekel, T., 2020. Speednet: Learning the speediness in videos. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. CVPR, pp. 9922–9931.
4. Bishay, M., Zoumpourlis, G., Patras, I., 2019. Tarn: Temporal attentive relation network for few-shot and zero-shot action recognition. In: British Machine Vision Conference. BMVC.
5. Self-supervision & meta-learning for one-shot unsupervised cross-domain detection;Borlino;Comput. Vis. Image Underst. (CVIU),2022