1. Bambach, S.: A survey on recent advances of computer vision algorithms for egocentric video. CoRR (2015). abs/1501.02825
2. Bergstrom, T., Shi, H.: Human-object interaction detection: A quick survey and examination of methods. In: Proceedings of the 1st International Workshop on Human-Centric Multimedia Analysis, pp. 63–71 (2020)
3. Bochkovskiy, A., Wang, C.-Y., Mark Liao, H.-Y.: Yolov4: Optimal speed and accuracy of object detection (2020). arXiv:2004.10934
4. Dai, J., Li, Y., He, K., Sun, J.: R-FCN: object detection via region-based fully convolutional networks. In: Lee, D.D., Sugiyama, M., von Luxburg, U., Guyon, I., Garnett, R., (eds.) Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, pp. 379–387. Barcelona, Spain (2016)
5. Damen, D., Doughty, H., Farinella, G.M., Fidler, S., Furnari, A., Kazakos, E., Moltisanti, D., Munro, J., Perrett, T., Price, W., et al.: Scaling egocentric vision: The epic-kitchens dataset. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 720–736 (2018)