1. Lu, C., Krishna, R., Bernstein, M., & Li, F. (2016). Visual relationship detection with language priors. In B. Leibe, J. Matas, N. Sebe, et al. (Eds.), Proceedings of the 14th European conference on computer vision (pp. 852–869). Cham: Springer.
2. Tang, K., Niu, Y., Huang, J., Shi, J., & Zhang, H. (2020). Unbiased scene graph generation from biased training. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 3716–3725). Piscataway: IEEE.
3. Lv, J., Liu, W., Zhou, L., Wu, B., & Ma, H. (2018). Multi-stream fusion model for social relation recognition from videos. In K. Schoeffmann, T. H. Chalidabhongse, C.-W. Ngo, et al. (Eds.), Proceedings of the 24th international conference on multimedia modeling (pp. 355–368). Cham: Springer.
4. Liu, X., Liu, W., Zhang, M., Chen, J., Gao, L., Yan, C., et al. (2019). Social relation recognition from videos via multi-scale spatial-temporal reasoning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 3566–3574). Piscataway: IEEE.
5. Kukleva, A., Tapaswi, M., & Laptev, I. (2020). Learning interactions and relationships between movie characters. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 9846–9855). Piscataway: IEEE.