Funder
Zhejiang Province Natural Science Foundation
National Natural Science Foundation of China
Reference39 articles.
1. H. Kuehne, H. Jhuang, E. Garrote, T. Poggio, T. Serre, HMDB: A large video database for human motion recognition, in: Int. Conf. Comput. Vis., 2011, pp. 2556–2563.
2. Hollywood in homes: Crowdsourcing data collection for activity understanding;Sigurdsson,2016
3. SST: Spatial and semantic transformers for multi-label image recognition;Chen;IEEE Trans. Image Process.,2022
4. Z.-M. Chen, X.-S. Wei, P. Wang, Y. Guo, Multi-Label Image Recognition with Graph Convolutional Networks, in: IEEE Conf. Comput. Vis. Pattern Recog., 2019, pp. 5177–5186.
5. C. Feichtenhofer, X3d: Expanding architectures for efficient video recognition, in: IEEE Conf. Comput. Vis. Pattern Recog., 2020, pp. 203–213.