1. Semantic-Disentangled Transformer with Noun-Verb Embedding for Compositional Action Recognition;Huang;IEEE Trans. Image Process.,2023
2. Huang, P., Shu, X., Yan, R., Tu, Z., and Tang, J. (2024). Appearance-Agnostic Representation Learning for Compositional Action Recognition. IEEE Trans. Circuits Syst. Video Technol.
3. A unified multimodal de-and re-coupling framework for rgb-d motion recognition;Zhou;IEEE Trans. Pattern Anal. Mach. Intell.,2023
4. Wang, J., Liu, Z., Wu, Y., and Yuan, J. (2012, January 21–26). Mining actionlet ensemble for action recognition with depth cameras. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.
5. Hussein, M.E., Torki, M., Gowayyed, M.A., and El-Saban, M. (2013, January 3–9). Human action recognition using a temporal hierarchy of covariance descriptors on 3d joint locations. Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI), Beijing, China.