1. Video captioning using global-local representation;Yan;IEEE Trans. Circuits Syst. Video Technol.,2022
2. D. Liu, Y. Cui, W. Tan, Y. Chen, SG-Net: Spatial Granularity Network for One-Stage Video Instance Segmentation, in: Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit., 2021, pp. 9816–9825.
3. Coarse-to-fine video instance segmentation with factorized conditional appearance flows;Qin;IEEE/CAA J. Autom. Sin.,2023
4. C. Isaac, M. Gérard, Detecting and tracking moving objects for video surveillance, in: Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., 1999, pp. 319–325.
5. Z. Zhang, S. Fidler, R. Urtasun., Instance-Level Segmentation for Autonomous Driving with Deep Densely Connected MRFs, in: Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2016, pp. 669–677.