Funder
Xi’an Jiaotong University
Reference34 articles.
1. Deep neural networks in machine translation: An overview.;Zhang;IEEE Intell. Syst.,2015
2. O. Vinyals, A. Toshev, S. Bengio, D. Erhan, Show and tell: A neural image caption generator, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015, pp. 3156–3164.
3. R. Girshick, Fast r-cnn, in: Proceedings of the IEEE International Conference on Computer Vision, 2015, pp. 1440–1448.
4. Grit: Faster and better image captioning transformer using dual visual features;Nguyen,2022
5. Deformable detr: Deformable transformers for end-to-end object detection;Zhu,2020