1. Shen, X., Liu, B., Zhou, Y., Zhao, J., & Liu, M. (2020). Remote sensing image captioning via Variational Autoencoder and Reinforcement Learning. Knowledge-Based Systems, 203, 105920.
2. Zeng, X., Wen, L., Liu, B., & Qi, X. (2019). Deep learning for ultrasound image caption generation based on object detection. Neurocomputing, 392, 132–141.
3. Zhang, J., Li, K., & Wang, Z. (2021). Parallel-fusion LSTM with synchronous semantic and visual information for image captioning. Journal of Visual Communication and Image Representation, 75(2021), 103044.
4. Yan, S., Xie, Y., Wu, F., Smith, J. S., Lu, W., & Zhang, B. (2019). Image captioning via hierarchical attention mechanism and policy gradient optimization. Signal Processing, 167(2020), 107329.
5. Wang, S., Lan, L., Zhang, X., & Luo, Z. (2020). GateCap Gated spatial and semantic attention model for image captioning. Multimedia Tools and Applications, 79(17), 11531–11549.