Author:
Shui Jianan,Ding Shuaipeng,Li Mingyong,Ma Yan
Publisher
Springer Nature Singapore
Reference28 articles.
1. Abdullah, T., Bazi, Y., Al Rahhal, M.M., Mekhalfi, M.L., Rangarajan, L., Zuair, M.: TextRS: deep bidirectional triplet network for matching text to remote sensing images. Remote Sens. 12(3), 405 (2020)
2. Cheng, Q., Zhou, Y., Fu, P., Xu, Y., Zhang, L.: A deep semantic alignment network for the cross-modal image-text retrieval in remote sensing. IEEE J. Sel. Top. Appl. Earth Observations Remote Sens. 14, 4284–4297 (2021)
3. Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255. IEEE (2009)
4. Faghri, F., Fleet, D.J., Kiros, J.R., Fidler, S.: VSE++: improving visual-semantic embeddings with hard negatives. arXiv preprint arXiv:1707.05612 (2017)
5. Feng, D., He, X., Peng, Y.: MKVSE: multimodal knowledge enhanced visual-semantic embedding for image-text retrieval. ACM Trans. Multimed. Comput. Commun. Appl. 19(5), 1–21 (2023)