M3PT: A Multi-Modal Model for POI Tagging-Reference-Cited by-同舟云学术

M3PT: A Multi-Modal Model for POI Tagging

Published:2023-08-04 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
language:
Short-container-title:

Author:

Yang Jingsong¹^ORCID,Han Guanzhou²^ORCID,Yang Deqing¹^ORCID,Liu Jingping³^ORCID,Xiao Yanghua¹^ORCID,Xu Xiang²^ORCID,Wu Baohua²^ORCID,Ni Shenghua²^ORCID

Affiliation:

1. Fudan University, Shanghai, China

2. Alibaba Group, Hangzhou, China

3. East China University of Science and Technology, Shanghai, China

Funder

Shanghai Science and Technology Development Foundation

National Natural Science Foundation of China

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3580305.3599862

Reference52 articles.

1. Emanuel Ben-Baruch , Tal Ridnik , Nadav Zamir , Asaf Noy , Itamar Friedman , Matan Protter , and Lihi Zelnik-Manor . 2020. Asymmetric loss for multi-label classification. arXiv preprint arXiv:2009.14119 ( 2020 ). Emanuel Ben-Baruch, Tal Ridnik, Nadav Zamir, Asaf Noy, Itamar Friedman, Matan Protter, and Lihi Zelnik-Manor. 2020. Asymmetric loss for multi-label classification. arXiv preprint arXiv:2009.14119 (2020).

2. Yen-Chun Chen , Linjie Li , Licheng Yu , Ahmed El Kholy , Faisal Ahmed , Zhe Gan , Yu Cheng , and Jingjing Liu . 2020 . UNITER: UNiversal Image-TExt Representation Learning. In Computer Vision - ECCV 2020 - 16th European Conference, Glasgow, UK, August 23--28, 2020 , Proceedings, Part XXX (Lecture Notes in Computer Science , Vol. 12375), Andrea Vedaldi, Horst Bischof, Thomas Brox, and Jan-Michael Frahm (Eds.). Springer, 104-- 120 . https://doi.org/10.1007/978--3-030--58577--8_7 10.1007/978--3-030--58577--8_7 Yen-Chun Chen, Linjie Li, Licheng Yu, Ahmed El Kholy, Faisal Ahmed, Zhe Gan, Yu Cheng, and Jingjing Liu. 2020. UNITER: UNiversal Image-TExt Representation Learning. In Computer Vision - ECCV 2020 - 16th European Conference, Glasgow, UK, August 23--28, 2020, Proceedings, Part XXX (Lecture Notes in Computer Science, Vol. 12375), Andrea Vedaldi, Horst Bischof, Thomas Brox, and Jan-Michael Frahm (Eds.). Springer, 104--120. https://doi.org/10.1007/978--3-030--58577--8_7

3. A survey and analysis on automatic image annotation

4. Karan Desai and Justin Johnson . 2020. VirTex: Learning Visual Representations from Textual Annotations. CoRR , Vol. abs/ 2006 .06666 ( 2020 ). showeprint[arXiv]2006.06666 https://arxiv.org/abs/2006.06666 Karan Desai and Justin Johnson. 2020. VirTex: Learning Visual Representations from Textual Annotations. CoRR , Vol. abs/2006.06666 (2020). showeprint[arXiv]2006.06666 https://arxiv.org/abs/2006.06666