Enhancing Dynamic Image Advertising with Vision-Language Pre-training-Reference-Cited by-同舟云学术

Enhancing Dynamic Image Advertising with Vision-Language Pre-training

Published:2023-07-18 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval
language:
Short-container-title:

Author:

Wen Zhoufutu¹^ORCID,Zhao Xinyu²^ORCID,Jin Zhipeng¹^ORCID,Yang Yi¹^ORCID,Jia Wei¹^ORCID,Chen Xiaodong¹^ORCID,Li Shuanglong¹^ORCID,Liu Lin¹^ORCID

Affiliation:

1. Baidu Inc., Beijing, China

2. Peking University, Beijing, China

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3539618.3591844

Reference31 articles.

1. Hangbo Bao , Wenhui Wang , Li Dong , Qiang Liu , Owais Khan Mohammed , Kriti Aggarwal, Subhojit Som, Songhao Piao, and Furu Wei. 2022 . VLMo: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts. In Advances in Neural Information Processing Systems (NeurIPS) . Hangbo Bao, Wenhui Wang, Li Dong, Qiang Liu, Owais Khan Mohammed, Kriti Aggarwal, Subhojit Som, Songhao Piao, and Furu Wei. 2022. VLMo: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts. In Advances in Neural Information Processing Systems (NeurIPS).

2. Yen-Chun Chen , Linjie Li , Licheng Yu , Ahmed El Kholy , Faisal Ahmed , Zhe Gan , Yu Cheng , and Jingjing Liu . 2019 . UNITER: UNiversal Image-TExt Representation Learning . In Proceedings of the European Conference on Computer Vision (ECCV). Yen-Chun Chen, Linjie Li, Licheng Yu, Ahmed El Kholy, Faisal Ahmed, Zhe Gan, Yu Cheng, and Jingjing Liu. 2019. UNITER: UNiversal Image-TExt Representation Learning. In Proceedings of the European Conference on Computer Vision (ECCV).

3. Ekin Dogus Cubuk , Barret Zoph , Jonathon Shlens , and Quoc V. Le . 2020. Randaugment: Practical automated data augmentation with a reduced search space . In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). Ekin Dogus Cubuk, Barret Zoph, Jonathon Shlens, and Quoc V. Le. 2020. Randaugment: Practical automated data augmentation with a reduced search space. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

4. Jacob Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2019 . BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding . In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, (NAACL-HLT). Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, (NAACL-HLT).

5. Alexey Dosovitskiy , Lucas Beyer , Alexander Kolesnikov , Dirk Weissenborn , Xiaohua Zhai , Thomas Unterthiner , Mostafa Dehghani , Matthias Minderer , Georg Heigold , Sylvain Gelly , Jakob Uszkoreit , and Neil Houlsby . 2021 . An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale . In Proceedings of the 9th International Conference on Learning Representations (ICLR). Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, Jakob Uszkoreit, and Neil Houlsby. 2021. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. In Proceedings of the 9th International Conference on Learning Representations (ICLR).

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval;Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval;2024-07-10

2. Enhancing Baidu Multimodal Advertisement with Chinese Text-to-Image Generation via Bilingual Alignment and Caption Synthesis;Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval;2024-07-10

3. Automatic Image Aesthetic Assessment for Human-designed Digital Images;Proceedings of the 1st International Workshop on Multimedia Content Generation and Evaluation: New Methods and Practice;2023-10-29