Online Distillation-enhanced Multi-modal Transformer for Sequential Recommendation-Reference-Cited by-同舟云学术

Online Distillation-enhanced Multi-modal Transformer for Sequential Recommendation

Published:2023-10-26 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 31st ACM International Conference on Multimedia
language:
Short-container-title:

Author:

Ji Wei¹^ORCID,Liu Xiangyan¹^ORCID,Zhang An¹^ORCID,Wei Yinwei²^ORCID,Ni Yongxin¹^ORCID,Wang Xiang³^ORCID

Affiliation:

1. National University of Singapore, Singapore, Singapore

2. Monash University, Melboune, VIC, Austria

3. University of Science and Technology of China, Hefei, China

Funder

This work is fully supported by the Advanced Research and Technology Innovation Centre (ARTIC), the National University of Singapore under Grant (project number: A-8000969-00-00). This research is also supported by the National Natural Science Foundation of China (9227010114) and the University Synergy Innovation Program of Anhui Province (GXXT-2022-040).

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3581783.3612091

Reference58 articles.

1. Rohan Anil , Gabriel Pereyra , Alexandre Passos , Robert Ormandi , George E Dahl , and Geoffrey E Hinton . 2018. Large scale distributed neural network training through online distillation. arXiv preprint arXiv:1804.03235 ( 2018 ). Rohan Anil, Gabriel Pereyra, Alexandre Passos, Robert Ormandi, George E Dahl, and Geoffrey E Hinton. 2018. Large scale distributed neural network training through online distillation. arXiv preprint arXiv:1804.03235 (2018).

2. ItemSage: Learning Product Embeddings for Shopping Recommendations at Pinterest

3. Online Knowledge Distillation with Diverse Peers

4. Jin Chen , Defu Lian , Yucheng Li , Baoyun Wang , Kai Zheng , and Enhong Chen . 2022. Cache-Augmented Inbatch Importance Resampling for Training Recommender Retriever. arXiv preprint arXiv:2205.14859 ( 2022 ). Jin Chen, Defu Lian, Yucheng Li, Baoyun Wang, Kai Zheng, and Enhong Chen. 2022. Cache-Augmented Inbatch Importance Resampling for Training Recommender Retriever. arXiv preprint arXiv:2205.14859 (2022).

5. Revisiting Pre-Trained Models for Chinese Natural Language Processing

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Dynamic Network for Language-based Fashion Retrieval;Proceedings of the 1st International Workshop on Deep Multimodal Learning for Information Retrieval;2023-10-29