Embedding Compression in Recommender Systems: A Survey-Reference-Cited by-同舟云学术

Embedding Compression in Recommender Systems: A Survey

Published:2024-01-12 Issue:5 Volume:56 Page:1-21
ISSN:0360-0300
Container-title:ACM Computing Surveys
language:en
Short-container-title:ACM Comput. Surv.

Author:

Li Shiwei¹^ORCID,Guo Huifeng²^ORCID,Tang Xing³^ORCID,Tang Ruiming²^ORCID,Hou Lu²^ORCID,Li Ruixuan¹^ORCID,Zhang Rui⁴^ORCID

Affiliation:

1. Huazhong University of Science and Technology, China

2. Huawei Noah’s Ark Lab, China

3. Tencent, China

4. ruizhang.info, China

Abstract

To alleviate the problem of information explosion, recommender systems are widely deployed to provide personalized information filtering services. Usually, embedding tables are employed in recommender systems to transform high-dimensional sparse one-hot vectors into dense real-valued embeddings. However, the embedding tables are huge and account for most of the parameters in industrial-scale recommender systems. In order to reduce memory costs and improve efficiency, various approaches are proposed to compress the embedding tables. In this survey, we provide a comprehensive review of embedding compression approaches in recommender systems. We first introduce deep learning recommendation models and the basic concept of embedding compression in recommender systems. Subsequently, we systematically organize existing approaches into three categories: low precision, mixed dimension, and weight sharing. Lastly, we summarize the survey with some general suggestions and provide future prospects for this field.

Funder

National Natural Science Foundation of China

Science and Technology Support Program of Hubei Province

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science,Theoretical Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3637841

Reference82 articles.

1. Artem Babenko and Victor S. Lempitsky. 2014. Additive quantization for extreme vector compression. In 2014 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2014, Columbus, OH, USA, June 23–28, 2014. IEEE Computer Society, Columbus, OH, USA, 931–938.

2. Yash Bhalgat, Jinwon Lee, Markus Nagel, Tijmen Blankevoort, and Nojun Kwak. 2020. LSQ+: Improving low-bit quantization through learnable offsets and better initialization. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR Workshops 2020, Seattle, WA, USA, June 14–19, 2020. Computer Vision Foundation / IEEE, 2978–2985.

3. Tong Chen, Hongzhi Yin, Yujia Zheng, Zi Huang, Yang Wang, and Meng Wang. 2021. Learning elastic embeddings for customizing on-device recommenders. In KDD’21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Virtual Event, Singapore, August 14–18, 2021. ACM, 138–147.

4. Yunchuan Chen, Lili Mou, Yan Xu, Ge Li, and Zhi Jin. 2016. Compressing neural language models by sparse word representations. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016, August 7–12, 2016, Berlin, Germany, Volume 1: Long Papers. The Association for Computer Linguistics.

5. Towards low-loss 1-bit quantization of user-item representations for top-K recommendation;Chen Yankai;CoRR,2021

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Fuzzy PID-Incorporated Stochastic Gradient Descent Algorithm for Fast and Accurate Latent Factor Analysis;IEEE Transactions on Fuzzy Systems;2024-07