Revisiting Negative Sampling vs. Non-sampling in Implicit Recommendation-Reference-Cited by-同舟云学术

Revisiting Negative Sampling vs. Non-sampling in Implicit Recommendation

Published:2023-01-31 Issue:1 Volume:41 Page:1-25
ISSN:1046-8188
Container-title:ACM Transactions on Information Systems
language:en
Short-container-title:ACM Trans. Inf. Syst.

Author:

Chen Chong¹,Ma Weizhi¹,Zhang Min¹,Wang Chenyang¹,Liu Yiqun¹,Ma Shaoping¹

Affiliation:

1. Tsinghua University, Beijing, China

Abstract

Recommendation systems play an important role in alleviating the information overload issue. Generally, a recommendation model is trained to discern between positive (liked) and negative (disliked) instances for each user. However, under the open-world assumption, there are only positive instances but no negative instances from users’ implicit feedback, which poses the imbalanced learning challenge of lacking negative samples. To address this, two types of learning strategies have been proposed before, the negative sampling strategy and non-sampling strategy. The first strategy samples negative instances from missing data (i.e., unlabeled data), while the non-sampling strategy regards all the missing data as negative. Although learning strategies are known to be essential for algorithm performance, the in-depth comparison of negative sampling and non-sampling has not been sufficiently explored by far. To bridge this gap, we systematically analyze the role of negative sampling and non-sampling for implicit recommendation in this work. Specifically, we first theoretically revisit the objection of negative sampling and non-sampling. Then, with a careful setup of various representative recommendation methods, we explore the performance of negative sampling and non-sampling in different scenarios. Our results empirically show that although negative sampling has been widely applied to recent recommendation models, it is non-trivial for uniform sampling methods to show comparable performance to non-sampling learning methods. Finally, we discuss the scalability and complexity of negative sampling and non-sampling and present some open problems and future research topics that are worth being further explored.

Funder

Natural Science Foundation of China

Tsinghua University Guoqiang Research Institute

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Science Applications,General Business, Management and Accounting,Information Systems

Link

https://dl.acm.org/doi/pdf/10.1145/3522672

Reference82 articles.

1. Immanuel Bayer, Xiangnan He, Bhargav Kanagal, and Steffen Rendle. 2017. A generic coordinate descent framework for learning from implicit feedback. In Proceedings of the 26th International Conference on World Wide Web. 1341–1350.

2. Adversarial contrastive estimation;Bose Avishek Joey;arXiv preprint arXiv:1805.03642,2018

3. KBGAN: Adversarial learning for knowledge graph embeddings;Cai Liwei;arXiv preprint arXiv:1711.04071,2017

4. Chong Chen, Fei Sun, Min Zhang, and Bolin Ding. 2022. Recommendation unlearning. In Proceedings of the Web Conference 2022.

5. Chong Chen, Min Zhang, Yiqun Liu, and Shaoping Ma. 2018. Neural attentional rating regression with review-level explanations. In Proceedings of the Web Conference.

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Scalable and Provably Fair Exposure Control for Large-Scale Recommender Systems;Proceedings of the ACM Web Conference 2024;2024-05-13

2. LLMRec: Large Language Models with Graph Augmentation for Recommendation;Proceedings of the 17th ACM International Conference on Web Search and Data Mining;2024-03-04

3. Predicting potential target genes in molecular biology experiments using machine learning and multifaceted data sources;iScience;2024-03

4. False Negative Sample Aware Negative Sampling for Recommendation;Lecture Notes in Computer Science;2024

5. Safe Collaborative Filtering;SSRN Electronic Journal;2024