Common Pitfalls in Training and Evaluating Recommender Systems-Reference-Cited by-同舟云学术

Common Pitfalls in Training and Evaluating Recommender Systems

Published:2017-09 Issue:1 Volume:19 Page:37-45
ISSN:1931-0145
Container-title:ACM SIGKDD Explorations Newsletter
language:en
Short-container-title:SIGKDD Explor. Newsl.

Author:

Chen Hung-Hsuan¹,Chung Chu-An²,Huang Hsin-Chien²,Tsui Wen²

Affiliation:

1. National Central University

2. Industrial Technology Research Institute

Abstract

This paper formally presents four common pitfalls in training and evaluating recommendation algorithms for information systems. Specifically, we show that it could be problematic to separate the server logs into training and test data for model generation and model evaluation if the training and the test data are selected improperly. In addition, we show that click through rate { a common metric to measure and compare the performance of different recommendation algorithms -- may not be a good measurement of profitability { the income a recommendation module brings to a website. Moreover, we demonstrate that evaluating recommendation revenue may not be a straightforward task as it first looks. Unfortunately, these pitfalls appeared in many previous studies on recommender systems and information systems. We explicitly explain these problems and propose methods to address them. We conducted experiments to support our claims. Finally, we review previous papers and competitions that may suffer from these problems.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3137597.3137601

Reference27 articles.

1. ACM RecSys Challenge 2017. http://2017. recsyschallenge.com/. Accessed: 2017-07-14. ACM RecSys Challenge 2017. http://2017. recsyschallenge.com/. Accessed: 2017-07-14.

2. Click-through rate prediction. https://www.kaggle. com/c/avazu-ctr-prediction. Accessed: 2017-07-14. Click-through rate prediction. https://www.kaggle. com/c/avazu-ctr-prediction. Accessed: 2017-07-14.

3. Display advertising challenge. https://www.kaggle.com/c/criteo-display-ad-challenge. Accessed: 2017-07-14. Display advertising challenge. https://www.kaggle.com/c/criteo-display-ad-challenge. Accessed: 2017-07-14.

4. Outbrain click prediction. https://www.kaggle.com/c/outbrain-click-prediction. Accessed: 2017-07-14. Outbrain click prediction. https://www.kaggle.com/c/outbrain-click-prediction. Accessed: 2017-07-14.

5. Diversifying search results

Cited by 18 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Our Model Achieves Excellent Performance on MovieLens: What Does It Mean?;ACM Transactions on Information Systems;2024-07

2. Explaining Neural News Recommendation with Attributions onto Reading Histories;ACM Transactions on Intelligent Systems and Technology;2024-06-18

3. On Challenges of Evaluating Recommender Systems in an Offline Setting;Proceedings of the 17th ACM Conference on Recommender Systems;2023-09-14

4. Detecting Inaccurate Sensors on a Large-Scale Sensor Network Using Centralized and Localized Graph Neural Networks;IEEE Sensors Journal;2023-08-01

5. Take a Fresh Look at Recommender Systems from an Evaluation Standpoint;Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval;2023-07-18