Author:
Cai Qingpeng,Filos-Ratsikas Aris,Tang Pingzhong,Zhang Yiwei
Abstract
In large e-commerce websites, sellers have been observed to engage in fraudulent behaviour, faking historical transactions in order to receive favourable treatment from the platforms, specifically through the allocation of additional buyer impressions which results in higher revenue for them, but not for the system as a whole. This emergent phenomenon has attracted considerable attention, with previous approaches focusing on trying to detect illicit practices and to punish the miscreants. In this paper, we employ the principles of reinforcement mechanism design, a framework that combines the fundamental goals of classical mechanism design, i.e. the consideration of agents' incentives and their alignment with the objectives of the designer, with deep reinforcement learning for optimizing the performance based on these incentives. In particular, first we set up a deep-learning framework for predicting the sellers' rationality, based on real data from any allocation algorithm. We use data from one of largest e-commerce platforms worldwide and train a neural network model to predict the extent to which the sellers will engage in fraudulent behaviour. Using this rationality model, we employ an algorithm based on deep reinforcement learning to optimize the objectives and compare its performance against several natural heuristics, including the platform's implementation and incentive-based mechanisms from the related literature.
Publisher
Association for the Advancement of Artificial Intelligence (AAAI)
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Personalized Prediction of Bounded-Rational Bargaining Behavior in Network Resource Sharing;IEEE INFOCOM 2024 - IEEE Conference on Computer Communications;2024-05-20
2. A multiview clustering framework for detecting deceptive reviews;Journal of Computer Security;2024-02-02
3. Reinforcement Re-ranking with 2D Grid-based Recommendation Panels;Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region;2023-11-26
4. Investigating Fraud and Misconduct in Legitimate Internet Economy based on Customer Complaints;2023 IEEE 22nd International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom);2023-11-01
5. A Thompson Sampling Algorithm With Logarithmic Regret for Unimodal Gaussian Bandit;IEEE Transactions on Neural Networks and Learning Systems;2023-09