Adversarial Constrained Bidding via Minimax Regret Optimization with Causality-Aware Reinforcement Learning

Author:

Wang Haozhe1ORCID,Du Chao1ORCID,Fang Panyan1ORCID,He LI1ORCID,Wang Liang1ORCID,Zheng Bo1ORCID

Affiliation:

1. Alibaba Group, Beijing, China

Publisher

ACM

Reference48 articles.

1. Jonas Adler and Sebastian Lunz . 2018. Banach wasserstein gan. Advances in neural information processing systems 31 ( 2018 ). Jonas Adler and Sebastian Lunz. 2018. Banach wasserstein gan. Advances in neural information processing systems 31 (2018).

2. Alexander A Alemi , Ian Fischer , Joshua V Dillon , and Kevin Murphy . 2016. Deep variational information bottleneck. arXiv preprint arXiv:1612.00410 ( 2016 ). Alexander A Alemi, Ian Fischer, Joshua V Dillon, and Kevin Murphy. 2016. Deep variational information bottleneck. arXiv preprint arXiv:1612.00410 (2016).

3. Alimama 2022. Alimama. Retrieved 2022 from https://www.alimama.com/ Alimama 2022. Alimama. Retrieved 2022 from https://www.alimama.com/

4. David Balduzzi , Sebastien Racaniere , James Martens , Jakob Foerster , Karl Tuyls , and Thore Graepel . 2018 . The mechanics of n-player differentiable games . In International Conference on Machine Learning. PMLR, 354--363 . David Balduzzi, Sebastien Racaniere, James Martens, Jakob Foerster, Karl Tuyls, and Thore Graepel. 2018. The mechanics of n-player differentiable games. In International Conference on Machine Learning. PMLR, 354--363.

5. S. Balseiro A. Kim M. Mahdian and V. Mirrokni. 2021. Budget-Management Strategies in Repeated Auctions. Operations Research 69 3 (2021). S. Balseiro A. Kim M. Mahdian and V. Mirrokni. 2021. Budget-Management Strategies in Repeated Auctions. Operations Research 69 3 (2021).

Cited by 3 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Spending Programmed Bidding: Privacy-friendly Bid Optimization with ROI Constraint in Online Advertising;Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2024-08-24

2. FreQuant: A Reinforcement-Learning based Adaptive Portfolio Optimization with Multi-frequency Decomposition;Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2024-08-24

3. Bayesian reinforcement learning for navigation planning in unknown environments;Frontiers in Artificial Intelligence;2024-07-04

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3