Multi-armed bandits for bid shading in first-price real-time bidding auctions

Author:

Tilli Tuomo12,Espinosa-Leal Leonardo2

Affiliation:

1. ReadPeak Oy, Helsinki, Finland

2. Department of Bussiness Management and Analytics, Arcada University of Applied Sciences, Helsinki, Finland

Abstract

Online advertisements are bought through a mechanism called real-time bidding (RTB). In RTB, the ads are auctioned in real-time on every webpage load. The ad auctions can be of two types: second-price or first-price auctions. In second-price auctions, the bidder with the highest bid wins the auction, but they only pay the second-highest bid. This paper focuses on first-price auctions, where the buyer pays the amount that they bid. This research evaluates how multi-armed bandit strategies optimize the bid size in a commercial demand-side platform (DSP) that buys inventory through ad exchanges. First, we analyze seven multi-armed bandit algorithms on two different offline real datasets gathered from real second-price auctions. Then, we test and compare the performance of three algorithms in a production environment. Our results show that real data from second-price auctions can be used successfully to model first-price auctions. Moreover, we found that the trained multi-armed bandit algorithms reduce the bidding costs considerably compared to the baseline (naïve approach) on average 29%and optimize the whole budget by slightly reducing the win rate (on average 7.7%). Our findings, tested in a real scenario, show a clear and substantial economic benefit for ad buyers using DSPs.

Publisher

IOS Press

Subject

Artificial Intelligence,General Engineering,Statistics and Probability

Reference14 articles.

1. Operating anadvertising programmatic buying platform: A case study;Gonzalvez-Cabañas;IJIMAI,2016

2. AdX: a model for ad exchanges;Muthukrishnan;ACM SIGecomExchanges,2009

3. Internet advertising andthe generalized second-price auction: Selling billions of dollars worthof keywords;Edelman;American Economic Review,2007

4. Lattimore T. and Szepesvári C. , Bandit algorithms, Cambridge University Press (2020).

5. Nonparametric estimation from incompleteobservations;Kaplan;Journal of the American Statistical Association,1958

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3