Author:
Belyavsky Grigory,Danilova Natalya,Ougolnitsky Guennady
Abstract
This paper considers resource allocation among producers (agents) in the case where the Principal knows nothing about their cost functions while the agents have Markovian awareness about his/her strategies. We use a dynamic setup of the stochastic inverse Stackelberg game as the model. We suggest an algorithm for solving this game based on Q-learning. The associated Bellman equations contain functions of one variable for the Principal and also for the agents. The new results are illustrated by numerical examples.
Funder
Russian Science Foundation
Subject
General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)
Reference38 articles.
1. Marktform und Gleichgewicht;Von Stackelberg,1934
2. Phenomena in Inverse Stackelberg Games, Part 1: Static Problems
3. Phenomena in Inverse Stackelberg Games, Part 2: Dynamic Problems
4. The Theory of Incentives: The Principal-Agent Model;Laffont,2002
5. Theory of Control in Organizations;Novikov,2013
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献