Author:
Rieder Ulrich,Wagner Hartmut
Publisher
Springer Science and Business Media LLC
Subject
Management Science and Operations Research,General Decision Sciences
Reference12 articles.
1. H. Benzing, K. Hinderer and M. Kolonko, On thek-armed Bernoulli bandit: Monotonicity of the total reward under an arbitrary prior distribution, Math. Operationsforschung Statistik, Ser. Optimization 15(1984)583–595.
2. H. Benzing and M. Kolonko, Structured policies for a sequential design problem with general distributions, Math. Oper. Res. 12(1987)60–71.
3. D.A. Berry and B. Fristedt,Bandit Problems (Chapman and Hall, London, 1985).
4. D.P. Bertsekas,Dynamic Programming: Deterministic and Stochastic Models (Prentice Hall, Englewood Cliffs, NJ, 1987).
5. K. Hinderer,Foundations of Non-Stationary Dynamic Programming with Discrete Time, Parameter (Springer, Berlin, 1970).
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献