1. A (revised) survey of approximate methods for solving partially observable Markov decision processes;Aberdeen,2003
2. Analysis of Thompson sampling for the multi-armed bandit problem;Agrawal,2012
3. Online algorithms: A survey;Albers;Mathematical Programming,2003
4. A review of simulation optimization techniques;Andradóttir,1998
5. Simulation optimimzation;Andradóttir,1998