1. Bertsekas D.P.: Dynamic Programming and Optimal Control, vol 1 and 2. Athena Scientific, Belmont (1995)
2. Chepuri K., Homem De Mello T.: Solving the vehicle routing problem with stochastic demands using the cross-entropy method. Ann. Oper. Res. 134, 153–181 (2005)
3. Cicirello, V., Smith, S.F.: The max k-armed bandit: a new model for exploration applied to search heuristic selection. In: Proceedings of the 20th National Conference on Artificial Intelligence (AAAI-05) (2005)
4. Dorigo M., Gambardella L.M.: Ant colony system: a cooperative learning approach to the traveling salesman problem. IEEE Trans. Evol. Comput. 1, 53–66 (1997)
5. Fu, M.C., Hu, J., Marcus, S. I.: Model-based randomized methods for global optimization. In: Proceedings of the 17th international symposium on mathematical theory of networks and systems. Kyoto, Japan (2006)