1. Neuro-dynamic programming;Bertsekas,1996
2. Relative value function approximation for the capacitated re-entrant line scheduling problem;Choi;IEEE Transactions on Automation Science and Engineering,2005
3. Csáji, B. Cs. (2008). Adaptive resource control: Machine learning approaches to resource allocation in uncertain and changing environments. Dissertation for the Ph.D. Degree, Eötvös Loránd University, Budapest, Hungary, p. 104.
4. Value function based reinforcement learning in changing Markovian environments;Csáji;Journal of Machine Learning Research,2008
5. Adaptive stochastic resource control: A machine learning approach;Csáji;Journal of Artificial Intelligence Research,2008