1. Reinforcement learning for true adaptive traffic signal control;Abdulhai;Journal of Transportation Engineering,2003
2. Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control;Al-Tamimi;Automatica,2007
3. Adaptive critic designs for discrete-time zero-sum games with application to H-Infinity control;Al-Tamimi;IEEE Transactions on Systems Man Cybernetics-Part B,2006
4. Natural gradient works efficiently in learning;Amari;Neural Computation,1998
5. A. Antos, R. Munos, C. Szepesvari, Regularized fitted Q-iteration for planning in continuous-space Markovian decision problems, in: 2009 American Control Conference, Hyatt Regency Riverfront, St. Louis, MO, USA, June 10–12, pp. 725–730.