1. Fitted Q-iteration in continuous action-space MDPs;Antos,2008
2. A convergent actor-critic-based FRL algorithm with application to power management of wireless transmitters;Berenji;IEEE Transactions on Fuzzy Systems,2003
3. Dynamic programming and optimal control, Vol. 2;Bertsekas,2007
4. Neuro-dynamic programming;Bertsekas,1996
5. Neurofuzzy adaptive modeling and control;Brown,1994