1. Dynamic programming;Bellman,1957
2. Neuro-dynamic programming;Bertsekas,1996
3. Neural network toolbox user's guide (MATLAB);Demuth,2002
4. Control of nonlinear systems using polynomial ARMA models;Hernández;AIChE Journal,1993
5. Kaisare, N. S., Lee, J. M., & Lee, J. H. (2002). Comparison of policy iteration, value iteration and temporal difference learning. In AIChE Annual Meeting, Indianapolis, IN.