1. Barto, A.G. and Sutton, R. (1996). Reinforcement Learning: An introduction. Adaptive Computation and Machine Learning.
2. Bertsekas, D.P. and Tsitsiklis, J.N. (1996). Neuro-Dynamic programming. Belmont: Athena Scientific.
3. Bosq, D. (1996). Algorithms for minimization without derivatives. Nonparametric statistics for stochastic processes. Estimation and prediction.Lecture notes in statistics. New York: Springer-Verlag.
4. Brent, R.P. (1996). Algorithms for minimization without derivatives. Englewood Cliffs: Prentice-Hall.
5. Gold, C. (2003). FX trading via recurrent Reinforcement Learning. Proc of IEEE Intl Conf on Computat Intel in Financial Engin, 1(1), 363-370.