1. Adaptive control.;Åström,1989
2. Infinite-horizon policy-gradient estimation;Baxter;Journal of Artificial Intelligence Research,2001
3. Experiments with infinite-horizon policy-gradient estimation;Baxter;Journal of Artificial Intelligence Research,2001
4. Dynamic programming and optimal control, Vols. I and II;Bertsekas,1995
5. Dynamic programming and optimal control, Vols. I and II;Bertsekas,2001