1. Policy-gradient algorithms for partially observable Markov decision processes;Aberdeen,2003
2. Scaling internal-state policy-gradient methods for POMDPs;Aberdeen,2002
3. Dynamic optimization of long-term growth rate for a portfolio with transaction costs and logarithmic utility;Akian;Mathematical Finance,2001
4. Infinite-horizon policy-gradient estimation;Baxter;Journal of Artificial Intelligence Research,2001
5. Dynamic Programming and Stochastic Control;Bertsekas,1976