1. On the theory of policy gradient methods: Optimality, approximation, and distribution shift;Agarwal;Journal of Machine Learning Research,2021
2. Reinforcement learning in decentralized stochastic control systems with partial history sharing;Arabneydi,2015
3. Reinforcement learning of POMDPs using spectral methods;Azizzadenesheli,2016
4. Policy iteration for decentralized control of Markov decision processes;Bernstein;Journal of Artificial Intelligence Research,2009
5. The complexity of decentralized control of Markov decision processes;Bernstein;Mathematics of Operations Research,2002