1. Efficient average reward reinforcement learning using constant shifting values;Yang,2016
2. Reinforcement learning: An introduction;Sutton,1998
3. A Concise Introduction to Decentralized POMDPs;Oliehoek,2016
4. Robust optimal control scheme for unknown constrained-input nonlinear systems via a plug-n-play event-sampled critic-only algorithm;Zhang;IEEE Trans. Syst. Man Cybern.: Syst.,2019
5. Reasoning About Uncertainty;Halpern,2003