1. Double Q-learning;hasselt;Advances in neural information processing systems,0
2. Reinforcement learning for control: Performance, stability, and deep approximators;bu?oniu;Annual Reviews in Control,2018
3. Safe model-based reinforcement learning with stability guarantees;berkenkamp;ArXiv Preprint,2017
4. The interplay between stability and regret in online learning;saha;ArXiv Preprint,2012
5. A deeper look at experience replay;zhang;ArXiv Preprint,2017