Author:
Bozkurt Alper Kamil,Wang Yu,Pajic Miroslav
Reference24 articles.
1. Safe reinforcement learning via shielding;alshiekh;AAAI Conference on Artificial Intelligence,2018
2. Run-Time Optimization for Learned Controllers Through Quantitative Games
3. Probably Approximately Correct MDP Learning and Control With Temporal Logic Constraints
4. Using reward machines for high-level task specification and decomposition in reinforcement learning;icarte;International Conference on Machine Learning (ICML),2018
5. Omega-Regular Objectives in Model-Free Reinforcement Learning