1. Rewarding behaviors;Bacchus,1996
2. Optimal and dynamic planning for Markov decision processes with co-safe LTL specifications;Lacerda,2014
3. Deep reinforcement learning with temporal logics;Hasanbeig,2020
4. Learning and planning for temporally extended tasks in unknown environments;Bradley,2021