1. Offline reinforcement learning: Tutorial, review, and perspectives on open problems;levine;Proc Neural Inf Process Syst,0
2. Benchmarks for deep off-policy evaluation;fu;Proc Int Conf Learn Representations,0
3. Incentivizing exploration in reinforcement learning with deep predictive models;stadie,2015
4. Conservative Q-learning for offline reinforcement learning;kumar;Proc 34th Int Conf Neural Inf Process Syst,0