1. An, G., Moon, S., Kim, J.H., Song, H.O.: Uncertainty-based offline reinforcement learning with diversified Q-ensemble. Adv. Neural Inf. Process. Syst. 34, 7436–7447 (2021)
2. Andrychowicz, O.M., et al.: Learning dexterous in-hand manipulation. Int. J. Robot. Res. 39(1), 3–20 (2020)
3. Bai, C., et al.: Pessimistic bootstrapping for uncertainty-driven offline reinforcement learning. In: International Conference on Learning Representations (2022). https://openreview.net/forum?id=Y4cs1Z3HnqL
4. Brandfonbrener, D., Whitney, W., Ranganath, R., Bruna, J.: Offline RL without off-policy evaluation. Adv. Neural Inf. Process. Syst. 34, 4933–4946 (2021)
5. Chen, L., et al.: Decision transformer: reinforcement learning via sequence modeling. Adv. Neural Inf. Process. Syst. 34, 15084–15097 (2021)