1. Barto, A.G., Sutton, R.S., Anderson, C.W.: Neuronlike adaptive elements that can solve difficult learning control problems. IEEE Trans. Syst. Man Cybern. 13(5), 834–846 (1983). https://doi.org/10.1109/TSMC.1983.6313077
2. Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Routledge, London (1984)
3. Brockman, G., et al.: OpenAI Gym (2016). https://arxiv.org/abs/1606.01540
4. Coppens, Y., Efthymiadis, K., et al.: Distilling deep reinforcement learning policies in soft decision trees. In: Proceedings of IJCAI 2019 Workshop on Explainable Artificial Intelligence, pp. 1–6 (2019)
5. Fujimoto, S., van Hoof, H., Meger, D.: Addressing function approximation error in actor-critic methods. In: Dy, J., Krause, A. (eds.) Proceedings of 35th ICML, pp. 1587–1596. PMLR (2018)