1. Abadi M, Agarwal A, Barham P, et al. (2015) TensorFlow: large-scale machine learning on heterogeneous systems. Software available from tensorflow.org. URL: https://www.tensorflow.org/
2. Achiam J, Knight E, Abbeel P (2019) Towards characterizing divergence in deep Q-learning. ArXiv Preprint arXiv:1903.08894.
3. Amiranashvili A, Dosovitskiy A, Koltun V, et al. (2018) Analyzing the role of temporal differencing in deep reinforcement learning. In: Proceedings of the international conference on learning representations. URL: https://openreview.net/forum?id=HyiAuyb0b
4. Hybrid impedance control of robotic manipulators
5. Anschel O, Baram N, Shimkin N (2017) Averaged-DQN: variance reduction and stabilization for deep reinforcement learning. In: Proceedings of the 34 th International Conference on Machine, Sydney, Australia, PMLR 70, 2017, August 6–11 2017, pp. 176–185.