1. https://osf.io/r24mu/?view_only=b44cec578cce44e5920f150940f68230
2. Amin, S., Gomrokchi, M., Satija, H., van Hoof, H., Precup, D.: A survey of exploration methods in reinforcement learning (2021)
3. Anderson, J.R.: Learning and Memory: An Integrated Approach, 2nd edn. Wiley, Hoboken (2000)
4. Lecture Notes in Computer Science;P Ashok,2019
5. Auer, P., Cesa-Bianchi, N., Fischer, P.: Finite-time analysis of the multiarmed bandit problem. Mach. Learn. 47, 235–256 (2004)