1. Pallapa V Challenges in ubiquitous network management, 2013. http://pet.ece.iisc.ernet/pallapa/Ubi-Management
2. Sutton R, Barto A (2018) Reinforcement learning: an introduction. MIT Press, Cambridge
3. Wenwen X, Chong D, Haonan G, Shenghong L (2019) Reinforcement learning based stochastic shortest path finding in wireless sensor networks. IEEE Access 7(2019):15780–15817
4. Almuthanna N, Yasin Y (2019) Reinforcement learning-based resource allocation in fog RAN for IoT with heterogeneous latency requirements. In: Proceedings of IEEE international conference on communications, Shanghai, China, 2(2019):1–11
5. Dapeng Z, Zhiwei G (2020) Fault tolerant control using reinforcement learning and particle swarm optimization. IEEE Access 8(2020):16802–16811