Reinforcement learning algorithms: A brief survey-Reference-Cited by-同舟云学术

Reinforcement learning algorithms: A brief survey

Published:2023-11 Issue: Volume:231 Page:120495
ISSN:0957-4174
Container-title:Expert Systems with Applications
language:en
Short-container-title:Expert Systems with Applications

Author:

Shakya Ashish Kumar,Pillai Gopinatha,Chakrabarty Sohom^ORCID

Publisher

Elsevier BV

Subject

Artificial Intelligence,Computer Science Applications,General Engineering

Reference365 articles.

1. Abdoos, M., Mozayani, N., & Bazzan, A. L. C. (2011). Traffic light control in non-stationary environments based on multi agent Q-learning. In Proceedings of the 14th International IEEE Conference on Intelligent Transportation Systems (ITSC) (pp. 1580-1585).

2. Achiam, J., Held, D., Tamar, A., & Abbeel, P. (2017). Constrained policy optimization. In Proceedings of the 34th International Conference on Machine Learning (ICML'17) (pp. 22–31).

3. Cyber-security and reinforcement learning — A brief survey;Adawadkar;Engineering Applications of Artificial Intelligence,2022

4. Reinforcement learning based recommender systems: A survey;Afsar;ACM Computing Surveys,2022

5. Agrawal, S. & Jia, R. (2017). Optimistic posterior sampling for reinforcement learning: worst-case regret bounds. In Proceedings of the 31st Conference on Neural Information Processing Systems (NIPS 2017) (pp. 1184-1194).

Cited by 66 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Decentralized Counterfactual Value with Threat Detection for Multi-Agent Reinforcement Learning in mixed cooperative and competitive environments;Expert Systems with Applications;2024-12

2. Collaborative optimization of multi-energy multi-microgrid system: A hierarchical trust-region multi-agent reinforcement learning approach;Applied Energy;2024-12

3. Flexible recommendation for optimizing the debt collection process based on customer risk using deep reinforcement learning;Expert Systems with Applications;2024-12

4. Collaborative promotion: Achieving safety and task performance by integrating imitation reinforcement learning;Expert Systems with Applications;2024-12

5. Predictive coding for the actions and emotions of others and its deficits in autism spectrum disorders;Neuroscience & Biobehavioral Reviews;2024-12