Distributed Multi-Agent Reinforcement Learning by Actor-Critic Method

Author:

Heredia Paulo C.,Mou Shaoshuai

Publisher

Elsevier BV

Subject

Control and Systems Engineering

Reference19 articles.

1. Natural actor–critic algorithms;Bhatnagar;Automatica,2009

2. Stochastic approximation: a dynamical systems view;Borkar,2008

3. Dai, B., Shaw, A., Li, L., Xiao, L., He, N., Liu, Z., Chen, J., and Song, L. (2018). Sbeed: Convergent reinforcement learning with nonlinear function approximation. In ICML 2018.

4. Qd-learning: A collaborative distributed strategy for multi-agent reinforcement learning through consensus+innovations;Kar;IEEE Transactions on Signal Processing,2013

5. Konda, V.R. and Tsitsiklis, J.N. (2000). Actor-critic algorithms. In Advances in neural information processing systems, 1008–1014.

Cited by 11 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Cooperative Multiagent Reinforcement Learning With Partial Observations;IEEE Transactions on Automatic Control;2024-02

2. Intelligent Control of Robots with Minimal Power Consumption in Pick-and-Place Operations;Energies;2023-11-03

3. Management of Braking Energy in Electric Vehicles using Reinforcement Learning;2023 International Conference on Clean Electrical Power (ICCEP);2023-06-27

4. Simulation Study of Processes in Electric Vehicles under Braking Control Based on Reinforcement Learning;2023 IEEE 17th International Conference on Compatibility, Power Electronics and Power Engineering (CPE-POWERENG);2023-06-14

5. Distributed Offline Reinforcement Learning;2022 IEEE 61st Conference on Decision and Control (CDC);2022-12-06

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3