Distributed Consensus-Based Multi-Agent Off-Policy Temporal-Difference Learning-Reference-Cited by-同舟云学术

Distributed Consensus-Based Multi-Agent Off-Policy Temporal-Difference Learning

Published:2021-12-14 Issue: Volume: Page:
ISSN:
Container-title:2021 60th IEEE Conference on Decision and Control (CDC)
language:
Short-container-title:

Author:

Stankovic Milos S.,Beko Marko,Stankovic Srdjan S.

Funder

Science Fund of the Republic of Serbia

Fundação para a Ciência e a Tecnologia

Publisher

IEEE

Link

http://xplorestaging.ieee.org/ielx7/9682670/9682776/09683607.pdf?arnumber=9683607

Reference34 articles.

1. Decentralized Parameter Estimation by Consensus Based Stochastic Approximation

2. Preface

3. Weak convergence properties of constrained emphatic temporal-difference learning with constant and slowly diminishing stepsize;yu;Journal of Machine Learning Research,2016

4. Cooperative off-policy prediction of Markov decision processes in adaptive networks

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multi-agent off-policy actor-critic algorithm for distributed multi-task reinforcement learning;European Journal of Control;2023-11

2. Distributed consensus-based multi-agent temporal-difference learning;Automatica;2023-05

3. Multi-Agent Actor-Critic Multitask Reinforcement Learning based on GTD(1) with Consensus;2022 IEEE 61st Conference on Decision and Control (CDC);2022-12-06

4. Convergent Distributed Actor-Critic Algorithm Based on Gradient Temporal Difference;2022 30th European Signal Processing Conference (EUSIPCO);2022-08-29

5. Distributed Actor-Critic Learning Using Emphatic Weightings;2022 8th International Conference on Control, Decision and Information Technologies (CoDIT);2022-05-17