Author:
Stankovic Milos S.,Beko Marko,Stankovic Srdjan S.
Funder
Science Fund of the Republic of Serbia
Fundação para a Ciência e a Tecnologia
Reference34 articles.
1. Decentralized Parameter Estimation by Consensus Based Stochastic Approximation
2. Preface
3. Weak convergence properties of constrained emphatic temporal-difference learning with constant and slowly diminishing stepsize;yu;Journal of Machine Learning Research,2016
4. Cooperative off-policy prediction of Markov decision processes in adaptive networks
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献