Publisher
Springer Science and Business Media LLC
Reference50 articles.
1. Dhiman G, Kumar AV, Nirmalan R et al (2023) Multi-modal active learning with deep reinforcement learning for target feature extraction in multi-media image processing applications. Multimed Tools Appl 82:5343–5367. https://doi.org/10.1007/s11042-022-12178-7
2. Salimans T, Ho J, Chen X, Sidor S, Sutskever I (2017) Evolution strategies as a scalable alternative to reinforcement learning. arXiv preprint arXiv: 1703.03864. https://doi.org/10.48550/arXiv.1703.03864
3. Todorov E, Erez T, Tassa Y (2012) MuJoCo: A physics engine for model-based control. IEEE/RSJ Int Conf Intell Robots Syste, Vilamoura-Algarve, Portugal 5026–5033. https://doi.org/10.1109/IROS.2012.6386109
4. Baba N (1981) Convergence of a random optimization method for constrained optimization problems. J Optim Theory Appl 33:451–461. https://doi.org/10.1007/BF00935752
5. Pattathil S, Zhang K, Ozdaglar A (2023) Symmetric (Optimistic) Natural Policy Gradient for Multi-Agent Learning with Parameter Convergence. Proceedings of The 26th International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 206:5641–5685. https://proceedings.mlr.press/v206/pattathil23a.html