Author:
Antony Vijesh Villavarayan,Sumithra Rudresha Shreyas,Abdulla Mohammed Shahid
Funder
Council of Scientific and Industrial Research, India
Publisher
Springer Science and Business Media LLC
Subject
Applied Mathematics,Management Science and Operations Research,Control and Optimization
Reference33 articles.
1. Asadi, K., Littman, M.L.: An alternative softmax operator for reinforcement learning. In D. Precup., Y. W. Teh (Eds.), Proceedings of International Conference on Machine Learning. 70, 243–252 (2017)
2. Bertsekas, D.: Multiagent value iteration algorithms in dynamic programming and reinforcement learning. Results Control Optim. 1, 100003 (2020)
3. Bian, T., Jiang, Z.P.: Value iteration and adaptive dynamic programming for data-driven adaptive optimal control design. Autom. J. IFAC 71, 348–360 (2016)
4. Blanchard, P., Higham, D.J., Higham, N.J.: Accurately computing the log-sum-exp and softmax functions. IMA J. Numer. Anal. 41, 2311–2330 (2021)
5. Boyd, S.P., Vandenberghe, L.: Convex Optimization. Cambridge University Press, Cambridge (2004)