Author:
Jiang Kun,Kong Lingyue,Dong Lu
Publisher
Springer Nature Singapore
Reference29 articles.
1. Wang, Y., Liu, H., Zheng, W., et al.: Multi-objective workflow scheduling with deep-Q-network-based multi-agent reinforcement learning. IEEE Access 7, 39974–39982 (2019)
2. Silver, D., Lever, G., Heess, N., et al.: Deterministic policy gradient algorithms. In: International Conference on Machine Learning, pp. 387–395. PMLR (2014)
3. Mnih, V., et al.: Human-level control through deep reinforcement learning. Nature 518(7540), 529 (2015)
4. Eshkevari, S.S., Eshkevari, S.S., Sen, D., et al.: Active structural control framework using policy-gradient reinforcement learning. Eng. Struct. 274, 115122 (2023)
5. Hong, M., Wai, H.T., Wang, Z., et al.: A two-timescale stochastic algorithm framework for bilevel optimization: complexity analysis and application to actor-critic. SIAM J. Optim. 33(1), 147–180 (2023)