Optimization of Single-user Task Migration based on Improved DDPG-Reference-Cited by-同舟云学术

Optimization of Single-user Task Migration based on Improved DDPG

Published:2024-07-17 Issue: Volume:21 Page:308-317
ISSN:2224-3402
Container-title:WSEAS TRANSACTIONS ON INFORMATION SCIENCE AND APPLICATIONS
language:en
Short-container-title:

Author:

Ning Cao¹,Yang He²,Can Hu¹

Affiliation:

1. College of Computer Science and Software Engineering, Hohai University, Nanjing, CHINA

2. College of Information Science and Engineering, Hohai University, Nanjing, CHINA

Abstract

Aiming at the problems of slow convergence and unstable convergence of traditional reinforcement learning algorithms in minimizing computational cost on edge servers with random task arrivals and time-varying wireless channels, an improved DDPG algorithm (IDDPG) was proposed. The Critic network structure of DDPG was replaced by the Dueling structure, which converged faster by splitting the state value function into an advantage function and a value function. The update frequency of the Critic network was adjusted to be higher than that of the Actor-network to make the overall training more stable. The Ornstein- Uhlenbeck noise was added to the actions selected through the Actor-network to improve the algorithm exploration ability, and the action noise size was set in segments to ensure the stability of convergence. Experimental results show that, compared with other algorithms, the IDDPG algorithm can better minimize the computational cost and has a certain improvement in the convergence speed and convergence stability.

Publisher

World Scientific and Engineering Academy and Society (WSEAS)

Reference25 articles.

1. SHI Weisong, ZHANG Xingzhou, WANG Yifan, ZHANG Qingyang. Edge computing: State-of-the-art and future directions. Journal of Computer Research and Development, 2019, 56 (1):69-89 (in Chinese).

2. W. Shi, J. Cao, Q. Zhang, Y. Li and L. Xu, "Edge Computing: Vision and Challenges," in IEEE Internet of Things Journal, vol. 3, no. 5, pp. 637-646, Oct. 2016, doi: 10.1109/JIOT.2016.257919 8

3. K. Zhang, Y. Mao, S. Leng, Y. He and Y. ZHANG, "Mobile-Edge Computing for Vehicular Networks: A Promising Network Paradigm with Predictive Off-Loading," in IEEE Vehicular Technology Magazine, vol. 12, no. 2, pp. 36-44, June 2017, doi: 10.1109/MVT. 2017.2668838

4. Y. Mao, C. You, J. Zhang, K. Huang and K. B. Letaief, "A Survey on Mobile Edge Computing: The Communication Perspective," in IEEE Communications Surveys & Tutorials, vol. 19, no. 4, pp. 2322- 2358, Fourthquarter 2017, doi: 10.1109/COMST.2017.2745201

5. Maria J.P. Peixoto and Akramul Azim. 2021. Using time-correlated noise to encourage exploration and improve autonomous agents performance in Reinforcement Learning. Procedia Comput. Sci., 191, C (2021), 85-92. https://doi.org/10.1016/j. procs.2021.07.014.