Computational task transfer scheme based on multi-agent generative adversarial imitation learning-Reference-Cited by-同舟云学术

Computational task transfer scheme based on multi-agent generative adversarial imitation learning

Published:2024-09-02 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Huang Haojing¹,Li Jiajun²,Lu Fei¹,Li Jianxin²

Affiliation:

1. Open University of Guangdong (Guandong Polytechnic Institute)

2. Dongguan Polytechnic

Abstract

The multi-agent computing task scheduling problem can be transformed into a task assignment optimization problem under the premise of minimum system cost and maximum sample utilization. For this problem, this paper proposes a computational task migration scheme based on multi-agent body generation adversarial imitation learning. In the battlefield, this scheme uses the idle computing power of tanks, fighting vehicles, drones or infantry equipment to form a Ad Hoc cloud. Based on the characteristics of distributed training of reinforcement learning algorithm, it makes full use of equipment resources to solve the transfer scheme of computing tasks. By using behavioral cloning to construct the data set and build the initial strategy model, the reward function is simplified, and the network structure, parameters and training hyperparameters are set. Driven by the training set, the generative adversarial inverse reinforcement learning method is used for network training. The simulation results show that the MAGAIL-MCT algorithm can mimic the expert trajectory behavior and successfully train the neural network. The experimental results prove that the scheme is improved in reducing system overhead, improving sample utilization, reducing migration delay, migration energy consumption and reducing the impact of mobility.

Publisher

Springer Science and Business Media LLC

Reference18 articles.

1. Survey of wireless manet application in battlefield operations [J];Rajabhushanam C;Int J Adv Comput Sci Appl,2011

2. Burer S, Saxena A The milp road to miqcp [C]//Lee J, Leyffer S.The IMA Volumes in Mathematics and Its Applications: Mixed Integer Nonlinear Programming.2012: 373–405

3. Skianis C.A survey on context-aware mobile and wireless networking: On networking and computing environments’ integration [J];Makris P;IEEE Commun Surv Tutorials,2013

4. Learning for a robot: deep reinforcement learning, imitation learning;HUA J;Transf learning[J] Sens,2021

5. Survey of imitation learning for robotic manipulation[J];[5] FANGB;Int J Intell Rob Appl,2019