Affiliation:
1. College of Information Science and Technology, Jinan University, Guangzhou 510632, China
2. Guangdong Key Laboratory of Data Security and Privacy Preserving, Guangzhou 511443, China
Abstract
The Internet of Vehicles (IoV) enables vehicular data services and applications through vehicle-to-everything (V2X) communications. One of the key services provided by IoV is popular content distribution (PCD), which aims to quickly deliver popular content that most vehicles request. However, it is challenging for vehicles to receive the complete popular content from roadside units (RSUs) due to their mobility and the RSUs’ constrained coverage. The collaboration of vehicles via vehicle-to-vehicle (V2V) communications is an effective solution to assist more vehicles to obtain the entire popular content at a lower time cost. To this end, we propose a multi-agent deep reinforcement learning (MADRL)-based popular content distribution scheme in vehicular networks, where each vehicle deploys an MADRL agent that learns to choose the appropriate data transmission policy. To reduce the complexity of the MADRL-based algorithm, a vehicle clustering algorithm based on spectral clustering is provided to divide all vehicles in the V2V phase into groups, so that only vehicles within the same group exchange data. Then the multi-agent proximal policy optimization (MAPPO) algorithm is used to train the agent. We introduce the self-attention mechanism when constructing the neural network for the MADRL to help the agent accurately represent the environment and make decisions. Furthermore, the invalid action masking technique is utilized to prevent the agent from taking invalid actions, accelerating the training process of the agent. Finally, experimental results are shown and a comprehensive comparison is provided, which demonstrates that our MADRL-PCD scheme outperforms both the coalition game-based scheme and the greedy strategy-based scheme, achieving a higher PCD efficiency and a lower transmission delay.
Funder
Science and Technology Planning Project of Guangdong
Guangdong Provincial NSF
Science and Technology Planning Project of Guangzhou
Key Laboratory of Smart Education of Guangdong Higher Education Institutes, Jinan University
Jinan University
Opening Project of Key Laboratory of Safety of Intelligent Robots for State Market Regulation
NSFC
Subject
General Physics and Astronomy
Reference40 articles.
1. Yousefi, S., Mousavi, M.S., and Fathy, M. (2006, January 21–23). Vehicular Ad Hoc Networks (VANETs): Challenges and Perspectives. Proceedings of the 2006 6th International Conference on ITS Telecommunications, Chengdu, China.
2. Internet of Vehicles: Motivation, Layered Architecture, Network Model, Challenges, and Future Aspects;Kaiwartya;IEEE Access,2016
3. Yin, J., ElBatt, T., Yeung, G., Ryu, B., Habermas, S., Krishnan, H., and Talty, T. (2004, January 1). Performance Evaluation of Safety Applications over DSRC Vehicular Ad Hoc Networks. Proceedings of the 1st ACM International Workshop on Vehicular Ad Hoc Networks, VANET’04, Philadelphia, PA, USA.
4. Infotainment and Road Safety Service Support in Vehicular Networking: From a Communication Perspective;Cheng;Mech. Syst. Signal Process.,2011
5. Cybersecurity Challenges in Vehicular Communications;Sadatsharan;Veh. Commun.,2020