Federated Reinforcement Learning for Collaborative Intelligence in UAV-Assisted C-V2X Communications-Reference-Cited by-同舟云学术

Federated Reinforcement Learning for Collaborative Intelligence in UAV-Assisted C-V2X Communications

Published:2024-07-12 Issue:7 Volume:8 Page:321
ISSN:2504-446X
Container-title:Drones
language:en
Short-container-title:Drones

Author:

Gupta Abhishek¹^ORCID,Fernando Xavier¹^ORCID

Affiliation:

1. Department of Electrical, Computer and Biomedical Engineering, Toronto Metropolitan University, Toronto, ON M5B 2K3, Canada

Abstract

This paper applies federated reinforcement learning (FRL) in cellular vehicle-to-everything (C-V2X) communication to enable vehicles to learn communication parameters in collaboration with a parameter server that is embedded in an unmanned aerial vehicle (UAV). Different sensors in vehicles capture different types of data, contributing to data heterogeneity. C-V2X communication networks impose additional communication overhead in order to converge to a global model when the sensor data are not independent-and-identically-distributed (non-i.i.d.). Consequently, the training time for local model updates also varies considerably. Using FRL, we accelerated this convergence by minimizing communication rounds, and we delayed it by exploring the correlation between the data captured by various vehicles in subsequent time steps. Additionally, as UAVs have limited battery power, processing of the collected information locally at the vehicles and then transmitting the model hyper-parameters to the UAVs can optimize the available power consumption pattern. The proposed FRL algorithm updates the global model through adaptive weighing of Q-values at each training round. By measuring the local gradients at the vehicle and the global gradient at the UAV, the contribution of the local models is determined. We quantify these Q-values using nonlinear mappings to reinforce positive rewards such that the contribution of local models is dynamically measured. Moreover, minimizing the number of communication rounds between the UAVs and vehicles is investigated as a viable approach for minimizing delay. A performance evaluation revealed that the FRL approach can yield up to a 40% reduction in the number of communication rounds between vehicles and UAVs when compared to gross data offloading.

Funder

Natural Sciences and Engineering Research Council (NSERC) of Canada

Publisher

MDPI AG

Link

https://www.mdpi.com/2504-446X/8/7/321/pdf

Reference54 articles.

1. Shah, G., Saifuddin, M., Fallah, Y.P., and Gupta, S.D. (2020, January 16–18). RVE-CV2X: A Scalable Emulation Framework for Real-Time Evaluation of C-V2X based Connected Vehicle Applications. Proceedings of the 2020 IEEE Vehicular Networking Conference (VNC), New York, NY, USA.

2. Amadeo, M., Campolo, C., Molinaro, A., Harri, J., Rothenberg, C.E., and Vinel, A. (2019). Enhancing the 3GPP V2X Architecture with Information-Centric Networking. Future Internet, 11.

3. Park, H., and Lim, Y. (2021). Deep Reinforcement Learning Based Resource Allocation with Radio Remote Head Grouping and Vehicle Clustering in 5G Vehicular Networks. Electronics, 10.

4. Making a Case for Federated Learning in the Internet of Vehicles and Intelligent Transportation Systems;Manias;IEEE Netw.,2021

5. Zang, J., and Shikh-Bahaei, M. (April, January 29). Full Duplex-Based Scheduling Protocol for Latency Enhancement in 5G C-V2X VANETs. Proceedings of the 2021 IEEE Wireless Communications and Networking Conference (WCNC), Nanjing, China.