Double Q-learning based routing protocol for opportunistic networks-Reference-Cited by-同舟云学术

Double Q-learning based routing protocol for opportunistic networks

Published:2023-01-03 Issue:1 Volume:29 Page:1-14
ISSN:1875-8940
Container-title:Journal of High Speed Networks
language:
Short-container-title:JHS

Author:

Singh Jagdeep¹,Dhurandher Sanjay Kumar²,Woungang Isaac³,Barolli Leonard⁴

Affiliation:

1. Department of Computer Science and Engineering, Sant Longowal Institute of Engineering and Technology, Longowal, India

2. Department of Information Technology, Netaji Subhas University of Technology, New Delhi, India

3. Department of Computer Science, Toronto Metropolitan University, Toronto, Ontario, Canada

4. Department of Information and Communication Engineering, Faculty of Information Engineering, Fukuoka Institute of Technology, Fukuoka, Japan

Abstract

Opportunistic Delay Tolerant Networks also referred to as Opportunistic Networks (OppNets) are a subset of wireless networks having mobile nodes with discontinuous opportunistic connections. As such, developing a performant routing protocol in such an environment remains a challenge. Most research in the literature have shown that reinforcement learning-based routing algorithms can achieve a good routing performance, but these algorithms suffer from under-estimations and/or over-estimations. Toward addressing these shortcomings, in this paper, a Double Q-learning based routing protocol for Opportunistic Networks framework named Off-Policy Reinforcement-based Adaptive Learning (ORAL) is proposed, which selects the most suitable next-hop node to transmit the message toward its destination without any bias by using a weighted double Q-estimator. In the next-hop selection process, a probability-based reward mechanism is involved, which considers the node’s delivery probability and the frequency of encounters among the nodes to boost the protocol’s efficiency. Simulation results convey that the proposed ORAL protocol improves the message delivery ratio by maintaining a trade-off between underestimation and overestimation. Simulations are conducted using the HAGGLE INFOCOM 2006 real mobility data trace and synthetic model, showing that when time-to-live is varied, (1) the proposed ORAL scheme outperforms DQLR by 14.05%, 9.4%, 5.81% respectively in terms of delivery probability, overhead ratio and average delay; (2) it also outperforms RLPRoPHET by 16.17%, 9.2%, 6.85%, respectively in terms of delivery ratio, overhead ratio and average delay.

Publisher

IOS Press

Subject

Computer Networks and Communications,Hardware and Architecture,Information Systems

Reference19 articles.

1. Q-learning based energy-efficient and void avoidance routing protocol for underwater acoustic sensor networks

2. DTN routing as a resource allocation problem;Balasubramanian;ACM SIGCOMM Computer Comm. Review,2007

3. Optimising message broadcasting in opportunistic networks;Chancay García;Computer Communications,2020

4. ARBR: Adaptive reinforcement-based routing for DTN

5. QELAR: A machine-learning-based adaptive routing protocol for energy-efficient and lifetime-extended underwater sensor networks;Hu;IEEE Trans. on Mobile Computing,2010

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A comprehensive survey on Machine Learning techniques in opportunistic networks: Advances, challenges and future directions;Pervasive and Mobile Computing;2024-05

2. Exploring the potential of pH-sensitive polymers in targeted drug delivery;Journal of Biomaterials Science, Polymer Edition;2023-11-15

3. Multi-Decision Dynamic Intelligent Routing Protocol for Delay-Tolerant Networks;Electronics;2023-11-03

4. Seed Node Selection Algorithm Based on Node Influence in Opportunistic Offloading;2023 IEEE Symposium on Computers and Communications (ISCC);2023-07-09

5. Keeping Up with Technology: Socioemotional and Equity Challenges with Children and Schools;Children & Schools;2023-05-27