Deep Reinforcement Learning-Based RMSA Policy Distillation for Elastic Optical Networks

Author:

Tang BixiaORCID,Huang Yue-CaiORCID,Xue YunORCID,Zhou Weixing

Abstract

The reinforcement learning-based routing, modulation, and spectrum assignment has been regarded as an emerging paradigm for resource allocation in the elastic optical networks. One limitation is that the learning process is highly dependent on the training environment, such as the traffic pattern or the optical network topology. Therefore, re-training is required in case of network topology or traffic pattern variations, which consumes a great amount of computation power and time. To ease the requirement of re-training, we propose a policy distillation scheme, which distills knowledge from a well-trained teacher model and then transfers the knowledge to the to-be-trained student model, so that the training of the latter can be accelerated. Specifically, the teacher model is trained for one training environment (e.g., the topology and traffic pattern) and the student model is for another training environment. The simulation results indicate that our proposed method can effectively speed up the training process of the student model, and it even leads to a lower blocking probability, compared with the case that the student model is trained without knowledge distillation.

Funder

National Natural Science Foundation of China

Basic and Applied Basic Research Foundation of Guangdong Province

Guangdong Science and Technology Department

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Reference37 articles.

1. Cisco Visual Networking Index: Forecast and Trends, 2017–2022https://www.cisco.com/c/en_in/index.html

2. Spectrum-efficient and scalable elastic optical path network: architecture, benefits, and enabling technologies

3. Elastic optical networking: a new dawn for the optical layer?

4. A review of routing and wavelength assignment approaches for wavelength-routed optical WDM networks;Zang;Opt. Netw. Mag.,2000

5. Routing and spectrum assignment: A metaheuristic for hybrid ordering selection in elastic optical networks

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3