Fast adaptation of multi-task meta-learning for optical performance monitoring-Reference-Cited by-同舟云学术

Fast adaptation of multi-task meta-learning for optical performance monitoring

Published:2023-06-27 Issue:14 Volume:31 Page:23183
ISSN:1094-4087
Container-title:Optics Express
language:en
Short-container-title:Opt. Express

Author:

Zhang Yu¹,Zhou Peng¹,Liu Yan¹,Wang Jixiang¹,Li Chuanqi²,Lu Ye¹

Affiliation:

1. Education Department of Guangxi

2. Nanning Normal University

Abstract

An algorithm is proposed for few-shot-learning (FSL) jointing modulation format identification (MFI) and optical signal-to-noise ratio (OSNR) estimation. The constellation diagrams of six widely-used modulation formats over a wide range of OSNR (10-40 dB) are obtained by a dual-polarization (DP) coherent detection system at 32 GBaud. We introduce auxiliary task to model-agnostic meta-learning (MAML) which makes the gradient of meta tasks decline faster in the direction of optimal target. Ablation experiments including multi-task model-agnostic meta-learning (MT-MAML), single-task model-agnostic meta-learning (ST-MAML) and adaptive multi-task learning (AMTL) are executed to train a data set with only 20 examples for each class. First, we discuss the impact from the number of shots and gradient descent steps for support set on the meta-learning based schemes to determine the best hyper parameters and conclude that the proposed method better captures the similarity between new and previous knowledge at 4 shot and 1 step. Withdrawn fine-tuning, the model achieves the lowest error ∼0.37 dB initially. Then, we simulate two other schemes (AMTL and ST-MAML), and the numerical results shows that mean square error (MSE) are ∼0.6 dB, ∼0.3 dB and ∼0.18 dB, respectively, proposed method has faster adaption to main task. For low order modulation formats, the proposed method almost reduces the error to 0. Meanwhile, we reveal the degree of deviation between the prediction and target and find that the deviation is mainly concentrated in the high OSNR range of 25-40 dB. Specifically, we investigate the variation curve of adaptive weights during pretraining and conclude that after 30 epoch, the model's attention was almost entirely focused on estimating OSNR. In addition, we study the generalization ability of the model by varying the transmission distance. Importantly, excellent generalization is also experimentally verified. In this paper, the method proposed will greatly reduce the cost for repetitively collecting data and the training resources required for fine-tuning models when OPM devices need to be deployed at massive nodes in dynamic optical networks.

Funder

Education Department of Guangxi Zhuang Autonomous Region

Publisher

Optica Publishing Group

Subject

Atomic and Molecular Physics, and Optics

Reference24 articles.

1. Low Complexity OSNR Monitoring and Modulation Format Identification Based on Binarized Neural Networks

2. Optical Performance Monitoring: A Review of Current and Future Technologies

3. Optical performance monitoring for the next generation optical communication networks

4. Controllable Asymmetry Attack on Two-Way Fiber Time Synchronization System

5. Machine Learning Techniques for Optical Performance Monitoring From Directly Detected PDM-QAM Signals

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Covert fault detection with imbalanced data using an improved autoencoder for optical networks;Journal of Optical Communications and Networking;2023-10-30