Affiliation:
1. Education Department of Guangxi
2. Nanning Normal University
Abstract
An algorithm is proposed for few-shot-learning (FSL) jointing modulation format identification (MFI) and optical signal-to-noise ratio (OSNR) estimation. The constellation diagrams of six widely-used modulation formats over a wide range of OSNR (10-40 dB) are obtained by a dual-polarization (DP) coherent detection system at 32 GBaud. We introduce auxiliary task to model-agnostic meta-learning (MAML) which makes the gradient of meta tasks decline faster in the direction of optimal target. Ablation experiments including multi-task model-agnostic meta-learning (MT-MAML), single-task model-agnostic meta-learning (ST-MAML) and adaptive multi-task learning (AMTL) are executed to train a data set with only 20 examples for each class. First, we discuss the impact from the number of shots and gradient descent steps for support set on the meta-learning based schemes to determine the best hyper parameters and conclude that the proposed method better captures the similarity between new and previous knowledge at 4 shot and 1 step. Withdrawn fine-tuning, the model achieves the lowest error ∼0.37 dB initially. Then, we simulate two other schemes (AMTL and ST-MAML), and the numerical results shows that mean square error (MSE) are ∼0.6 dB, ∼0.3 dB and ∼0.18 dB, respectively, proposed method has faster adaption to main task. For low order modulation formats, the proposed method almost reduces the error to 0. Meanwhile, we reveal the degree of deviation between the prediction and target and find that the deviation is mainly concentrated in the high OSNR range of 25-40 dB. Specifically, we investigate the variation curve of adaptive weights during pretraining and conclude that after 30 epoch, the model's attention was almost entirely focused on estimating OSNR. In addition, we study the generalization ability of the model by varying the transmission distance. Importantly, excellent generalization is also experimentally verified. In this paper, the method proposed will greatly reduce the cost for repetitively collecting data and the training resources required for fine-tuning models when OPM devices need to be deployed at massive nodes in dynamic optical networks.
Funder
Education Department of Guangxi Zhuang Autonomous Region
Subject
Atomic and Molecular Physics, and Optics
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献