Research on automatic pilot repetition generation method based on deep reinforcement learning-Reference-Cited by-同舟云学术

Research on automatic pilot repetition generation method based on deep reinforcement learning

Published:2023-10-11 Issue: Volume:17 Page:
ISSN:1662-5218
Container-title:Frontiers in Neurorobotics
language:
Short-container-title:Front. Neurorobot.

Author:

Pan Weijun,Jiang Peiyuan,Li Yukun,Wang Zhuang,Huang Junxiang

Abstract

Using computers to replace pilot seats in air traffic control (ATC) simulators is an effective way to improve controller training efficiency and reduce training costs. To achieve this, we propose a deep reinforcement learning model, RoBERTa-RL (RoBERTa with Reinforcement Learning), for generating pilot repetitions. RoBERTa-RL is based on the pre-trained language model RoBERTa and is optimized through transfer learning and reinforcement learning. Transfer learning is used to address the issue of scarce data in the ATC domain, while reinforcement learning algorithms are employed to optimize the RoBERTa model and overcome the limitations in model generalization caused by transfer learning. We selected a real-world area control dataset as the target task training and testing dataset, and a tower control dataset generated based on civil aviation radio land-air communication rules as the test dataset for evaluating model generalization. In terms of the ROUGE evaluation metrics, RoBERTa-RL achieved significant results on the area control dataset with ROUGE-1, ROUGE-2, and ROUGE-L scores of 0.9962, 0.992, and 0.996, respectively. On the tower control dataset, the scores were 0.982, 0.954, and 0.982, respectively. To overcome the limitations of ROUGE in this field, we conducted a detailed evaluation of the proposed model architecture using keyword-based evaluation criteria for the generated repetition instructions. This evaluation criterion calculates various keyword-based metrics based on the segmented results of the repetition instruction text. In the keyword-based evaluation criteria, the constructed model achieved an overall accuracy of 98.8% on the area control dataset and 81.8% on the tower control dataset. In terms of generalization, RoBERTa-RL improved accuracy by 56% compared to the model before improvement and achieved a 47.5% improvement compared to various comparative models. These results indicate that employing reinforcement learning strategies to enhance deep learning algorithms can effectively mitigate the issue of poor generalization in text generation tasks, and this approach holds promise for future application in other related domains.

Publisher

Frontiers Media SA

Subject

Artificial Intelligence,Biomedical Engineering

Reference39 articles.

1. Fine-tuning GPT-3 for Russian text summarization;Alexandr,2021

2. Generating e-commerce product titles and predicting their quality;de Souza,2018

3. BERT: pre-training of deep bidirectional transformers for language understanding;Devlin;arXiv preprint arXiv:1810.04805,2018

4. The development, evaluation and application of an aviation radiotelephony specialised technical vocabulary list;Drayton;English Specific Purposes,2023

5. Bert fine-tuning for Arabic text summarization;Elmadani;arXiv preprint arXiv:2004.14135,2020

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Instruction Fine-tuning and LoRA Combined Approach for Optimizing Large Language Models;Journal of Society of Korea Industrial and Systems Engineering;2024-06-30

2. SLKIR: A framework for extracting key information from air traffic control instructions Using small sample learning;Scientific Reports;2024-04-29

3. Assessment and analysis of accents in air traffic control speech: a fusion of deep learning and information theory;Frontiers in Neurorobotics;2024-03-05