Real-Time Automatic Translation Algorithm for Chinese Subtitles in Media Playback Using Knowledge Base-Reference-Cited by-同舟云学术

Real-Time Automatic Translation Algorithm for Chinese Subtitles in Media Playback Using Knowledge Base

Published:2022-06-18 Issue: Volume:2022 Page:1-11
ISSN:1875-905X
Container-title:Mobile Information Systems
language:en
Short-container-title:Mobile Information Systems

Author:

Yan Li¹^ORCID

Affiliation:

1. Foreign Languages Department, Xinjiang Teacher’s College, Urumqi 830000, China

Abstract

Currently, speech technology allows for simultaneous subtitling of live television programs using speech recognition and the respeaking approach. Although many previous studies on the quality of live subtitling utilizing voice recognition have been proposed, little attention has been paid to the quantitative elements of subtitles. Due to the high performance of neural machine translation (NMT), it has become the standard machine translation method. A data-driven translation approach requires high-quality, large-scale training data and powerful computing resources to achieve good performance. However, data-driven translation will face challenges when translating languages with limited resources. This paper’s research work focuses on how to integrate linguistic knowledge into the NMT model to improve the translation performance and quality of the NMT system. A method of integrating semantic concept information in the NMT system is proposed to address the problem of out-of-set words and low-frequency terms in the NMT system. This research also provides an NMT-centered read modeling and decoding approach integrating an external knowledge base. The experimental results show that the proposed strategy can effectively increase the MT system’s translation performance.

Publisher

Hindawi Limited

Subject

Computer Networks and Communications,Computer Science Applications

Link

http://downloads.hindawi.com/journals/misy/2022/5245035.pdf

Reference30 articles.

1. Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups

2. ImageNet classification with deep convolutional neural networks

3. Scaling Conditional Random Field with Application to Chinese Word Segmentation

4. Syntax augmented machine translation via chart parsing

5. Moses: open source toolkit for statistical machine translation;P. Koehn