Spoken‐to‐written text conversion for enhancement of Korean–English readability and machine translation-Reference-Cited by-同舟云学术

Spoken‐to‐written text conversion for enhancement of Korean–English readability and machine translation

Published:2024-02 Issue:1 Volume:46 Page:127-136
ISSN:1225-6463
Container-title:ETRI Journal
language:en
Short-container-title:ETRI Journal

Author:

Choi HyunJung¹^ORCID,Choi Muyeol²,Kim Seonhui¹,Lim Yohan¹^ORCID,Lee Minkyu²,Yun Seung²,Kim Donghyun²^ORCID,Kim Sang Hun²

Affiliation:

1. Department of Artificial Intelligence University of Science and Technology Daejeon Republic of Korea

2. Integrated Intelligence Research Section Electronics and Telecommunications Research Institute Daejeon Republic of Korea

Abstract

AbstractThe Korean language has written (formal) and spoken (phonetic) forms that differ in their application, which can lead to confusion, especially when dealing with numbers and embedded Western words and phrases. This fact makes it difficult to automate Korean speech recognition models due to the need for a complete transcription training dataset. Because such datasets are frequently constructed using broadcast audio and their accompanying transcriptions, they do not follow a discrete rule‐based matching pattern. Furthermore, these mismatches are exacerbated over time due to changing tacit policies. To mitigate this problem, we introduce a data‐driven Korean spoken‐to‐written transcription conversion technique that enhances the automatic conversion of numbers and Western phrases to improve automatic translation model performance.

Funder

Electronics and Telecommunications Research Institute

Publisher

Wiley

Link

https://onlinelibrary.wiley.com/doi/pdf/10.4218/etrij.2023-0354

Reference23 articles.

1. Automatic Construction of a Large-Scale Speech Recognition Database Using Multi-Genre Broadcast Data with Inaccurate Subtitle Timestamps

2. English–Korean speech translation corpus (EnKoST‐C): Construction procedure and evaluation results

3. Number Normalization in Korean Using the Transformer Model

4. An end-to-end synthesis method for Korean text-to-speech systems

5. M.Sunkara C.Shivade S.Bodapati andK.Kirchhoff Neural inverse text normalization (ICASSP 2021‐2021 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP) Toronto Canada) 2021 pp.7573–7577.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Special issue on speech and language AI technologies;ETRI Journal;2024-02