Robust Data Augmentation for Neural Machine Translation through EVALNET-Reference-Cited by-同舟云学术

Robust Data Augmentation for Neural Machine Translation through EVALNET

Published:2022-12-27 Issue:1 Volume:11 Page:123
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Park Yo-Han^ORCID,Choi Yong-Seok^ORCID,Yun Seung^ORCID,Kim Sang-Hun,Lee Kong-Joo^ORCID

Abstract

Since building Neural Machine Translation (NMT) systems requires a large parallel corpus, various data augmentation techniques have been adopted, especially for low-resource languages. In order to achieve the best performance through data augmentation, the NMT systems should be able to evaluate the quality of augmented data. Several studies have addressed data weighting techniques to assess data quality. The basic idea of data weighting adopted in previous studies is the loss value that a system calculates when learning from training data. The weight derived from the loss value of the data, through simple heuristic rules or neural models, can adjust the loss used in the next step of the learning process. In this study, we propose EvalNet, a data evaluation network, to assess parallel data of NMT. EvalNet exploits a loss value, a cross-attention map, and a semantic similarity between parallel data as its features. The cross-attention map is an encoded representation of cross-attention layers of Transformer, which is a base architecture of an NMT system. The semantic similarity is a cosine distance between two semantic embeddings of a source sentence and a target sentence. Owing to the parallelism of data, the combination of the cross-attention map and the semantic similarity proved to be effective features for data quality evaluation, besides the loss value. EvalNet is the first NMT data evaluator network that introduces the cross-attention map and the semantic similarity as its features. Through various experiments, we conclude that EvalNet is simple yet beneficial for robust training of an NMT system and outperforms the previous studies as a data evaluator.

Funder

Electronics and Telecommunications Research Institute

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2227-7390/11/1/123/pdf

Reference18 articles.

1. Wei, J., and Zou, K. (2019, January 3–7). EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), Hong Kong, China.

2. Sennrich, R., Haddow, B., and Birch, A. (2016, January 7–12). Improving Neural Machine Translation Models with Monolingual Data. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, Berlin, Germany.

3. Shu, J., Xie, Q., Yi, L., Zhao, Q., Zhou, S., Xu, Z., and Meng, D. (2019, January 8–14). Meta-weight-net: Learning an explicit mapping for sample weighting. Proceedings of the Advances in Neural Information Processing Systems, Vancouver, BC, Canada.

4. Dy, J., and Krause, A. (2018, January 10–15). MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels. Proceedings of the 35th International Conference on Machine Learning (PMLR), Stockholm, Sweden.

5. Dy, J., and Krause, A. (2018, January 10–15). Learning to Reweight Examples for Robust Deep Learning. Proceedings of the 35th International Conference on Machine Learning (PMLR), Stockholm, Sweden.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Neural Machine Translation of Electrical Engineering with Fusion of Memory Information;Applied Sciences;2023-09-13

2. Neural Machine Translation of Electrical Engineering Based on Integrated Convolutional Neural Networks;Electronics;2023-08-25

3. Low-Resource Neural Machine Translation: A Systematic Literature Review;IEEE Access;2023