Agreement on Target-Bidirectional Recurrent Neural Networks for Sequence-to-Sequence Learning-Reference-Cited by-同舟云学术

Agreement on Target-Bidirectional Recurrent Neural Networks for Sequence-to-Sequence Learning

Published:2020-03-19 Issue: Volume:67 Page:581-606
ISSN:1076-9757
Container-title:Journal of Artificial Intelligence Research
language:
Short-container-title:jair

Author:

Liu Lemao,Finch Andrew,Utiyama Masao,Sumita Eiichiro

Abstract

Recurrent neural networks are extremely appealing for sequence-to-sequence learning tasks. Despite their great success, they typically suffer from a shortcoming: they are prone to generate unbalanced targets with good prefixes but bad suffixes, and thus performance suffers when dealing with long sequences. We propose a simple yet effective approach to overcome this shortcoming. Our approach relies on the agreement between a pair of target-directional RNNs, which generates more balanced targets. In addition, we develop two efficient approximate search methods for agreement that are empirically shown to be almost optimal in terms of either sequence level or non-sequence level metrics. Extensive experiments were performed on three standard sequence-to-sequence transduction tasks: machine transliteration, grapheme-to-phoneme transformation and machine translation. The results show that the proposed approach achieves consistent and substantial improvements, compared to many state-of-the-art systems.

Publisher

AI Access Foundation

Subject

Artificial Intelligence

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Intelligent Parking Control Method Based on Multi-Source Sensory Information Fusion and End-to-End Deep Learning;Applied Sciences;2023-04-16

2. Harmonic Noise-Tolerant ZNN for Dynamic Matrix Pseudoinversion and Its Application to Robot Manipulator;Frontiers in Neurorobotics;2022-06-13

3. Recurrent Neural Network Techniques: Emphasis on Use in Neural Machine Translation;Informatica;2021-12-23

4. Electricity Theft Detection in Power Consumption Data Based on Adaptive Tuning Recurrent Neural Network;Frontiers in Energy Research;2021-11-10

5. Attending From Foresight: A Novel Attention Mechanism for Neural Machine Translation;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2021