Neural Machine Translation with Gumbel-Greedy Decoding-Reference-Cited by-同舟云学术

Neural Machine Translation with Gumbel-Greedy Decoding

Published:2018-04-27 Issue:1 Volume:32 Page:
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Gu Jiatao,Im Daniel Jiwoong,Li Victor O.K.

Abstract

Previous neural machine translation models used some heuristic search algorithms (e.g., beam search) in order to avoid solving the maximum a posteriori problem over translation sentences at test phase. In this paper, we propose the \textit{Gumbel-Greedy Decoding} which trains a generative network to predict translation under a trained model. We solve such a problem using the Gumbel-Softmax reparameterization, which makes our generative network differentiable and trainable through standard stochastic gradient methods. We empirically demonstrate that our proposed model is effective for generating sequences of discrete words.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Optimal design of frame structures with mixed categorical and continuous design variables using the Gumbel–Softmax method;Structural and Multidisciplinary Optimization;2024-02-24

2. Sequential visual and semantic consistency for semi-supervised text recognition;Pattern Recognition Letters;2024-02

3. A Review of the Gumbel-max Trick and its Extensions for Discrete Stochasticity in Machine Learning;IEEE Transactions on Pattern Analysis and Machine Intelligence;2023-02-01

4. When Pairs Meet Triplets: Improving Low-Resource Captioning via Multi-Objective Optimization;ACM Transactions on Multimedia Computing, Communications, and Applications;2022-03-04

5. Cross-modal Representation Learning for Understanding Manufacturing Procedure;Lecture Notes in Computer Science;2022