NAT4AT: Using Non-Autoregressive Translation Makes Autoregressive Translation Faster and Better-Reference-Cited by-同舟云学术

NAT4AT: Using Non-Autoregressive Translation Makes Autoregressive Translation Faster and Better

Published:2024-05-13 Issue: Volume: Page:4181-4192
ISSN:
Container-title:Proceedings of the ACM Web Conference 2024
language:
Short-container-title:

Author:

Zheng Huanran¹^ORCID,Zhu Wei¹^ORCID,Wang Xiaoling¹^ORCID

Affiliation:

1. East China Normal University, Shanghai, China

Funder

NSFC grant

Shanghai Trusted Industry Internet Software Collaborative Innovation Center

National Key R&D Program of China

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3589334.3645527

Reference44 articles.

1. Yu Bao, Hao Zhou, Shujian Huang, Dongqi Wang, Lihua Qian, Xinyu Dai, Jiajun Chen, and Lei Li. 2022. GLAT: Glancing at Latent Variables for Parallel Text Generation. In ACL.

2. Tom B. Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-Voss, Gretchen Krueger, T. J. Henighan, Rewon Child, Aditya Ramesh, Daniel M. Ziegler, Jeff Wu, Clemens Winter, Christopher Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam McCandlish, Alec Radford, Ilya Sutskever, and Dario Amodei. 2020. Language Models are Few-Shot Learners. ArXiv, Vol. abs/2005.14165 (2020).

3. Cunxiao Du, Zhaopeng Tu, and Jing Jiang. 2021. Order-agnostic cross entropy for non-autoregressive machine translation. arXiv preprint arXiv:2106.05093 (2021).

4. Learning to Rewrite for Non-Autoregressive Neural Machine Translation

5. Marjan Ghazvininejad, Omer Levy, Yinhan Liu, and Luke Zettlemoyer. 2019. Mask-predict: Parallel decoding of conditional masked language models. arXiv preprint arXiv:1904.09324 (2019).