1. Zhu, J.-Y., Park, T., Isola, P. & Efros, A.A. Unpaired image-to-image translation using cycle-consistent adversarial networks. In 2017 IEEE Int. Conference on Computer Vision (ICCV) 2223–2232 (IEEE, 2017).
2. Oord, A. V. D. et al. Wavenet: A generative model for raw audio. Preprint at https://arxiv.org/abs/1609.03499 (2016).
3. Wu, Y. et al. Google’s neural machine translation system: Bridging the gap between human and machine translation. Preprint at https://arxiv.org/abs/1609.08144 (2016).
4. Johnson, J., Alahi, A. & Li, F.-F. Perceptual losses for real-time style transfer and super-resolution. In Proc. European Conference on Computer Vision 694–711 (Springer, 2016).
5. He, Y. et al. Streaming end-to-end speech recognition for mobile devices. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 6381–6385 (IEEE, 2019).