1. Alex Graves Santiago Fernández Faustino J. Gomez and Jürgen Schmidhuber. 2006. Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks. In ICML Vol. 148. 369–376. Alex Graves Santiago Fernández Faustino J. Gomez and Jürgen Schmidhuber. 2006. Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks. In ICML Vol. 148. 369–376.
2. Anmol Gulati , James Qin , Chung-Cheng Chiu , Niki Parmar , Yu Zhang , Jiahui Yu , Wei Han , Shibo Wang , Zhengdong Zhang , Yonghui Wu , and Ruoming Pang . 2020 . Conformer: Convolution-augmented Transformer for Speech Recognition. In Interspeech. 5036–5040. Anmol Gulati, James Qin, Chung-Cheng Chiu, Niki Parmar, Yu Zhang, Jiahui Yu, Wei Han, Shibo Wang, Zhengdong Zhang, Yonghui Wu, and Ruoming Pang. 2020. Conformer: Convolution-augmented Transformer for Speech Recognition. In Interspeech. 5036–5040.
3. Pengcheng Guo Florian Boyer Xuankai Chang Tomoki Hayashi Yosuke Higuchi Hirofumi Inaguma Naoyuki Kamo Chenda Li Daniel Garcia-Romero Jiatong Shi Jing Shi Shinji Watanabe Kun Wei Wangyou Zhang and Yuekai Zhang. 2021. Recent Developments on Espnet Toolkit Boosted By Conformer. In ICASSP. 5874–5878. Pengcheng Guo Florian Boyer Xuankai Chang Tomoki Hayashi Yosuke Higuchi Hirofumi Inaguma Naoyuki Kamo Chenda Li Daniel Garcia-Romero Jiatong Shi Jing Shi Shinji Watanabe Kun Wei Wangyou Zhang and Yuekai Zhang. 2021. Recent Developments on Espnet Toolkit Boosted By Conformer. In ICASSP. 5874–5878.
4. W. Hu Y. Luo J. Meng Z. Qian and Q. Huo. 2020. A Study of BPE-based Language Modeling for Open Vocabulary Latin Language OCR. In ICFHR. 133–138. W. Hu Y. Luo J. Meng Z. Qian and Q. Huo. 2020. A Study of BPE-based Language Modeling for Open Vocabulary Latin Language OCR. In ICFHR. 133–138.
5. Jaehyeon Kim Sungwon Kim Jungil Kong and Sungroh Yoon. 2020. Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search. In NeurIPS Vol. 33. 8067–8077. Jaehyeon Kim Sungwon Kim Jungil Kong and Sungroh Yoon. 2020. Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search. In NeurIPS Vol. 33. 8067–8077.