1. Hierarchical generative modeling for controllable speech synthesis;hsu,2018
2. Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition;zhang,2020
3. SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing
4. Conformer: Convolution-augmented Transformer for Speech Recognition
5. Group normalization;wu;Proceedings of the European Conference on Computer Vision (ECCV),2018