1. Oliver Adams, Trevor Cohn, Graham Neubig, and Alexis Michaud. 2017. Phonemic Transcription of Low-Resource Tonal Languages. In Proceedings of the Australasian Language Technology Association Workshop 2017. Brisbane, Australia, 53–60. https://aclanthology.org/U17-1006
2. Alexei Baevski, Yuhao Zhou, Abdelrahman Mohamed, and Michael Auli. 2020. wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations. In Advances in Neural Information Processing Systems, H. Larochelle, M. Ranzato, R. Hadsell, M.F. Balcan, and H. Lin (Eds.), Vol. 33. Curran Associates, Inc., 12449–12460. https://proceedings.neurips.cc/paper_files/paper/2020/file/92d1e1eb1cd6f9fba3227870bb6d7f07-Paper.pdf
3. AISHELL-1: An open-source Mandarin speech corpus and a speech recognition baseline
4. Malgorzata Ćavar, Damir Ćavar, and Hilaria Cruz. 2016. Endangered Language Documentation: Bootstrapping a Chatino Speech Corpus, Forced Aligner, ASR. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC’16). European Language Resources Association (ELRA), Portorož, Slovenia, 4004–4011. https://aclanthology.org/L16-1632
5. Charles Chen Razvan C. Bunescu Li Xu and Chang Liu. 2016. Tone Classification in Mandarin Chinese Using Convolutional Neural Networks. In Interspeech.