1. PaddlePaddle Authors. 2021. PaddleSpeech a toolkit for audio processing based on PaddlePaddle. https://github.com/PaddlePaddle/DeepSpeech. PaddlePaddle Authors. 2021. PaddleSpeech a toolkit for audio processing based on PaddlePaddle. https://github.com/PaddlePaddle/DeepSpeech.
2. Yoshua Bengio , Jé rô me Louradour, Ronan Collobert, and Jason Weston. 2009. Curriculum learning . In ICML 2009, Montreal, Quebec, Canada (ACM International Conference Proceeding Series , Vol. 382), , Andrea Pohoreckyj Danyluk, Lé on Bottou, and Michael L . Littman (Eds.). ACM, 41--48. https://doi.org/10.1145/1553374.1553380 Yoshua Bengio, Jé rô me Louradour, Ronan Collobert, and Jason Weston. 2009. Curriculum learning. In ICML 2009, Montreal, Quebec, Canada (ACM International Conference Proceeding Series, Vol. 382), , Andrea Pohoreckyj Danyluk, Lé on Bottou, and Michael L. Littman (Eds.). ACM, 41--48. https://doi.org/10.1145/1553374.1553380
3. Angie W. Boggust , Kartik Audhkhasi , Dhiraj Joshi , David Harwath , Samuel Thomas , Rogé rio Schmidt Feris , Danny Gutfreund , Yang Zhang , Antonio Torralba , Michael Picheny , and James R. Glass . 2019 . Grounding Spoken Words in Unlabeled Video. In CVPR Workshops 2019 , Long Beach, CA, USA. Computer Vision Foundation / IEEE, 29--32. Angie W. Boggust, Kartik Audhkhasi, Dhiraj Joshi, David Harwath, Samuel Thomas, Rogé rio Schmidt Feris, Danny Gutfreund, Yang Zhang, Antonio Torralba, Michael Picheny, and James R. Glass. 2019. Grounding Spoken Words in Unlabeled Video. In CVPR Workshops 2019, Long Beach, CA, USA. Computer Vision Foundation / IEEE, 29--32.
4. Jingyuan Chen , Lin Ma , Xinpeng Chen , Zequn Jie , and Jiebo Luo . 2019. Localizing Natural Language in Videos . In AAAI. AAAI Press , 8175--8182. Jingyuan Chen, Lin Ma, Xinpeng Chen, Zequn Jie, and Jiebo Luo. 2019. Localizing Natural Language in Videos. In AAAI. AAAI Press, 8175--8182.
5. Nuo Chen , Chenyu You , and Yuexian Zou . 2021. Self-supervised dialogue learning for spoken conversational question answering. arXiv preprint arXiv:2106.02182 ( 2021 ). Nuo Chen, Chenyu You, and Yuexian Zou. 2021. Self-supervised dialogue learning for spoken conversational question answering. arXiv preprint arXiv:2106.02182 (2021).