Adaptive data augmentation for mandarin automatic speech recognition-Reference-Cited by-同舟云学术

Adaptive data augmentation for mandarin automatic speech recognition

Published:2024-04 Issue:7 Volume:54 Page:5674-5687
ISSN:0924-669X
Container-title:Applied Intelligence
language:en
Short-container-title:Appl Intell

Author:

Ding Kai,Li Ruixuan,Xu Yuelin,Du Xingyue,Deng Bin

Funder

the foundation of Science and Technology on Near-Surface Detection Laboratory

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s10489-024-05381-6.pdf

Reference50 articles.

1. Tran VN, Huang C-E, Liu S-H, Aslam MS, Yang K-L, Li Y-H, Wang J-C (2023) Multi-view and multi-augmentation for self-supervised visual representation learning. Appl Intell 1–28. https://doi.org/10.1007/s10489-023-05163-6

2. Aydogan-Kilic D, Selcuk-Kestel AS (2023) Modification of hybrid rnn-hmm model in asset pricing: univariate and multivariate cases. Appl Intell 1–22. https://doi.org/10.1007/s10489-023-04762-7

3. Wu X, Tang B, Zhao M, Wang J, Guo Y (2023) Str transformer: a cross-domain transformer for scene text recognition. Appl Intell 53(3):3444–3458. https://doi.org/10.1007/s10489-022-03728-5

4. Amodei D, Ananthanarayanan S, Anubhai R, Bai J, Battenberg E, Case C, Casper J, Catanzaro B, Cheng Q, Chen G, Chen J, Chen J, Chen Z, Chrzanowski M, Coates A, Diamos G, Ding K, Du N, Elsen E, Engel J, Fang W, Fan L, Fougner C, Gao L, Gong C, Hannun A, Han T, Johannes L, Jiang B, Ju C, Jun B, LeGresley P, Lin L, Liu J, Liu Y, Li W, Li X, Ma D, Narang S, Ng A, Ozair S, Peng Y, Prenger R, Qian S, Quan Z, Raiman J, Rao V, Satheesh S, Seetapun D, Sengupta S, Srinet K, Sriram A, Tang H, Tang L, Wang C, Wang J, Wang K, Wang Y, Wang Z, Wang Z, Wu S, Wei L, Xiao B, Xie W, Xie Y, Yogatama D, Yuan B, Zhan J, Zhu Z (2016) Deep speech 2 : end-to-end speech recognition in english and mandarin. In: Proceedings of the 33rd international conference on machine learning, vol 48, pp 173–182. http://proceedings.mlr.press/v48/amodei16.pdf, https://proceedings.mlr.press/v48/amodei16.html

5. Park DS, Zhang Y, Chiu C-C, Chen Y, Li B, Chan W, Le QV, Wu Y (2020) Specaugment on large scale datasets. In: ICASSP 2020 - 2020 IEEE International conference on acoustics, speech and signal processing (ICASSP), pp 6879–6883. https://doi.org/10.1109/ICASSP40776.2020.9053205