1. Tran VN, Huang C-E, Liu S-H, Aslam MS, Yang K-L, Li Y-H, Wang J-C (2023) Multi-view and multi-augmentation for self-supervised visual representation learning. Appl Intell 1–28. https://doi.org/10.1007/s10489-023-05163-6
2. Aydogan-Kilic D, Selcuk-Kestel AS (2023) Modification of hybrid rnn-hmm model in asset pricing: univariate and multivariate cases. Appl Intell 1–22. https://doi.org/10.1007/s10489-023-04762-7
3. Wu X, Tang B, Zhao M, Wang J, Guo Y (2023) Str transformer: a cross-domain transformer for scene text recognition. Appl Intell 53(3):3444–3458. https://doi.org/10.1007/s10489-022-03728-5
4. Amodei D, Ananthanarayanan S, Anubhai R, Bai J, Battenberg E, Case C, Casper J, Catanzaro B, Cheng Q, Chen G, Chen J, Chen J, Chen Z, Chrzanowski M, Coates A, Diamos G, Ding K, Du N, Elsen E, Engel J, Fang W, Fan L, Fougner C, Gao L, Gong C, Hannun A, Han T, Johannes L, Jiang B, Ju C, Jun B, LeGresley P, Lin L, Liu J, Liu Y, Li W, Li X, Ma D, Narang S, Ng A, Ozair S, Peng Y, Prenger R, Qian S, Quan Z, Raiman J, Rao V, Satheesh S, Seetapun D, Sengupta S, Srinet K, Sriram A, Tang H, Tang L, Wang C, Wang J, Wang K, Wang Y, Wang Z, Wang Z, Wu S, Wei L, Xiao B, Xie W, Xie Y, Yogatama D, Yuan B, Zhan J, Zhu Z (2016) Deep speech 2 : end-to-end speech recognition in english and mandarin. In: Proceedings of the 33rd international conference on machine learning, vol 48, pp 173–182. http://proceedings.mlr.press/v48/amodei16.pdf, https://proceedings.mlr.press/v48/amodei16.html
5. Park DS, Zhang Y, Chiu C-C, Chen Y, Li B, Chan W, Le QV, Wu Y (2020) Specaugment on large scale datasets. In: ICASSP 2020 - 2020 IEEE International conference on acoustics, speech and signal processing (ICASSP), pp 6879–6883. https://doi.org/10.1109/ICASSP40776.2020.9053205