Cross-Lingual Self-training to Learn Multilingual Representation for Low-Resource Speech Recognition-Reference-Cited by-同舟云学术

Cross-Lingual Self-training to Learn Multilingual Representation for Low-Resource Speech Recognition

Published:2022-07-23 Issue:12 Volume:41 Page:6827-6843
ISSN:0278-081X
Container-title:Circuits, Systems, and Signal Processing
language:en
Short-container-title:Circuits Syst Signal Process

Author:

Zhang Zi-Qiang,Song Yan,Wu Ming-Hui,Fang Xin,McLoughlin Ian,Dai Li-Rong

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Signal Processing

Link

https://link.springer.com/content/pdf/10.1007/s00034-022-02075-7.pdf

Reference54 articles.

1. R. Ardila, M. Branson, K. Davis, M. Kohler, J. Meyer, M. Henretty, R. Morais, L. Saunders, F. Tyers, G. Weber, Common voice: a massively-multilingual speech corpus, in Proceedings of the 12th Language Resources and Evaluation Conference, European Language Resources Association (2020), pp. 4218–4222

2. A. Baevski, S. Schneider, M. Auli, vq-wav2vec: Self-supervised learning of discrete speech representations, in International Conference on Learning Representations (2020)

3. A. Baevski, Y. Zhou, A. Mohamed, M. Auli, wav2vec 2.0: a framework for self-supervised learning of speech representations. Adv. Neural Inf. Process. Syst. 33, 12449–12460 (2020)

4. L. Besacier, E. Barnard, A. Karpov, T. Schultz, Automatic speech recognition for under-resourced languages: a survey. Speech Commun. 56, 85–100 (2014). https://doi.org/10.1016/j.specom.2013.07.008

5. T. Chen, S. Kornblith, M. Norouzi, G. Hinton, A simple framework for contrastive learning of visual representations, in Proceedings of the 37th International Conference on Machine Learning, ed. by H.D.A. Singh III, vol. 119 (PMLR, 2020), pp. 1597–1607

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. CAM: A cross-lingual adaptation framework for low-resource language speech recognition;Information Fusion;2024-11

2. Forensics Method of Dialects Combine with ASR Post-Processing Correction;2023 IEEE 6th International Conference on Pattern Recognition and Artificial Intelligence (PRAI);2023-08-18

3. Robust Data2VEC: Noise-Robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning;ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2023-06-04

4. A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers;Aerospace;2023-05-22

5. M2ASR-KIRGHIZ: A Free Kirghiz Speech Database and Accompanied Baselines;Information;2023-01-16