Author:
Saon George,Tuske Zoltan,Audhkhasi Kartik
Cited by
41 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14
2. Chunked Attention-Based Encoder-Decoder Model for Streaming Speech Recognition;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14
3. End-to-End Speech Recognition: A Survey;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2024
4. A Token-Wise Beam Search Algorithm for RNN-T;2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU);2023-12-16
5. A comparative analysis between Conformer-Transducer, Whisper, and wav2vec2 for improving the child speech recognition;2023 International Conference on Speech Technology and Human-Computer Dialogue (SpeD);2023-10-25