Funder
Fondazione Cassa Di Risparmio Di Trento E Rovereto
NTT Communication Science Laboratories
Subject
Human-Computer Interaction,Theoretical Computer Science,Software
Reference111 articles.
1. Speaker diarization: A review of recent research;Anguera;IEEE/ACM Trans. Audio Speech Lang. Process.,2012
2. Speaker recognition based on deep learning: An overview;Bai;Neural Netw.,2021
3. Boeddeker, C., Heitkaemper, J., Schmalenstroeer, J., Drude, L., Heymann, J., Haeb-Umbach, R., 2018. Front-end processing for the CHiME-5 dinner party scenario. In: Proc. of CHiME-5 Workshop on Speech Processing in Everyday Environments. pp. 35–40.
4. TristouNet: triplet loss for speaker turn embedding;Bredin,2017
5. Bredin, H., Laurent, A., 2021. End-to-end speaker segmentation for overlap-aware resegmentation. In: Proc. of Interspeech. pp. 3111–3115.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献