Speech2Phone: A Novel and Efficient Method for Training Speaker Recognition Models-Reference-Cited by-同舟云学术

Speech2Phone: A Novel and Efficient Method for Training Speaker Recognition Models

Published:2021 Issue: Volume: Page:572-585
ISSN:0302-9743
Container-title:Intelligent Systems
language:
Short-container-title:

Author:

Casanova Edresson^ORCID,Candido Junior Arnaldo^ORCID,Shulby Christopher^ORCID,de Oliveira Frederico Santos^ORCID,Gris Lucas Rafael Stefanel^ORCID,da Silva Hamilton Pereira^ORCID,Aluísio Sandra Maria^ORCID,Ponti Moacir Antonelli^ORCID

Publisher

Springer International Publishing

Link

https://link.springer.com/content/pdf/10.1007/978-3-030-91699-2_39

Reference37 articles.

1. Abadi, M., et al.: TensorFlow: large-scale machine learning on heterogeneous distributed systems. arXiv preprint arXiv:1603.04467 (2016)

2. Ardila, R., et al.: Common voice: a massively-multilingual speech corpus. In: Proceedings of the 12th Language Resources and Evaluation Conference, pp. 4218–4222 (2020)

3. Arik, S., Chen, J., Peng, K., Ping, W., Zhou, Y.: Neural voice cloning with a few samples. In: Advances in Neural Information Processing Systems, pp. 10019–10029 (2018)

4. Bowater, R.J., Porter, L.L.: Voice recognition of telephone conversations. US Patent 6,278,772 (21 August 2001)

5. Bredin, H.: TristouNet: triplet loss for speaker turn embedding. In: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 5430–5434. IEEE (2017)

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. End-to-End Thai Text-to-Speech with Linguistic Unit;Proceedings of the 2024 International Conference on Multimedia Retrieval;2024-05-30