1. 1. AlBadawy, E.A., Gibiansky, A., He, Q., Wu, J., Chang, M.C., Lyu, S.: Vocbench: A neural vocoder benchmark for speech synthesis. In: ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). pp. 881-885. IEEE (2022)
2. 2. Ardila, R., Branson, M., Davis, K., Kohler, M., Meyer, J., Henretty, M., Morais, R., Saunders, L., Tyers, F., Weber, G.: Common voice: A massively-multilingual speech corpus. In: Proceedings of the Twelfth Language Resources and Evaluation Conference. pp. 4218-4222 (2020)
3. 3. Barker, J., Watanabe, S., Vincent, E., Trmal, J.: The fifth'chime'speech separation and recognition challenge: Dataset, task and baselines. In: Interspeech 2018-19th Annual Conference of the International Speech Communication Association (2018)
4. 4. Cosentino, J., Pariente, M., Cornell, S., Deleforge, A., Vincent, E.: Librimix: An opensource dataset for generalizable speech separation (2020)
5. 5. Fonseca, E., Favory, X., Pons, J., Font, F., Serra, X.: Fsd50k: an open dataset of humanlabeled sound events. IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, pp. 829-852 (2021)