1. Snyder, D., Garcia-Romero, D., Sell, G., Povey, D., Khudanpur, S.: X-vectors : robust dnn embeddings for speaker recognition. In: Center for Language and Speech Processing & Human Language Technology, Center of Excellence the Johns Hopkins Un, Icassp 2018, pp. 5329–5333 (2018)
2. Jin, M., Yoo, C.D.: Speaker verification and identification. Behav. Biometrics Hum. Identif. Intell. Appl., 264–289 (2009). https://doi.org/10.4018/978-1-60566-725-6.ch013
3. Karaali, O., Corrigan, G., Gerson, I., Massey, N.: Text-to-speech conversion with neural networks: a recurrent TDNN approach. In: Proceedings of Eurospeech 1997, Rhodes, Greece, pp. 561–564 (1997)
4. Peddinti, V., Povey, D., Khudanpur, S.: A time delay neural network architecture for efficient modeling of long temporal contexts.pdf. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2015, vol. 2015-Janua, pp. 2–6 (2015)
5. Liu, B., Zhang, W., Xu, X., Chen, D.: Time delay recurrent neural network for speech recognition. J. Phys. Conf. Ser. 1229(1) (2019). https://doi.org/10.1088/1742-6596/1229/1/012078