1. Adami, A. G., & Hermansky, H. (2003). Segmentation of speech for speaker and language recognition. In Proceedings of Eurospeech, Geneva (pp. 841–844).
2. Adami, A. G., Mihaescu, R., Reynolds, D. A., & Godfrey, J. J. (2003). Modeling prosodic dynamics for speaker recognition. In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, Hong Kong, China (Vol. 4, pp. 788–791).
3. Ann, T.-G., & Hutchins, S. E. (1996). On using prosodic cues in automatic language identification. In Proceedings of International Conference on Spoken Language Processing, Philadelphia, PA, USA (Vol. 3, pp. 1768–1772).
4. Bates, R. A., & Ostendorfy, M. (2002). Modeling pronunciation variation in conversational speech using prosody. In Proceedings of ISCA Tutorial and Research Workshop on Pronunciation Modeling and Lexical Access (pp. 42–47).
5. Busso, C., Lee, S., & Narayanan, S. (2009). Analysis of emotionally salient aspects of fundamental frequency for emotion detection. IEEE Transactions on Audio, Speech, and Language Processing, 17(4), 582–596.