1. Noulas, A., Englebienne, G., Kröse, B.J.A.: Multimodal Speaker Diarization. IEEE Transactions on Pattern Analysis and Machine Intelligence 34(1), 79–93 (2012)
2. Sinha, R., Tranter, S.E., Gales, M.J.F., Woodland, P.C.: The Cambridge March 2005 University speaker diarisation system. In: Proceedings of the European Conference on Speech Communication and Technology, pp. 2437–2440 (2005)
3. Tsiaras, V., Panagiotakis, C., Stylianou, Y.: Video and audio based detection of filled hesitation pauses in classroom lectures. In: Proceedings of the 17th European Signal Processing Conference (EUSIPCO 2009), Glasgow, Scotland, pp. 834–838 (2009)
4. Garau, G., Dielmann, A., Bourlard, H.: Audio and Visual Synchronisation for Speaker Diarisation. In: Proceedings of International Conference on Speech and Language Processing, Interspeech, Makuhari, Japan, pp. 2654–2657 (2010)
5. Friedland, G., Hung, H., Yeo, C.: Multi-Modal Speaker Diarization of Real-World Meetings using Compressed Domain Video Features. In: Proceedings ICASSP, pp. 4069–4072 (2009)