Multimodal speaker clustering in full length movies-Reference-Cited by-同舟云学术

Multimodal speaker clustering in full length movies

Published:2016-01-11 Issue:2 Volume:76 Page:2223-2242
ISSN:1380-7501
Container-title:Multimedia Tools and Applications
language:en
Short-container-title:Multimed Tools Appl

Author:

Kapsouras I.,Tefas A.,Nikolaidis N.,Peeters G.,Benaroya L.,Pitas I.

Publisher

Springer Science and Business Media LLC

Subject

Computer Networks and Communications,Hardware and Architecture,Media Technology,Software

Link

http://link.springer.com/article/10.1007/s11042-015-3181-5/fulltext.html

Reference33 articles.

1. Alameda-Pineda X, Yan Y, Ricci E, Lanz O, Sebe N (2015) Analyzing free-standing conversational groups: a multimodal approach. In: Proceedings of the 23rd ACM international conference on multimedia, MM ’15. ACM, New York, pp 5–14

2. Asthana A, Zafeiriou S, Cheng S, Pantic M (2013) Robust discriminative response map fitting with constrained local models. In: Proceedings of 2013 IEEE conference on computer vision and pattern recognition (CVPR), pp 3444–3451

3. Baltzakis H, Argyros A, Lourakis M, Trahanias P (2008) Tracking of human hands and faces through probabilistic fusion of multiple visual cues. In: Proceedings of the 6th international conference on computer vision systems, ICVS’08. Springer, Berlin, Heidelberg, pp 33–42

4. Calic J, Campbell N, Dasiopoulou S, Kompatsiaris Y (2005) A survey on multimodal video representation for semantic retrieval. In: The international conference on computer as a tool, 2005. EUROCON 2005, vol 1, pp 135–138

5. Carletta J (2006) Announcing the ami meeting corpus. The ELRA Newsletter 1(1):3–5

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. AVA-AVD: Audio-visual Speaker Diarization in the Wild;Proceedings of the 30th ACM International Conference on Multimedia;2022-10-10

2. DyViSE: Dynamic Vision-Guided Speaker Embedding for Audio-Visual Speaker Diarization;2022 IEEE 24th International Workshop on Multimedia Signal Processing (MMSP);2022-09-26

3. End-to-End Audio-Visual Neural Speaker Diarization;Interspeech 2022;2022-09-18

4. Robust Character Labeling in Movie Videos: Data Resources and Self-supervised Feature Adaptation;IEEE Transactions on Multimedia;2021

5. End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors;Interspeech 2020;2020-10-25