Tragic Talkers: A Shakespearean Sound- and Light-Field Dataset for Audio-Visual Machine Learning Research-Reference-Cited by-同舟云学术

Tragic Talkers: A Shakespearean Sound- and Light-Field Dataset for Audio-Visual Machine Learning Research

Published:2022-12 Issue: Volume: Page:
ISSN:
Container-title:European Conference on Visual Media Production
language:
Short-container-title:

Author:

Berghi Davide¹^ORCID,Volino Marco¹^ORCID,Jackson Philip J. B.¹^ORCID

Affiliation:

1. CVSSP, University of Surrey, UK

Funder

Innovate UK

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3565516.3565522

Reference40 articles.

1. Self-supervised object detection from audio-visual correspondence

2. RAVEL: an annotated corpus for training robots with audiovisual abilities

3. R. Arandjelovic and A. Zisserman . 2017. Look , Listen and Learn. In IEEE/CVF Inter. Conf. on Computer Vision. 609–617 . R. Arandjelovic and A. Zisserman. 2017. Look, Listen and Learn. In IEEE/CVF Inter. Conf. on Computer Vision. 609–617.

4. The CAVA corpus

5. Davide Berghi , Adrian Hilton , and Philip J . B. Jackson. 2021. Visually Supervised Speaker Detection and Localization via Microphone Array . In IEEE 23rd Inter. Workshop on Multimedia Signal Processing. Davide Berghi, Adrian Hilton, and Philip J. B. Jackson. 2021. Visually Supervised Speaker Detection and Localization via Microphone Array. In IEEE 23rd Inter. Workshop on Multimedia Signal Processing.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Leveraging Visual Supervision for Array-Based Active Speaker Detection and Localization;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2024

2. Audio Inputs for Active Speaker Detection and Localization Via Microphone Array;2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA);2023-10-22