Look Once to Hear: Target Speech Hearing with Noisy Examples-Reference-Cited by-同舟云学术

Look Once to Hear: Target Speech Hearing with Noisy Examples

Published:2024-05-11 Issue: Volume: Page:1-16
ISSN:
Container-title:Proceedings of the CHI Conference on Human Factors in Computing Systems
language:
Short-container-title:

Author:

Veluri Bandhav¹^ORCID,Itani Malek²^ORCID,Chen Tuochao²^ORCID,Yoshioka Takuya³^ORCID,Gollakota Shyamnath²^ORCID

Affiliation:

1. Paul G. Allen Center for Computer Science & Engineering, University of Washington, United States

2. Paul G. Allen Center for Computer Science and Engineering, University of Washington, United States

3. AssemblyAI, United States and Microsoft, United States

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3613904.3642057

Reference72 articles.

1. Triantafyllos Afouras, Joon Son Chung, and Andrew Zisserman. 2018. The Conversation: Deep Audio-Visual Speech Enhancement. (2018). arxiv:cs.CV/1804.04121

2. Triantafyllos Afouras Joon Son Chung and Andrew Zisserman. 2019. My lips are concealed: Audio-visual speech enhancement through obstructions. (2019). arxiv:cs.CV/1907.04975

3. V.R. Algazi R.O. Duda D.M. Thompson and C. Avendano. 2001. The CIPIC HRTF database. (2001) 99-102 pages. https://doi.org/10.1109/ASPAA.2001.969552

4. Winko W. An Barbara Shinn-Cunningham Hannes Gamper Dimitra Emmanouilidou David Johnston Mihai Jalobeanu Edward Cutrell Andrew Wilson Kuan-Jung Chiang and Ivan Tashev. 2021. Decoding Music Attention from “EEG Headphones”: A User-Friendly Auditory Brain-Computer Interface. (2021) 985-989 pages.

5. Taichi Asami Ryo Masumura Yoshikazu Yamaguchi Hirokazu Masataki and Yushi Aono. 2017. Domain adaptation of DNN acoustic models using knowledge distillation. (2017) 5185-5189 pages.