3D Spatial Features for Multi-Channel Target Speech Separation-Reference-Cited by-同舟云学术

3D Spatial Features for Multi-Channel Target Speech Separation

Published:2021-12-13 Issue: Volume: Page:
ISSN:
Container-title:2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
language:
Short-container-title:

Author:

Gu Rongzhi¹,Zhang Shi-Xiong²,Yu Meng²,Yu Dong²

Affiliation:

1. Peking University,Shenzhen,China

2. Tencent AI Lab,Bellevue,USA

Publisher

IEEE

Link

Reference18 articles.

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Unified Geometry-Aware Source Localization and Separation Framework for AD-HOC Microphone Array;2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW);2024-04-14

2. A Study of Multichannel Spatiotemporal Features and Knowledge Distillation on Robust Target Speaker Extraction;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14

3. USAT: A Universal Speaker-Adaptive Text-to-Speech Approach;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2024

4. ReZero: Region-Customizable Sound Extraction;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2024

5. Enhanced Neural Beamformer with Spatial Information for Target Speech Extraction;2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC);2023-10-31