WESPER: Zero-shot and Realtime Whisper to Normal Voice Conversion for Whisper-based Speech Interactions-Reference-Cited by-同舟云学术

WESPER: Zero-shot and Realtime Whisper to Normal Voice Conversion for Whisper-based Speech Interactions

Published:2023-04-19 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems
language:
Short-container-title:

Author:

Rekimoto Jun¹^ORCID

Affiliation:

1. The University of Tokyo, Japan and Sony CSL Kyoto, Japan

Funder

JST Moonshot

JST CREST

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3544548.3580706

Reference77 articles.

1. A. Al-Nasheri G. Muhammad M. Alsulaiman and Z. Ali. 2017. Investigation of Voice Pathology Detection and Classification on Different Frequency Regions Using Correlation Functions. Journal of Voice (2017). https://doi.org/10.1016/j.jvoice.2016.01.014 10.1016/j.jvoice.2016.01.014

2. A. Al-Nasheri G. Muhammad M. Alsulaiman and Z. Ali. 2017. Investigation of Voice Pathology Detection and Classification on Different Frequency Regions Using Correlation Functions. Journal of Voice (2017). https://doi.org/10.1016/j.jvoice.2016.01.014

3. Alexei Baevski , Henry Zhou , Abdelrahman Mohamed , and Michael Auli . 2020. wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations. arXiv [cs.CL] (June 2020 ). Alexei Baevski, Henry Zhou, Abdelrahman Mohamed, and Michael Auli. 2020. wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations. arXiv [cs.CL] (June 2020).

4. Toward Silent-Speech Control of Consumer Wearables

5. Fadi Biadsy , Ron J. Weiss , Pedro J. Moreno , Dimitri Kanevsky , and Ye Jia . 2019 . Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation. https://doi.org/10.48550/ARXIV.1904.04169 10.48550/ARXIV.1904.04169 Fadi Biadsy, Ron J. Weiss, Pedro J. Moreno, Dimitri Kanevsky, and Ye Jia. 2019. Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation. https://doi.org/10.48550/ARXIV.1904.04169

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Streaming ASR Encoder for Whisper-to-Speech Online Voice Conversion;IEEE Open Journal of Signal Processing;2024