DualVoice: Speech Interaction that Discriminates between Normal and Whispered Voice Input-Reference-Cited by-同舟云学术

DualVoice: Speech Interaction that Discriminates between Normal and Whispered Voice Input

Published:2022-10-28 Issue: Volume: Page:
ISSN:
Container-title:The 35th Annual ACM Symposium on User Interface Software and Technology
language:
Short-container-title:

Author:

Rekimoto Jun¹

Affiliation:

1. The University of Tokyo, Japan and Sony CSL Kyoto, Japan

Funder

JST Moonthos

Publisher

ACM

Reference43 articles.

1. M. S. Arun Sankar , M. Aiswariya , Dominic Anna Rose , Bhat Anushree , D. Bhagya Shree , P. Mohan Lakshmipriya , and P. S. Sathidevi . 2018 . Speech Sound Classification and Estimation of Optimal Order of LPC Using Neural Network . In Proceedings of the 2nd International Conference on Vision, Image and Signal Processing ( Las Vegas, NV, USA) (ICVISP 2018). Association for Computing Machinery, New York, NY, USA, Article 35, 5 pages. https://doi.org/ 10 .1145/3271553.3271611 10.1145/3271553.3271611 M. S. Arun Sankar, M. Aiswariya, Dominic Anna Rose, Bhat Anushree, D. Bhagya Shree, P. Mohan Lakshmipriya, and P. S. Sathidevi. 2018. Speech Sound Classification and Estimation of Optimal Order of LPC Using Neural Network. In Proceedings of the 2nd International Conference on Vision, Image and Signal Processing (Las Vegas, NV, USA) (ICVISP 2018). Association for Computing Machinery, New York, NY, USA, Article 35, 5 pages. https://doi.org/10.1145/3271553.3271611

2. Alexei Baevski , Henry Zhou , Abdelrahman Mohamed , and Michael Auli . 2020. wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations. arXiv [cs.CL] (June 2020 ). Alexei Baevski, Henry Zhou, Abdelrahman Mohamed, and Michael Auli. 2020. wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations. arXiv [cs.CL] (June 2020).

3. Toward Silent-Speech Control of Consumer Wearables

4. Heng-Jui Chang , Alexander H Liu , Hung-Yi Lee , and Lin-Shan Lee . 2020. End-to-end Whispered Speech Recognition with Frequency-weighted Approaches and Pseudo Whisper Pre-training. (May 2020 ). arxiv:2005.01972 [cs.CL] Heng-Jui Chang, Alexander H Liu, Hung-Yi Lee, and Lin-Shan Lee. 2020. End-to-end Whispered Speech Recognition with Frequency-weighted Approaches and Pseudo Whisper Pre-training. (May 2020). arxiv:2005.01972 [cs.CL]

5. Marius Cotescu , Thomas Drugman , Goeric Huybrechts , Jaime Lorenzo-Trueba , and Alexis Moinet . 2019. Voice Conversion for Whispered Speech Synthesis. (Dec . 2019 ). arxiv:1912.05289 [cs.SD] Marius Cotescu, Thomas Drugman, Goeric Huybrechts, Jaime Lorenzo-Trueba, and Alexis Moinet. 2019. Voice Conversion for Whispered Speech Synthesis. (Dec. 2019). arxiv:1912.05289 [cs.SD]

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Alaryngeal Speech Enhancement for Noisy Environments Using a Pareto Denoising Gated LSTM;Journal of Voice;2024-08

2. ReHEarSSE: Recognizing Hidden-in-the-Ear Silently Spelled Expressions;Proceedings of the CHI Conference on Human Factors in Computing Systems;2024-05-11

3. From Natural to Non-Natural Interaction: Embracing Interaction Design Beyond the Accepted Convention of Natural;INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION;2023-10-09

4. WESPER: Zero-shot and Realtime Whisper to Normal Voice Conversion for Whisper-based Speech Interactions;Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems;2023-04-19