Visual Onoma-to-Wave: Environmental Sound Synthesis from Visual Onomatopoeias and Sound-Source Images-Reference-Cited by-同舟云学术

Visual Onoma-to-Wave: Environmental Sound Synthesis from Visual Onomatopoeias and Sound-Source Images

Published:2023-06-04 Issue: Volume: Page:
ISSN:
Container-title:ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
language:
Short-container-title:

Author:

Ohnaka Hien¹,Takamichi Shinnosuke²,Imoto Keisuke³,Okamoto Yuki⁴,Fujii Kazuki²,Saruwatari Hiroshi²

Affiliation:

1. National Institute of Technology,Tokuyama College,Japan

2. The University of Tokyo,Japan

3. Doshisha University,Japan

4. Ritsumeikan University,Japan

Publisher

IEEE

Link

Reference27 articles.

2. Vector-based representation and clustering of audio using onomatopoeia words;sundaram;AAAI Fall Symposium Aurally Informed Performance,2006

3. vTTS: Visual-text to speech;nakano;Proc SLT,2022

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Environmental Sound Synthesis from Vocal Imitations and Sound Event Labels;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14