Funder
National Key R&D Program of China
Key Research and Development Program of Shaanxi
China Scholarship Council
Publisher
Institute of Electrical and Electronics Engineers (IEEE)
Subject
Electrical and Electronic Engineering,Acoustics and Ultrasonics,Computer Science (miscellaneous),Computational Mathematics
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Visio-Voice Transforming Images into Sound for the Visually Impaired;2024 IEEE International Conference on Information Technology, Electronics and Intelligent Communication Systems (ICITEICS);2024-06-28
2. Recent Advances in Synthesis and Interaction of Speech, Text, and Vision;Electronics;2024-04-30
3. Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-Training and Multi-Modal Tokens;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14
4. Generating Captions from Images using Unsupervised MO-CNN in Deep Learning;2024 3rd International Conference for Innovation in Technology (INOCON);2024-03-01
5. Direct speech-reply generation from text-dialogue context;2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC);2022-11-07