Affiliation:
1. Key Laboratory of Universal Wireless Communications, Beijing University of Posts and Telecommunications, Beijing 100876, China
2. Department of Broadband Communication, Peng Cheng Laboratory, Shenzhen 518066, China
Abstract
We consider the problem of learned speech transmission. Existing methods have exploited joint source–channel coding (JSCC) to encode speech directly to transmitted symbols to improve the robustness over noisy channels. However, the fundamental limit of these methods is the failure of identification of content diversity across speech frames, leading to inefficient transmission. In this paper, we propose a novel neural speech transmission framework named NST. It can be optimized for superior rate–distortion–perception (RDP) performance toward the goal of high-fidelity semantic communication. Particularly, a learned entropy model assesses latent speech features to quantify the semantic content complexity, which facilitates the adaptive transmission rate allocation. NST enables a seamless integration of the source content with channel state information through variable-length joint source–channel coding, which maximizes the coding gain. Furthermore, we present a streaming variant of NST, which adopts causal coding based on sliding windows. Experimental results verify that NST outperforms existing speech transmission methods including separation-based and JSCC solutions in terms of RDP performance. Streaming NST achieves low-latency transmission with a slight quality degradation, which is tailored for real-time speech communication.
Funder
National Natural Science Foundation of China
BUPT Excellent Ph.D. Students Foundation
Reference23 articles.
1. Semantic communication systems for speech transmission;Weng;IEEE J. Sel. Areas Commun.,2021
2. Semantic-preserved communication system for highly efficient speech transmission;Han;IEEE J. Sel. Areas Commun.,2022
3. Guo, J., Zhang, Y., Liu, C., Xu, W., and Bie, Z. (2023, January 5–8). SNR-Adaptive Multi-Layer Semantic Communication for Speech. Proceedings of the 2023 IEEE 34th Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC), Toronto, ON, Canada.
4. Qin, Z., Tao, X., Lu, J., Tong, W., and Li, G.Y. (2021). Semantic communications: Principles and challenges. arXiv.
5. Communication beyond transmitting bits: Semantics-guided source and channel coding;Dai;IEEE Wirel. Commun.,2023