Multimode Tree-Coding of Speech with Pre-/Post-Weighting-Reference-Cited by-同舟云学术

Multimode Tree-Coding of Speech with Pre-/Post-Weighting

Published:2022-02-15 Issue:4 Volume:12 Page:2026
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Li Ying-Yi,Ramadas Pravin,Gibson Jerry^ORCID

Abstract

As speech-coding standards have improved over the years, so complexity has increased, and less emphasis been placed on low encoding/decoding delay. We present a low-complexity, low-delay speech codec based on tree-coding with sample-by-sample adaptive long- and short-code generators that incorporates pre- and post-filtering for perceptual weighting and multimode speech classification with comfort noise generation (CNG). The pre-/post-weighting filters adapt based on the code generator parameters available at both the encoder and decoder rather than the usual method that uses the input speech. The coding of the multiple speech modes and comfort noise generation is accomplished using the code generator adaptation algorithms, again, rather than using the input speech. Codec complexity comparisons are presented and operational rate distortion curves for several standardized speech codecs and the new codec are given. Finally, codec performance is shown in relation to theoretical rate distortion bounds.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/12/4/2026/pdf

Reference47 articles.

1. Adaptive Predictive Coding of Speech Signals

2. Digital Coding of Waveforms: Principles and Applications to Speech and Video;Jayant,1984

3. Rate Distortion Bounds for Voice and Video

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Reinforcement Learning Approach to Speech Coding;Information;2022-07-11