Affiliation:
1. PLA University of Science and Technology
2. Unit 92985 of PLA Ruian
Abstract
This paper presents a low bit rate speech coder based on predictive lattice vector quantization (PLVQ) and time-scale modification (TSM). The coding model of proposed vocoder is built on the MELP, in which bit rate reduction is achieved by taking advantage of PLVQ and TSM techniques. PLVQ is used to encode the speech line spectrum pair (LSP) parameters, which has the advantage of lower implementation complexity than multi-stage vector quantization (MSVQ), moreover, it does not require memory for codebook storage. With our speech data base, PLVQ can save up to 4 bits/frame compared to unstructured codebook MSVQ. TSM can change the speed of speech signal with its perceptual characteristics remained. Through appending TSM as previous and post process, speech coding at bit rate about 1.1 kbps could be easily achieved without modifying the vocoder structure.
Publisher
Trans Tech Publications, Ltd.
Reference12 articles.
1. L.M. Supplee, R.P. Chon, A. McCree, et al., MELP: the new Federal standard at 2400bps, IEEE Processing of ICASSP, 1997, pp.1591-1594.
2. T. Wang, K. Koishida, V. Cuperman, et al., A 1200/2400bps coding suite based on MELP, IEEE workshop on speech coding, Tsukuba, Japan, 2002, pp.90-92.
3. W.J. Han, E.K. Kim, Y.H. Oh, Multicodebook split vector quantization of LSF parameters, IEEE Signal Processing Letters, 2002, 9(12): 418-421.
4. F. Lahouti, A.K. Khandani, Reconstruction of multi-stage vector quantized sources over noisy channels-application to MELP codec, IEEE Processing of ICASSP, 2004, pp.613-616.
5. T.R. Fischer, A pyramid vector quantizer, IEEE Trans. on information theory, 1986, IT-32 (4): 568-583.