BIMODAL QUANTIZATION OF WIDEBAND SPEECH SPECTRAL INFORMATION

Driss Guerchi

2008

Abstract

In this work we introduce an efficient method to reduce the coding rate of the spectral information in an algebraic code-excited linear prediction (ACELP) wideband codec. The Bimodal Vector Quantization (BMVQ) exploits the interframe correlation in spectral information to reduce the coding rate while maintaining high coded speech quality. In the BMVQ training phase, two codebooks are separately designed for voiced and unvoiced speech. For each speech frame, the optimal codebook for the search procedure is selected according to the interframe correlation of the spectral information. The BMVQ was successfully implemented in an ACELP wideband coder. The objective and subjective performance were found to be comparable to that of the combination of the split vector quantization and multistage vector quantization at 2.3 kbit/s.

References

  1. Y. Agiomyrgiannakis and Y. Stylianou, “Conditional Vector Quantization for Speech Coding”, IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no 2, pp. 377-386, February 2007.
  2. R. A. Salami, et al., “Design and description of CS-ACELP: A toll quality 8 kb/s speech coder”, IEEE Transactions on Speech and Audio Processing, vol. 6, no 2, pp. 116- 130, March 1998.
  3. ITU-T G.722.2, Wideband coding of speech at around 16 kbit/s using Adaptive Multi-Rate Wideband (AMRWB), July 2003.
  4. B. Bessete, et al., “The adaptive multirate wideband speech codec (AMR-WB)”, IEEE Transactions on Speech and Audio Processing, vol. 10, no 8, pp. 620-636, November 2002.
  5. M. Tamni, M. Jelinek, and V. T. Ruoppila, “Signal modification method for variable bit rate wideband speech coding”, IEEE Transactions on Speech and Audio Processing, vol. 13, no 5, pp. 620-636, September 2005.
  6. M. Jelinek and R. Salami, “Wideband speech coding Advances in VMR-WB standard”, IEEE Transactions on Speech and Audio Processing, vol. 15, no 4, pp. 1167- 1179, May 2007.
  7. D. Guerchi, T. Rabie, and A. Louzi, “Voicing-based codebook in low-rate wideband CELP coding”, in Proc. of the tenth European Conference on Speech Communication and Technology (Interspeech 2007- Eurospeech), Antwerp, Belgium, August 2007, pp. 2505-2508.
  8. K. K. Paliwal and B. S. Atal, “Efficient vector quantization of LPC parameters at 24 bits/frame”, IEEE Transactions on Speech, and Audio Processing, vol. 1, no. 1, January 1993.
Download


Paper Citation


in Harvard Style

Guerchi D. (2008). BIMODAL QUANTIZATION OF WIDEBAND SPEECH SPECTRAL INFORMATION . In Proceedings of the International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2008) ISBN 978-989-8111-60-9, pages 151-155. DOI: 10.5220/0001939201510155


in Bibtex Style

@conference{sigmap08,
author={Driss Guerchi},
title={BIMODAL QUANTIZATION OF WIDEBAND SPEECH SPECTRAL INFORMATION},
booktitle={Proceedings of the International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2008)},
year={2008},
pages={151-155},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001939201510155},
isbn={978-989-8111-60-9},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2008)
TI - BIMODAL QUANTIZATION OF WIDEBAND SPEECH SPECTRAL INFORMATION
SN - 978-989-8111-60-9
AU - Guerchi D.
PY - 2008
SP - 151
EP - 155
DO - 10.5220/0001939201510155