Table 2: Performance of the global G.722.2 with
steganographic SBS-CP implementation.
Embedding rate
(Bits/frame)
G.722.2 with
SBS-CP by BCP
G.722.2 with
SBS-CP by UCP
WB-PESQ WB-PESQ
0 3.790 3.790
1 3.798 3.705
2 3.823 3.744
3 3.687 3.814
4 3.719 3.680
5 3.756 3.747
6 3.766 3.720
7 3.676 3.720
For all embedding rates, these simulation results
show that the overall quality of stego-speech is
almost identical to quality of cover public speech;
which means that our proposed steganographic
techniques are practically imperceptibles. Most WB-
PESQ scores of the stego-signals are between 3.67
and 3.82. Hence, a good speech quality was obtained
and no degradation was caused by the embedding
process. On the other hand, steganographic SBS-CP
systems designed by the UCP yields slight
improvement to the G.722.2 WB-PESQ performance
compared to SBS-CP with balanced partitioning.
5 CONCLUSION
In this paper, we proposed two variants of VQ-based
speech steganography binning schemes for G.722.2
secure speech communication system. The
simulation results showed that the two
steganographic SBS-CP methods by UCP and BCP
can generate stego-speech signals with similar
quality to cover speech signals; which means that
the resulting stego-speech is indistinguishable from
the original cover speech. Hence, the two proposed
variants of SBS-CP method can ensure a high
transparency with a maximal embedding rate of 7
bits/frame (350 bits/s).
Robustness against intentional and non-
intentional attacks has not been investigated in this
work; it will be studied in future research.
REFERENCES
Bessette, B., Salami, R., Lefebvre, R., Jelínek, M., Rotola-
Pukkila, J., Vainio, J., Mikkola, H., Järvinen, K., 2002.
The adaptive multirate wideband speech codec (AMR-
WB), IEEE Transactions on Speech and Audio
Processing, vol. 10, no. 8, pp. 620-636.
Boudraa, M., Boudraa, B., Guerin, B., 1992. Mise en place
de phrases arabes phonetiquement equilibrées. In
JEP'92, XIXèmes Journées d'Etude sur la Parole.
Bruxelles.
Cheraitia, S., Bouzid, M., 2014. Robust coding of
wideband speech immittance spectral frequencies.
Speech Communication, Elsevier, vol. 65, pp. 94-108.
Cox, I. J., Miller, M. L., Bloom, J. A., Fridrich, J., Kalker,
T., 2008. Digital Watermarking and Steganography,
Second Edition, Morgan Kaufmann Publishers, USA.
Djebbar, F., Ayad, B., Meraim, K. A., Hamam, H., 2012.
Comparative study of digital audio steganography
techniques, EURASIP Journal on Audio, Speech, and
Music Processing, Springer, vol. 25, pp. 1-16.
Garofolo J. S., et al., DARPA TIMIT Acoustic-phonetic
Continuous Speech Database. National Institute of
Standards and Technology (NIST), Gaithersburg,
October 1988.
Geiser, B., Vary, P., 2008. High rate data hiding in
ACELP speech codecs. In ICASSP’2008, IEEE
International Conference on Acoustics, Speech and
Signal Processing. pp. 4005-4008. USA.
Gersho, A., Gray, R. M., 1992. Vector quantization and
Signal compression, Kluwer Acad. Publishers, USA.
ITU-T Recommendation G.722.2. Wideband coding of
speech at around 16 kb/s using Adaptive Multi-rate
Wideband (AMR-WB), 2003.
ITU-T Recommendation P.862.2. Wideband Extension to
Recommendation P.862 for the Assessment of
Wideband Telephone Networks and Speech Codecs,
Geneva, 2005.
Kim, Jo. M., 2002. A digital image watermarking scheme
based on vector quantisation. IEICE Trans. on Inf. and
Systems, vol. E85-D, pp. 1054-1056.
McCree, A., Truong, K., George, E. B., Barnwell, T. P.,
Viswanathan, V., 1996. A 2.4 kbits/s MELP Coder
Candidate for the New U.S. Federal Standard. In
ICASSP'96, IEEE International Conference on
Acoustics, Speech and Signal Processing. pp. 200-203.
Moulin, P., Koetter, R., 2005. Data-Hiding Codes.
Proceedings of The IEEE, Vol. 93, pp. 2083-2126.
Paliwal, K. K., Atal, B. S., 1993. Efficient vector
quantization of LPC parameters at 24 bits/frame. IEEE
Transactions on Speech and Audio Processing, vol. 1,
no. 1, pp. 3-14.
Wang, F. H., Jain, L. C., Pan, J. S., 2007. A novel VQ-
based watermarking scheme with genetic codebook
partition. Journal of Network and Computation
Applications (JNCA), Elsevier, vol. 30, no. 1, pp. 4-23.
Yargıcoglu, A. U., Ilk, H. G., 2010. Hidden data
transmission in mixed excitation linear prediction
coded speech using quantisation index modulation,
IET Information Security, vol. 4, Issue 3, pp. 158–166.