Complexity Analysis of Video Frames by Corresponding Audio Features

SeungHo Shin, TaeYong Kim



In this paper, we propose a method to estimate the video complexity by using audio features based on human synesthesia factors. By analyzing the features of audio segments related to video frames, we initially estimate the complexity of the video frames and can improve the performance of video compression. The effectiveness of proposed method is verified by applying it to an actual H.264/AVC Rate-Control.


  1. Kwon, J. C., Lee, M. J., Kim, J. K., 2003. Practical R-Q and D-Q Models for Video Rate Control, IEICE Trans. On Communications, vol.E86-B, no.1.
  2. Li, Y., Narayanan, S. S., Kuo, C. C. J., 2004. ContentBased Movie Analysis and Indexing Based on AudioVisual Cues, IEEE Trans. Circuits Syst. Video Technol., vol.14, no.8, pp.1073-1085.
  3. Lu, L., Zhang, H. J., Jiang, H., 2002. Content Analysis for Audio Classification and Segmentation, IEEE Trans. On Speech and Audio Proc., vol.10, no.7, pp.504-516.
  4. Ma, S., Gao, W., Lu, Y., 2002, Rate control on JVT standard, JVT-D030, pp.22-26.
  5. Pinquier, J., Rouas, J., 2002. Robust Speech/music classification in audio documents, International Conference on Spoken Language Processing, Denver, USA, vol.3 pp.2005-2008.

Paper Citation

in Harvard Style

Shin S. and Kim T. (2012). Complexity Analysis of Video Frames by Corresponding Audio Features . In Proceedings of the International Conference on Signal Processing and Multimedia Applications and Wireless Information Networks and Systems - Volume 1: SIGMAP, (ICETE 2012) ISBN 978-989-8565-25-9, pages 111-114. DOI: 10.5220/0003982001110114

in Bibtex Style

author={SeungHo Shin and TaeYong Kim},
title={Complexity Analysis of Video Frames by Corresponding Audio Features},
booktitle={Proceedings of the International Conference on Signal Processing and Multimedia Applications and Wireless Information Networks and Systems - Volume 1: SIGMAP, (ICETE 2012)},

in EndNote Style

JO - Proceedings of the International Conference on Signal Processing and Multimedia Applications and Wireless Information Networks and Systems - Volume 1: SIGMAP, (ICETE 2012)
TI - Complexity Analysis of Video Frames by Corresponding Audio Features
SN - 978-989-8565-25-9
AU - Shin S.
AU - Kim T.
PY - 2012
SP - 111
EP - 114
DO - 10.5220/0003982001110114