Hybrid Person Detection and Tracking in H.264/AVC Video Streams

Philipp Wojaczek, Marcus Laumer, Peter Amon, Andreas Hutter, André Kaup

2015

Abstract

In this paper we present a new hybrid framework for detecting and tracking persons in surveillance video streams compressed according to the H.264/AVC video coding standard. The framework consists of three stages and operates in both the compressed and the pixel domain of the video. The combination of compressed and pixel domain represents the hybrid character. Its main objective is to significantly reduce the amount of computation required, in particular for frames and image regions with few people present. In its first stage the proposed framework evaluates the header information for each compressed frame in the video sequence, namely the macroblock type information. This results in a coarse binary mask segmenting the frame into foreground and background. Only the foreground regions are processed further in the second stage that searches for persons in the image pixel domain by applying a person detector based on the Implicit Shape Model. The third stage segments each detected person further with a newly developed method that fuses information from the first two stages. This helps obtaining a finer segmentation for calculating a color histogram suitable for tracking the person using the mean shift algorithm. The proposed framework was experimentally evaluated on a publicly available test set. The results demonstrate that the proposed framework reliably separates frames with and without persons such that the computational load is significantly reduced while the detection performance is kept.

References

  1. Andriluka, M., Roth, S., and Schiele, B. (2008). People-Tracking-by-Detection and People-Detectionby-Tracking. In Proc. 2008 IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), pages 1-8.
  2. Berclaz, J., Fleuret, F., Turetken, E., and Fua, P. (2011). Multiple Object Tracking Using K-Shortest Paths Optimization. IEEE Transactions on Pattern Analysis and Machine Intelligence, 33(9):1806-1819.
  3. Comaniciu, D., Ramesh, V., and Meer, P. (2003). Kernelbased object tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence, 25(5):564-577.
  4. Corrales, J., Gil, P., Candelas, F., and Torres, F. (2009). Tracking based on Hue-Saturation Features with a Miniaturized Active Vision System. In Proc. 40th Int. Symposium on Robotics, pages 107-112.
  5. Eiselein, V., Fradi, H., Keller, I., Sikora, T., and Dugelay, J.- L. (2013). Enhancing Human Detection Using Crowd Density Measures and an Adaptive Correction Filter. In Proc. 2013 10th IEEE Int. Conf. on Advanced Video and Signal Based Surveillance (AVSS), pages 19-24.
  6. Evans, M., Osborne, C., and Ferryman, J. (2013). Multicamera Object Detection and Tracking with Object Size Estimation. In Proc. 2013 10th IEEE Int. Conf. on Advanced Video and Signal Based Surveillance (AVSS), pages 177-182.
  7. Kailath, T. (1967). The Divergence and Bhattacharyya Distance Measures in Signal Selection. IEEE Transactions on Communication Technology, 15(1):52-60.
  8. Laumer, M., Amon, P., Hutter, A., and Kaup, A. (2013). Compressed Domain Moving Object Detection Based on H.264/AVC Macroblock Types. In Proc. of the International Conference on Computer Vision Theory and Applications (VISAPP), pages 219-228.
  9. Leibe, B., Leonardis, A., and Bernt, S. (2004). Combined Object Categorization and Segmentation With an Implicit Shape Model. In Proc. Workshop on Statistical Learning in Computer Vision (ECCV workshop), pages 17-32.
  10. Lowe, D. G. (2004). Distinctive image features from scaleinvariant keypoints. International Journal of Computer Vision, 60:91-110.
  11. MPEG (2010). ISO/IEC 14496-10:2010 - Coding of AudioVisual Objects - Part 10: Advanced Video Coding.
  12. Nawaz, T., Poiesi, F., and Cavallaro, A. (2014). Measures of Effective Video Tracking. IEEE Transactions on Image Processing, 23(1):376-388.
  13. Poppe, C., De Bruyne, S., Paridaens, T., Lambert, P., and Van de Walle, R. (2009). Moving object detection in the H.264/AVC compressed domain for video surveillance applications. Journal of Visual Communication and Image Representation, 20(6):428-437.
  14. Senst, T., Eiselein, V., and Sikora, T. (2012). Robust Local Optical Flow for Feature Tracking. IEEE Transactions on Circuits and Systems for Video Technology, 22(9):1377-1387.
  15. Yilmaz, A., Javed, O., and Shah, M. (2006). Object Tracking: A Survey. ACM Computing Surveys, 38(4).
Download


Paper Citation


in Harvard Style

Wojaczek P., Laumer M., Amon P., Hutter A. and Kaup A. (2015). Hybrid Person Detection and Tracking in H.264/AVC Video Streams . In Proceedings of the 10th International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2015) ISBN 978-989-758-089-5, pages 478-485. DOI: 10.5220/0005296704780485


in Bibtex Style

@conference{visapp15,
author={Philipp Wojaczek and Marcus Laumer and Peter Amon and Andreas Hutter and André Kaup},
title={Hybrid Person Detection and Tracking in H.264/AVC Video Streams},
booktitle={Proceedings of the 10th International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2015)},
year={2015},
pages={478-485},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005296704780485},
isbn={978-989-758-089-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 10th International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2015)
TI - Hybrid Person Detection and Tracking in H.264/AVC Video Streams
SN - 978-989-758-089-5
AU - Wojaczek P.
AU - Laumer M.
AU - Amon P.
AU - Hutter A.
AU - Kaup A.
PY - 2015
SP - 478
EP - 485
DO - 10.5220/0005296704780485