HEAD DETECTION IN STEREO DATA FOR PEOPLE COUNTING AND SEGMENTATION

Tim van Oosterhout, Sander Bakkes, Ben Kröse

Abstract

In this paper we propose a head detection method using range data from a stereo camera. The method is based on a technique that has been introduced in the domain of voxel data. For application in stereo cameras, the technique is extended (1) to be applicable to stereo data, and (2) to be robust with regard to noise and variation in environmental settings. The method consists of foreground selection, head detection, and blob separation, and, to improve results in case of misdetections, incorporates a means for people tracking. It is tested in experiments with actual stereo data, gathered from three distinct real-life scenarios. Experimental results show that the proposed method performs well in terms of both precision and recall. In addition, the method was shown to perform well in highly crowded situations. From our results, we may conclude that the proposed method provides a strong basis for head detection in applications that utilise stereo cameras.

References

  1. Beymer, D. (2000). Person counting using stereo. In Workshop on Human Motion, pages 127-133.
  2. Darrell, T., Gordon, G., Harville, M., and Woodfill, J. (2000). Integrated person tracking using stereo, color, and pattern detection. International Journal of Computer Vision, 37(2):175-185.
  3. Fu, H. C., Chen, J. R., and Pao, H. T. (2007). Remote head counting and tracking in crowded scene via WWW/Internet. In Proceedings of the IADIS International Conference WWW/Internet 2007.
  4. Hayashi, K., Hashimoto, M., Sumi, K., Sasakawa, K., Center, A. T., Co, M. E., and Hyogo, J. (2004). Multipleperson tracker with a fixed slanting stereo camera. In Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings, pages 681-686.
  5. Heisele, B. and Woehler, C. (1998). Motion-based recognition of pedestrians. In Pattern Recognition, 1998. Proceedings. Fourteenth International Conference on, volume 2.
  6. Horn, B. (1986). Robot vision. McGraw-Hill Higher Education.
  7. Horprasert, T., Harwood, D., and Davis, L. S. (1999). A statistical approach for real-time robust background subtraction and shadow detection. In IEEE ICCV, volume 99.
  8. Hoshino, T. and Izumi, T. (2006). Improvement of head extraction for height measurement by combination of sphere matching and optical flow. In SICE-ICASE, 2006. International Joint Conference, pages 1607- 1612.
  9. Huang, X., Li, L., and Sim, T. (2004). Stereo-based human head detection from crowd scenes. In Proceedings of International Conference on Image Processing, pages 1353-1356.
  10. Ishii, Y., Hongo, H., Yamamoto, K., and Niwa, Y. (2004). Face and head detection for a real-time surveillance system. In Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on, volume 3, pages 298-301. IEEE.
  11. Kelly, P., O'Connor, N. E., and Smeaton, A. F. (2009). Robust pedestrian detection and tracking in crowded scenes. Image and Vision Computing, 27(10):1445- 1458.
  12. Luo, R. and Guo, Y. (2001). Real-time stereo tracking of multiple moving heads. In IEEE ICCV Workshop RATFG-RTS01, pages 55-59.
  13. Mikic, I., Trivedi, M., Hunter, E., and Cosman, P. (2003). Human body model acquisition and tracking using voxel data. International Journal of Computer Vision, 53(3):199-223.
  14. Park, S. and Aggarwal, J. K. (2000). Head segmentation and head orientation in 3d space for pose estimation of multiple people. In IEEE Southwest Symposium on Image Analysis and Interpretation.
  15. Scharstein, D. and Szeliski, R. (2002). A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. International Journal of Computer Vision, 47(1):7-42.
  16. Stauffer, C. and Grimson, W. E. L. (1999). Adaptive background mixture models for real-time tracking. In Computer Vision and Pattern Recognition, 1999. IEEE Computer Society Conference on., volume 2.
  17. Zhao, T. and Nevatia, R. (2003). Bayesian human segmentation in crowded situations. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition, volume 2.
  18. Zivkovic, Z. (2004). Improved adaptive gaussian mixture model for background subtraction. In Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on, volume 2.
Download


Paper Citation


in Harvard Style

van Oosterhout T., Bakkes S. and Kröse B. (2011). HEAD DETECTION IN STEREO DATA FOR PEOPLE COUNTING AND SEGMENTATION . In Proceedings of the International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2011) ISBN 978-989-8425-47-8, pages 620-625. DOI: 10.5220/0003362806200625


in Bibtex Style

@conference{visapp11,
author={Tim van Oosterhout and Sander Bakkes and Ben Kröse},
title={HEAD DETECTION IN STEREO DATA FOR PEOPLE COUNTING AND SEGMENTATION},
booktitle={Proceedings of the International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2011)},
year={2011},
pages={620-625},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003362806200625},
isbn={978-989-8425-47-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2011)
TI - HEAD DETECTION IN STEREO DATA FOR PEOPLE COUNTING AND SEGMENTATION
SN - 978-989-8425-47-8
AU - van Oosterhout T.
AU - Bakkes S.
AU - Kröse B.
PY - 2011
SP - 620
EP - 625
DO - 10.5220/0003362806200625