A Multi-configuration Part-based Person Detector

Alvaro Garcia-Martin, Ruben Heras Evangelio, Thomas Sikora

2014

Abstract

People detection is a task that has generated a great interest in the computer vision and specially in the surveillance community. One of the main problems of this task in crowded scenarios is the high number of occlusions deriving from persons appearing in groups. In this paper, we address this problem by combining individual body part detectors in a statistical driven way in order to be able to detect persons even in case of failure of any detection of the body parts, i.e., we propose a generic scheme to deal with partial occlusions. We demonstrate the validity of our approach and compare it with other state of the art approaches on several public datasets. In our experiments we consider sequences with different complexities in terms of occupation and therefore with different number of people present in the scene, in order to highlight the benefits and difficulties of the approaches considered for evaluation. The results show that our approach improves the results provided by state of the art approaches specially in the case of crowded scenes.

References

  1. Ali, I. and Dailey, M. N. (2012). Multiple human tracking in high-density crowds. Image and Vision Computing, 30(12):966 - 977.
  2. Andriluka, M., Roth, S., and Schiele, B. (2008). People-tracking-by-detection and people-detectionby-tracking. In Proc. of CVPR, pages 1-8.
  3. Andriluka, M., Roth, S., and Schiele, B. (2010). Monocular 3d pose estimation and tracking by detection. In Proc. of CVPR, pages 623-630.
  4. Dalal, N. and Triggs, B. (2005). Histograms of oriented gradients for human detection. In Proc. of CVPR, pages 886-893.
  5. Dollár, P., Appel, R., and Kienzle, W. (2012a). Crosstalk cascades for frame-rate pedestrian detection. In Proc. of ECCV, number 645-659.
  6. Dollár, P., Wojek, C., Schiele, B., and Perona, P. (2012b). Pedestrian detection: An evaluation of the state of the art. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34(4):743-761.
  7. Enzweiler, M. and Gavrila, D. M. (2009). Monocular pedestrian detection: Survey and experiments. IEEE Transactions on Pattern Analysis and Machine Intelligence, 31(12):2179-2195.
  8. Felzenszwalb, P. F., Girshick, R. B., McAllester, D., and Ramanan, D. (2010). Object detection with discriminatively trained part-based models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(9):1627-1645.
  9. Garcia-Martin, A. and Martinez, J. M. (2012). On collaborative people detection and tracking in complex scenarios. Image and Vision Computing, 30(4):345-354.
  10. Gerónimo, D., L ópez, A. M., Sappa, A. D., and Graf, T. (2010). Survey of pedestrian detection for advanced driver assistance systems. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(7):1239- 1258.
  11. Girshick, R. B., Felzenszwalb, P. F., and McAllester, D. Discriminatively trained deformable part models, release 4. http://people.cs.uchicago.edu/ rbg/latentrelease4/.
  12. Girshick, R. B., Felzenszwalb, P. F., and Mcallester, D. (2011). Object detection with grammar models. In Proc. of NIPS.
  13. Kullback, S. and Leibler, R. A. (1951). On information and sufficiency. The Annals of Mathematical Statistics, 22(1):79-86.
  14. Leibe, B., Seemann, E., and Schiele, B. (2005). Pedestrian detection in crowded scenes. In Proc. of CVPR, pages 878-885.
  15. Milan, A., Roth, S., and Schindler, K. (2014). Continuous energy minimization for multitarget tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence, 36(1):58-72.
  16. Patzold, M., Evangelio, R. H., and Sikora, T. (2010). Counting people in crowded environments by fusion of shape and motion information. In Proceedings of the 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS 7810, pages 157-164, Washington, DC, USA. IEEE Computer Society.
  17. PETS. International workshop on performance evaluation of tracking and surveillance, http://www.cvg.rdg.ac.uk/pets2009/index.html.
  18. Rodriguez, M., Laptev, I., Sivic, J., and Audibert, J.-Y. (2011). Density-aware person detection and tracking in crowds. In Proc. of ICCV, pages 2423-2430.
  19. Seemann, E., Fritz, M., and Schiele, B. (2007). Towards robust pedestrian detection in crowded image sequences. In Proc. of CVPR, pages 1-8.
  20. Tang, S., Andriluka, M., and Schiele, B. (2014). Detection and tracking of occluded people. International Journal of Computer Vision.
  21. Zeng, C. and Ma, H. (2010). Robust head-shoulder detection by pca-based multilevel hog-lbp detector for people counting. In Proc. of ICPR, pages 2069-2072.
Download


Paper Citation


in Harvard Style

Garcia-Martin A., Heras Evangelio R. and Sikora T. (2014). A Multi-configuration Part-based Person Detector . In Proceedings of the 11th International Conference on Signal Processing and Multimedia Applications - Volume 1: MUSESUAN, (ICETE 2014) ISBN 978-989-758-046-8, pages 321-328. DOI: 10.5220/0005126703210328


in Bibtex Style

@conference{musesuan14,
author={Alvaro Garcia-Martin and Ruben Heras Evangelio and Thomas Sikora},
title={A Multi-configuration Part-based Person Detector},
booktitle={Proceedings of the 11th International Conference on Signal Processing and Multimedia Applications - Volume 1: MUSESUAN, (ICETE 2014)},
year={2014},
pages={321-328},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005126703210328},
isbn={978-989-758-046-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 11th International Conference on Signal Processing and Multimedia Applications - Volume 1: MUSESUAN, (ICETE 2014)
TI - A Multi-configuration Part-based Person Detector
SN - 978-989-758-046-8
AU - Garcia-Martin A.
AU - Heras Evangelio R.
AU - Sikora T.
PY - 2014
SP - 321
EP - 328
DO - 10.5220/0005126703210328