6 CONCLUSIONS AND FUTURE
WORK
Experimental results show that the proposed method
performs well in terms of both precision and recall.
In addition, the method was shown to perform well
in highly crowded situations. From our results, we
may conclude that the proposed method provides a
strong basis for head detection in applications that
utilise stereo cameras and that it works well both on
its own and in combination with tracking.
For future work, we will investigate how the blob
splitting method can be enhanced for a better segmen-
tation of people in stereo images. The detected head
locations may serve as a basis for detecting other body
parts and, ultimately, recognition of complete poses.
Finally, given the blob splitting that the method en-
ables, we can build rich profiles based on colour and
height that can be used for cross camera correlation in
a sparse camera setup.
ACKNOWLEDGEMENTS
The research reported in this paper was supported
by the Foundation Innovation Alliance (SIA - Sticht-
ing Innovatie Alliantie) with funding from the Dutch
Ministry of Education, Culture and Science (OCW),
in the framework of the ‘Mens voor de Lens’ project.
REFERENCES
Beymer, D. (2000). Person counting using stereo. In Work-
shop on Human Motion, pages 127–133.
Darrell, T., Gordon, G., Harville, M., and Woodfill, J.
(2000). Integrated person tracking using stereo, color,
and pattern detection. International Journal of Com-
puter Vision, 37(2):175–185.
Fu, H. C., Chen, J. R., and Pao, H. T. (2007). Re-
mote head counting and tracking in crowded scene via
WWW/Internet. In Proceedings of the IADIS Interna-
tional Conference WWW/Internet 2007.
Hayashi, K., Hashimoto, M., Sumi, K., Sasakawa, K., Cen-
ter, A. T., Co, M. E., and Hyogo, J. (2004). Multiple-
person tracker with a fixed slanting stereo camera.
In Sixth IEEE International Conference on Automatic
Face and Gesture Recognition, 2004. Proceedings,
pages 681–686.
Heisele, B. and Woehler, C. (1998). Motion-based recog-
nition of pedestrians. In Pattern Recognition, 1998.
Proceedings. Fourteenth International Conference on,
volume 2.
Horn, B. (1986). Robot vision. McGraw-Hill Higher Edu-
cation.
Horprasert, T., Harwood, D., and Davis, L. S. (1999). A sta-
tistical approach for real-time robust background sub-
traction and shadow detection. In IEEE ICCV, vol-
ume 99.
Hoshino, T. and Izumi, T. (2006). Improvement of head
extraction for height measurement by combination of
sphere matching and optical flow. In SICE-ICASE,
2006. International Joint Conference, pages 1607–
1612.
Huang, X., Li, L., and Sim, T. (2004). Stereo-based human
head detection from crowd scenes. In Proceedings of
International Conference on Image Processing, pages
1353–1356.
Ishii, Y., Hongo, H., Yamamoto, K., and Niwa, Y. (2004).
Face and head detection for a real-time surveillance
system. In Pattern Recognition, 2004. ICPR 2004.
Proceedings of the 17th International Conference on,
volume 3, pages 298–301. IEEE.
Kelly, P., O’Connor, N. E., and Smeaton, A. F. (2009).
Robust pedestrian detection and tracking in crowded
scenes. Image and Vision Computing, 27(10):1445–
1458.
Luo, R. and Guo, Y. (2001). Real-time stereo tracking
of multiple moving heads. In IEEE ICCV Workshop
RATFG-RTS01, pages 55–59.
Miki
´
c, I., Trivedi, M., Hunter, E., and Cosman, P. (2003).
Human body model acquisition and tracking using
voxel data. International Journal of Computer Vision,
53(3):199–223.
Park, S. and Aggarwal, J. K. (2000). Head segmentation
and head orientation in 3d space for pose estimation
of multiple people. In IEEE Southwest Symposium on
Image Analysis and Interpretation.
Scharstein, D. and Szeliski, R. (2002). A taxonomy and
evaluation of dense two-frame stereo correspondence
algorithms. International journal of computer vision,
47(1):7–42.
Stauffer, C. and Grimson, W. E. L. (1999). Adaptive
background mixture models for real-time tracking.
In Computer Vision and Pattern Recognition, 1999.
IEEE Computer Society Conference on., volume 2.
Zhao, T. and Nevatia, R. (2003). Bayesian human seg-
mentation in crowded situations. In IEEE Computer
Society Conference on Computer Vision and Pattern
Recognition, volume 2.
Zivkovic, Z. (2004). Improved adaptive gaussian mixture
model for background subtraction. In Pattern Recog-
nition, 2004. ICPR 2004. Proceedings of the 17th In-
ternational Conference on, volume 2.
HEAD DETECTION IN STEREO DATA FOR PEOPLE COUNTING AND SEGMENTATION
625