is no dependent of the morphological features of the
heads. The experimentation shows that the system is
able to detect the persons present in its vicinity, track
their motions and give a value of possible interest on
the interaction of the persons with the robot.
The proposed method can be easily updated in
future works to analyze other types of input data as
sounds or laser range finder. Also, the degree of in-
terest will be useful to plan the actions of the robot
towards the persons in order to allow a more natural
human-robot interaction.
ACKNOWLEDGEMENTS
This work has been partially supported by the Span-
ish MEC project TIN2006-05565 and Andalusian Re-
gional Government project TIC1670.
REFERENCES
Aguirre, E. and Gonz
´
alez, A. (2000). Fuzzy behaviors for
mobile robot navigation: Design, coordination and fu-
sion. International Journal of Approximate Reason-
ing, 25:255–289.
Bennewitz, M., Faber, F., Joho, D., Schreiber, M., and
Behnke, S. (2005). Integrating vision and speech
for conversations with multiple persons. In IROS’05:
Proceedings of the IEEE/RSJ Int. Conf. on Intelligent
Robots and Systems, pages 2523 – 2528.
Bien, Z. and Song, W. (2003). Blend of soft computing
techniques for effective human-machine interaction
in service robotic systems. Fuzzy Sets and Systems,
134(1):5–25.
Birchfield, S. (1998). Elliptical Head Tracking Using Inten-
sity Gradients and Color Histograms. In IEEE Con-
ference on Computer Vision and Pattern Recognition,
pages 232–237.
Chang, C. and Lin, C. (2006). Libsvm.
a library for support vector machines.
http://www.csie.ntu.edu.tw/ cjlin/libsvm/.
Comaniciu, D., Ramesh, V., and Meer, P. (2000). Real-Time
Tracking of Non-Rigid Objects using Mean Shift. In
IEEE Conference on Computer Vision and Pattern
Recognition, volume 2, pages 142–149.
Cristianini, N. and Shawe-Taylor, J. (2000). An Introduction
To Support Vector Machines (and other Kernel Based
Methods). Cambridge University Press.
Darrell, T., Demirdjian, D., Checka, N., and Felzenszwalb,
P. (2001). Plan-view trajectory estimation with dense
stereo background models. In Eighth IEEE Interna-
tional Conference on Computer Vision (ICCV 2001),
volume 2, pages 628 – 635.
Fritsch, J., Kleinehagenbrock, M., Lang, S., Pl
¨
otz, T., Fink,
G. A., and Sagerer, G. (2003). Multi-modal anchor-
ing for human-robot interaction. Robotics and Au-
tonomous Systems, 43(2-3):133–147.
Ghidary, S. S., Nakata, Y., Saito, H., Hattori, M., and
Takamori, T. (2002). Multi-modal interaction of hu-
man and home robot in the context of room map gen-
eration. Autonomous Robots, 13(2):169–184.
Haritaoglu, I., Beymer, D., and Flickner, M. (2002). Ghost
3d: detecting body posture and parts using stereo. In
Workshop on Motion and Video Computing, pages 175
– 180.
Harville, M. (2004). Stereo person tracking with adaptive
plan-view templates of height and occupancy statis-
tics. Image and Vision Computing, 2:127–142.
Hayashi, K., Hashimoto, M., Sumi, K., and Sasakawa, K.
(2004). Multiple-person tracker with a fixed slanting
stereo camera. In 6th IEEE International Conference
on Automatic Face and Gesture Recognition, pages
681–686.
Henry, G. and Dunteman (1989). Principal Components
Analysis. SAGE Publications.
Intel (2005). OpenCV: Open source Computer Vision li-
brary.
Kuhn, H. W. (1955). The hungarian method for the assign-
ment problem. Naval Research Logistics Quarterly,
2:83–97.
Kulic, D. and Croft, E. (2003). Estimating intent for hu-
man robot interaction. In International Conference on
Advanced Robotics, pages 810–815.
Lienhart, R. and Maydt, J. (2002). An Extended Set of
Haar-Like Features for rapid Object detection. In
IEEE Conf. on Image Processing, pages 900–903.
Mu
˜
noz-Salinas, R., Aguirre, E., and Garc
´
ıa-Silvente, M.
(2006). People detection and tracking using stereo vi-
sion and color. To appear in Image and Vision Com-
puting. Available online at www.sciencedirect.com.
PtGrey (2005). Bumblebee. Binocu-
lar stereo vision camera system.
http://www.ptgrey.com/products/bumblebee/index.html.
Saffiotti, A. (1997). The uses of fuzzy logic in autonomous
robot navigation. Soft Computing, 1:180–197.
Snidaro, L., Micheloni, C., and Chiavedale, C. (2005).
Video security for ambient intelligence. IEEE Trans-
actions on Systems, Man and Cybernetics, Part A,
35:133 – 144.
Song, W., Kim, D., Kim, J., and Bien, Z. (2001). Visual
servoing for a user’s mouth with effective intention
reading in a wheelchair-based robotic arm. In ICRA,
pages 3662–3667.
Viola, P. and Jones, M. (2001). Rapid object detection using
a boosted cascade of simple features. In IEEE Conf.
on Computer Vision and Pattern Recognition, pages
511–518.
ICINCO 2007 - International Conference on Informatics in Control, Automation and Robotics
188