ACTIVE OBJECT CATEGORIZATION ON A HUMANOID ROBOT

Vignesh Ramanathan, Axel Pinz

2011

Abstract

We present a Bag of Words-based active object categorization technique implemented and tested on a humanoid robot. The robot is trained to categorize objects that are handed to it by a human operator. The robot uses hand and head motions to actively acquire a number of different views. A view planning scheme using entropy minimization reduces the number of views needed to achieve a valid decision. Categorization results are significantly improved by active elimination of background features using robot arm motion. Our experiments cover both, categorization when the object is handed to the robot in a fixed pose at training and testing, and object pose independent categorization. Results on a 4-class object database demonstrate the classification efficiency, a significant gain from multi-view compared to single-view classification, and the advantage of view planning. We conclude that humanoid robotic systems can be successfully applied to actively categorize objects - a task with many potential applications ranging from edutainment to active surveillance.

References

  1. Borotschnig, H., Paletta, L., Pranti, M., and Pinz, A. (2000). Appearance based active object recognition. Image and Vision Computing, 18:715-727.
  2. Borotschnig, H., Paletta, L., Prantl, M., and Pinz, A. (1998). Active object recognition in parametric eigenspace. In British Machine Vision Conference.
  3. Bosch, A., Zisserman, A., and Munoz, X. (2007). Image classification using random forests and ferns. In International Conference on Computer Vision.
  4. Bustos, B., Kein, D., Saupe, D., Schreck, T., and Vranic, D. (2005). Feature-based similarity search in 3d object databases. ACM Computing Surveys (CSUR), 37(4):345-387.
  5. Deinzer, F., Denzler, J., Derichs, C., and Niemann, H. (2006). Integrated viewpoint fusion and viewpoint selection for optimal object recognition. In British Machine Vision Conference.
  6. Deinzer, F., Denzler, J., and Niemann, H. (2003). Viewpoint selection - planning optimal sequences of views for object recognition. In Computer Analysis of Images and Patterns, pages 64-73. Springer Berlin / Heidelberg.
  7. Denzler, J. and Brown, C. (2002). Information theoretic sensor data selection for active object recognition and state estimation. IEEE Trans. on Pattern Analysis and Machine Intelligence, 24(2):145-157.
  8. Dickinson, S., Leonardis, A., Schiele, B., and Tarr, M., editors (2009). Object Categorization. Cambridge University Press.
  9. Leibe, B., Leonardis, A., and Schiele, B. (2004). Combined object categorization and segmentation with an implicit shape model. In European Conference on Computer Vision Workshop on Statistical Learning in Computer Vision.
  10. Lowe, D. (2004). Distinctive image features from scaleinvariant keypoints. International Journal of Computer Vision, 60(2):91-110.
  11. Pinz, A. (2006). Object categorization. Foundations and Trends in Computer Graphics and Vision, 1(4):255- 353.
  12. Roy, S., Chaudhury, S., and Banerjee, S. (2004). Active recognition through next view planning: A survey. Pattern Recognition, 37:429 - 446.
  13. Schiele, B. and Crowley, J. L. (1998). Transinformation for active object recognition. In International Conference on Computer Vision.
  14. Sivic, J. and Zisserman, A. (2003). Video google: A text retrieval approach to object matching in videos. In Proc. IEEE International Conference on Computer Vision (ICCV), pages 1470-1477.
  15. Zhang, S., Tian, Q., Hua, G., Huang, Q., and Li, S. (2009). Descriptive visual words and visual phrases for image applications. In Proc. ACM Int. Conf. on Multimedia, pages 75-84.
Download


Paper Citation


in Harvard Style

Ramanathan V. and Pinz A. (2011). ACTIVE OBJECT CATEGORIZATION ON A HUMANOID ROBOT . In Proceedings of the International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2011) ISBN 978-989-8425-47-8, pages 235-241. DOI: 10.5220/0003312802350241


in Bibtex Style

@conference{visapp11,
author={Vignesh Ramanathan and Axel Pinz},
title={ACTIVE OBJECT CATEGORIZATION ON A HUMANOID ROBOT},
booktitle={Proceedings of the International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2011)},
year={2011},
pages={235-241},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003312802350241},
isbn={978-989-8425-47-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2011)
TI - ACTIVE OBJECT CATEGORIZATION ON A HUMANOID ROBOT
SN - 978-989-8425-47-8
AU - Ramanathan V.
AU - Pinz A.
PY - 2011
SP - 235
EP - 241
DO - 10.5220/0003312802350241