ATTENTION MODELS FOR VERGENCE MOVEMENTS BASED ON THE JAMF FRAMEWORK AND THE POPEYE ROBOT

Niklas Wilming, Felix Wolfsteller, Peter König, Rui Caseiro, João Xavier, Helder Araújo

Abstract

In this work we describe a novel setup for implementation and development of stereo vision attention models in a realistic embodied setting. We introduce a stereo vision robot head, called POPEYE, that provides degrees of freedom comparable to a human head. We describe the geometry of the robot as well as the characteristics that make it a good candidate for studying models of visual attention. Attentional robot control is implemented with JAMF, a graphical modeling framework which allows to easily implement current state-of-the-art saliency models. We give a brief overview over JAMF and show implementations of four exemplary attention models that can control the robot head.

References

  1. Andersen, C. S., Andersen, C. S., Crowley, J. L., D, P. P., D, P. P., and Perram, J. (1996). A framework for control of a camera head. Technical report.
  2. Aryananda, L. and Weber, J. (2004). Mertz: a quest for a robust and scalable active vision humanoid head robot. Humanoid Robots, 2004 4th IEEE/RAS International Conference on, 2:513-532.
  3. Batista, J., Dias, J., Araújo, H., and Almeida, A. (1995). The isr multi-degrees-of-freedom active vision robot head: design and calibration. In M2VIP'95-Second International Conference on Mechatronics and Machine Vision in Practice, Hong-Kong.
  4. Batista, J., Peixoto, P., and Araújo, H. (2000). A focusingby-vergence system controlled by retinal motion disparity. In ICRA, pages 3209-3214.
  5. Bruce, N. and Tsotsos, J. (2005). An attentional framework for stereo vision. Computer and Robot Vision, 2005. Proceedings. The 2nd Canadian Conference on, pages 88-95.
  6. Carpenter, H. (1988). Movements of the Eyes. London Pion Limited, second edition edition.
  7. Choi, S.-B., Jung, B.-S., Ban, S.-W., Niitsuma, H., and Lee, M. (2006). Biologically motivated vergence control system using human-like selective attention model. Neurocomputing, 69(4-6):537-558.
  8. Conradt, J., Simon, P., Pescatore, M., and Verschure, P. (2002). Saliency maps operating on stereo images detect landmarks and their distance. In ICANN 7802: Proceedings of the International Conference on Artificial Neural Networks, pages 795-800, London, UK. Springer-Verlag.
  9. Einhäuser, W., Schumann, F., Bardins, S., Bartl, K., Böning, G., Schneider, E., and König, P. (2007). Human eyehead co-ordination in natural exploration. Network: Computation in Neural Systems, 18(3):267-297.
  10. Einhäuser, W., Schumann, F., Vockeroth, J., Bartl, K., M., C., J., H., Schneider, E., and König, P. (2008). Distinct Roles for Eye for Eye and Head Movements in Selecting Salient Image Parts During Natural Exploration (in press). Ann. N.Y. Acad. Sci.
  11. Fellenz, W. A. and Hartmann, G. (2002). A modular lowcost active vision head.
  12. Gasteratos, A. and Sandini, G. (2002). Factors affecting the accuracy of an active vision head. In SETN 7802: Proceedings of the Second Hellenic Conference on AI, pages 413-422, London, UK. Springer-Verlag.
  13. Grosso, E. and Tistarelli, M. (1995). Active/dynamic stereo vision. IEEE Transactions on Pattern Analysis and Machine Intelligence, 17(9):868-879.
  14. Hartley, R. and Zisserman, A. (2004). Multiple View Geometry in Computer Vision. Cambridge University Press.
  15. Helmoltz, H. (1925). Dover.
  16. Henkel, R. (1998). A Simple and Fast Neural Network Approach to Stereovision. Advances in Neural Information Processing Systems, pages 808-814.
  17. Itti, L. and Koch, C. (2001). Computational modelling of visual attention. Nature Reviews Neuroscience, 2(3):194-204.
  18. Jansen, L., Onat, S., and König, P. (2008). Free viewing of natural images: The influence of disparity. Journal of Vision (in press).
  19. Knight, J. and Reid, I. (2006). Automated alignment of robotic pan-tilt camera units using vision. International Journal of Computer Vision, 68(3):219-237.
  20. Knudsen, E. (2007). Fundamental Components of Attention. Annual Review of Neuroscience, 30:57.
  21. Ouerhani, N. and Hugli, H. (2000). Computing visual attention from scene depth. In ICPR 7800: Proceedings of the International Conference on Pattern Recognition, page 1375, Washington, DC, USA. IEEE Computer Society.
  22. Quigley, C., Onat, S., Harding, S., Cooke, M., and König, P. (2008). Audio-visual integration during overt visual attention. Journal of Vision (in press).
  23. Salthouse, T. and Ellis, C. (1980). Determinants of eyefixation duration. Am J Psychol, 93(2):207-34.
  24. Steger, J., Wilming, N., Wolfsteller, F., Höning, N., and König, P. (2008). The jamf attention modelling framework. In WAPCV 2008, Santorini, Greece.
  25. Truong, H., Abdallah, S., Rougeaux, S., and Zelinsky, E. (2000). A novel mechanism for stereo active vision. In In Proc. Australian Conference on Robotics and Automation. ARAA.
  26. Yamato, J. (1999). A layered control system for stereo vision head with vergence. Systems, Man, and Cybernetics, 1999. IEEE SMC 7899 Conference Proceedings. 1999 IEEE International Conference on, 2:836-841 vol.2.
  27. Zhang, Z. (1999). Flexible Camera Calibration By Viewing a Plane From Unknown Orientations. In International Conference on Computer Vision (ICCV'99), pages 666-673, Corfu, Greece.
  28. Zitnick, C. and Kanade, T. (1999). Cooperative algorithm for stereo matching and occlusion detection. Technical Report CMU-RI-TR-99-35, Robotics Institute, Carnegie Mellon University, Pittsburgh, PA.
Download


Paper Citation


in Harvard Style

Wilming N., Wolfsteller F., König P., Caseiro R., Xavier J. and Araújo H. (2009). ATTENTION MODELS FOR VERGENCE MOVEMENTS BASED ON THE JAMF FRAMEWORK AND THE POPEYE ROBOT . In Proceedings of the Fourth International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP, (VISIGRAPP 2009) ISBN 978-989-8111-69-2, pages 429-437. DOI: 10.5220/0001782204290437


in Bibtex Style

@conference{visapp09,
author={Niklas Wilming and Felix Wolfsteller and Peter König and Rui Caseiro and João Xavier and Helder Araújo},
title={ATTENTION MODELS FOR VERGENCE MOVEMENTS BASED ON THE JAMF FRAMEWORK AND THE POPEYE ROBOT},
booktitle={Proceedings of the Fourth International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP, (VISIGRAPP 2009)},
year={2009},
pages={429-437},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001782204290437},
isbn={978-989-8111-69-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Fourth International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP, (VISIGRAPP 2009)
TI - ATTENTION MODELS FOR VERGENCE MOVEMENTS BASED ON THE JAMF FRAMEWORK AND THE POPEYE ROBOT
SN - 978-989-8111-69-2
AU - Wilming N.
AU - Wolfsteller F.
AU - König P.
AU - Caseiro R.
AU - Xavier J.
AU - Araújo H.
PY - 2009
SP - 429
EP - 437
DO - 10.5220/0001782204290437