6 CONCLUSIONS AND FUTURE
WORK
We have described a virtual reality tool, capable of
generating pairs of stereo images like the ones that
can be obtained by a verging pan-tilt robotic head and
the related ground truth data.
To obtain such a behavior the toe-in stereo-
scopic technique should be preferred to the off-axis
technique. By proper rototranslations of the view
volumes, we can create benchmark sequences for
vision systems with convergent axis. Moreover, by
using the precise 3D position of the objects these
vision systems can interact with the scene in a
proper way. A data set of stereo image pairs and
the related ground truth disparities are available for
the Computer Vision community at the web site
www.pspc.dibe.unige.it/Research/vr.html
.
Since the purpose of this work was not to create
a photo-realistic virtual reality tool but to obtain
sufficiently complex scenarios for benchmarking an
active vision system, we have not directly addressed
the problem of improving the photo-realistic quality
of the 3D scene, rather we focused on the definition
of a realistic model of the interactions between the
vision system and the observed scene. The creation
of even more complex and photo-realistic scenes will
be part of a future work.
Furthermore, we will integrate vergence/version
strategies in the system in order to have a really ac-
tive tool that interacts with the virtual environments.
It would also be interesting to modify the standard
pan-tilt behavior by including more biologically plau-
sible constraints on the camera movements (Schreiber
et al., 2001; Van Rijn and Van den Berg, 1993).
ACKNOWLEDGEMENTS
We wish to thank Luca Spallarossa for the helpful
comments.
REFERENCES
Aloimonos, Y., Weiss, I., and Bandyopadhyay, A. (1988).
Active vision. Int. J. of Computer Vision, 1:333–356.
Awaad, I., Hartanto, R., Le´on, B., and Pl¨oger, P. (2008). A
software system for robotic learning by experimenta-
tion. In Workshop on robot simulators (IROS08).
Bourke, P. and Morse, P. (2007). Stereoscopy: Theory and
practice. Workshop at 13th International Conference
on Virtual Systems and Multimedia.
Dankers, A. and Zelinsky, A. (2004). Cedar: A real-world
vision system: Mechanism, control and visual pro-
cessing. Machine Vision and Appl., 16(1):47–58.
Hearn, D. and Baker, M. P. (1997). Computer Graphics, C
Version. 2nd edition. Prentice Hall.
Huang, J., Lee, A. B., and Mumford, D. (2000). Statistics
of range images. In Proc. of the IEEE Conference on
Computer Vision and Pattern Recognition.
Jørgensen, J. and Petersen, H. (2008). Usage of simula-
tions to plan stable grasping of unknown objects with
a 3-fingered schunk hand. In Workshop on robot sim-
ulators (IROS08).
Ma, Y., Soatto, S., and Kosecka, J.and Sastry, S. (2004).
An Invitation to 3D Vision. From Images to Geometric
Models. Springer-Verlag.
Okada, K., Kino, Y., and Kanehiro, F. (2002). Rapid devel-
opment system for humanoid vision-based behaviors
with real-virtual common interface. In IEEE/RSJ Int.
Conf. on Intelligent Robots and Systems.
Rabie, T. F. and Terzopoulos, D. (2000). Active perception
in virtual humans. In Vision Interface 2000 (VI 2000).
Read, J. and Cumming, B. (2006). Does depth perception
require vertical disparity detectors? Journal of Vision,
6(12):1323–1355.
Scharstein, D. and Pal, C. (2007). Learning conditional ran-
dom fields for stereo. In Proc. of the IEEE Conference
on Computer Vision and Pattern Recognition.
Schreiber, K. M., Crawford, J. D., Fetter, M., and Tweed,
D. B. (2001). The motor side of depth vision. Nature,
410:819–822.
Theimer, W. and Mallot, H. (1994). Phase-based binocular
vergence control and depth reconstruction using active
vision. CVGIP: Image Understanding, 60(3):343–
358.
Trucco, E. and Verri, A. (1998). Introductory Techniques
for 3-D Computer Vision. Prentice Hall.
Ulusoy, I., Halici, U., and Leblebicioglu, K. (2004). 3d
cognitive map construction by active stereo vision in
a virtual world. Lecture notes in Computer Science,
3280:400–409.
Van Rijn, L. and Van den Berg, A. (1993). Binocular eye
orientation during fixations: Listing’s law extended to
include eye vergence. Vision Research, 33:691–708.
Volpel, B. and Theimer, W. (1995). Localization uncertainty
in area-based stereo algorithms. IEEE Transactions on
Systems, Man and Cybernetics, 25(12):1628–1634.
Wrigh, R., Lipchack, B., and Haemel, N. (2007). OpenGL
superbible, Fourth Edition, Comprehensive Tutorial
and Reference. Addison-Wesley.
A VIRTUAL REALITY SIMULATOR FOR ACTIVE STEREO VISION SYSTEMS
449