As a second way towards obstacle avoidance with
stereo cameras, we have – after noting that deep learn-
ing algorithms such as Muller et al. (2004) do not rely
on view disparity – proposed the disparity-sensitive
deep learning network NCC-DISP which has only dis-
parity data as input, and show that it performs as well
as an earlier system using raw stereo camera inputs.
The next generation of ToyCollect robots will ad-
ditionally feature an active sensing depth camera to
generate approximate depth maps which would al-
low a qualitative evaluation of stereo camera algo-
rithms. Adding bumper and acceleration sensors will
furthermore allow to automate the generation of large
amounts of obstacle detection training data, which is
currently still a manual process.
ACKNOWLEDGEMENTS
This project was partially funded by the Austrian Re-
search Promotion Agency (FFG) and by the Austrian
Federal Ministry for Transport, Innovation and Tech-
nology (BMVIT) within the Talente internship re-
search program 2014-2019. We would like to thank
Lukas D.-B. for all 3D chassis designs of robot R2X;
Vladimir T. for answering questions and explaining
his model; and all interns which have worked on this
project, notably Georg W., Julian F. and Miriam T.
REFERENCES
Becker, F. and Ebner, M. (2019). Collision detection for
a mobile robot using logistic regression. In Proceed-
ings of the 16th International Conference on Informat-
ics in Control, Automation and Robotics - Volume 2:
ICINCO,, pages 167–173. INSTICC, SciTePress.
Bojarski, M., Del Testa, D., Dworakowski, D., et al. (2016).
End to end learning for self-driving cars. Technical
Report 1604.07316, Cornell University.
Bradski, G. (2000). The OpenCV Library. Dr. Dobb’s Jour-
nal of Software Tools.
Cadena, C., Carlone, L., Carrillo, H., Latif, Y., Scaramuzza,
D., Neira, J., Reid, I., and Leonard, J. (2016). Past,
present, and future of simultaneous localization and
mapping: Towards the robust-perception age. IEEE
Transactions on Robotics, 32(6):1309–1332.
Hirschmuller, H. (2008). Stereo processing by semiglobal
matching and mutual information. PAMI, 30(2):328–
341.
Kandel, E. R., Schwartz, J. H., Jessell, T. M., of Biochem-
istry, D., Jessell, M. B. T., Siegelbaum, S., and Hud-
speth, A. (2000). Principles of neural science, 4th
Edition. McGraw-Hill New York.
Khan, M. and Parker, G. (2019). Vision based indoor obsta-
cle avoidance using a deep convolutional neural net-
work. In Proceedings of the 11th International Joint
Conference on Computational Intelligence - NCTA,
(IJCCI 2019), pages 403–411. INSTICC, SciTePress.
Kim, J., Kolmogorov, V., and Zabih, R. (2003). Visual cor-
respondence using energy minimization and mutual
information. In ICCV ’03: Proceedings of the Ninth
IEEE International Conference on Computer Vision,
page 1033, Washington, DC, USA. IEEE Computer
Society.
Konolige, K. (1998). Small vision systems: Hardware and
implementation. In Robotics research, pages 203–
212. Springer.
Luo, W., Xiao, Z., Ebel, H., and Eberhard, P. (2019). Stereo
vision-based autonomous target detection and track-
ing on an omnidirectional mobile robot. In Proceed-
ings of the 16th International Conference on Informat-
ics in Control, Automation and Robotics - Volume 2:
ICINCO,, pages 268–275. INSTICC, SciTePress.
Muller, U., Ben, J., Cosatto, E., Fleep, B., and LeCun, Y.
(2004). Autonomous off-road vehicle control using
end-to-end learning. Technical report, DARPA-IPTO,
Arlington, Virginia, USA. ARPA Order Q458, Pro-
gram 3D10, DARPA/CMO Contract #MDA972-03-
C-0111, V1.2, 2004/07/30.
Muller, U., Ben, J., Cosatto, E., Fleep, B., and LeCun, Y.
(2006). Off-road obstacle avoidance through end-to-
end learning. In Advances in neural information pro-
cessing systems, pages 739–746.
Pfeiffer, M., Schaeuble, M., Nieto, J. I., Siegwart, R.,
and Cadena, C. (2016). From perception to de-
cision: A data-driven approach to end-to-end mo-
tion planning for autonomous ground robots. CoRR,
abs/1609.07910.
Seewald, A. K. (2020). Revisiting End-to-end Deep Learn-
ing for Obstacle Avoidance: Replication and Open Is-
sues. In Proceedings of the 12th International Con-
ference on Agents and Artificial Intelligence (ICAART
2020), Valetta, Malta., pages 652–659.
Tankovich, V., Hane, C., Zhang, Y., Kowdle, A., Fanello, S.,
and Bouaziz, S. (2021). Hitnet: Hierarchical iterative
tile refinement network for real-time stereo matching.
In Proceedings of the IEEE/CVF Conference on Com-
puter Vision and Pattern Recognition, pages 14362–
14372.
Wang, Y., Dongfang, L., Jeon, H., Chu, Z., and Matson, ET.
(2019). End-to-end learning approach for autonomous
driving: A convolutional neural network model. In
Rocha, A., Steels, L., and van den Herik, J., editors,
Proc. of ICAART 2019, volume 2, pages 833–839.
ICAART 2022 - 14th International Conference on Agents and Artificial Intelligence
672