ACKNOWLEDGEMENTS
This work was funded by the UK Engineering and
Physical Sciences Research Council (EP/J012025/1).
The authors would like to thank Austin Gregg-Smith
for advice on hardware and graphics, and Dr David
Hanwell for help with maths and text.
REFERENCES
Angelaki, D. and Cullen, K. (2008). Vestibular system: The
many facets of a multimodal sense. Annual Review of
Neuroscience, 31:125–150.
Bi, Y., Guan, J., and Bell, D. (2008). The combination of
multiple classifiers using an evidential reasoning ap-
proach. Artificial Intelligence, 172(15):1731–1751.
Bishop, C. (2006). Pattern Recognition and Machine
Learning. Springer.
Brock, M. and Kristensson, P. (2013). Supporting blind
navigation using depth sensing and sonification. In
Proc. Conf. Pervasive and ubiquitous computing ad-
junct publication.
Dahlkamp, H., Kaehler, A., Stavens, D., Thrun, S., and
Bradski, G. (2006). Self-supervised monocular road
detection in desert terrain. In Proc. Robotics Science
and Systems. Philadelphia.
Dalal, N. and Triggs, B. (2005). Histograms of oriented
gradients for human detection. In Proc. IEEE Conf.
Computer Vision and Pattern Recognition.
Delong, A., Osokin, A., Isack, H. N., and Boykov, Y.
(2012). Fast approximate energy minimization with
label costs. International journal of computer vision,
96(1):1–27.
Deshpande, N. and Patla, A. (2007). Visual–vestibular in-
teraction during goal directed locomotion: effects of
aging and blurring vision. Experimental brain re-
search, 176(1):43–53.
DeSouza, G. and Kak, A. (2002). Vision for mobile robot
navigation: A survey. IEEE Trans. Pattern Analysis
and Machine Intelligence, 24(2):237–267.
Domke, J. (2013). Learning graphical model parame-
ters with approximate marginal inference. IEEE
Trans. Pattern Analysis and Machine Intelligence,
35(10):2454.
Gould, S., Fulton, R., and Koller, D. (2009). Decomposing
a scene into geometric and semantically consistent re-
gions. In Proc. IEEE Int. Conf. Computer Vision.
Gupta, S., Arbel
´
aez, P., Girshick, R., and Malik, J.
(2014). Indoor scene understanding with rgb-d im-
ages: Bottom-up segmentation, object detection and
semantic segmentation. Int. Journal of Computer Vi-
sion, pages 1–17.
Haines, O. and Calway, A. (2012). Detecting planes and
estimating their orientation from a single image. In
Proc. British Machine Vision Conf.
Hoiem, D., Efros, A., and Hebert, M. (2007). Recovering
surface layout from an image. Int. Journal of Com-
puter Vision, 75(1):151–172.
Joshi, N., Kang, S., Zitnick, C., and Szeliski, R. (2010).
Image deblurring using inertial measurement sensors.
ACM Trans. Graphics, 29(4):30.
Kleiner, A. and Dornhege, C. (2007). Real-time localization
and elevation mapping within urban search and rescue
scenarios. Journal of Field Robotics, 24(8-9):723–
745.
Kr
¨
ahenb
¨
uhl, P. and Koltun, V. (2011). Efficient inference in
fully connected crfs with gaussian edge potentials. In
Advances in Neural Information Processing Systems,
pages 109–117.
Kundu, A., Li, Y., Dellaert, F., Li, F., and Rehg, J. (2014).
Joint semantic segmentation and 3d reconstruction
from monocular video. In Proc. European Conf. Com-
puter Vision.
Li, S. (2009). Markov Random Field Modeling in Image
Analysis. Springer-Verlag.
Lorch, O., Albert, A., Denk, J., Gerecke, M., Cupec, R.,
Seara, J., Gerth, W., and Schmidt, G. (2002). Experi-
ments in vision-guided biped walking. In Proc. IEEE
Int. Conf. Intelligent Robots and Systems.
Maimone, M., Cheng, Y., and Matthies, L. (2007). Two
years of visual odometry on the mars exploration
rovers. Journal of Field Robotics, 24(3):169–186.
N
¨
utzi, G., Weiss, S., Scaramuzza, D., and Siegwart, R.
(2011). Fusion of imu and vision for absolute scale
estimation in monocular slam. Journal of Intelligent
and Robotic Systems, 61(1-4):287–299.
Patla, A. (1997). Understanding the roles of vision in
the control of human locomotion. Gait & Posture,
5(1):54–69.
Pini
´
es, P., Lupton, T., Sukkarieh, S., and Tard
´
os, J. (2007).
Inertial aiding of inverse depth slam using a monoc-
ular camera. In Proc. IEEE Int. Conf. Robotics and
Automation.
Sadhukhan, D., Moore, C., and E., C. (2004). Terrain esti-
mation using internal sensors. In Proc. Int. Conf. on
Robotics and Applications.
Sturgess, P., Alahari, K., Ladicky, L., and Torr, P. (2009).
Combining appearance and structure from motion fea-
tures for road scene understanding. In Proc. British
Machine Vision Conf.
Tapu, R., Mocanu, B., and Zaharia, T. (2013). A computer
vision system that ensure the autonomous navigation
of blind people. In Proc. Conf. E-Health and Bioengi-
neering.
Virre, E. (1996). Virtual reality and the vestibular appara-
tus. Engineering in Medicine and Biology Magazine,
15(2):41–43.
Von Gioi, R., Jakubowicz, J., Morel, J., and Randall, G.
(2010). Lsd: A fast line segment detector with a false
detection control. IEEE Trans. Pattern Analysis and
Machine Intelligence, 32(4):722–732.
VISAPP2015-InternationalConferenceonComputerVisionTheoryandApplications
32