thru the end of sequence. A possible reason being
that, those feature points belonging to the moving ob-
jects are falsely triangulated and tracked. Arriving at
the end of the sequence, the regularised energy model
with α set to 0.75 achieved the lowest drift within
1.7% (3 metres), while the conventional re-projection
error minimisation approach presented the worst re-
sult, with a motion drift above 5%.
We also calculated segmented motion errors in
terms of translation and rotation components of the
estimated ego-motion, with respect to various travel
distances. The travel distance is not measured only
from the beginning of the sequence; any segment be-
gins from an arbitrary frame k thru frame k + n where
n > 0 having a length l is taken into account during
the error calculation of interval [l
p
, l
q
) if l
p
≤ l < l
q
.
We divide the length of the sequence into 10 equally
spaced segments for plotting. The results are de-
picted in Fig. 5. It shows that, in the translation com-
ponent, the damping parameter α = 0.75 yields the
best accuracy in all segments, while the conventional
model maintains a moderate accuracy in travel dis-
tances shorter than 100 metres. On the rotation part,
however, it presents the worst accuracy. The best
accuracy, achieved by α = 0.5, which equally relies
on both re-projection and epipolar constraints, is five
times better than the conventional model.
5 CONCLUSIONS
In this paper we reviewed the underlying mathemati-
cal models of the monocular ego-motion estimation
problem and formulated an enhanced minimisation
model to improve the stability and robustness of the
optimisation process.
The experimental findings support a positive ef-
fect of the proposed model on increasing the accu-
racy of the VO procedure. Remarkably, with monoc-
ular vision the presented implementation achieves an
overall motion drift within 2% over 200 metres, which
is comparative to the stereo VO implementations as
listed on the website of the KITTI visual odometry
benchmark in 2016.
REFERENCES
Badino, H., Yamamoto, A., Kanade, T.: Visual odometry by
multi-frame feature integration. Int. ICCV Workshop
Computer Vision Autonomous Driving (2013)
Engels, C., Stewenius, H., Nister, D.: Bundle adjust-
ment rules. In Proc. Photogrammetric Computer Vi-
sion (2006)
Fischler, M.A., Bolles, R.C.: Random sample consensus:
A paradigm for model fitting with applications to im-
age analysis and automated cartography. Comm. of
the ACM, vol. 24, no. 6, pp. 381–395 (1981)
Forster, C., Pizzoli, M., Scaramuzza, D.: SVO: Fast semi-
direct monocular visual odometry. In: Proc. IEEE Int.
Conf. Robotics Automation, pp. 15–22 (2014)
Geiger, A., Lenz, P., Stiller, C., and Urtasun, R.: Vision
meets robotics: The KITTI dataset. Int. J. Robotics
Research, vol. 32, no. 11, pp. 1231–1237 (2013)
Geng, H., Chien, H.-J., Nicolescu, R., Klette, R.: Egomo-
tion estimation and reconstruction with Kalman filters
and GPS integration. In: Proc. Computer Analysis of
Images and Patterns, vol. 9256, pp. 399–410 (2015)
Hartley, R. I., Zisserman, A. : Multiple View Geometry in
Computer Vision, second edition. Cambridge Univer-
sity Press, Cambridge (2004)
Hu, G., Huang, S., Zhao, L., Alempijevic A., Dissanayake,
G.: A robust RGB-D SLAM algorithm. In Proc.:
IEEE/RSJ Int. Conf. on Intelligent Robots and Sys-
tems, pp. 1714–1719 (2012)
Klette, R.: Concise Computer Vision. Springer, London
(2014)
Konolige, K., Agrawal, M.: FrameSLAM: From bundle
adjustment to real-time visual mapping. IEEE Trans.
Robotics, vol. 5, no. 24, pp. 1066–1077 (2008)
Lepetit, V, Moreno-Noguer, F., Fua. P.: EPnP: An accurate
O(n) solution to the PnP problem. Int. J. Computer
Vision, vol. 81, pp. 155–166 (2009)
Levenberg, K.A.: Method for the solution of certain non-
linear problems in least squares. The Quarterly Ap-
plied Math., vol. 2, pp. 164–168 (1944)
Morales, S., Klette, R.: Kalman-filter based spatio-temporal
disparity integration. Pattern Recognition Letters, vol.
34, no. 8, pp. 873–883 (2013)
Sampson, P.D.: Fitting conic sections to ‘very scattered’
data: An iterative refinement of the Bookstein algo-
rithm. Computer Graphics Image Processing, vol. 18,
no. 1, pp. 97–108 (1982)
Scaramuzza, D., Fraundorfer, F.: Visual odometry: Part I
- The first 30 years and fundamentals. IEEE Robotics
Automation Magazine, vol. 18, pp. 80–92 (2011)
Tomasi, C., Kanade, T.: Detection and tracking of point fea-
tures. Carnegie Mellon University Technical Report,
CMU-CS-91-132 (1991)
Vaudrey, T., Badino, H., Gehrig, S.: Integrating disparity
images by incorporating disparity rate. In. Proc. Robot
Vision, LNCS 4931, pp. 29–42 (2008)
Wu, F.C., Zhang, Q., Hu, Z.Y.: Efficient suboptimal solu-
tions to the optimal triangulation. Int. J. Computer Vi-
sion, vol. 91, no. 1, pp. 77–106 (2011)
Zhengyou, Z.: Determining the epipolar geometry and its
uncertainty: A review. Int. J. Computer Vision, vol. 2,
no. 27, pp. 161–198 (1998)
VISAPP 2017 - International Conference on Computer Vision Theory and Applications
368