ception pipeline so errors in segmentation propagates
throughout all the system. Better segmentation accu-
racy therefore improves other aspects of the system
such as tracking.
As future work, we plan to provide a detailed sta-
tistical analysis such as standard deviation to demon-
strate the significance of the improvement on segmen-
tation. Also, showing the effect of segmentation accu-
racy on object tracking would be useful to reveal how
the under-segmentation problem effects the whole ob-
ject recognition system of an autonomous vehicle.
The presented method does not benefit from the se-
quential nature of the problem. Adding a posterior
inference using prior knowledge from previous time
steps would speed up the overall system. The prior
knowledge obtained by the mean shift method could
also be used for this purpose.
Sub-sampling of 3D Lidar data by mapping indi-
vidual point measurements to an occupancy grid rep-
resentation and reduction of the motion estimation
into one dimension is sufficient to successfully dis-
criminate moving objects from their neighbors such
as buildings or parked cars. However, because of
stationary under-segmented objects and the group of
pedestrians moving in the same direction, the er-
ror rate stays around 9%. Exploiting an appearance
model together with the features of the grid represen-
tation would help to detect stationary nearby objects
and to separate each pedestrian in a group moving to-
wards the same direction. Also, this error rate en-
courages us to integrate a classification module as a
future work. Adding semantic cues would resolve the
under-segmentation problem of stationary nearby ob-
jects and, thus, improve the general segmentation ac-
curacy.
In addition, instead of estimating the motion of
the whole scene at each time step, the system might
decide to estimate only informative parts of the envi-
ronment by using semantic information. This could
further decrease the computational costs of the seg-
mentation and tracking components.
The detection of object classes would also be very
useful for the segmentation and tracking steps. To ob-
tain temporal information from the scene, applying
an iterative closest point approach would be interest-
ing instead of tracking each grid cell on an occupancy
grid. We intend to compare the performance of our
method with other novel algorithms proposed in the
literature.
ACKNOWLEDGEMENTS
We acknowledge the support by the EU’s Seventh
Framework Programme under grant agreement no.
607400 (TRAX, Training network on tRAcking in
compleX sensor systems) http://www.trax.utwente.nl/
REFERENCES
Azim, A. and Aycard, O. (2012). Detection, classification
and tracking of moving objects in a 3d environment.
In Intelligent Vehicles Symposium (IV), 2012 IEEE,
pages 802–807. IEEE.
Bar-Shalom, Y. (1987). Tracking and data association.
Academic Press Professional, Inc.
Blei, D. M. and Frazier, P. I. (2011). Distance dependent
chinese restaurant processes. The Journal of Machine
Learning Research, 12:2461–2488.
Choi, J., Ulbrich, S., Lichte, B., and Maurer, M. (2013).
Multi-target tracking using a 3d-lidar sensor for au-
tonomous vehicles. In Intelligent Transportation
Systems-(ITSC), 2013 16th International IEEE Con-
ference on, pages 881–886. IEEE.
Comaniciu, D. and Meer, P. (2002). Mean shift: A robust
approach toward feature space analysis. IEEE Trans-
actions on pattern analysis and machine intelligence,
24(5):603–619.
Douillard, B., Underwood, J., Kuntz, N., Vlaskine, V.,
Quadros, A., Morton, P., and Frenkel, A. (2011).
On the segmentation of 3d lidar point clouds. In
Robotics and Automation (ICRA), 2011 IEEE Inter-
national Conference on, pages 2798–2805.
Fritsch, J., Kuhnl, T., and Geiger, A. (2013). A new per-
formance measure and evaluation benchmark for road
detection algorithms. In Intelligent Transportation
Systems-(ITSC), 2013 16th International IEEE Con-
ference on, pages 1693–1700. IEEE.
Fukunaga, K. and Hostetler, L. (1975). The estimation of
the gradient of a density function, with applications in
pattern recognition. IEEE Transactions on informa-
tion theory, 21(1):32–40.
Geiger, A., Lenz, P., Stiller, C., and Urtasun, R. (2013).
Vision meets robotics: The kitti dataset. International
Journal of Robotics Research (IJRR).
Geiger, A., Lenz, P., and Urtasun, R. (2012). Are we
ready for autonomous driving? the kitti vision bench-
mark suite. In Computer Vision and Pattern Recogni-
tion (CVPR), 2012 IEEE Conference on, pages 3354–
3361. IEEE.
Gelman, A., Carlin, J. B., Stern, H. S., and Rubin, D. B.
(2003). Bayesian data analysis. Chapman and
Hall/CRC Texts in Statistical Science.
Geman, S. and Geman, D. (1984). Stochastic relaxation,
gibbs distributions, and the bayesian restoration of im-
ages. IEEE Transactions on pattern analysis and ma-
chine intelligence, (6):721–741.
Held, D., Guillory, D., Rebsamen, B., Thrun, S., and
Savarese, S. (2016). A probabilistic framework for
ICINCO 2017 - 14th International Conference on Informatics in Control, Automation and Robotics
170