shows the excellent usage of the LDA combination of
depth similarity and gradient features for human de-
tection in depth images.
6 CONCLUSIONS
To reduce the complexity of the RDSF calculation
and to keep the performance just by usage of mean
and variance features on the depth image, delivers not
sufficiently accurate results. The computational time
of the MV-RDSF is very fast, but the accuracy of the
original RDSF cannot be reached. Since, the feature
space of the MV-RDSF is not significantly enough to
separate the dataset, a LDA combination between the
classical HOG feature and the MV-RDSF shows very
good attributes to solve the problem. The time com-
plexity of the MV-HOG-RDSF is better than of the
RDSF (Ikemura and Fujiyoshi, 2010) and less fea-
tures are selected (needed) in the training to separate
the positive examples from the negative ones. Fur-
thermore, just the depth image, and not the intensity
or the RGB image, is needed, for a good classifica-
tion result. The complexity of computing the inte-
gral images and features add up to a fast classification.
Based on the LDA combination, just one classifier is
needed and not a decision fusion of different classi-
fiers, which are using just one of both feature types,
as it has been done in (Wang et al., 2012).
Further research could try to combine other depth
features with each other to reach more and more
the best possible classification result. Still on focus
should be the time complexity of the used features to
produce as fast as possible classifiers for real time ap-
plications, where real time means to ensure the com-
putation of all methods inside the frame rate of the
underlying sensor, in this case 33ms.
ACKNOWLEDGEMENT
The research of the project ”Move and See” leading
to these results has received funding from the Min-
istry of Health, Equalities, Care and Ageing of North
Rhine-Westphalia (MGEPA), Germany and the Euro-
pean Union.
REFERENCES
Dalal, N. and Triggs, B. (2005). Histograms of Oriented
Gradients for Human Detection. In Proceedings of
the IEEE Computer Society Conference on Computer
Vision and Pattern Recognition (CVPR’05), volume 1,
pages 886–893, San Diego,California,USA.
Doll
´
ar, P., Wojek, C., Schiele, B., and Perona, P. (2012).
Pedestrian Detection: An Evaluation of the State of
the Art. IEEE Transactions on Pattern Analysis and
Machine Learning, 34(4):743 – 761.
Enzweiler, M. and Gavrila, D. M. (2009). Monocular
Pedestrian Detection: Survey and Experiments. IEEE
Transactions on Pattern Analysis and Machine Intel-
ligence, 31(12):2179 – 2195.
Ikemura, S. and Fujiyoshi, H. (2010). Real-Time Hu-
man Detection using Relational Depth Similarity Fea-
tures. In Proceedings of the 10th Asian Conference on
Computer Vision (ACCV’10), pages 25 – 38, Queen-
stown,New Zealand.
Izenman, A. J. (2008). Modern Multivariate Statistical
Techniques - Regression, Classification, and Manifold
Learning. Springer New York.
Keller, C. G., Enzweiler, M., and Gavrila, D. M. (2011). A
New Benchmark for Stereo-Based Pedestrian Detec-
tion. In IEEE Intelligent Vehicles Symposium (IV’11),
pages 691 – 696, Baden-Baden,Germany.
Mattheij, R., Postma, E., van den Hurk, Y., and Spronck,
P. (2012). Depth-based detection using haar-like fea-
tures. In Proceedings of the 24th BENELUX Confer-
ence on Artificial Intelligence (BNAIC’12), pages 162
– 169, Maastricht,Netherlands.
Nunn, C., M
¨
uller, D., Meuter, M., M
¨
uller-Schneiders,
S., and Kummert, A. (2009). An Improved Ad-
aboost Learning Scheme using Lda Features for Ob-
ject Recognition. In Proceedings of the 12th In-
ternational IEEE Conference on Intelligent Trans-
portation Systems (ITSC’09), pages 486 – 491, St.
Louis,MO,USA.
Shafait, F., Keysers, D., and Breuel, T. M. (2008). Effi-
cient Implementation of Local Adaptive Threshold-
ing Techniques Using Integral Images. In Yanikoglu,
B. A. and Berkner, K., editors, Proceedings SPIE
6815,Document Recognition and Retrieval XV.
Spinello, L. and Arras, K. O. (2011). People detection
in RGB-D data. In Proceedings of the IEEE/RSJ
International Conference on Intelligent Robots and
Systems (IROS’11), pages 3838 – 3843, San Fran-
cisco,CA,USA.
Viola, P. and Jones, M. (2001a). Fast and Robust Classi-
fication using Asymmetric AdaBoost and a Detector
Cascade. In Advances in Neural Information Process-
ing System 14, pages 1311 – 1318. MIT Press.
Viola, P. and Jones, M. (2001b). Rapid Object Detection
Using a Boosted Cascade of Simple Features. In Pro-
ceedings of the IEEE Computer Society Conference on
Computer Vision and Pattern Recognition (CVPR’01),
volume 1, pages 511 – 518, Kauai,HI,USA.
Wang, N., Gong, X., and Liu, J. (2012). A New Depth
Descriptor for Pedestrian Detection in RGB-D Im-
ages. In Proceedings of the 21st International Confer-
ence on Pattern Recognition (ICPR’12), pages 3688 –
3691, Tsukuba,Japan.
Wu, S., Yu, S., and Chen, W. (2011). An attempt to pedes-
trian detection in depth images. In Proccedings of the
3rd Chinese Conference on Intelligent Visual Surveil-
lance (IVS’11), pages 97 – 100, Beijing,China.
VISAPP2015-InternationalConferenceonComputerVisionTheoryandApplications
356