easily extendable by a more advanced classifier cas-
cade (e.g. (Viola and Michael, 2001)) or a cascade of
increasingly more complex classifiers (e.g. (Heisele
et al., 2001)). These in turn could be combined with
efficient subwindow search techniques (e.g. (Lampert
et al., 2008; An et al., 2010)). We demonstrated the
combination of two feature types. However, in the
same way the two feature types have been combined,
it can easily be extended by an arbitrary number of ad-
ditional features. We think that the presented results
encourage further investigations in this direction and
we will investigate ways of incorporating additional
features based on, for example, motion, depth clues,
color, etc.
ACKNOWLEDGEMENTS
We thank the five reviewers for their comments to im-
prove the manuscript. T.B. is supported by a scholar-
ship from the Graduate School of Mathematical Anal-
ysis of Evolution, Information, and Complexity at
Ulm University. H.N. and T.B. are supported in part
by the Transregional Collaborative Research Centre
SFB/TRR 62 “Companion-Technology for Cognitive
Technical Systems” funded by the German Research
Foundation (DFG). We greatly appreciate the compu-
tational ressources provided by the (bwGRID, 2011).
REFERENCES
Agarwal, S., Awan, A., and Roth, D. (2004a). Learning
to Detect Objects in Images via a Sparse, Part-Based
Representation. TPAMI, 26(11):1475–1490.
Agarwal, S., Awan, A., and Roth, D. (2004b). UIUC
Image Database for Car Detection download page.
http://l2r.cs.uiuc.edu/∼cogcomp/Data/Car/. [Online;
accessed 27-Mar.-2010].
Amit, Y., Geman, D., and Fan, X. (2004). A Coarse-to-
Fine Strategy for Multiclass Shape Detection. TPAMI,
26(12):1606–21.
An, S., Peursum, P., Liu, W., Venkatesh, S., and Chen,
X. (2010). Exploiting Monge Structures in Optimum
Subwindow Search. In CVPR.
bwGRID (2011). member of the German D-Grid
initiative, funded by the Ministry for Education
and Research (Bundesministerium f
¨
ur Bildung und
Forschung) and the Ministry for Science, Research
and Arts Baden-Wuerttemberg (Ministerium f
¨
ur Wis-
senschaft, Forschung und Kunst Baden-W
¨
urttem-
berg). http://www.bw-grid.de. [Online; accessed 13-
Apr.-2011].
Dalal, N. and Triggs, B. (2005). Histograms of Oriented
Gradients for Human Detection. In CVPR, volume 1,
pages 886–893.
Enzweiler, M. and Gavrila, D. M. (2009). Monocu-
lar Pedestrian Detection: Survey and Experiments.
TPAMI, 31(12):2179–95.
Fan, R., Chang, K.-W., Hsieh, C.-J., Wang, X.-R., and Lin,
C.-J. (2008). LIBLINEAR: A Library for Large Lin-
ear Classification. JMLR, 9:1871–1874.
Fradkin, D. and Muchnik, I. (2006). Support Vector Ma-
chines for Classification. DIMACS Series in Dis-
crete Mathematics and Theoretical Computer Science,
70:13–20.
Freund, Y. and Schapire, R. E. (1997). A Decision-
Theoretic Generalization of On-Line Learning and an
Application to Boosting. Journal of Computer and
System Sciences, 55:119–139.
Fritz, M., Leibe, B., Caputo, B., and Schiele, B. (2005).
Integrating Representative and Discriminative Models
for Object Category Detection. In ICCV.
Hamker, F. H. (2005). The Emergence of Attention by
Population-based Inference and its Role in Distributed
Processing and Cognitive Control of Vision. Com-
puter Vision and Image Understanding, 100:64–106.
Heisele, B., Serre, T., Mukherjee, S., and Poggio, T. (2001).
Feature Reduction and Hierarchy of Classifiers for
Fast Object Detection in Video Images. In CVPR.
Jiang, X., Rosen, E., Zeffiro, T., Vanmeter, J., Blanz, V., and
Riesenhuber, M. (2006). Evaluation of a Shape-Based
Model of Human Face Discrimination using FMRI
and Behavioral Techniques. Neuron, 50(1):159–72.
Lampert, C. H., Blaschko, M. B., and Hofmann, T. (2008).
Beyond Sliding Windows: Object Localization by Ef-
ficient Subwindow Search. In CVPR, pages 1–8.
Mutch, J. and Lowe, D. G. (2008). Object Class Recogni-
tion and Localization using sparse Features with lim-
ited Receptive Fields. IJCV, 80(1):45–57.
Pedersoli, M., Gonz
`
alez, J., Bagdanov, A. D., and Vil-
lanueva, J. J. (2010). Recursive Coarse-to-Fine Local-
ization for fast Object Detection. In ECCV, volume 6.
Riesenhuber, M. and Poggio, T. (1999a). Are Cortical Mod-
els Really Bound by the “Binding Problem”? Neuron,
24:87–93.
Riesenhuber, M. and Poggio, T. (1999b). Hierarchical Mod-
els of Object Recognition in Cortex. Nature Neuro-
science, 2(11):1019–1025.
Schyns, P. G. and Oliva, A. (1994). From Blobs To Bound-
ary Edges: Evidence for Time- and Spatial-Scale-
Dependent Scene Recognition. Psychological Sci-
ence, 5(4):195–200.
Serre, T., Wolf, L., and Poggio, T. (2005). Object Recog-
nition with Features inspired by Visual Cortex. In
CVPR, pages 994–1000.
Viola, P. and Michael, J. (2001). Rapid Object Detection us-
ing a Boosted Cascade of Simple Features. In CVPR.
Zhu, Q., Avidan, S., Yeh, M.-C., and Cheng, K.-T. (2006).
Fast Human Detection Using a Cascade of Histograms
of Oriented Gradients. In CVPR.
ICPRAM 2012 - International Conference on Pattern Recognition Applications and Methods
288