ing model evaluation. This proves that the optimised
evaluation process of this detector is very beneficial
for the total evaluation time. As we also could ob-
serve in the speed results of table 3, the calculation of
the rotation map is very computation intensive com-
pared to the other parts of the evaluation pipeline. An
improvement in its evaluation speed would be of great
impact on the speed-up it can attain.
Figure 16: The deviation of calculation time between rota-
tion approaches.
5 CONCLUSION
5.1 Conclusion
In this paper we propose a two-step approach to im-
prove the processing speed of object detection under
rotation. As a first step we trained multiple models to
cover all needed orientations. This allows to eliminate
the need to rotate the image, and allows to reuse the
feature pyramid for all models. Next to that we use
sample points to extract orientation information over
the image. This allows us to reduce the number of
models that have to be evaluated at each location. We
used the ACF-detection frameworkfor its fast training
and evaluation speed and aquired still a speed-up by
using a rotation map. We compare to the baseline of
evaluating multiple rotations of the image, and aquire
a speed-up of 8.2 times, while maintaining high accu-
racy.
5.2 Future Work
As a follow-up of this work, we plan to test our ap-
proach with other object detection algorithms, such as
DPM. Next to that we will try to eliminate the accu-
racy loss by improving the NMS-algorithm and work
on a better approximation technique for the rotation
map. We also plan to apply the rotation invariance in
multiple cases such as surveillance applications and
hand detection.
REFERENCES
Bay, H., Ess, A., Tuytelaars, T., and Van Gool, L. (2008).
Speeded-up robust features (surf). Comput. Vis. Image
Underst., 110(3):346–359.
Benenson, R., Mathias, M., Timofte, R., and Van Gool, L.
(2012). Pedestrian detection at 100 frames per second.
In Computer Vision and Pattern Recognition (CVPR),
2012 IEEE Conference on, pages 2903–2910. IEEE.
Benenson, R., Mathias, M., Tuytelaars, T., and Van Gool, L.
(2013). Seeking the strongest rigid detector. In Com-
puter Vision and Pattern Recognition (CVPR), 2013
IEEE Conference on, pages 3666–3673. IEEE.
CAVIAR (2003). Caviar: context aware vision using image-
based active recognition.
Dalal, N. and Triggs, B. (2005). Histograms of oriented gra-
dients for human detection. In Computer Vision and
Pattern Recognition, 2005. CVPR 2005. IEEE Com-
puter Society Conference on, volume 1, pages 886–
893. IEEE.
De Smedt, F., Van Beeck, K., Tuytelaars, T., and Goedem´e,
T. (2013). Pedestrian detection at warp speed: Ex-
ceeding 500 detections per second. In Computer Vi-
sion and Pattern Recognition Workshops (CVPRW),
2013 IEEE Conference on, pages 622–628. IEEE.
Doll´ar, P. (2013). Piotrs image and video matlab toolbox
(pmt). Software available at: http://vision. ucsd. edu/˜
pdollar/toolbox/doc/index. html.
Doll´ar, P., Appel, R., Belongie, S., and Perona, P. (2014).
Fast feature pyramids for object detection. IEEE
Trans. Pattern Anal. Mach. Intell., 36(8):1532–1545.
Doll´ar, P., Belongie, S., and Perona, P. (2010). The fastest
pedestrian detector in the west. In BMVC.
Doll´ar, P., Tu, Z., Perona, P., and Belongie, S. (2009). Inte-
gral channel features. In BMVC, volume 2, page 5.
Felzenszwalb, P., McAllester, D., and Ramanan, D. (2008).
A discriminatively trained, multiscale, deformable
part model. In Computer Vision and Pattern Recog-
nition, 2008. CVPR 2008. IEEE Conference on, pages
1–8. IEEE.
Felzenszwalb, P. F., Girshick, R. B., and McAllester, D.
(2010). Cascade object detection with deformable part
models. In Computer vision and pattern recognition
(CVPR), 2010 IEEE conference on, pages 2241–2248.
IEEE.
Lowe, D. G. (1999). Object recognition from local scale-
invariant features. In Computer vision, 1999. The pro-
ceedings of the seventh IEEE international conference
on, volume 2, pages 1150–1157. Ieee.
Mathias, M., Timofte, R., Benenson, R., and Van Gool, L.
(2013). Traffic sign recognitionhow far are we from
the solution? In Neural Networks (IJCNN), The 2013
International Joint Conference on, pages 1–8. IEEE.
Mittal, A., Zisserman, A., and Torr, P. H. (2011). Hand
detection using multiple proposals. In BMVC, pages
1–11. Citeseer.
Viola, P. and Jones, M. J. (2004). Robust real-time face
detection. International journal of computer vision,
57(2):137–154.
FastRotationInvariantObjectDetectionwithGradientbasedDetectionModels
407