Figure 3: It shows a sample of image for each class of the testing set. In the first line, it shows the original images and
in the second line, it shows the rotated image. From left to right column, the classes are: BACKGROUND Google, Faces,
Faces Easy, Leopards and Motorbikes.
The proposed approach got 76.8% of accuracy against
43.2% of accuracy using the BoW + spatial pyramids.
Almost 78% better when compared to the traditional
approach.
7 CONCLUSIONS
AND FUTURE WORK
In this work, we treat the rotation invariance in the
object categorization problem. The state-of-art meth-
ods used to deal the problem do not treat the pres-
ence of rotation objects in the scene. This property is
important for some kinds of objects and scenes. We
propose, for spatial quantization, a pyramid that uses
a collection of circular concentric areas and showed
to be more robust to rotation of objects in the scene.
When compared to the traditional spatial pyramids,
the results showed very promising results improving
the accuracy in almost 78% in a scenary of pres-
ence of rotation of objects. As future work, we are
planning to test some other parameters for the pro-
posed approach like a different number of levels for
the pyramid, different size of radius in each level of
the pyramid, different descriptors like SURF that is
reported to be more robust to rotation than SIFT and
PHOW and to test in a different and more challeng-
ing database. In a more realistic scenario, just some
objects were rotated while background remains un-
changed. Still, rotation can occur around different
points in 3D space.
ACKNOWLEDGEMENTS
We would like to thanks the reviewers for the valuable
contributions. The authors are partially supported by
CNPq, the Brazilian National Research Council.
REFERENCES
Bosch, A., Zisserman, A., and Munoz, X. (2007). Image
classification using random forests and ferns. In 11th
International Conference on Computer Vision, pages
1–8, Rio de Janeiro, Brazil.
Cortes, C. and Vapnik, V. (1995). Support-vector networks.
Machine Learning, 20(3):273.
Csurka, G., Dance, C., Fan, L., Willamowski, J., and Bray,
C. (2004). Visual categorization with bags of key-
points. In ECCV International Workshop on Statis-
tical Learning in Computer Vision, Prague, Czech Re-
public.
Cula, G. and Dana, J. (2001). Compact representation of
bidirectional texture functions. In IEEE Computer
Society Conference on Computer Vision and Pattern
Recognition, pages 1041–1047, Kauai, USA. IEEE
Computer Society.
Deng, J., Berg, A., Li, K., and Fei-Fei, L. (2010). What does
classifying more than 10,000 image categories tell us?
Computer Vision–ECCV 2010, pages 71–84.
Duda, R. O., Hart, P. E., and Stork, D. G. (2001). Pattern
Classification. John Wiley and Sons.
Fei-Fei, L., Fergus, R., and Perona, P. (2006). One-
shot learning of object categories. IEEE Transac-
tions On Pattern Analysis and Machine Intelligence,
28(4):594–611.
Lazebnik, S., Schmid, C., and Ponce, J. (2006). Beyond
bags of features: Spatial pyramid matching for rec-
ognizing natural scene categories. In Computer Vi-
sion and Pattern Recognition, 2006 IEEE Computer
Society Conference on, pages 2169–2178, New York,
USA.
Sivic, J., Russell, R., Efros, A., Zisserman, A., and Free-
man, W. (2005). Discovering objects and their loca-
tion in images. In Computer Vision, 2005. ICCV 2005.
Tenth IEEE International Conference on, pages 370–
377, San Diego, USA. IEEE Computer Society.
Szeliski, R. (2011). Computer Vision: Algorithms and Ap-
plications. Springer-Verlag.
Wang, J., Yang, J., Yu, K., Lv, F., Huang, T., and Gong, Y.
(2010). Locality-constrained linear coding for image
classification. In Computer Vision and Pattern Recog-
nition, IEEE Computer Society Conference on, pages
3360–3367, San Francisco, USA. IEEE Computer So-
ciety.
VISAPP2013-InternationalConferenceonComputerVisionTheoryandApplications
692