ometric Blur and Shape Context, consistently outper-
formed SIFT as well as the appearance based features.
This is not surprising since the appearance of a char-
acter in natural images can vary a lot but the shape
remains somewhat consistent.
We also presented preliminary results on recog-
nizing Kannada characters but the problem appears to
be extremely challenging and could perhaps benefit
from a compositional or hierarchical approach given
the large number of visually distinct classes.
ACKNOWLEDGEMENTS
We would like to acknowledge the help of several vol-
unteers who annotated the datasets presented in this
paper. In particular, we would like to thank Arun,
Kavya, Ranjeetha, Riaz and Yuvraj. We would also
like to thank Richa Singh and Gopal Srinivasa for de-
veloping tools for annotation.
REFERENCES
Belongie, S., Malik, J., and Puzicha, J. (2002). Shape
matching and object recognition using shape contexts.
IEEE Transactions on Pattern Analysis and Machine
Intelligence.
Berg, A. C., Berg, T. L., and Malik, J. (2005). Shape match-
ing and object recognition using low distortion corre-
spondence. In Proc IEEE Conf on Computer Vision
and Pattern Recognition, San Diego CA, June 20-25.
Brown, M. S., Sun, M., Yang, R., yun, L., and Seales, W. B.
(2007). Restoring 2d content from distorted docu-
ments. IEEE Transactions on Pattern Analysis and
Machine Intelligence.
Clark, P. and Mirmehdi, M. (2002). Recognising text in real
scenes. International Journal on Document Analysis
and Recognition, 4:243–257.
Jin, Y. and Geman, S. (2006). Context and hierarchy in
a probabilistic image model. In Proc IEEE Conf on
Computer Vision and Pattern Recognition, New York
NY, June 17-22.
Johnson, A. E. and Herbert, M. (1999). Using spin images
for efficient object recognition in cluttered 3d scenes.
IEEE Transactions on Pattern Analysis and Machine
Intelligence, 21(5):433–449.
Jurie, F. and Triggs, B. (2005). Creating efficient codebooks
for visual recognition. In Proceedings of the IEEE
International Conference on Computer Vision.
Kise, K. and Doermann, D. S., editors (2007). Proceedings
of the Second International Workshop on Camera-
based Document Analysis and Recognition CBDAR,
Curitiba, Brazil. http://www.imlab.jp/cbdar2007/.
Krempp, A., Geman, D., and Amit, Y. (2002). Sequential
learning of reusable parts for object detection. Techni-
cal report, Computer Science Department, Johns Hop-
kins University.
Kumar, S., Gupta, R., Khanna, N., Chaudhury, S., and
Joshi, S. (2007). Text extraction and document image
segmentation using matched wavelets and mrf model.
IEEE Transactions on Image Processing, 16(8):2117–
2128.
Lazebnik, S., Schmid, C., and Ponce, J. (2005). A sparse
texture representation using local affine regions. IEEE
Transactions on Pattern Analysis and Machine Intel-
ligence, 27(8):1265–1278.
le Cun, Y., Bottou, L., Bengio, Y., and Haffner, P. (1998).
Gradient-based learning applied to document recogni-
tion. Proceedings of the IEEE, 86(11):2278–2324.
Lowe, D. G. (1999). Object recognition from local scale-
invariante features. In Proc 7th Int Conf on Computer
Vision, Corfu, Greece.
Pal, U., Sharma, N., Wakabayashi, T., and Kimura, F.
(2007). Off-line handwritten character recognition of
devnagari script. In International Conference on Doc-
ument Analysis and Recognition (ICDAR), pages 496–
500, Curitiba, PR, Brazil. IEEE.
Plamondon, R. and Srihari, S. N. (2000). On-line and off-
line handwriting recognition: A comprehensive sur-
vey. IEEE Transactions on Pattern Analysis and Ma-
chine Intelligence, 22(1):63–84.
Tu, Z., Chen, X., Yuille, A. L., and Zhu, S. C. (2005). Im-
age parsing: Unifying segmentation, detection, and
recognition. International Journal of Computer Vi-
sion, Marr Prize Issue.
Varma, M. and Ray, D. (2007). Learning the discrimina-
tive power-invariance trade-off. In Proceedings of the
IEEE International Conference on Computer Vision,
Rio de Janeiro, Brazil.
Varma, M. and Zisserman, A. (2002). Classifying images of
materials: Achieving viewpoint and illumination inde-
pendence. In Proceedings of the 7th European Con-
ference on Computer Vision, Copenhagen, Denmark,
volume 3, pages 255–271. Springer-Verlag.
Varma, M. and Zisserman, A. (2003). Texture classifica-
tion: Are filter banks necessary? In Proceedings of
the IEEE Conference on Computer Vision and Pattern
Recognition.
Weinman, J. J. and Learned Miller, E. (2006). Improving
recognition of novel input with similarity. In Proc
IEEE Conf on Computer Vision and Pattern Recog-
nition, New York NY, June 17-22.
Zhang, H., Berg, A. C., Maire, M., and Malik, J. (2006).
SVM-KNN: Discriminative nearest neighbor classifi-
cation for visual category recognition. In Proc IEEE
Conf on Computer Vision and Pattern Recognition,
New York NY, June 17-22.
VISAPP 2009 - International Conference on Computer Vision Theory and Applications
280