5.3.2 Runtime Analysis
The runtime of our approach can be estimated from
the constituent processes namely, preprocessing and
feature extraction. The preprocessing phase depends
on Otsu binarization, and morphological
skeletonizing operators, which are in general
,
where is the size of the image. As regards the
feature extraction phase, we deal with contour
points laid out in a circle. Each movement of the
contour depends on computing external and internal
forces acting on each point. Luckily, the iterative
optimization method used in modelling snake
evolution uses just one computation (involving
matrix inversion) of internal force matrix whose size
depends on . External image forces involve
computation of image gradients in horizontal and
vertical directions and need
operations in one
pass. Finally, we iterate over evolution steps to get
the snake to its terminal shape. Since, and are
fixed prior to running the algorithm and is usually
very small compared with the size of image, the total
cost turns out to be
.
6 CONCLUSIONS
In this paper we put forth a novel feature to
holistically solve natural scene character recognition
problem that avoids dependency on specific features.
Through our results we showed the potential of using
our novel feature to better capture shape and font
variations in scene character images. We got better
results than several baseline methods and achieved
improved recognition performance on the datasets
using leave-random-one-out cross-validation,
showing the importance of feature-independency and
preservation of spatial correlations in recognition.
In future we hope to get state-of-the-art
performance using better image segmentation
methods and also optimizing other parameters of
contour evolution. We also look forward to using
contour evolution on grayscale images directly.
REFERENCES
Ali, M., and Foroosh, H., 2015. Natural Scene Character
Recognition without Dependency on Specific Features.
In Proceedings of the International Conference on
Computer Vision Theory and Applications (VISAPP),
Berlin, Germany, March 2015.
Chen, X., and Yuille, A., 2004. Detecting and reading text
in natural scenes. In Computer Vision and Pattern
Recognition, 2004. CVPR 2004. IEEE 2004. Vol. 2. pp.
II–366.
Coates, A., Carpenter, B., Case, C., Satheesh, S., Suresh,
B., Wang, T., Wu, D., and Ng, A., 2011. Text detection
and character recognition in scene images with
unsupervised feature learning. In International
Conference on Document Analysis and Recognition
(ICDAR), 2011. IEEE 2011, pp. 440–445.
Dalal, N., and Triggs, B., 2005. Histograms of oriented
gradients for human detection. In International
Conference on Computer Vision and Pattern
Recognition (CVPR) 2005. IEEE 2005, pp.886-893.
de Campos, T. E., Babu, B. R., and Varma, M., 2009.
Character recognition in natural images. In Proceedings
of the International Conference on Computer Vision
Theory and Applications (VISAPP), Lisbon, Portugal,
February 2009. pp. 273–280.
Donoser, M., Bischof, H., and Wagner, S., 2008. Using web
search engines to improve text recognition. In 19th
International Conference on Pattern Recognition,
ICPR 2008. Vol. no. 14, pp. 8-11.
Hazan, T., Polak, S., and Shashua, A., 2005. Sparse Image
Coding using a 3D Non-negative Tensor Factorization.
In International Conference on Computer Vision
(ICCV), 2005. IEEE 2005. Vol. 1, pp. 50–57.
Field, J., and Learned-Miller, E., 2013. Improving Open-
Vocabulary Scene Text Recognition. In International
Conference on Document Analysis and Recognition
(ICDAR) 2013. IEEE 2013, pp. 604–608.
Ivins, J., and Porrill J., 2000. Everything you always wanted
to know about snakes. AIVRU Technical Memo 86, July
1993 (Revised June 1995; March 2000).
Kass, M., Witkin, A., and Terzopoulos, D. 1987. Snakes:
Active contour models. International Journal of
Computer Vision. v. 1, n. 4, pp. 321-331.
Kita, K., and Wakahara, T., 2010. Binarization of color
characters in scene images using k-means clustering
and support vector machines. In International
Conference on Pattern Recognition (ICPR), 2010.
IEEE 2010, pp. 3183–3186.
Lucas, S. M., Panaretos, A., Sosa, L., Tang, A., Wong, S.,
and Young, R., 2003. ICDAR 2003 robust reading
competitions. In Proceedings of the Seventh
International Conference on Document Analysis and
Recognition 2003. IEEE 2003. Vol. 2, pp. 682–687.
Mishra, A., Alahari, K., and Jawahar, C., 2011. An MRF
model for binarization of natural scene text. In
International Conference on Document Analysis and
Recognition (ICDAR), 2011. IEEE 2011, pp. 11–16.
Nagy, R., Dicker, A., and Meyer-Wegener, K., 2011.
NEOCR: A Configurable Dataset for Natural Image
Text Recognition.
In CBDAR Workshop, ICDAR 2011,
pp. 53-58.
Neumann, L., and Matas, J., 2011. A method for text
localization and recognition in real-world images. In
Computer Vision–ACCV 2010, pp. 770–783.
Niblack, W., 1985. An introduction to digital image
processing. Strandberg Publishing Company.
Otsu, N., 1979. A Threshold Selection Method from Gray-
Level Histogram. In Trans. System, Man and