RECOGNITION OF TEXT WITH KNOWN GEOMETRIC AND GRAMMATICAL STRUCTURE

Jan Rathouský, Martin Urban, Vojtěch Franc

Abstract

The optical character recognition (OCR) module is a fundamental part of each automated text processing system. The OCR module translates an input image with a text line into a string of symbols. In many applications (e.g. license plate recognition) the text has some a priori known geometric and grammatical structure. This article proposes an OCR method exploiting this knowledge which restricts the set of possible strings to a limited set of feasible combinations. The recognition task is formulated as maximization of a similarity function which uses character templates as reference. These templates are estimated by a support vector machine method from a set of examples. In contrast to the common approach, the proposed method performs character segmentation and recognition simultaneously. The method was successfully evaluated in a car license plate recognition system.

References

  1. Franc, V. and Hlavác?, V. (2006). A novel algorithm for learning support vector machines with structured output spaces. Research Report K333-22/06, CTUCMP-2006-04, Department of Cybernetics, Faculty of Electrical Engineering Czech Technical University, Prague, Czech Republic.
  2. Ko, M.-A. and Kim, Y.-M. (2003). License plate surveillance system using weighted template matching. In Proceedings of the 32nd Applied Imagery Pattern Recognition Workshop (AIPR'03). IEEE Computer Society.
  3. LeCun, Y., Bottou, L., Bengio, Y., and Haffner, P. (1998). Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278-2324.
  4. Lee, H.-J., Chen, S.-Y., and Wang, S.-Z. (2004). Extraction and recognition of license plates of motorcycles and vehicles on highways. In Proceedings of the 17th International Conference on Pattern Recognition (ICPR'04). IEEE Computer Society.
  5. Rahman, C. A., Badawy, W., and Radmanesh, A. (2003). A real time vehicle's license plate recognition system. In Proceedings of the IEEE Conference on Advanced Video and Signal Based Surveillance (AVSS'03). IEEE Computer Society.
  6. Savchynskyy, B. and Kamotskyy, O. (2006). Character templates learning for textual image recognition as an example of learning in structural recognition. In Proceedings of the Second International Conference on Document Image Analysis for Libraries (DIAL'06), pages 88-95. IEEE Computer Society.
  7. Shapiro, V. and Gluhchev, G. (2004). Multinational license plate recognition system: Segmentation and classification. In Proceedings of the 17th International Conference on Pattern Recognition, volume 4, pages 352- 355.
  8. Tsochantaridis, I., Joachims, T., Hofmann, T., and Altun, Y. (2005). Large margin methods for structured and interdependent output variables. Journal of Machine Learning Research, 6:1453-1484.
  9. Vapnik, V. (1998). Statistical Learning Theory. John Wiley & Sons, Inc.
Download


Paper Citation


in Harvard Style

Rathouský J., Urban M. and Franc V. (2008). RECOGNITION OF TEXT WITH KNOWN GEOMETRIC AND GRAMMATICAL STRUCTURE . In Proceedings of the Third International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2008) ISBN 978-989-8111-21-0, pages 194-199. DOI: 10.5220/0001086501940199


in Bibtex Style

@conference{visapp08,
author={Jan Rathouský and Martin Urban and Vojtěch Franc},
title={RECOGNITION OF TEXT WITH KNOWN GEOMETRIC AND GRAMMATICAL STRUCTURE},
booktitle={Proceedings of the Third International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2008)},
year={2008},
pages={194-199},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001086501940199},
isbn={978-989-8111-21-0},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Third International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2008)
TI - RECOGNITION OF TEXT WITH KNOWN GEOMETRIC AND GRAMMATICAL STRUCTURE
SN - 978-989-8111-21-0
AU - Rathouský J.
AU - Urban M.
AU - Franc V.
PY - 2008
SP - 194
EP - 199
DO - 10.5220/0001086501940199