A COMPLETE SYSTEM FOR DETECTION AND RECOGNITION OF TEXT IN GRAPHICAL DOCUMENTS USING BACKGROUND INFORMATION

Partha Pratim Roy, Josep Lladós, Umapada Pal

2009

Abstract

Automatic Text/symbols retrieval in graphical documents (map, engineering drawing) involves many challenges because they are not usually parallel to each other. They are multi-oriented and curve in nature to annotate the graphical curve lines and hence follow a curvi-linear way too. Sometimes, text and symbols frequently touch/overlap with graphical components (river, street, border line) which enhances the problem. For OCR of such documents we need to extract individual text lines and their corresponding words/characters. In this paper, we propose a methodology to extract individual text lines and an approach for recognition of the extracted text characters from such complex graphical documents. The methodology is based on the foreground and background information of the text components. To take care of background information, water reservoir concept and convex hull have been used. For recognition of multi-font, multi-scale and multi-oriented characters, Support Vector Machine (SVM) based classifier is applied. Circular ring and convex hull have been used along with angular information of the contour pixels of the characters to make the feature rotation and scale invariant.

References

  1. Adam, S., Ogier, J. M., Carlon, C., Mullot, R., Labiche, J., and Gardes, J. (2000). Symbol and character recognition: application to engineering drawing. In International Journal on Document Analysis and Recognition (IJDAR).
  2. Ahmed, M. and Ward, R. (2002). A rotation invariant rulebased thinning algorithm for character recognition. In IEEE Transactions on Pattern Analysis and Machine Intelligence.
  3. Cao, R. and Tan, C. (2001). Text/graphics separation in maps. In Proc. of International Workshop on Graphics Recognition (GREC).
  4. Fletcher, L. A. and Kasturi, R. (1988). A robust algorithm for text string separation from mixed text/graphics images. In IEEE Transactions on Pattern Analysis and Machine Intelligence.
  5. Goto, H. and Aso, H. (1999). Extracting curved lines using local linearity of the text line. In International Journal on Document Analysis and Recognition (IJDAR).
  6. Hase, H., Shinokawa, T., Yoneda, M., and Suen, C. Y. (2003). Recognition of rotated characters by eigenspace. In International Conference on Document Analysis and Recognition (ICDAR).
  7. Hones, F. and Litcher, J. (1994). Layout extraction of mixed mode documents. In Machine vision and applications.
  8. Loo, P. K. and C.L.Tan (2002). Word and sentence extraction using irregular pyramid. In Document Analysis Systems.
  9. Luo, H., Agam, G., and Dinstein, I. (1995). Directional mathematical morphology approach for line thinning and extraction of character strings from maps and line drawings. In International Conference on Document Analysis and Recognition (ICDAR).
  10. Pal, U., Belad, A., and Choisy, C. (2003). Touching numeral segmentation using water reservoir concept. In Pattern Recognition Letters.
  11. Pal, U., Kimura, F., Roy, K., and Pal, T. (2006). Recognition of english multi-oriented characters. In International Conference on Pattern Recognition.
  12. Pal, U. and Roy, P. P. (2004). Multi-oriented and curved text lines extraction from indian documents. In IEEE Transactions on SMC - Part B.
  13. Roy, K., Pal, U., and Chaudhuri, B. B. (2004). A system for joining and recognition of broken bangla numerals for indian postal automation. In In Proc. of 4th ICVGIP.
  14. Roy, P. P. (2007). An approach to text/graphics separation from color maps. In M.S. thesis, CVC, UAB, Barcelona.
  15. Roy, P. P., Pal, U., Llados, J., and Kimura, F. (2008a). Convex hull based approach for multi-oriented character recognition from graphical documents. In International Conference on Pattern Recognition.
  16. Roy, P. P., Pal, U., Llados, J., and Kimura, F. (2008b). Multi-oriented english text line extraction using background and foreground information. In Proc. of Document Analysis Systems.
  17. Roy, P. P., Vazquez, E., Llados, J., Baldrich, R., and Pal, U. (2007). A system to segment text and symbols from color maps. In Proc. of International Workshop on Graphics Recognition (GREC).
  18. Tombre, K., Tabbone, S., Peissier, L., Lamiroy, B., and Dosch, P. (2002). Text /graphics separation revisited. In Proc. of Document Analysis Systems.
  19. Vapnik, V. (1995). The nature of statistical learning theory. Springer Verlang.
  20. Xie, Q. and Kobayashi, A. (1991). A construction of pattern recognition system invariant of translation, scalechange and rotation transformation of pattern. In Trans. of the Society of Instrument and Control Engineers.
Download


Paper Citation


in Harvard Style

Pratim Roy P., Lladós J. and Pal U. (2009). A COMPLETE SYSTEM FOR DETECTION AND RECOGNITION OF TEXT IN GRAPHICAL DOCUMENTS USING BACKGROUND INFORMATION . In Proceedings of the Fourth International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2009) ISBN 978-989-8111-69-2, pages 209-216. DOI: 10.5220/0001801902090216


in Bibtex Style

@conference{visapp09,
author={Partha Pratim Roy and Josep Lladós and Umapada Pal},
title={A COMPLETE SYSTEM FOR DETECTION AND RECOGNITION OF TEXT IN GRAPHICAL DOCUMENTS USING BACKGROUND INFORMATION},
booktitle={Proceedings of the Fourth International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2009)},
year={2009},
pages={209-216},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001801902090216},
isbn={978-989-8111-69-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Fourth International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2009)
TI - A COMPLETE SYSTEM FOR DETECTION AND RECOGNITION OF TEXT IN GRAPHICAL DOCUMENTS USING BACKGROUND INFORMATION
SN - 978-989-8111-69-2
AU - Pratim Roy P.
AU - Lladós J.
AU - Pal U.
PY - 2009
SP - 209
EP - 216
DO - 10.5220/0001801902090216