Authors:
Partha Pratim Roy
1
;
Josep Lladós
1
and
Umapada Pal
2
Affiliations:
1
Universitat Autònoma de Barcelona, Spain
;
2
Indian Statistical Institute, India
Keyword(s):
Graphics Recognition, Optical Character Recognition, Convex Hull, Skeleton Analysis.
Related
Ontology
Subjects/Areas/Topics:
Artificial Intelligence
;
Biomedical Engineering
;
Biomedical Signal Processing
;
Computer Vision, Visualization and Computer Graphics
;
Data Manipulation
;
Feature Extraction
;
Features Extraction
;
Health Engineering and Technology Applications
;
Human-Computer Interaction
;
Image and Video Analysis
;
Informatics in Control, Automation and Robotics
;
Methodologies and Methods
;
Neurocomputing
;
Neurotechnology, Electronics and Informatics
;
Pattern Recognition
;
Physiological Computing Systems
;
Sensor Networks
;
Signal Processing, Sensors, Systems Modeling and Control
;
Soft Computing
Abstract:
Automatic Text/symbols retrieval in graphical documents (map, engineering drawing) involves many challenges because they are not usually parallel to each other. They are multi-oriented and curve in nature to annotate the graphical curve lines and hence follow a curvi-linear way too. Sometimes, text and symbols frequently touch/overlap with graphical components (river, street, border line) which enhances the problem. For OCR of such documents we need to extract individual text lines and their corresponding words/characters. In this paper, we propose a methodology to extract individual text lines and an approach for recognition of the extracted text characters from such complex graphical documents. The methodology is based on the foreground and background information of the text components. To take care of background information, water reservoir concept and convex hull have been used. For recognition of multi-font, multi-scale and multi-oriented characters, Support Vector Machine (SVM)
based classifier is applied. Circular ring and convex hull have been used along with angular information of the contour pixels of the characters to make the feature rotation and scale invariant.
(More)