Authors:
N. Nikolaou
and
N. Papamarkos
Affiliation:
Democritus University of Thrace, Greece
Keyword(s):
Color document segmentation, RGB color space, Mean shift, Edge preserving smoothing.
Related
Ontology
Subjects/Areas/Topics:
Artificial Intelligence
;
Biomedical Engineering
;
Biomedical Signal Processing
;
Computer Vision, Visualization and Computer Graphics
;
Data Manipulation
;
Health Engineering and Technology Applications
;
Human-Computer Interaction
;
Image and Video Analysis
;
Image Filtering
;
Image Formation and Preprocessing
;
Methodologies and Methods
;
Neurocomputing
;
Neurotechnology, Electronics and Informatics
;
Pattern Recognition
;
Physiological Computing Systems
;
Segmentation and Grouping
;
Sensor Networks
;
Soft Computing
Abstract:
In this paper we present a new method for color segmentation of complex document images which can be used as a preprocessing step of a text information extraction application. From the edge map of an image, we choose a representative set of samples of the input color image and built the 3D histogram of the RGB color space. These samples are used to locate a relatively large number of proper points in the 3D color space and use them in order to initially reduce the colors. From this step an oversegmented image is produced which usually has no more than 100 colors. To extract the final result, a mean shift procedure starts from the calculated points and locates the final color clusters of the RGB color distribution. Also, to overcome noise problems, a proposed edge preserving smoothing filter is used to enhance the quality of the image. Experimental results showed the method’s capability of producing correctly segmented complex color documents while removing background noise or low con
trast objects which is very desirable in text information extraction applications. Additionally, our method has the ability to cluster randomly shaped distributions.
(More)