A MULTISCALE OPERATOR FOR DOCUMENT IMAGE BINARIZATION

Neucimar Jerônimo Leite, Leyza Baldo Dorini

Abstract

Basically, document image binarization consists on the segmentation of scanned gray level images into text and background, and is a basic preprocessing stage in many image analysis systems. It is essential to threshold the document image reliably in order to extract useful information and make further processing such as character recognition and feature extraction. The main difficulties arise when dealing with poor quality document images, containing nonuniform illumination, shadows and smudge, for example. This paper presents an efficient morphological-based document image binarization technique that is able to cope with these problems. We evaluate the proposed approach for different classes of images, such as historical and machine-printed documents, obtaining promising results.

References

  1. Bosworth, J. and Acton, S. (2003). Morphological scalespace in image processing. Digital Signal Processing, 13:338-367.
  2. Dorini, L. E. B. and Leite, N. J. (2007). A scale-space toggle operator for morphological segmentation. In 8th International Symposium on Mathematical Morphology, pages 101-112.
  3. Dorini, L. E. B. and Leite, N. J. (2008). Multiscale image representation using scale-space theory. In XXXI Congresso Nacional de Matemtica Aplicada e Computacional, pages 130-137.
  4. Gatos, B., Pratikakis, I., and Perantonis, S. (2006). Adaptative degraded image binarization. Pattern Recognition, 39:317-327.
  5. Jackway, P. T. and Deriche, M. (1996). Scale-space properties of the multiscale morphological dilation-erosion. IEEE Transactions on Pattern Analysis and Machine Intelligence, 18:38-51.
  6. Maragos, P. and Meyer, F. (2000). A pde approach to nonlinear image simplification via levelings andreconstruction filters. In International Conference on Image Processing, pages 938-941.
  7. Niblack, W. (1986). An Introduction to Digital Image Processing. Prentice Hall.
  8. Otsu, N. (1979). A threshold selection method from greylevel histograms. IEEE Transactions on Systems, Man and Cybernetics, 9(1):377-393.
  9. Parker, J. R. (1996). Algorithms for Image Processing and Computer Vision. Wiley.
  10. Sahoo, P., Soltani, S., and Wong, A. (1988). A survey of thresholding techniques. Comput. Vision, Graphics Image Processing, 41(2):233260.
  11. Sauvola, J. and Pietikainen, M. (2000). Adaptive document image binarization. Pattern Recognition, 33:225-236.
  12. Serra, J. and Vicent, L. (1992). An overview of morphological filtering. Circuits, Systems and Signal Processing, 11(1):47-108.
  13. Sezgin, M. and Sankur, B. (2004). Survey over image thresholding techniques and quantitative performance evaluation. J. Electron. Imaging, 13:146-165.
  14. Trier, O. and Jain, A. (1995). Goal-directed evaluation of binarization methods. IEEE Trans. Pattern Anal. Mach. Intell., 17:1191-1201.
  15. Witkin, A. P. (1984). Scale-space filtering: a new approach to multi-scale description. In Image Understanding, pages 79-95. Ablex.
Download


Paper Citation


in Harvard Style

Jerônimo Leite N. and Baldo Dorini L. (2009). A MULTISCALE OPERATOR FOR DOCUMENT IMAGE BINARIZATION . In Proceedings of the Fourth International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2009) ISBN 978-989-8111-69-2, pages 34-39. DOI: 10.5220/0001779000340039


in Bibtex Style

@conference{visapp09,
author={Neucimar Jerônimo Leite and Leyza Baldo Dorini},
title={A MULTISCALE OPERATOR FOR DOCUMENT IMAGE BINARIZATION},
booktitle={Proceedings of the Fourth International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2009)},
year={2009},
pages={34-39},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001779000340039},
isbn={978-989-8111-69-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Fourth International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2009)
TI - A MULTISCALE OPERATOR FOR DOCUMENT IMAGE BINARIZATION
SN - 978-989-8111-69-2
AU - Jerônimo Leite N.
AU - Baldo Dorini L.
PY - 2009
SP - 34
EP - 39
DO - 10.5220/0001779000340039