Baseline Estimation in Arabic Handwritten Text-Line - Evaluation on AHTID/MW Database

Anis Mezghani, Slim Kanoun, Souhir Bouaziz, Maher Khemakhem, Haikal El Abed

Abstract

Baseline extraction is one of the most important phases for handwriting recognition. Due to the complexity of the Arabic scripts, baseline detection of Arabic handwritten text-lines is a difficult task compared to other languages. In this work, a method which combines some baseline extraction techniques used in literature was presented to provide a fine estimation of baseline in Arabic handwritten text-lines. For evaluation purpose, the AHTID/MW database was extended by a baseline ground truth annotation. The database is freely available for researchers worldwide which enable other researchers to test their baseline detection systems.

References

  1. Al-Badr, B., Mahmoud, S., 1995. Survey and bibliography of Arabic optical text recognition. Signal Processing. Vol.41(1): 49-77.
  2. Al-Rashaideh, H., 2006. Preprocessing phase for Arabic Word Handwritten Recognition. Electronic Scientific Journal. Vol.6 (1): 11-19.
  3. Al-Shatnawi, A., Omar, K., 2008. Methods of Arabic Baseline Detection -The State of Art. International Journal of Computer Science and Network Security. Vol.8 (10):137-142.
  4. Amin, A., 1998. Off-line Arabic character recognition: the state of the art. Pattern Recognition. Vol.31(5): 517- 530.
  5. Boubaker, H., Kherallah, M., Alimi, M. A.,2009. New Algorithm of Straight or Curved Baseline Detection for Short Arabic Handwritten Writing. International Conference on Document Analysis and Recognition. 778-782.
  6. Burrow, P., 2004. Arabic handwriting recognition, Thesis. University of Edinburgh. England.
  7. Côté, M., Chériet, M., Suen, C., Lecolinet, E., 1996. Détection des Lignes de Base de Mots Cursifs à l'aide de l'Entropie. Colloque sur l'Intelligence Artificielle dans les Technologies de l'Information.
  8. Elgammal, A. M., Ismail, M. A., 2001. A Graph-Based Segmentation and Feature Extraction Framework for Arabic Text Recognition. International Conference on Document Analysis and Recognition. 622-626.
  9. El-Hajj, R., Likforman-Sulem, L., A., Mokbe, C., 2005. Arabic Handwriting Recognition Using Baseline Dependant Features and Hidden Markov Modeling. International Conference on Document Analysis and Recognition. 893-897.
  10. Farooq, F., Govindaraju, V., Perrone, M., 2005. Preprocessing Methods for Handwritten Arabic Documents. International Conference on Document Analysis and Recognition. 267-271.
  11. Lemaitre, A., Camillerapp, J., Coüasnon, B., 2009. Multiscript Baseline Detection Using Perceptive Vision. Biennial Conference of the International Graphonomics Society.
  12. Likforman-Sulem, L., Hanimyan, A., Faure, C., 1995. A Hough based algorithm for extracting text lines in handwritten documents. International Conference on Document Analysis and Recognition. 774-777.
  13. Menasri, F., Vincent, N., Augustin, E., Cheriet, M., 2008. Un système de reconnaissance de mots arabes manuscrits hors-ligne sans signes diacritiques. Conférence Internationale francophone sur l'écrit et le document.
  14. Mezghani, A., Kanoun, S., Khemakhem, M., El Abed, H., 2012. A Database for Arabic Handwritten Text Image Recognition and Writer Identification. International Conference on Frontiers in Handwriting Recognition. 397-400.
  15. Nagabhushan, P., Alaei, A., 2010. Tracing and Straightening the Baseline in Handwritten Persian/Arabic Text-line: A New Approach Based on Painting-technique. International Journal on Computer Science and Engineering. Vol.2 (4): 907- 916.
  16. Pechwitz, M., Märgner, V., 2003. HMM Based approach for handwritten Arabic Word Recognition Using the IFN/ENIT DataBase. International Conference on Document Analysis and Recognition. 890-894.
  17. Pechwitz, M., Märgner, V., 2002. Baseline Estimation For Arabic Handwritten Words. Proceedings of the Eighth International Workshop on Frontiers in Handwriting Recognition. 479-484.
  18. Thomé, S., 1978. Prétraitement du chiffre manuscrit. Congrès AFCET, France. 568-576.
  19. Toumazet, J. J., 1990. Traitement de l'image par l'exemple, Sybex.
  20. Ziaratban, M., Faez, K., 2008. A Novel Two-Stage Algorithm for Baseline Estimation and Correction in Farsi and Arabic Handwritten Text line. International Conference on Pattern Recognition. 1-5.
Download


Paper Citation


in Harvard Style

Mezghani A., Kanoun S., Bouaziz S., Khemakhem M. and El Abed H. (2013). Baseline Estimation in Arabic Handwritten Text-Line - Evaluation on AHTID/MW Database . In Proceedings of the 2nd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM, ISBN 978-989-8565-41-9, pages 430-434. DOI: 10.5220/0004218704300434


in Bibtex Style

@conference{icpram13,
author={Anis Mezghani and Slim Kanoun and Souhir Bouaziz and Maher Khemakhem and Haikal El Abed},
title={Baseline Estimation in Arabic Handwritten Text-Line - Evaluation on AHTID/MW Database},
booktitle={Proceedings of the 2nd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,},
year={2013},
pages={430-434},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004218704300434},
isbn={978-989-8565-41-9},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 2nd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,
TI - Baseline Estimation in Arabic Handwritten Text-Line - Evaluation on AHTID/MW Database
SN - 978-989-8565-41-9
AU - Mezghani A.
AU - Kanoun S.
AU - Bouaziz S.
AU - Khemakhem M.
AU - El Abed H.
PY - 2013
SP - 430
EP - 434
DO - 10.5220/0004218704300434