Lanna Dharma Printed Character Recognition using k-Nearest Neighbor and Conditional Random Fields
Chutima Chueaphun, Atcharin Klomsae, Sanparith Marukatat, Jeerayut Chaijaruwanich
2012
Abstract
For centuries, in the North of Thailand, many books of Lanna Dharma characters had been printed. These books are the important sources of the knowledge of ancient Lanna wisdom. At present, the books are found old and damaged. Most of characters are rough and not clear according to its early printing technology at that time. Moreover, some sets of characters are relatively very similar which cause the difficulty to recognize them. This paper proposes a Lanna Dharma printed character recognition technique using k-Nearest Neighbor and Conditional Random Fields. The accuracy of recognition rate is about 82.61 percent.
References
- Alkhateeb, J. H., Khelifi, F., Jiang, J. and Ipson, S. S. (2009). A new approach for off-line handwritten Arabic word recognition using KNN classifier. Proc. ICSIPA, 191 -194.
- Duda, R. O. and Hart, P. (1972). Use of the hough transformation to detect lines and curves in pictures. Comm. ACM, 11-15.
- Duda, R. O. and Hart, P. (1973). Pattern classification and scene analysis. Wiley, New York.
- Feng, S., Manmatha, R. and Mccallum, A. (2006). Exploring the use of conditional random field models and hmms for historical handwritten document recognition. Proc. 2nd DIAL, 30-37.
- Holambe, A. N., Holambe, S. N. and Thool, R. C. (2010). Comparative study of Devanagari handwritten and printed character & numerals recognition using Nearest-Neighbor classifiers. Proc. 3rd ICCSIT, 426- 430.
- Lafferty, J., Mccallum, A. and Pereira, F. (2001). Conditional random fields: Probabilistic models for segmenting and labeling sequential data. Proc. 18th ICML, 282-289.
- Otsu, N. (1979). A threshold selection method from graylevel histograms. IEEE Trans. on System, Man, and Cybernetics, 9, 62-66.
- Shetty, S., Srinivasan, H. and Srihari, S. (2007). Handwritten word recognition using conditional random fields. Proc. 9th ICDAR, 2, 1098 - 1102.
- Shoombuatong, W., Traisathit, P., Prasitwattanaseree, S., Tayapiwatana, C., Cutler, R. and Chaijaruwanich, J. (2011). Prediction of the disulphide bonding state of cysteines in proteins using conditional random fields. International Journal of Data Mining and Bioinformatics, 5, 449-464.
- Subpaiboonkit, S., Thammarongtham, C. and Chaijaruwanich, J. (2012). RNA Family Classification Using the Conditional Random Fields Model. Chiang Mai Journal of Science, 39.
- Sutton, C. and Mccallum, A. (2007). Introduction to Statistical Relational Learning. MIT Press.
- Zhou, X. D., Liu, C. L. and Nakagawa, M. (2009). Online Handwritten Japanese Character String Recognition Using Conditional Random Fields. Proc. 10th ICDAR, 521-525.
Paper Citation
in Harvard Style
Chueaphun C., Klomsae A., Marukatat S. and Chaijaruwanich J. (2012). Lanna Dharma Printed Character Recognition using k-Nearest Neighbor and Conditional Random Fields . In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2012) ISBN 978-989-8565-29-7, pages 169-174. DOI: 10.5220/0004112801690174
in Bibtex Style
@conference{kdir12,
author={Chutima Chueaphun and Atcharin Klomsae and Sanparith Marukatat and Jeerayut Chaijaruwanich},
title={Lanna Dharma Printed Character Recognition using k-Nearest Neighbor and Conditional Random Fields},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2012)},
year={2012},
pages={169-174},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004112801690174},
isbn={978-989-8565-29-7},
}
in EndNote Style
TY - CONF
JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2012)
TI - Lanna Dharma Printed Character Recognition using k-Nearest Neighbor and Conditional Random Fields
SN - 978-989-8565-29-7
AU - Chueaphun C.
AU - Klomsae A.
AU - Marukatat S.
AU - Chaijaruwanich J.
PY - 2012
SP - 169
EP - 174
DO - 10.5220/0004112801690174