Text Recognition on Khmer Historical Documents using Glyph Class Map Generation with Encoder-Decoder Model

Dona Valy, Michel Verleysen, Sophea Chhun

Abstract

In this paper, we propose a handwritten text recognition approach on word image patches extracted from Khmer historical documents. The network consists of two main modules composing of deep convolutional and multi-dimensional recurrent blocks. We utilize the annotated information of glyph components in the word image to build a glyph class map which is to be predicted by the first module of the network call glyph class map generator. The second module of the network encodes the generated glyph class map and transform it into a context vector which is to be decoded to produce the final word transcription. We also adapt an attention mechanism to the decoder to take advantage of local contexts which are also provided by the encoder. Experiments on a publicly available dataset of digitized Khmer palm leaf manuscripts called SleukRith set are conducted.

Download


Paper Citation


in Harvard Style

Valy D., Verleysen M. and Chhun S. (2019). Text Recognition on Khmer Historical Documents using Glyph Class Map Generation with Encoder-Decoder Model.In Proceedings of the 8th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM, ISBN 978-989-758-351-3, pages 749-756. DOI: 10.5220/0007555507490756


in Bibtex Style

@conference{icpram19,
author={Dona Valy and Michel Verleysen and Sophea Chhun},
title={Text Recognition on Khmer Historical Documents using Glyph Class Map Generation with Encoder-Decoder Model},
booktitle={Proceedings of the 8th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,},
year={2019},
pages={749-756},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007555507490756},
isbn={978-989-758-351-3},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 8th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,
TI - Text Recognition on Khmer Historical Documents using Glyph Class Map Generation with Encoder-Decoder Model
SN - 978-989-758-351-3
AU - Valy D.
AU - Verleysen M.
AU - Chhun S.
PY - 2019
SP - 749
EP - 756
DO - 10.5220/0007555507490756