As seen in Table 1, finetuning the complete 
network by minimizing 
 improves the overall 
performance. Moreover, by enforcing the network to 
produce a good GCM (i.e. set a high value to ), the 
error rates of the predicted word transcription 
decrease even more.   
6  CONCLUSION 
In this paper, we present a robust approach to 
recognize handwritten texts on Khmer historical 
documents. The proposed approach utilizes the 
glyph class map (GCM) constructed using the glyph 
annotation which contains information about the 
structure, position, and identity of each glyph in the 
word image to be recognized. Two main modules, 
the GCM generator and the GCM encoder-decoder 
are developed to generate the GCM which is to be 
encoded into a context vector and also local contexts 
representing the input word image before being 
decoded into the final transcription. Our approach 
shows promising results evaluated on data extracted 
from SleukRith set, a publicly available dataset 
constructed on digitized Khmer palm leaf 
documents. 
ACKNOWLEDGEMENTS 
This research study is supported by ARES-CCD 
(program AI 2014-2019) under the funding of 
Belgian university cooperation. 
REFERENCES 
Bahdanau, D., Kyunghyun, C. & Yoshua, B., 2014. Neural 
machine translation by jointly learning to align and 
translate. arXiv preprint arXiv:1409.0473. 
Ding, H. et al., 2017. A Compact CNN-DBLSTM Based 
Character Model For Offline Handwriting Recognition 
with Tucker Decomposition. The 14th IAPR 
International Conference on Document Analysis and 
Recognition (ICDAR), pp. 507-512. 
Graves, A., Fernándex, S. & Schmidhuber, J., 2007. 
Multi-Dimensional Recurrent Neural Networks. The 
International Conference on Artificial Neural 
Networks. 
Graves, A., Fernández, S., Gomez, F. & Schmidhuber, J., 
2006. Connectionist temporal classification: labelling 
unsegmented sequence data with recurrent neural 
networks.  The 23rd international conference on 
Machine learning, pp. 369-376. 
Graves, A. & Schmidhuber, J., 2009. Offline handwriting 
recognition with multidimensional recurrent neural 
networks. Advances in neural information processing 
systems. 
Kesiman, M. W. A. et al., 2018. Benchmarking of 
Document Image Analysis Tasks for Palm Leaf 
Manuscripts from Southeast Asia. Journal of Imaging, 
4(2), p. 43. 
Kingma, D. P. & Ba, J., 2014. Adam: A method for 
stochastic optimization. arXiv preprint 
arXiv:1412.6980. 
Valy, D., Verleysen, M., Chhun, S. & Burie, J.-C., 2017. 
A New Khmer Palm Leaf Manuscript Dataset for 
Document Analysis and Recognition: SleukRith Set. 
The 4th International Workshop on Historical 
Document Imaging and Processing, pp. 1-6. 
Valy, D., Verleysen, M., Chhun, S. & Burie, J.-C., 2018. 
Character and Text Recognition of Khmer Historical 
Palm Leaf Manuscripts. The 16th International 
Conference on Frontiers in Handwritting Recognition. 
Voigtlaender, P., Doetsch, P. & Ney, H., 2016. 
Handwriting recognition with large multidimensional 
long short-term memory recurrent neural networks. 
The 15th International ConferenceIn Frontiers in 
Handwriting Recognition (ICFHR), pp. 228-233. 
Wang, W. et al., 2018. DenseRAN for Offline 
Handwritten Chinese Character Recognition. The 16th 
International Conference on Frontiers in 
Handwritting Recognition, pp. 104-109. 
Wu, Y. et al., 2016. Google’s Neural Machine Translation 
System: Bridging the Gap between Human and 
Machine Translation. arXiv preprint 
arXiv:1609.08144. 
Wu, Y.-C., Yin, F., Chen, Z. & Liu, C.-L., 2017. 
Handwritten Chinese Text Recognition Using 
Separable Multi-Dimensional Recurrent Neural 
Network. The 14th IAPR International Conference on 
Document Analysis and Recognition (ICDAR), pp. 79-
84.