As seen in Table 1, finetuning the complete
network by minimizing
improves the overall
performance. Moreover, by enforcing the network to
produce a good GCM (i.e. set a high value to ), the
error rates of the predicted word transcription
decrease even more.
6 CONCLUSION
In this paper, we present a robust approach to
recognize handwritten texts on Khmer historical
documents. The proposed approach utilizes the
glyph class map (GCM) constructed using the glyph
annotation which contains information about the
structure, position, and identity of each glyph in the
word image to be recognized. Two main modules,
the GCM generator and the GCM encoder-decoder
are developed to generate the GCM which is to be
encoded into a context vector and also local contexts
representing the input word image before being
decoded into the final transcription. Our approach
shows promising results evaluated on data extracted
from SleukRith set, a publicly available dataset
constructed on digitized Khmer palm leaf
documents.
ACKNOWLEDGEMENTS
This research study is supported by ARES-CCD
(program AI 2014-2019) under the funding of
Belgian university cooperation.
REFERENCES
Bahdanau, D., Kyunghyun, C. & Yoshua, B., 2014. Neural
machine translation by jointly learning to align and
translate. arXiv preprint arXiv:1409.0473.
Ding, H. et al., 2017. A Compact CNN-DBLSTM Based
Character Model For Offline Handwriting Recognition
with Tucker Decomposition. The 14th IAPR
International Conference on Document Analysis and
Recognition (ICDAR), pp. 507-512.
Graves, A., Fernándex, S. & Schmidhuber, J., 2007.
Multi-Dimensional Recurrent Neural Networks. The
International Conference on Artificial Neural
Networks.
Graves, A., Fernández, S., Gomez, F. & Schmidhuber, J.,
2006. Connectionist temporal classification: labelling
unsegmented sequence data with recurrent neural
networks. The 23rd international conference on
Machine learning, pp. 369-376.
Graves, A. & Schmidhuber, J., 2009. Offline handwriting
recognition with multidimensional recurrent neural
networks. Advances in neural information processing
systems.
Kesiman, M. W. A. et al., 2018. Benchmarking of
Document Image Analysis Tasks for Palm Leaf
Manuscripts from Southeast Asia. Journal of Imaging,
4(2), p. 43.
Kingma, D. P. & Ba, J., 2014. Adam: A method for
stochastic optimization. arXiv preprint
arXiv:1412.6980.
Valy, D., Verleysen, M., Chhun, S. & Burie, J.-C., 2017.
A New Khmer Palm Leaf Manuscript Dataset for
Document Analysis and Recognition: SleukRith Set.
The 4th International Workshop on Historical
Document Imaging and Processing, pp. 1-6.
Valy, D., Verleysen, M., Chhun, S. & Burie, J.-C., 2018.
Character and Text Recognition of Khmer Historical
Palm Leaf Manuscripts. The 16th International
Conference on Frontiers in Handwritting Recognition.
Voigtlaender, P., Doetsch, P. & Ney, H., 2016.
Handwriting recognition with large multidimensional
long short-term memory recurrent neural networks.
The 15th International ConferenceIn Frontiers in
Handwriting Recognition (ICFHR), pp. 228-233.
Wang, W. et al., 2018. DenseRAN for Offline
Handwritten Chinese Character Recognition. The 16th
International Conference on Frontiers in
Handwritting Recognition, pp. 104-109.
Wu, Y. et al., 2016. Google’s Neural Machine Translation
System: Bridging the Gap between Human and
Machine Translation. arXiv preprint
arXiv:1609.08144.
Wu, Y.-C., Yin, F., Chen, Z. & Liu, C.-L., 2017.
Handwritten Chinese Text Recognition Using
Separable Multi-Dimensional Recurrent Neural
Network. The 14th IAPR International Conference on
Document Analysis and Recognition (ICDAR), pp. 79-
84.