and fraktur using lstm networks. In Document Analy-
sis and Recognition (ICDAR), 2013 12th International
Conference on, pages 683–687. IEEE.
Chowdhury, A. and Vig, L. (2018). An efficient end-to-end
neural model for handwritten text recognition. arXiv
preprint arXiv:1807.07965.
Cowell, J. and Hussain, F. (2003). Amharic character recog-
nition using a fast signature based algorithm. In In-
formation Visualization, 2003. IV 2003. Proceedings.
Seventh International Conference on, pages 384–389.
IEEE.
Das, A., Li, J., Ye, G., Zhao, R., and Gong, Y. (2019).
Advancing acoustic-to-word ctc model with attention
and mixed-units. IEEE/ACM Transactions on Au-
dio, Speech, and Language Processing, 27(12):1880–
1892.
Ghader, H. and Monz, C. (2017). What does attention in
neural machine translation pay attention to? arXiv
preprint arXiv:1710.03348.
Gondere, M. S., Schmidt-Thieme, L., Boltena, A. S., and
Jomaa, H. S. (2019). Handwritten amharic charac-
ter recognition using a convolutional neural network.
arXiv preprint arXiv:1909.12943.
Graves, A. (2008). Supervised sequence labelling with re-
current neural networks [ph. d. dissertation]. Techni-
cal University of Munich, Germany.
Graves, A., Liwicki, M., Bunke, H., Schmidhuber, J., and
Fern
´
andez, S. (2008). Unconstrained on-line hand-
writing recognition with recurrent neural networks. In
Advances in neural information processing systems,
pages 577–584.
Huang, W., He, D., Yang, X., Zhou, Z., Kifer, D., and Giles,
C. L. (2016). Detecting arbitrary oriented text in the
wild with a visual attention model. In Proceedings of
the 24th ACM international conference on Multime-
dia, pages 551–555.
Huang, Y., Luo, C., Jin, L., Lin, Q., and Zhou, W. (2019).
Attention after attention: Reading text in the wild with
cross attention. In 2019 International Conference on
Document Analysis and Recognition (ICDAR), pages
274–280. IEEE.
Lee, C.-Y. and Osindero, S. (2016). Recursive recurrent
nets with attention modeling for ocr in the wild. In
Proceedings of the IEEE Conference on Computer Vi-
sion and Pattern Recognition, pages 2231–2239.
Li, Z., Jin, L., Lai, S., and Zhu, Y. (2020). Improv-
ing attention-based handwritten mathematical expres-
sion recognition with scale augmentation and drop at-
tention. In 2020 17th International Conference on
Frontiers in Handwriting Recognition (ICFHR), pages
175–180. IEEE.
Luong, M.-T., Pham, H., and Manning, C. D. (2015). Ef-
fective approaches to attention-based neural machine
translation. arXiv preprint arXiv:1508.04025.
Ly, N.-T., Nguyen, C.-T., Nguyen, K.-C., and Nakagawa,
M. (2017). Deep convolutional recurrent network for
segmentation-free offline handwritten japanese text
recognition. In 2017 14th IAPR International Con-
ference on Document Analysis and Recognition (IC-
DAR), volume 7, pages 5–9. IEEE.
Maitra, D. S., Bhattacharya, U., and Parui, S. K. (2015).
Cnn based common approach to handwritten character
recognition of multiple scripts. In Document Analy-
sis and Recognition (ICDAR), 2015 13th International
Conference on, pages 1021–1025. IEEE.
Martınek, J., Lenc, L., and Kr
´
al, P. (2020). Building an
efficient ocr system for historical documents with little
training data.
Mekuria, G. T. and Mekuria, G. T. (2018). Amharic text
document summarization using parser. International
Journal of Pure and Applied Mathematics, 118(24).
Meshesha, M. (2008). Recognition and retrieval from doc-
ument image collections. PhD thesis, IIIT Hyderabad,
India.
Meshesha, M. and Jawahar, C. (2007). Optical character
recognition of amharic documents. African Journal of
Information & Communication Technology, 3(2).
Messina, R. and Louradour, J. (2015). Segmentation-free
handwritten chinese text recognition with lstm-rnn.
In 2015 13th International Conference on Document
Analysis and Recognition (ICDAR), pages 171–175.
IEEE.
Meyer, R. (2006). Amharic as lingua franca in ethiopia.
Lissan: Journal of African Languages and Linguis-
tics, 20(1/2):117–132.
Mondal, M., Mondal, P., Saha, N., and Chattopadhyay, P.
(2017). Automatic number plate recognition using cnn
based self synthesized feature learning. In Calcutta
Conference (CALCON), 2017 IEEE, pages 378–381.
IEEE.
Poulos, J. and Valle, R. (2017). Character-based handwrit-
ten text transcription with attention networks. arXiv
preprint arXiv:1712.04046.
Reta, B. Y., Rana, D., and Bhalerao, G. V. (2018). Amharic
handwritten character recognition using combined
features and support vector machine. In 2018 2nd In-
ternational Conference on Trends in Electronics and
Informatics (ICOEI), pages 265–270. IEEE.
Teferi, D. (1999). Optical character recognition of typewrit-
ten amharic text. Master’s thesis, School of Informa-
tion studies for Africa, Addis Ababa.
Watanabe, S., Hori, T., Kim, S., Hershey, J. R., and Hayashi,
T. (2017). Hybrid ctc/attention architecture for end-
to-end speech recognition. IEEE Journal of Selected
Topics in Signal Processing, 11(8):1240–1253.
Wion, A. (2006). The national archives and library of
ethiopia: six years of ethio-french cooperation (2001-
2006).
Wu, Y.-C., Yin, F., Chen, Z., and Liu, C.-L. (2017). Hand-
written chinese text recognition using separable multi-
dimensional recurrent neural network. In 2017 14th
IAPR International Conference on Document Analy-
sis and Recognition (ICDAR), volume 1, pages 79–84.
IEEE.
Zhang, J., Du, J., and Dai, L. (2018). Track, attend,
and parse (tap): An end-to-end framework for on-
line handwritten mathematical expression recogni-
tion. IEEE Transactions on Multimedia, 21(1):221–
233.
A Blended Attention-CTC Network Architecture for Amharic Text-image Recognition
441