Japanese Scene Character Recognition using Random Image Feature
and Ensemble Scheme
Fuma Horie
1
and Hideaki Goto
2
1
Graduate School of Information Sciences, Tohoku University, Sendai, Japan
2
Cyberscience Center, Tohoku University, Sendai, Japan
Keywords:
Random Image Feature, Japanese Scene Character Recognition, Synthetic Scene Character Data, Ensemble
Voting Classifier, Multi-Layer Perceptron.
Abstract:
Scene character recognition is challenging and difficult owing to various environmental factors at image cap-
turing and complex design of characters. Japanese character recognition requires a large number of scene
character images for training since thousands of character classes exist in the language. In order to enhance
the Japanese scene character recognition, we utilized a data augmentation method and an ensemble scheme
in our previous work. In this paper, Random Image Feature (RI-Feature) method is newly proposed for im-
proving the ensemble learning. Experimental results show that the accuracy has been improved from 65.57%
to 78.50% by adding the RI-Feature method to the ensemble learning. It is also shown that HOG feature
outperforms CNN in the Japanese scene character recognition.
1 INTRODUCTION
Recognition of text information in the scene, which is
often referred to as scene character recognition, has
some important applications such as automatic driv-
ing system and automatic translation. Scene charac-
ter recognition is more difficult in comparison with
printed character recognition as there are various fac-
tors such as rotation, geometric distortion, uncon-
trolled lighting, blur, noise and complex design of
characters in the scene images. Japanese scene char-
acter recognition requires a large number of training
data since thousands of character classes exist in the
language. However, collecting a large number char-
acter image samples in real scenes is a hard task.
Some previous researches introduced a data aug-
mentation method using Synthetic Scene character
Data (SSD) which is randomly generated by some
particular algorithms such as filter processing, mor-
phology operation, color change, and geometric dis-
tortion from the font sets of printed characters (Jader-
berg et al., 2014)(Ren et al., 2016)(Jiang and Goto,
2017)(Horie and Goto, 2018). Jader et al. and Ren
et al. have shown that the accuracy of the deep neu-
ral network model can be improved by adding SSD
to the training data. It has been proved that the aug-
mentation methods are effective for improving the ac-
curacy of the scene character recognition. Figure 1
shows some examples of the Japanese characters in
natural scenes. In our previous work (Jiang and Goto,
2017)(Horie and Goto, 2018), we developed a train-
ing datasets consisting of both Real Scene character
Data (RSD) and SSD. The ensemble scheme is used
to improve the generalization ability of the classifier.
For further improvements of the generalization
ability, Random Image Feature (RI-Feature) method
is newly proposed in this paper. The RI-Feature
method is to randomly process an image before ex-
tracting character features and it is applied to each
classifier by different parameters. It is expected that
the RI-Feature method will make the generalization
ability higher. Moreover, we propose a new ensemble
scheme using Multi-Layer Perceptron (MLP) in this
paper. Experimental results show the effectiveness of
RI-Feature method and MLP.
Convolutional Neural Network (CNN) has
achieved a remarkable performance in various image
recognition tasks including also the scene character
recognition. However, the CNN needs a large number
of high-quality training data. It is thought that CNN
is not able to achieve high accuracy in the scene char-
acter recognition when it suffers from the shortage
of training data. Especially, Japanese scene character
datasets currently available are far from enough
to train the CNN. Some previous scene character
recognition systems use HOG feature since it has
414
Horie, F. and Goto, H.
Japanese Scene Character Recognition using Random Image Feature and Ensemble Scheme.
DOI: 10.5220/0007341904140420
In Proceedings of the 8th International Conference on Pattern Recognition Applications and Methods (ICPRAM 2019), pages 414-420
ISBN: 978-989-758-351-3
Copyright
c
2019 by SCITEPRESS – Science and Technology Publications, Lda. All rights reserved