A Multi Class Classiﬁcation to Detect Original Form of Kaomoji using

Neural Network

Noriyuki Okumura

1 a

and Rei Okumura

Faculty of Modern Social Studies, Otemae University, Japan

Advanced Course of Mechanical and Electronic System Engineering,

National Institute of Technology, Akashi College, Japan

Keywords:

Kaomoji, Original Form, Neural Network, Middle Layer.

Abstract:

In this paper, we propose a multi-class classiﬁcation method for Kaomoji using feed forward neural network.

Neural network has some units in each layer, but the suitable number of units is not clear. This research

investigated the relation between the number of units and the accuracy of multi-class classiﬁcation method.

1 INTRODUCTION

In this paper, we report on the estimation of the ori-

ginal form of Kaomoji, emoticons of Japanese style,

which is one of the classiﬁcation tasks of Kaomoji.

Kaomojis are represented not only by ASCII charac-

ters that are half-width characters but also by com-

binations of various characters, including full-width

characters such as Japanese characters (Kanji, Hira-

gana, and Katakana). At present, the total number of

Kaomoji has already exceeded over 100,000. Besides,

it is also possible to compose Kaomoji of shapes that

do not ﬁt on one line by composing Unicode. Accor-

ding to Kazama et al., it is possible to extract millions

of face Kaomoji by using Twitter logs (Kazama et al.,

2016). On the other hand, any researchers did not

establish a framework for comprehensively treating

Kaomoji, and each researcher has only a large-scale

dictionary of Kaomoji.

In this paper, in order to put together a wide va-

riety of Kaomoji, we aim to deﬁne the original form

of Kaomoji and estimate the original form from arbi-

trary Kaomoji. We have already proposed a method

using neural networks as a method for estimating Ka-

omoji, but we did not verify the validity of the number

of units to be used in the middle layer. Therefore, in

consideration of the possibility that certain Kaomoji,

especially theatrical type Kaomoji, have multiple ori-

ginal forms, the number of units in the middle layer

in multiclass classiﬁcation is investigated.

As a result of the research experiment, we conﬁr-

https://orcid.org/0000-0003-1149-1645

med that by preparing 6,500 units in the case of the

middle layer, we could obtain the best results in terms

of learning time and accuracy rate.

2 RELATED WORK

Bedrick et al. try to detect Kaomoji using PCFG

(Probabilistic context-free grammar). (Bedrick et al.,

2012) The target of extracting Kaomoji is tweets pos-

ted on Twitter. This method only uses the rules of

PCFG (Probabilistic context-free grammar) and does

not deﬁne the original form of Kaomoji.

Kazama et al. proposed Kaomoji detection algori-

thm. (Kazama et al., 2016)(in Japanese) their method

is to analyze articles posted on SNS such as Twitter

and extract Kaomoji-like strings. By the method of

Kazama et al., it is possible to extract character strings

that can be regarded as a large number of Kaomoji as

similar as Bedrick et al.. Their method, however, only

rules the composition of Kaomoji and is not suitable

for grouping Kaomoji as in this paper.

Ptaszynski et al. proposed CAO system as a fra-

mework for the comprehensive treatment of Kaomoji.

(Ptaszynski et al., 2010) The CAO system not only ex-

tracts the character string that can be regarded as Ka-

omoji but also aims to extract the emotion that the ex-

tracted Kaomoji expresses. In particular, this method

extracts Kaomoji using the eye-mouth-eye sequence

(triplet) as the basis of Kaomoji as a feature of the Ka-

omoji. This method is similar to this research in that

it deﬁnes a basic string representing Kaomoji. As we

Okumura, N. and Okumura, R.

A Multi Class Classiﬁcation to Detect Original Form of Kaomoji using Neural Network.

DOI: 10.5220/0008366203770382

In Proceedings of the 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2019), pages 377-382

ISBN: 978-989-758-382-7

377

have not made detailed deﬁnitions such as (Okumura,

2016) for considering symmetry, it is not suitable for

grouping.

The author’s team is developing a classiﬁcation

method for Kaomoji using neural networks and cosine

similality. (Okumura and Okumura, 2018)(Okumura,

2017) As a result, it is possible to extract the only ori-

ginal form inferred from any Kaomoji with about 70

% accuracy rate. On the other hand, we do not im-

plement a multiclass classiﬁcation for Kaomoji with

multiple primitives.

In this paper, we aim to construct a system that

can be estimated even if it belongs to multiple classes

by labeling a string that can be regarded as Kaomoji

as a class called an original form. Also, in this paper,

we examine the number of units in the middle layer

of the neural network, which is necessary to estimate

the original form from the character string that can be

regarded as a Kaomoji.

3 THE METHOD TO ESTIMATE

THE ORIGINAL FORM OF

KAOMOJI CORRESPONDING

TO MULTI-CLASS

CLASSIFICATION

Our conventional method aims at outputting one of

3,110 original form of Kaomoji. However, for exam-

ple,

( ) " o(* *)o ( )

in the case of such Kaomoji, the system cannot judge

which original form to extract because of fo including

three types of character sequences considered to re-

present a face. Therefore, extracting just one original

form is not enough as a grouping, and it is necessary

to identify all the faces included in the Kaomoji-like

character sequence and its original form.

In this paper, we implement multiclass classiﬁca-

tion using ﬁxed-length input feed-forward neural ne-

twork using the character Embedding. In the previous

example, we have to construct a model that is correct

if we can extract the three original forms ( ( )

, ( ) , ( ) , strictly, ( ) has

appeared twice, so our system have to extract two ty-

pes of original form of Kaomoji).

3.1 Multiclass Classiﬁcation of Kaomoji

The neural network used in this paper is a simple mo-

del with only one middle layer. On the other hand,

in the case of Kaomoji with linguistic features, it is

known that the information to be given to the input

layer is insufﬁcient with the One-hot vector. There-

fore, this system vectorizes the input (Kaomoji) using

the character Embedding. At the time of writing this

article, the longest of the emoticons registered in the

emoticon database collected is 65 characters, so the

input to the Embedding layer is 65 units. Each cha-

racter input to the Embedding layer is converted to

a 100-dimensional vector in order from the left side

of the emoticon and combined in order. For Kaomoji

less than 65 characters, generate a ﬁxed-length input

vector with zero paddings (NULL characters) for the

shortfall. The ﬁgure 1 shows the conﬁguration of the

neural network adopted in this paper.

In this paper, we change the number of units in the

middle layer in the ﬁgure 1, and we want to derive the

appropriate number of units based on both the evalu-

ation by the correct answer rate (described later) and

the learning time. Some researchers noted that altho-

ugh there is an argument that there is a standard of

(number of input units + number of output units) x

2/3 as a standard of the number of units, it is various

factors such as the complexity of the problem to be

solved and the size of learning data. Because of the

inﬂuence, it does not go beyond the range of heuris-

tics.

3.2 Evaluation Method

In our past studies, it was regarded as correct as long

as at least one correct original form is included in the

outputted original form group as an evaluation scale

in the estimation of the emoticon base form. Howe-

ver, in multiclass classiﬁcation, we have to evaluate

whether outputs of our system include all the original

forms. In this paper, if our system classiﬁed three ori-

ginal forms, then; the conventional evaluation method

(Easy) is corresponding to the correct if one of the ou-

tput is a correct answer; Normal evaluation method is

corresponding to the ratio between correct answers in

output and the number of correct answers (Normal);

Hard evaluation method is corresponding to the cor-

rect if the system’s output is corresponding to all of

the correct answers.

For example, if there are three types of correct

answers and the estimated results are these, For exam-

ple, if there are three types of correct answers and the

estimated results are these, then the evaluation me-

thod for Easy answers correct, the evaluation method

for Normal calculates a ratio as a correct rate of 2/3,

and the evaluation method for Hard answers incorrect.

Evaluation is performed by the following formula 1,

2, 3.

KMIS 2019 - 11th International Conference on Knowledge Management and Information Systems

378

(・_・) (__)

Figura 1: The Model to estimate the original form of Kaomoji.

Eval

Easy

(1)

Eval

Normal

∑

Normal

NormalCorrect

(2)

Eval

Hard

(3)

E. E is corresponding to the number that the output

has one of the original forms at least

Normal. Normal is corresponding to the number of

correct answers in output

NormalCorrect. NormalCorrect is corresponding to

the number of correct answers for each Kaomoji

H. H is corresponding to the number that all output

equals the original forms

N. the number of test data

In Eval

Easy

, the ratio of the number of cor-

rect answers to all evaluation targets is calcula-

ted, in Eval

Norm

, the average accuracy rate of all

evaluation targets is calculated, and in Eval

Hard

Eval

Easy

Calculatethesameratioas, and investigate

the change in accuracy rate depending on the number

of units.

In this evaluation, the average accuracy rate of

this paper is calculated under 10-fold cross validation

using 28,296 pairs of Kaomoji-like character strings

and original forms. In the following sections, we exa-

mine the transition of the average accuracy rate of 10-

fold cross-validation and the time for learning.

4 RESULT

In this section, we describe the results of evaluating

neural network method in ﬁgure 1 by the method des-

cribed in section 3.2. The minimum number of units

in the middle layer is 500, and the results show the

tendency in increments of 500 up to 10,000.

4.1 Evaluation for Loss

Figure 2, ﬁgure 3 show the transition of loss in lear-

ning data and evaluation data. The vertical axis shows

the output from the loss function, and the horizontal

axis shows the Epoch number.

0.005

0.01

0.015

0.02

0.025

500 1000 1500 2000 2500 3000 3500 4000 4500 5000

500 1000 1500 2000 2500

3000 3500 4000 4500 5000

5500 6000 6500 7000 7500

8000 8500 9000 9500 10000

Figura 2: Loss of training data.

4.2 Evaluation for Training Data

Figure 4, ﬁgure 5, ﬁgure 6 the transition of the evalu-

ation value by the method. The vertical axis indicates

the accuracy rate, and the horizontal axis indicates the

number of Epochs.

A Multi Class Classiﬁcation to Detect Original Form of Kaomoji using Neural Network

379

0.005

0.01

0.015

0.02

0.025

0.03

500 1000 1500 2000 2500 3000 3500 4000 4500 5000

500 1000 1500 2000 2500

3000 3500 4000 4500 5000

5500 6000 6500 7000 7500

8000 8500 9000 9500 10000

Figura 3: Loss of evaluation data.

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

500 1000 1500 2000 2500 3000 3500 4000 4500 5000

500 1000 1500 2000 2500

3000 3500 4000 4500 5000

5500 6000 6500 7000 7500

8000 8500 9000 9500 10000

Figura 4: Evaluation method "Easy" for training data.

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

500 1000 1500 2000 2500 3000 3500 4000 4500 5000

500 1000 1500 2000 2500

3000 3500 4000 4500 5000

5500 6000 6500 7000 7500

8000 8500 9000 9500 10000

Figura 5: Evaluation method "Normal" for training data.

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

500 1000 1500 2000 2500 3000 3500 4000 4500 5000

500 1000 1500 2000 2500

3000 3500 4000 4500 5000

5500 6000 6500 7000 7500

8000 8500 9000 9500 10000

Figura 6: Evaluation method "Hard" for training data.

4.3 Evaluation for Evaluation Data

Figure 7, ﬁgure 8, ﬁgure 9 the transition of the evalu-

ation value by the method. The vertical axis indicates

the accuracy rate, and the horizontal axis indicates the

number of Epochs.

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

500 1000 1500 2000 2500 3000 3500 4000 4500 5000

500 1000 1500 2000 2500

3000 3500 4000 4500 5000

5500 6000 6500 7000 7500

8000 8500 9000 9500 10000

Figura 7: Evaluation method "Easy" for training data.

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

500 1000 1500 2000 2500 3000 3500 4000 4500 5000

500 1000 1500 2000 2500

3000 3500 4000 4500 5000

5500 6000 6500 7000 7500

8000 8500 9000 9500 10000

Figura 8: Evaluation method "Normal" for training data.

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

500 1000 1500 2000 2500 3000 3500 4000 4500 5000

500 1000 1500 2000 2500

3000 3500 4000 4500 5000

5500 6000 6500 7000 7500

8000 8500 9000 9500 10000

Figura 9: Evaluation method "Hard" for training data.

4.4 Time for Evaluation

Figure 10 shows the time for each evaluation. The

vertical axis shows the required time.

KMIS 2019 - 11th International Conference on Knowledge Management and Information Systems

380

10000

20000

30000

40000

50000

60000

500

1000

1500

2000

2500

3000

3500

4000

4500

5000

5500

6000

6500

7000

7500

8000

8500

9000

9500

10000

500 1000 1500 2000

2500

3000 3500 4000 4500 5000

5500 6000 6500 7000 7500

8000 8500 9000 9500 10000

Figura 10: Time for 10-fold cross-varidation.

5 DISCUSSION

Figure 2, ﬁgure 3 in section 4.1 show that the trai-

ning process is advancing rapidly up to around 1000

Epoch. Moreover, the loss in evaluation data increa-

ses after 1000 Epoch. Therefore, it is sufﬁcient for

our neural network model in the proposed model that

the number of training Epoch is about 1,000 Epoch.

There is no difference depending on the number of

units except 500 units and 1,000 units.

According to each evaluation (Easy, Normal

Hard) shown in section 4.2, in all evaluations, the

evaluation values show the same tendency except in

the case of 500, 1,000, and 1,500 units. The correct-

ness rate is 90% or more for training data by preparing

the middle layer of at least 2,000 units or more from

the evaluation value of around 1000Epoch. Therefore,

learning converges at around 1000 Epoch.

According to each evaluation (Easy, Normal,

Hard) shown in section 4.3, although the evaluation

rate for Easy is only about 10% high, there is no

difference in the evaluation values for Normal and

Hard. From this, when performing multiclass classi-

ﬁcation, it is rarely output more than the total number

of original forms contained in emoticon-like character

strings. The number of primitives output by estima-

tion is equal to the total number of primitives (num-

ber of correct primitives) included in the Kaomoji-like

character sequence. On the other hand, the accuracy

rate is about 50%. Therefore, the future task is to im-

prove performance. The correct answer rate around

1,000 Epoch is the highest with an accuracy of about

42% for 10,000 units, but the number of units calcu-

lated based on the rule of thumb described in section

3.1 is 6500 (= The accuracy rate in the case of (6500

+ 3110) x 2/3 = 6406 = 6500) is about 41%, which is

a small difference. Also, the overall tendency is that

the accuracy rate slightly increases as the number of

units increases, but the accuracy rate may be lower

than in the case of the number of units according to

the rule of thumb, so increasing the number of units

is not better.

Finally, we will consider the time required for

learning shown in section 4.4. The time required for

learning increases as the number of units increases, as

shown in ﬁgure 10.

A smaller number of units is more effective than

a more signiﬁcant number of units from the viewpo-

int of the required time for 10-fold cross-validation.

On the other hand, since it is difﬁcult to improve the

accuracy rate, it is considered better to adopt the nu-

mber of units near the number of units based on he-

uristics. Besides, when the number of units is 9,000

or more, the required time tends to increase rapidly,

so the number of units less than 8,000 is considered

appropriate for the model adopted in this paper.

6 CONCLUSIONS

In this paper, we investigated the number of units in

the middle layer in a feed-forward neural network to

estimate the original form of Kaomoji. We conﬁrmed

experimentally the optimum value of the number of

units based on the empirical rules and found that it is

a model that does not deviate from the empirical rules.

When the number of units in the middle layer is 6,500,

we conﬁrmed that training exceeds 3,500 Epoch and

over 50% in any evaluation method, but the problem

occurs that it takes too much time for training.

As future work, the proposed system is based on

the character Embedding as information to be given

to the input layer, but systems can deal with only the

characters that appear in the database of Kaomoji, so

it is necessary to consider the Embedding method that

can correspond to all Japanese characters. Also, since

the input has a ﬁxed length of 6,400, it is necessary to

apply a model compatible with variable-length input

such as a recurrent neural network and extend it to

a system compatible with an emoticon of any length.

There is. Similarly, in this paper, the middle layer

is considered to be one layer, but we will investigate

the accuracy rate in the case of multiple layers, and

we would like to clarify the relationship between the

number of layers and the number of units as a model

used for the analysis of Kaomoji.

ACKNOWLEDGEMENTS

This work was supported by JSPS KAKENSHI Grant

Number 18K11455.

A Multi Class Classiﬁcation to Detect Original Form of Kaomoji using Neural Network

381

REFERENCES

Bedrick, S., Beckley, R., Roark, B., and Sproat, R. (2012).

Robust kaomoji detection in twitter. In Proceedings of

the Second Workshop on Language in Social Media,

pages 56–64. Association for Computational Linguis-

tics.

Kazama, K., Mizuki, S., and Sakaki, T. (2016). Study

of sentiment analysis using emoticons on twit-

ter(in japanese). The 30th Annual Conference

of the Japanese Society for Artiﬁcial Intelligence,

JSAI2016:3H3OS17a4–3H3OS17a4.

Okumura, N. (2016). (in ja-

panese). Proceedings of the 22nd annual meetin-

gof the Association for Natural Language Processing,

NLP2016:P1–1.

Okumura, N. (2017). A large scale knowledge base re-

presenting the base form of kaomoji. In Proceedings

of the 9th International Joint Conference on Knowle-

dge Discovery, Knowledge Engineering and Knowle-

dge Management - Volume 2: KEOD,, pages 246–252.

INSTICC, SciTePress.

Okumura, R. and Okumura, N. (2018). A method to

estimate the original form of kaomoji using neural

network with character embedding function(in japa-

nese). The 32nd Annual Conference of the Japanese

Society for Artiﬁcial Intelligence, JSAI2018:2B405–

2B405.

Ptaszynski, M., Araki, K., Dybala, P., Rzepka, R., and Ma-

ciejewski, J. (2010). Cao: A fully automatic emoticon

analysis system based on theory of kinesics. IEEE

Transactions on Affective Computing, 1:46–59.

KMIS 2019 - 11th International Conference on Knowledge Management and Information Systems

382