Figure 2: Plot of vertical sums of points (red graph); plot of
horizontal sums of points (blue graph) in Table 3.
show a ranked list of words from the 1st to 10th in the
descending order of cosine similarity with 𝑣
+
𝑣
−𝑣
. The words in the blue cells
matched with the correct answers (i.e., they matched
with word
). The numerals in the rightmost column
show the points earned based on step (6). It is
noteworthy only the partial results are shown in Table
2, whereas the overall results were obtained in the
same format.
Table 4 is a summary of Table 2, with the pair of
word
and word
in column C, and the pair of
word
and word
in the third row. Table 3 shows an
enlarged version of the top three rows and the lower
rows of the left part of Table 4.
For example, in cell E84 of Table 3 where row
84 and column E coincide, word
and word
are “become” and “do not become,”
respectively, and word
and word
are “exist”
and “do not exist,” respectively. In this setting
for the word analogy task, the point earned is
1.0, as indicated in cell E84; this means that the
first place word matches the correct answer.
In Tables 3 and 4, cells with values of 1, 0.5, and
0.25 are indicated in pink, light blue, and light
green, respectively.
The numerals in the bottom row cells of Tables
3 and 4 are the total scores, i.e., the vertical sums
of the points. Additionally, pair word
and
word
in the third row are sorted from left to
right in the descending order of their total
scores.
The maximum value of the total scores is 99
because the 100 pairs of verbs and their negation
shown in Figure 1 are used to generate a word
analogy task alternately.
Owing to space limitations, the right side of
column AZ is not shown in Table 3; however,
the horizontal sums of the points were
calculated. Pair word
and word
in column C
were sorted from top to bottom in the
descending order of the sums.
The numerals in the first and second rows are
the ranking of word
among 100 verbs and the
ranking of word
among 100 verb negations
used in the experiment with respect to the
number of occurrences in the corpus,
respectively. Similarly, the ranking of word
among 100 verbs and the ranking of word
among 100 verb negations are shown in
columns A and B, respectively. The red and blue
cells indicate the top and bottom 30 rankings,
respectively.
2.3 Discussion
In Figure 2, the red graph is a plot of the vertical sums
of the points shown in Table 4, and the blue graph is
a plot of their horizontal sums. We observed that the
red graph varied more significantly than the blue one.
This implies that the difficulty of the word analogy
task depends primarily on word
and word
rather
than word
and word
.
Therefore, we focused on word
and word
. As
shown in the second row of Table 4, many cells on
the left side are red, whereas many cells on the right
side are blue. Because the red and blue cells in the
second row indicate high and low frequencies of
word
occurrence, respectively, if word
appears
frequently in the corpus, then the vertical sum of the
points tends to be large, and vice versa. Furthermore,
if word
appears frequently in the corpus, then the
word vector of word
generated by word2vec is
sufficiently precise to return a correct response to the
word analogy task. Therefore, the vertical sum of
points can be interpreted as a success indicator for the
word analogy task.
In this experiment, the top 100 verbs that appear
frequently in the corpus were used as word
, and the
word vector of word
generated by word2vec should
always be precise.
However, the right side of the second row in
Table 4 contained red cells, where the vertical sum of
the points indicated a low value. These red cells
reflect the difficulty of the word analogy task despite
the high occurrence frequency of word
in the
corpus.