FTT STT ATT TTT TFT ATF
3.2 3.2 3.6 4.6 6.9 6.9
FFT STF FTF FFF TFF
6.9 7.2 7.4 7.7 8.4
course, all of them are novel distance measures intro-
duced in this paper. Furthermore, the following shows
the p-values when we perform the Hommel test let-
ting F-T-T be the control.
FTT STT ATT TTT TFT ATF
– 1.00 0.81 0.84 0.16 0.16
FFT STF FTF FFF TFF
0.14 0.10 0.10 0.07 0.04
We see that there is a clear line between the *-T-T
type and the others: with the significance level of
10%, the superiority of the F-T-T (and S-T-T) dis-
tance to the S-T-F (constrained), F-T-F (Ta
¨
ı), F-F-F
and T-F-F distances is proven statistically significant;
For the A-T-F (degree-two) and T-F-T distances, alt-
hough we cannot reject the null hypothesis, the p-
values are as small as 16%, much smaller than 84%
for the T-T-T distance.
6 CONCLUSION
We have shown that viewing edit distance measures
from a view point of properties of their associated tra-
ces is useful. In particular, by using particular proper-
ties of traces as parameters to describe edit distance
measures, we not only can deal with edit distances
measures consistently but also can engineer new me-
asures systematically. In fact, taking rooted trees as an
example, we have introduced three independent para-
meters, and have shown that the parameters determine
16 edit distance measures. Surprisingly, only three
among them had been known in the literature, and
all the remainder were novel. Furthermore, through
experiments to measure predictive accuracy of each
combination of one of the 16 measures and the k-
NN classifier, we have discovered that a certain novel
class of edit distance measures can perform the best.
The measures belonging to the class are significantly
different from the edit distance measures known in
the literature, since the cost to replace a label with a
different label is set as infinity. This mandates the as-
sociated traces to preserve labels of vertices.
7 ACKNOWLEDGMENTS
This work was partially supported by the Grant-in-
Aid for Scientific Research (JSPS KAKENHI Grant
Number 16K12491 and 17H00762) from the Japan
Society for the Promotion of Science.
REFERENCES
Collins, M. and Duffy, N. (2001). Convolution kernels for
natural language. In Advances in Neural Information
Processing Systems 14 [Neural Information Proces-
sing Systems: Natural and Synthetic, NIPS 2001], pa-
ges 625–632. MIT Press.
Demaine, E. D., Mozes, S., Rossman, B., and Weimann, O.
(2006). An optimal decomposition algorithm for tree
edit distance. ACM Trans. Algo., 6.
Dulucq, S. and Touzet, H. (2003). Analysis of tree edit
distance algorithms. In the 14th annual symposium on
Combinatorial Pattern Matching (CPM), pages 83 –
95.
Hashimoto, K., Goto, S., Kawano, S., Aoki-Kinoshita,
K. F., and Ueda, N. (2006). KEGG as a glycome in-
formatics resource. Glycobiology, 16:63R – 70R.
Kao, M.-Y., Lam, T.-W., Sung, W.-K., and Ting, H.-F.
(2007). An even faster and more unifying algorithm
for comparing trees via unbalanced bipartite matc-
hings.
Kimura, D. and Kashima, H. (2012). Fast computation of
subpath kernel for trees. In ICML.
Klein, P. N. (1998). Computing the edit-distance between
unrooted ordered trees. LNCS, 1461:91–102. ESA’98.
Kuboyama, T., Shin, K., Miyahara, T., and Yasuda, H.
(2005). A theoretical analysis of alignment and edit
problems for trees. In Proc. of Theoretical Computer
Science, The 9th Italian Conference, Lecture Notes in
Computer Science 3701, pages 323–337.
Levenshtein, V. I. (1966). Binary codes capable of cor-
recting deletions, insertions, and reversals. Soviet
Physics Doklady, 10(8):707 – 710.
Lu, C. L., Su, Z. Y., and Tang, G. Y. (2001). A New Measure
of Edit Distance between Labeled Trees. In LNCS, vo-
lume 2108, pages pp. 338–348. Springer-Verlag Hei-
delberg.
Moschitti, A. Example data for TREE KERNELS
IN SVM-LIGHT. http://disi.unitn.it/moschitti/Tree-
Kernel.htm.
Neuhaus, M. and Bunke, H. (2007). Bridging the gap bet-
ween graph edit distance and kernel machines. World
Scientific.
Pawlik, M. and Augsten, N. (2011). Rted: A robust algo-
rithm for the tree edit distance. In Proceedings of the
VLDB Endowment, volume 5, pages 334–345.
Shin, K. (2015). Tree edit distance and maximum
agreement subtree. Inf. Process. Lett., 115(1):69–73.
Ta
¨
ı, K. C. (1979). The tree-to-tree correction problem. jour-
nal of the ACM, 26(3):422–433.
ICAART 2018 - 10th International Conference on Agents and Artificial Intelligence
274