of the two pages. Once they have all the paths
they select the shortest path or path with the most
common subsumer (two kinds of measures) and
then they apply Resnik’s measure(Resnik, 1995).
Like ours, this work uses exclusively Wikipe-
dia/DBpedia categories to represent complex ob-
jects and to measure similarity between them. But
at least two important differences exist between
the two works, probably due to the different types
of objects (words versus LOD resources) : first
we explicitely assign weights to categories and se-
cond our measure combines contributions of all
categories and not only the ones corresponding to
the shortest path.
• (Fouss et al., 2005) : To measure similarity be-
tween elements of a database, authors define a
weighted undirected graph in which nodes corre-
spond to database elements (e.g. movies, people
and movie categories) and edges to links between
them (e.g. has watched). The weight w
i j
of the
edge connecting two nodes i and j is defined as
follows : the more important the relation between
elements i and j, the larger the value of w
i j
. A
Markov chain is defined in which a state is associ-
ated to each node of the graph and the probability
of jumping from a node i to an adjacent node j
is proportional to the weight w
i j
. Using this Mar-
kov chain properties, the authors show that simi-
lar resources are connected by a comparably large
number of short paths and dissimilar resources
have fewer paths connecting them and these paths
will be longer. They also show that a similarity
measure can be extracted from the pseudoinverse
of the Laplacian matrix of the graph. This method
was not designed for linked data and if we want
to adapt it we must add weights to RDF graphs’
edges, i.e. to each RDF triple’s predicate. Such
weights should represent relatedness between the
resources connected by the concerned edges. In
other words, to apply this method to linked data,
we must first compute relatedness between each
pair of resources belonging to an RDF triple. This
is not realistic.
9 CONCLUSION AND FUTURE
WORK
In this work we aimed to define a simple and highly
correlated to human judgment similarity measure for
Linked data. We positively answered the question :
can we measure and explain semantic similarity using
exclusively DBpedia categories. But this work is a
part of a larger project in which we also deal with the
following problems :
1. To show that DBpedia categories can used for a
unified representation for all the linked data re-
sources and not only those of DBpedia.
2. To use machine learning methods to create new
categories and to assign categories to resources.
3. To give a general characterization of feature-
based similarity that the measure presented in this
paper will be a special case.
REFERENCES
Bob
´
ee, B. and Robitaille, R. (1975). Etude sur les coeffi-
cients d’asym
´
etrie et d’aplatissemnt d’un
´
echantillon,
rapport num. 49. Technical report, INRS-Eau, univer-
sit
´
e du Qu
´
ebec, Qu
´
ebec.
Cheekula, S. K., Kapanipathi, P., Doran, D., Jain, P., and
Sheth, A. P. (2015). Entity recommendations using
hierarchical knowledge bases.
Damljanovic, D., Stankovic, M., and Laublet, P. (2012).
Linked data-based concept recommendation: Compa-
rison of different methods in open innovation scena-
rio. In Proceedings of ESWC 2012, pages 24–38.
di Noia, T., Mirizzi, R., Ostuni, V. C., Romito, D., and Zan-
ker, M. (2012). Linked open data to support content-
based recommender systems. In Proceedings of 8th
International Conference on Semantic Systems, pages
1–8.
Fouss, F., Pirotte, A., and Saerens, M. (2005). A novel way
of computing similarities between nodes of a graph,
with application to collaborative recommendation. In
Proceedings of Web Intelligence 2005, pages 550–
556.
Gunaratna, K., Lalithsena, S., Jain, P., Henson, C. A., and
Sheth, A. P. (2011). A systematic property mapping
using category hierarchy and data. Technical report,
http://corescholar.libraries.wright.edu/knoesis/601.
Lehmann, J., Isele, R., Jakob, M., Jentzsch, A., Konto-
kostas, D., Mendes, P. N., Hellmann, S., Morsey, M.,
van Kleef, P., Auer, S., and Bizer, C. (2015). Dbpedia
- a large-scale, multilingual knowledge base extracted
from wikipedia. Semantic Web, 6(2):167–195.
Meng, L., Huang, R., and Gu, J. (2013). A review of se-
mantic similarity measures in wordnet. International
Journal of Hybrid Information Technology, 6(1):1–12.
Meymandpour, R. and Davis, J. G. (2016). A semantic
similarity measure for linked data: An information
content-based approach. Knowledge Based Systems,
109:276–293.
Ostuni, V. C., di Noia, T., Mirizzi, R., and Sciascio, E. D.
(2014). A linked data recommender system using a
neighborhood-based graph kernel. In Proceedings of
the 15th EC-Web, pages 89–100.
Passant, A. (2010). Measuring semantic distance on linking
data and using it for resources recommendations. In
KEOD 2018 - 10th International Conference on Knowledge Engineering and Ontology Development
126