Table 2: Comparison of Time taken among the
proposed and existing approaches.
Similarity
Measures
lesk res wup sum f(d,ic,g)
Time(s) 60,984 0,0625 0,1125 61,159 61,259
which merge three individual measures; however
the better result we attain can justify this cost. Table
2 shows the time for the individual measures
compared with the new model. For the sake of
fairness, we run the experiment 10 times and take
the average response time for each measure, so we
can notice that 100 ms is the extra cost that we pay
to gain more accurate measure comparing it with the
sum of the three measure, while we pay 275 ms
comparing it with lesk measure which is the time
consuming measure.
5 CONCLUSIONS
In this paper, we have introduced a new model to
identify the similarity between words using
WordNet. This model combines existing methods
for semantic similarity calculation and finds a
combination of three methods each from a different
category of information sources. We argue that, in
order to achieve better similarity measures all the
information sources; shortest path length between
compared words, depth in the taxonomy hierarchy,
information content, semantic density of compared
words, and the gloss definition of the words should
be taken into account. We evaluate our method on
widely used benchmarking datasets, such as M&C
dataset, R&G dataset, and 353-TC. The experimental
results prove our assumption and fit particularly well
in simulating human judgment on semantic
similarity between words. In future work, we intend
to use this similarity measure in real world
applications such as word sense disambiguation.
ACKNOWLEDGEMENTS
This research is part of the “Search Computing”
(SeCo) project, funded by the European Research
Council (ERC), under the 2008 Call for “IDEAS
Advanced Grants”, dedicated to frontier research.
REFERENCES
Banerjee, S., and Pedersen, T., 2003. Extended gloss
overlaps as a measure of semantic relatedness. In
Proceedings on Artificial Intelligence, Eighteenth
International Joint Conference, Acapulco, pp.805–810.
Budanitsky, A. and Hirst, G., 2006. Evaluating WordNet-
based Measures of Lexical Semantic Relatedness.
Finkelstein, L., Gabrilovich, E., Matias, Y., Rivlin, E.,
Solan, Z., Wolfman, G., and Ruppin, E., 2002. Placing
Search in Context: The Concept Revisited. ACM
Transactions on Information Systems, 20(1):116-131.
Hope, D., 2008. Java WordNet::Similarity
http://www.cogs.susx.ac.uk/users/drh21/
Jiang, J. J. and Conrath, D., 1997. Semantic similarity
based on corpus statistics and lexical taxonomy. In
Proceedings of International Conference on Research
in Computational Linguistics, Taiwan.
Leacock, C., and Chodorow, M., 1998. Combining local
context and WordNet similarity for word sense
identification. pp. 265–283.
Lesk, M., 1986. Automatic sense disambiguation using
machine readable dictionaries: How to tell a pine
cone from an ice cream cone. In Proceedings, Fifth
International Conference on Systems Documentation
(SIGDOC ’86), pages 24–26.
Li, Y., Bandar, Z. A., and McLean, D., 2003. An
Approach for Measuring Semantic Similarity between
Words Using Multiple Information Sources, IEEE
Transactions on knowledge and & engineering, 15(4).
Lin, D., 1998. An information-theoretic definition of
similarity. In Proceedings of the 15th International
Conference on Machine Learning, Madison, WI.
Miller, G. A., and Charles, W. G. 1991. Contextual
correlates of semantic similarity. Language and
Cognitive Processes, 6(1): 1–28.
Miller, G. A., 1995. WordNet: a lexical database for
English. Communications ACM, 38(11), pp. 39-41.
Rada, R., Mili, H., Bicknell, E., and Blettner, M., 1989.
Development and application of a metric on semantic
nets. IEEE Transactions on Systems, Man, and
Cybernetics, 19(1): 17–30.
Resnik, P., 1995. Using information content to evaluate
semantic similarity. In Proceedings of the 14th
International Joint Conference on Artificial
Intelligence, pages 448–453, Montreal.
Rubenstein, H., and Goodenough, John B., 1965.
Contextual correlates of synonymy. Communications
of the ACM, 8(10), pp. 627–633.
Strube, M., and Ponzetto, S., 2006. WikiRelate!
Computing Semantic Relatedness Using Wikipedia,
American Association for Artificial Intelligence.
Wu, Z., and Palmer, M., 1994. Verb semantics and lexical
selection. In 32nd Annual Meeting of the Association
for Computational Linguistics, pp. 133–138.
SEMANTIC MEASURES BASED ON WORDNET USING MULTIPLE INFORMATION SOURCES
503