Measuring Entity Semantic Relatedness using Wikipedia

Liliana Medina, Ana L. N. Fred, Rui Rodrigues, Joaquim Filipe

Abstract

In this paper we propose a semantic relatedness measure between scientific concepts, using Wikipedia as an hierarchical taxonomy. The devised measure examines the length of Wikipedia category path between two concepts, assigning a weight to each category that corresponds to its depth in the hierarchy. This procedure was extended to measure the relatedness between two distinct concept sets (herein referred to as entities), where the amount of shared nodes in the paths computed for all possible concept sets is also integrated in a global relatedness measure index.

References

  1. Gouws, S., Rooyen, G., and Engelbrecht, H. (2010). Measuring conceptual similarity by spreading activation over wikipedia's hyperlink structure. In Proceedings of the 2nd Workshop on The People's Web Meets NLP: Collaboratively Constructed Semantic Resources.
  2. Jiang, J. J. and Conrath, D. W. (1997). Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy. In International Conference Research on Computational Linguistics (ROCLING X).
  3. Leacock, C. and Chodorow, M. (1998). Combining Local Context and WordNet Similarity for Word Sense Identification, chapter 11, pages 265-283. The MIT Press.
  4. Liu, J. and Birnbaum, L. (2007). Measuring semantic similarity between named entities by searching the web directory. In Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, WI 7807, pages 461-465.
  5. Milne, D. and Witten, I. H. (2008). An effective, lowcost measure of semantic relatedness obtained from wikipedia links. In In Proceedings of AAAI 2008.
  6. Nastase, V. and Strube, M. (2008). Decoding wikipedia categories for knowledge acquisition. In AAAI, pages 1219-1224.
  7. Ponzetto, S. P. and Strube, M. (2007). Knowledge derived from wikipedia for computing semantic relatedness. J. Artif. Int. Res., 30:181-212.
  8. Rada, R., Mili, H., Bicknell, E., and Blettner, M. (1989). Development and application of a metric on semantic nets. IEEE Transactions on Systems, Man and Cybernetics, 19(1):17-30.
  9. Resnik, P. (1999). Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language. Journal of Artificial Intelligence Research, 11:95-130.
  10. Rodrguez, M. A. and Egenhofer, M. J. (2003). Determining semantic similarity among entity classes from different ontologies. IEEE Transactions on Knowledge and Data Engineering, 15:442-456.
  11. Slimani, T., Yaghlane, B. B., and Mellouli, K. (2006). A New Similarity Measure based on Edge Counting. In Proceedings of world academy of science, engineering and technology, volume 17.
  12. Wu, Z. and Palmer, M. (1994). Verbs semantics and lexical selection. In Proceedings of the 32nd annual meeting on Association for Computational Linguistics, ACL 7894, pages 133-138. Association for Computational Linguistics.
  13. Zesch, T., Müller, C., and Gurevych, I. (2008). Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary. In Proceedings of the Conference on Language Resources and Evaluation (LREC).
Download


Paper Citation


in Harvard Style

Medina L., L. N. Fred A., Rodrigues R. and Filipe J. (2012). Measuring Entity Semantic Relatedness using Wikipedia . In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: SSTM, (IC3K 2012) ISBN 978-989-8565-29-7, pages 431-437. DOI: 10.5220/0004180204310437


in Bibtex Style

@conference{sstm12,
author={Liliana Medina and Ana L. N. Fred and Rui Rodrigues and Joaquim Filipe},
title={Measuring Entity Semantic Relatedness using Wikipedia},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: SSTM, (IC3K 2012)},
year={2012},
pages={431-437},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004180204310437},
isbn={978-989-8565-29-7},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: SSTM, (IC3K 2012)
TI - Measuring Entity Semantic Relatedness using Wikipedia
SN - 978-989-8565-29-7
AU - Medina L.
AU - L. N. Fred A.
AU - Rodrigues R.
AU - Filipe J.
PY - 2012
SP - 431
EP - 437
DO - 10.5220/0004180204310437