Semantic Relatedness with Variable Ontology Density

Rui Rodrigues, Joaquim Filipe, Ana L. N. Fred

Abstract

In a previous work, we proposed a semantic relatedness measure between scientific concepts, using Wikipedia categories network as an ontology, based on the length of the category path. After observing substantial differences in the arc density of the categories network, across the whole graph, it was concluded that these irregularities in the ontology density may lead to substantial errors in the computation of the semantic relatedness measure. Now we attempt to correct for this bias and improve this measure by adding the notion of ontology density and proposing a new semantic relatedness measure. The proposed measure computes a weighed length of the category path between two concepts in the ontology graph, assigning a different weight to each arc of the path, depending on the ontology density in its region. This procedure has been extended to measure semantic relatedness between entities, an entity being defined as a set of concepts.

References

  1. Coleman, T. F. and Moré, J. J. (1983). Estimation of sparse Jacobian matrices and graph coloring problems. SIAM Journal on Numerical Analysis, 20(1):187-209.
  2. Gabrilovich, E. and Markovitch, S. (2007). Computing semantic relatedness using wikipedia-based explicit semantic analysis. In Proceedings of the 20th international joint conference on Artifical intelligence, IJCAI'07, pages 1606-1611. Morgan Kaufmann Publishers Inc.
  3. Gouws, S., Rooyen, G., and Engelbrecht, H. (2010). Measuring conceptual similarity by spreading activation over wikipedia's hyperlink structure. In Proceedings of the 2nd Workshop on The People's Web Meets NLP: Collaboratively Constructed Semantic Resources.
  4. Gracia, J. and Mena, E. (2008). Web-based measure of semantic relatedness. In In Proc. of 9th International Conference on Web Information Systems Engineering (WISE 2008), Auckland (New Zealand, pages 136- 150. Springer.
  5. Jiang, J. J. and Conrath, D. W. (1997). Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy. In International Conference Research on Computational Linguistics (ROCLING X).
  6. Leacock, C. and Chodorow, M. (1998). Combining Local Context and WordNet Similarity for Word Sense Identification, chapter 11, pages 265-283. The MIT Press.
  7. Liu, J. and Birnbaum, L. (2007). Measuring semantic similarity between named entities by searching the web directory. In Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, WI 7807, pages 461-465.
  8. Medina, L. A. S., Fred, A. L. N., Rodrigues, R., and Filipe, J. (2012). Measuring entity semantic relatedness using wikipedia. In Fred, A. L. N., Filipe, J., Fred, A. L. N., and Filipe, J., editors, KDIR, pages 431-437. SciTePress.
  9. Milne, D. and Witten, I. H. (2008). An effective, lowcost measure of semantic relatedness obtained from wikipedia links. In In Proceedings of AAAI 2008.
  10. Nastase, V. and Strube, M. (2008). Decoding wikipedia categories for knowledge acquisition. In AAAI, pages 1219-1224.
  11. Ponzetto, S. P. and Strube, M. (2007). Knowledge derived from wikipedia for computing semantic relatedness. J. Artif. Int. Res., 30:181-212.
  12. Rada, R., Mili, H., Bicknell, E., and Blettner, M. (1989). Development and application of a metric on semantic nets. IEEE Transactions on Systems, Man and Cybernetics, 19(1):17-30.
  13. Resnik, P. (1999). Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language. Journal of Artificial Intelligence Research, 11:95-130.
  14. Slimani, T., Yaghlane, B. B., and Mellouli, K. (2006). A New Similarity Measure based on Edge Counting. In Proceedings of world academy of science, engineering and technology, volume 17.
  15. Wu, Z. and Palmer, M. (1994). Verbs semantics and lexical selection. In Proceedings of the 32nd annual meeting on Association for Computational Linguistics, ACL 7894, pages 133-138. Association for Computational Linguistics.
  16. Zesch, T., Müller, C., and Gurevych, I. (2008). Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary. In Proceedings of the Conference on Language Resources and Evaluation (LREC).
Download


Paper Citation


in Harvard Style

Rodrigues R., Filipe J. and L. N. Fred A. (2014). Semantic Relatedness with Variable Ontology Density . In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: SSTM, (IC3K 2014) ISBN 978-989-758-048-2, pages 554-559. DOI: 10.5220/0005189005540559


in Bibtex Style

@conference{sstm14,
author={Rui Rodrigues and Joaquim Filipe and Ana L. N. Fred},
title={Semantic Relatedness with Variable Ontology Density},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: SSTM, (IC3K 2014)},
year={2014},
pages={554-559},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005189005540559},
isbn={978-989-758-048-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: SSTM, (IC3K 2014)
TI - Semantic Relatedness with Variable Ontology Density
SN - 978-989-758-048-2
AU - Rodrigues R.
AU - Filipe J.
AU - L. N. Fred A.
PY - 2014
SP - 554
EP - 559
DO - 10.5220/0005189005540559