EVALUATING THE HYPONYM ATTACHMENTS IN AN UNSUPERVISED TAXONOMY ENRICHMENT FRAMEWORK

Emil Şt. Chifu, Viorica R. Chifu

Abstract

The paper describes an unsupervised framework for domain taxonomy enrichment with new domain-specific concepts extracted from domain text corpora. The framework is based on an extended model of hierarchical self-organizing maps. Terms extracted by mining a text corpus encode contextual content information, in a distributional vector space. The enrichment behaves like a classification of the extracted terms into the existing taxonomy by attaching them as hyponyms for the intermediate and leaf nodes of the taxonomy. We propose an evaluation setting in which we assess the power of attraction of the population of terms towards the branches of the taxonomy (recall) and the precision of attaching correct hyponyms (accuracy).

References

  1. Alfonseca, E., Manandhar, S., 2002. Extending a lexical ontology by a combination of distributional semantics signatures. In A. Gómez-Pérez, V.R. Benjamins (Eds.), 13th International Conference on Knowledge Engineering and Knowledge Management, LNAI. Springer, pp. 1-7.
  2. Alfonseca E., Manandhar, S., 2002. An unsupervised method for general named entity recognition and automated concept discovery. In 1st International Conference on General WordNet.
  3. Buitelaar, P., Cimiano, P., Magnini B., 2005. Ontology learning from text: an overview. In P. Buitelaar, P. Cimiano, B. Magnini (Eds.), Ontology Learning from Text: Methods, Evaluation and Applications, Frontiers in Artificial Intelligence and Applications Series. IOS Press, pp. 1-10.
  4. Chifu, E.S., Letia, I.A, 2006. Unsupervised ontology enrichment with hierarchical self-organizing maps. In I.A. Letia (Ed.), IEEE 2nd International Conference on Intelligent Computer Communication and Processing, pp. 3-9.
  5. Cimiano, P., Völker, J., 2005. Towards large-scale, opendomain and ontology-based named entity classification. In RANLP'05, International Conference on Recent Advances in Natural Language Processing, pp. 166-172.
  6. Craven, M., DiPasquo, D., Freitag, D., McCallum, A., Mitchell, T., Nigam, K., Slattery, S., 2000. Learning to construct knowledge bases from the World Wide Web. Artificial Intelligence 118, pp. 69-113.
  7. Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V., 2002. GATE: a framework and graphical development environment for robust NLP tools and applications. In 40th Anniversary Meeting of the ACL.
  8. Dittenbach, M., Merkl, D., Rauber, A., 2002. Organizing and exploring high-dimensional data with the Growing Hierarchical Self-Organizing Map. In L. Wang, et al. (Eds.), 1st International Conference on Fuzzy Systems and Knowledge Discovery, vol. 2, pp. 626-630.
  9. Fellbaum, Chr. (Ed.), 1998. WordNet: An Electronic Lexical Database, MIT Press. Cambridge, Mass.
  10. Hearst, M.A., Schütze, H., 1993. Customizing a lexicon to better suit a computational task. In ACL SIGLEX Workshop on Acquisition of Lexical Knowledge from Text, 1993.
  11. Kohonen, T., Kaski, S., Lagus, K., Salojärvi, J., Honkela, J., Paatero, V., Saarela, A., 2000. Self-organization of a massive document collection. IEEE Transactions on Neural Networks 11, pp. 574-585.
  12. Maedche, A., Pekar, V., Staab, S., 2003. Ontology learning part one: on discovering taxonomic relations from the Web. In N. Zhong, et al. (Eds.), Web Intelligence, LNCS. Springer, pp. 301-321.
  13. Widdows, D., 2003. Unsupervised methods for developing taxonomies by combining syntactic and statistical information. In HLT-NAACL Conference, pp. 197-204.
  14. Witschel, H.F., 2005. Using decision trees and text mining techniques for extending taxonomies. In Learning and Extending Lexical Ontologies by using Machine Learning Methods, Workshop at ICML-05, pp. 61-68.
Download


Paper Citation


in Harvard Style

Şt. Chifu E. and R. Chifu V. (2008). EVALUATING THE HYPONYM ATTACHMENTS IN AN UNSUPERVISED TAXONOMY ENRICHMENT FRAMEWORK . In Proceedings of the Fourth International Conference on Web Information Systems and Technologies - Volume 2: WEBIST, ISBN 978-989-8111-27-2, pages 247-252. DOI: 10.5220/0001529302470252


in Bibtex Style

@conference{webist08,
author={Emil Şt. Chifu and Viorica R. Chifu},
title={EVALUATING THE HYPONYM ATTACHMENTS IN AN UNSUPERVISED TAXONOMY ENRICHMENT FRAMEWORK},
booktitle={Proceedings of the Fourth International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,},
year={2008},
pages={247-252},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001529302470252},
isbn={978-989-8111-27-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Fourth International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,
TI - EVALUATING THE HYPONYM ATTACHMENTS IN AN UNSUPERVISED TAXONOMY ENRICHMENT FRAMEWORK
SN - 978-989-8111-27-2
AU - Şt. Chifu E.
AU - R. Chifu V.
PY - 2008
SP - 247
EP - 252
DO - 10.5220/0001529302470252