A FRAMEWORK AUTOMATING DOMAIN ONTOLOGY CONSTRUCTION

Yin-Fu Huang, Yu-Yu Huang

Abstract

This paper proposed a general framework that could automatically construct domain ontology on a collection of documents with the help of The Free Dictionary, WordNet, and Wikipedia Categories. Both explicit and implicit features of index terms in documents are used to evaluate word correlations and then to construct Is-A relationships in the framework. Thus, the built ontology would consist of 1) concepts, 2) Is-A and Parts-of relationships among concepts, and 3) word relationships. Besides, the built ontology could be further refined by learning from incremental documents periodically. To help users browse the built ontology, an ontology browsing system was implemented and provided different search modes and functionality to facilitate searching a variety of relationships.

References

  1. Trent Apted and Judy Kay, 2002. “Automatic construction of learning ontologies,” Proc. ICCE Workshop on Concepts and Ontologies in Web-based Educational Systems, pp. 1563-1564.
  2. Florian Beil, Martin Ester, and Xiaowei Xu, 2002. “Frequent term-based text clustering,” Proc. 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 436-442.
  3. Richard C. Bodner and Fei Song, 1996. “Knowledgebased approaches to query expansion in information retrieval,” Proc. 11th Biennial Conference of the Canadian Society for Computational Studies of Intelligence on Advances in Artificial Intelligence, pp. 146-158.
  4. P. Buitelaar, P. Cimiano, and B. Magnini, 2005. Ontology Learning from Text: Methods, Evaluation and Applications, Frontiers in Artificial Intelligence and Applications Series, Vol. 123.
  5. Rowena Chau and Chung-Hsing Yeh, 2005. “Enabling a semantic smart WWW: a soft computing framework for automatic ontology development,” Proc. International Conference on Intelligent Agents, Web Technologies and Internet Commerce, pp. 1067-1071.
  6. Michael Dittenbach, Dieter Merkl, and Andreas Rauber, 2000. “The growing hierarchical self-organizing map,” Proc. the International Joint Conference on Neural Networks, Vol. 6, pp. 15-19.
  7. Dave Elliman and JRG Pulido, 2001. “Automatic derivation of on-line document ontologies,” Proc. International Workshop on Mechanisms for Enterprise Integration: From Objects to Ontologies (MERIT 2001), the 15th European Conference on Object Oriented Programming.
  8. W. B. Frakes and R. Baeza-Yates, 1992. Information Retrieval: Data Structures and Algorithms, Prentice Hall.
  9. Weipeng Fu, Bin Wu, Qing He, and Zhongzhi Shi, 2001. “Text document clustering and the space of concept on text document automatically generated,” Proc. IEEE ICII Conference, Vol. 3, pp. 107-112.
  10. K. W. Gan, C. Y. Wang, and Brian Mak, 2002. “Knowledge-based sense pruning using the HowNet: an alternative to word sense disambiguation,” Proc. International Symposium of Chinese Spoken Language Processing, pp. 189-192.
  11. Asunción Gómez-Pérez and David Manzano-Macho, 2003. “A survey of ontology learning methods and techniques,” OntoWeb Deliverable 1.5.
  12. Yi Guan, Xiao-Long Wang, Xiang-Yong Kong, and Jian Zhao, 2002. “Quantifying semantic similarity of Chinese words from HowNet,” Proc. International Conference on Machine Learning and Cybernetics, Vol. 1, pp. 234-239.
  13. Yin-Fu Huang and Chun-Hao Hsu, 2007. “PubMed smarter: searching the papers with implicit words based on Gene Ontology,” Proc. 4th International Conference on Information Technology and Applications, Vol. 1, pp. 339-343.
  14. Sin-Jae Kang and Jong-Hyeok Lee, 2001. “Semiautomatic practical ontology construction by using a thesaurus, computational dictionaries, and large corpora,” Proc. Workshop on Human Language Technology and Knowledge Management, pp. 1-8.
  15. M. Kavalec, A. Maedche, and V. Svatek, 2004. “Discovery of lexical entries for non-taxonomic relations in ontology learning,” SOFSEM 2004: Theory and Practice of Computer Science, LNCS 2932, pp. 249-256.
  16. Latifur Khan and Lei Wang, 2002. “Automatic ontology derivation using clustering for image classification,” Proc. Workshop on Multimedia Information Systems, pp. 56-65.
  17. Teuvo Kohonen, Samuel Kaski, Krista Lagus, Jarkko alojärvi, Jukka Honkela, Vesa Paatero, and Antti Saarela, 2000. “Self-organization of a massive document collection,” IEEE Transactions on Neural Networks, Vol. 11, No. 3, pp. 574-585.
  18. Yan-Hwang Kuo, Chang-Shing Lee, Shu-Mei Guo, and YingHsu Chen, 2005. “Apply object-oriented technology to construct Chinese ontology on the internet,” Journal of Internet Technologies, Vol. 6, No. 4, pp. 385-394.
  19. Jim Z. C. Lai and Wen-Feng Wu, 2002. “Design and implementation of a classifier for Chinese e-mails,” Proc. 7th Conference on Artificial Intelligence and Applications, pp. 368-373.
  20. Alexander Maedche and Steffen Staab, 2000. “Mining ontologies from text,” Proc. 12th European Workshop on Knowledge Acquisition, Modeling and Management, pp. 189-202.
  21. Alexander Maedche and Steffen Staab, 2001. “Ontology learning for the semantic web,” IEEE Intelligent Systems, Vol. 16, No. 2, pp. 72-79.
  22. Riichiro Mizoguchi, 2003. “Tutorial on ontological engineering - part 1: introduction to ontological engineering,” New Generation Computing, Vol. 21, No. 4, pp. 365-384.
  23. Thanh Tho Quan, Siu Cheung Hui, and Tru Hoang ao, 2006. “Automatic fuzzy ontology generation for semantic web,” IEEE Transactions on Knowledge and Data Engineering, Vol. 18, No. 6, pp.842-856.
  24. Prieto-Diaz Ruben, 2003. “A faceted approach to building ontologies,” Proc. IEEE International Conference on Information Reuse and Integration, pp. 458-465.
  25. David Sánchez and Antonio Moreno, 2006. “Discovering non-taxonomic relations from the web,” Proc. 7th International Conference on Intelligent Data Engineering and Automated Learning, pp. 629-636.
  26. A. Schutz and P. Buitelaar, 2005. “RelExt: a tool for relation extraction in ontology extension,” Proc. 4th International Semantic Web Conference, pp. 593-606.
  27. Vaclav Snase, Pavel Moravec, and Jaroslav Pokorny, 2005. “WordNet ontology based model for web retrieval,” Proc. International Workshop on Challenges in Web Information Retrieval and Integration, pp. 220-225.
  28. Yuri A. Tijerino, David W. Embley, Deryle W. Lonsdale, and George Nagy, 2003. “Ontology generation from tables,” Proc. 4th International Conference on Web Information Systems Engineering, pp. 242-249.
  29. Ju-in Youn, He-Jue Eun, Cheol-Jung Yoo, and Yong-Sung Kim, 2004. “Adaptive documents classification system based on ontology constructed by fuzzy function and fuzzy relations,” Proc. International Conference on Cyberworlds, pp. 182-187.
Download


Paper Citation


in Harvard Style

Huang Y. and Huang Y. (2008). A FRAMEWORK AUTOMATING DOMAIN ONTOLOGY CONSTRUCTION . In Proceedings of the Fourth International Conference on Web Information Systems and Technologies - Volume 2: WEBIST, ISBN 978-989-8111-27-2, pages 16-25. DOI: 10.5220/0001516800160025


in Bibtex Style

@conference{webist08,
author={Yin-Fu Huang and Yu-Yu Huang},
title={A FRAMEWORK AUTOMATING DOMAIN ONTOLOGY CONSTRUCTION},
booktitle={Proceedings of the Fourth International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,},
year={2008},
pages={16-25},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001516800160025},
isbn={978-989-8111-27-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Fourth International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,
TI - A FRAMEWORK AUTOMATING DOMAIN ONTOLOGY CONSTRUCTION
SN - 978-989-8111-27-2
AU - Huang Y.
AU - Huang Y.
PY - 2008
SP - 16
EP - 25
DO - 10.5220/0001516800160025