SEMI-AUTONOMOUS RULE ACQUISITION FRAMEWORK USING CONTROLLED LANGUAGE AND ONTOLOGY

Mye M. Sohn, Yungyu Choi

Abstract

This paper presents a framework for rule extraction from unstructured web documents. To do so, we adopted the controlled language technique to reduce the burden as well as error of a domain expert and suggest a rule extraction framework that uses ontology, to solve the problem of missing variable and value that may be caused by incomplete natural language. Here, it is referred to as NEXUCE (New rule EXtraction Using ontology and Controlled natural languagE). To evaluate the performance of the NEXUCE framework, the natural language statements were collected from the websites of Internet bookstores and the rule extraction capability was analyzed. As a result, it was proven that NEXUCE can have more than 70% of rule extraction from unstructured web documents.

References

  1. Alani, H., Sanghee Kim, Millard, D. E., Weal, M. J., Hall, W., Lewis, P. H. and Shadbolt, N. R., 2003. Automatic ontology-based knowledge extraction from Web documents, IEEE Intelligent Systems, Vol.18, No.10.
  2. Bernstein, A., Kaufmann, E. and Kaiser, C., 2005. Querying the Semantic Web with Ginseng: A Guided Input Natural Language Search Engine, 15th Workshop on Information Technology and Systems (WITS 2005).
  3. Bernstein, A. and Kaufmann, E., 2006. GINO - A Guided Input Natural Language Ontology Editor, 5th International Semantic Web Conference, Vol 4273.
  4. Duke, J., Davies, A. and Stonkus, A., 2002. OntoShare: Using Ontologies for Knowledge Sharing, Proceedings of the WWW Workshop on Semantic Web.
  5. Etchells, T. A. and Lisboa, P. J. G., 2006. Orthogonal search-based rule extraction (OSRE) for trained neural networks: a practical and efficient approach, Neural Networks, Vol.17, No. 2.
  6. Gelbukh, A., 2005. Natural Language Processing, Proceedings of the Fifth International Conference on Hybrid Intelligent Systems (HIS'05).
  7. Handschuh, S., Staab, S. and Ciravegna, F., 2002. SCREAM - Semi-automatic CREAtion of Metadata, Knowledge Engineering and Knowledge Management: Ontologies and the Semantic Web, Vol 2473.
  8. Kang, J. and Lee, J. K., 2005. Rule Identification from Web Pages by the XRML approach, Decision Support Systems, Vol. 41, No 1.
  9. Kent, A., Lancour, H., Nasri, W. Z., Daily, J.E., 1975. Encyclopaedia of Library and Information Science, CRC Press.
  10. Klein, D. and Manning, C. D., 2003, Accurate Unlexicalized Parsing, Proceedings of the 41st Meeting of the Association for Computational Linguistics (ACL 7803), Vol. 1.
  11. Lee, J.K. and Sohn, M., 2003. Extensible Rule Markup Language - toward Intelligent Web Platform, Communications of the ACM, Vol. 46, No 5.
  12. Park Sangun and Lee, J. K., 2007. Rule identification using ontology while acquiring rules from Web pages, International Journal of Human-Computer Studies, Vol. 65, No 7.
  13. Ressom, H. W., Varghese, R. S., Orvisky, E., Drake, S. K., Hortin, G. L., Abdel-Hamid, M., Loffredo, C. A. and Goldman, R., 2006. Biomaker Identification and Rule Extraction from Mass Spectral Serum Profiles, Computational Intelligence and Bioinformatics and Computational Biology (CIBCB 7806).
  14. Schwitter, R and Tilbrook, M., 2004. Controlled Natural Language meets the Semantic Web, Australasian Language Technology Workshop 2004 (ALT 7804).
  15. Sullivan, D. (2001) Document Warehousing and Text Mining, Wiley.
  16. Thompson, C. W., Pazandak, P. and Tennant, H. R., 2005. Talk to your Semantic Web, IEEE Internet Computing, and Vol. 9, No. 6.
  17. Vargas-Vera, M., Motta, E. and Domingue J.. 2001. Knowledge Extraction by using an Ontology-based Annotation Tool, K-CAP 2001 workshop on Knowledge Markup and Semantic (K-CAP 7801).
  18. Wang Chong, Miao Xiong, Qi Zhou and Youg Y., 2007. PANTO: A Potable Natural Language Interface to Ontologies, 4th European Semantic Web Conference (ESWC 7807), Vol. 4519.
Download


Paper Citation


in Harvard Style

Sohn M. and Choi Y. (2009). SEMI-AUTONOMOUS RULE ACQUISITION FRAMEWORK USING CONTROLLED LANGUAGE AND ONTOLOGY . In Proceedings of the International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, ISBN 978-989-8111-66-1, pages 238-245. DOI: 10.5220/0001658702380245


in Bibtex Style

@conference{icaart09,
author={Mye M. Sohn and Yungyu Choi},
title={SEMI-AUTONOMOUS RULE ACQUISITION FRAMEWORK USING CONTROLLED LANGUAGE AND ONTOLOGY},
booktitle={Proceedings of the International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,},
year={2009},
pages={238-245},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001658702380245},
isbn={978-989-8111-66-1},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,
TI - SEMI-AUTONOMOUS RULE ACQUISITION FRAMEWORK USING CONTROLLED LANGUAGE AND ONTOLOGY
SN - 978-989-8111-66-1
AU - Sohn M.
AU - Choi Y.
PY - 2009
SP - 238
EP - 245
DO - 10.5220/0001658702380245