Saying is not Modelling

Christophe Roche

Abstract

In this article we claim that the conceptual modeling built from text is rarely an ontology. Such a conceptualization is corpus-dependent and does not offer the main properties we expect from ontology, e.g. reusability and soundness. Furthermore, ontology extracted from text in general does not match ontology defined by expert using a formal language. Such a result is not surprising since ontology is an extra-linguistic conceptualization whereas knowledge extracted from text is the concern of textual linguistics. Incompleteness of text and using rhetorical figures, like synecdoche, deeply modify the perception of the conceptualization we may have. It means that ontological knowledge, which is necessary for text understanding, is not in general embedded into documents. The article will end on some remarks about formal languages. If they allow to define “a specification of a conceptualization” they nevertheless raise their own issues mainly due to their epistemological neutrality. Ontology design remains an epistemological issue.

References

  1. Staab, S., Studer, R.: Handbook on Ontologies. Steffen Staab (Editor), Rudi Studer (Editor), Springer 2004
  2. Gomez-Perez, A., Corcho. O., Fernandez-Lopez, M.: Ontological Engineering : with examples from the areas of Knowledge Management, e-Commerce and the Semantic Web. Asuncion Gomez-Perez, Oscar Corcho, Mariano Fernandez-Lopez, Springer 2004
  3. Roche, C.: Ontology: a Survey. 8th Symposium on Automated Systems Based on Human Skill and Knowledge IFAC, September 22-24 2003, Göteborg, Sweden
  4. Buitelaar, P., Cimiano P., Magnini B.: Ontology Learning from Text: Methods, Evaluation and Applications (Frontiers in Artificial Intelligence and Applications, Vol. 123). P. Buitelaar (Editor) Ios Press Publication (July 1, 2005)
  5. Dourgnon-Hanoune, A., Salaün, P., Roche, C.: Ontology for long-term knowledge. XIX IEA/AIE, Annecy 27-30 June 2006
  6. Dourgnon-Hanoune, A., Mercier-Laurent, E., Roche, C.: How to value and transmit nuclear industry long term knowledge. ICEIS 2005, 7th International Conference on Enterprise Information Systems, Miami, 24-28 May 2005
  7. Sinclair, J.: Corpus and Text - Basic Principles. In: Developing Linguistic Corpora: a Guide to Good Practice. Ed. M. Wynne. Oxford: Oxbow Books: 1-16. Available online from http://ahds.ac.uk/linguistic-corpora/ [Accessed 2007-04-11]
  8. Aussenac-Gilles, N., Sörgel, D.: Text analysis for ontology and terminology engineering. Applied Ontology. n°1. pp. 35-46
  9. Daille, B.: Recent Trends in Computational Terminology. Special issue of Terminology 10:1 (2004). Edited by Béatrice Daille, Kyo Kageura, Hiroshi Nakagawa and Lee-Feng Chien, Benjamins publishing company
  10. Harris, Z.: Mathematical Structures of Language. 1968, reprint 1979. R.E. Krieger Publishing Company, Inc.
  11. http://wordnet.princeton.edu/
  12. Kiryakov, A., Popov, B., Terziev, I., Manov, D., Ognyanoff, D.: Semantic Annotation, Indexing, and Retrieval.Elsevier's Journal of Web Sematics, Vol. 2, Issue (1), 2005
  13. Grice, H.P.: Meaning. Philosophical Review n°66. pp 377-88, 1957
  14. Cruse, D.A.: Lexical Semantics. Cambridge University Press 1986
  15. Ushold, M., Gruninger, M.: Ontologies: Principles, Methods and Applications. Knowledge Engineering Review, Vol. 11, n° 2, June 1996. Also available from AIAI as AIAI-TR-191
  16. Gruber, T.: A Translation Approach to Portable Ontology Specifications. Knowledge Systems Laboratory September 1992 - Technical Report KSL 92-71 Revised April 1993. Appeared in Knowledge Acquisition, 5(2):199-220, 199
  17. Sapir, E.: Language. An Introduction to the study of speech. Docer Publications, 2004. Originally published by Harcourt, Brace and Company, 1921)
  18. Whorf, B.L.: Language, Thought and Reality. The MIT Press, 1956
  19. Baader, F., Calvanese, D., McGuiness, D., Nardi, D., Patel-Schneider, P.: The Description Logic Handbook. Franz Baader, Diego Calvanese, Deborah McGuinness, Daniele Nardi, Peter Patel-Schneider, editors. Cambridge University Press, 2003
  20. Wright, J. M., Fox, M.S., Adam, D.: 84 "SRL/1.5 Users Manual." Technical report;Robotics Institute, Carnegie-Mellon University 1984.
  21. Woods 75. What's in a Link: Foundations for Semantic Networks. Representation and Understanding: Studies in Cognitive Science, 35-82, edited by D.G. Bobrow and A.M. Collins, New York: Academic Press, 1975.
  22. OWL Web Ontology Language: http://www.w3.org/TR/owl-features/
  23. Guarino, N., Carrara, M., Giaretta, P.: An Ontology of Meta-Level Categories. of Knowledge Representation and Reasoning: Proceedings of the Fourth International Conference (KR94), Morgan Kaufmann, San Mateo, CA.
  24. Kaplan, A.: Towards a consistent logical framework for ontological analysis. FOIS'01. October 17-19, 2001, Ogunquit, USA.
  25. Roche, C.: The 'Specific-Difference' Principle: a Methodology for Building Consensual and Coherent Ontologies. IC-AI'2001: Las Vegas, USA, June 25-28 2001
  26. Spies, M., Roche, C.: Aristotelian ontologies and OWL modelling. Third International Workshop on Philosophy and Informatics. Saarbrücken, Germany - May 3-4, 2006
Download


Paper Citation


in Harvard Style

Roche C. (2007). Saying is not Modelling . In Proceedings of the 4th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2007) ISBN 978-972-8865-97-9, pages 47-56. DOI: 10.5220/0002426000470056


in Bibtex Style

@conference{nlpcs07,
author={Christophe Roche},
title={Saying is not Modelling},
booktitle={Proceedings of the 4th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2007)},
year={2007},
pages={47-56},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002426000470056},
isbn={978-972-8865-97-9},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 4th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2007)
TI - Saying is not Modelling
SN - 978-972-8865-97-9
AU - Roche C.
PY - 2007
SP - 47
EP - 56
DO - 10.5220/0002426000470056