FROM LINGUISTICS TO ONTOLOGIES - The Role of Named Entities in the Conceptualisation Process

Nouha Omrane, Adeline Nazarenko, Sylvie Szulman

Abstract

Ontologies that have been built from texts can be associated with lexical information that is crucial for the semantic annotation of texts and all semantic search tasks. However, the entire pocess of building ontologies from texts cannot be fully automated and it is important to guide the knowledge engineer during the building process. This paper presents an enriched version of TERMINAE, which is a text-based methodology for ontology design. It combines a fact-based approach of modeling with the more traditional concept-centric one. We show that named entities can be used to enrich an existing ontology and to bootstrap the acquisition process. In other words, named entities are used for the conceptualisation of ontologies and not only for their population. This approach is illustrated on two use-cases based on policy documents and evaluated by measuring the Precision and Recall of the resulting ontologies with respect to pre-existing ontologies independently built by domain experts.

References

  1. Aussenac-Gilles, N., Bourigault, D., Condamines, A., and Gros, C. (1995). How can knowledge acquisition benefit from terminology? In Proceedings of the 9th Knowledge Acquisition Workshop.
  2. Aussenac-Gilles, N., Despr/'es, S., and Szulman, S. (2008). The terminae method and platform for ontology engineering from texts. In Buitelaar, P. and Cimiano, P., editors, Bridging the Gap between Text and Knowledge, pages 199-223. IOS Press.
  3. Cimiano, P. and Völker, J. (2005). Text2onto - a framework for ontology learning and data-driven change discovery. In Proc. of the 10th Int. Conf. on Applications of Natural Language to Information Systems, pages 227-238.
  4. Faure, D. and Nédellec, C. (1999). Knowledge acquisition of predicate argument structures from technical texts using machine learning: the system asium. In et R. Stude, D. F., editor, Proc. of the 11th Int. Conf. on Knowledge Engineering and Knowledge Management (EKAW'99), pages 329-334. Springer-Verlag.
  5. Giuliano, C. and Gliozzo, A. (2008). Instance-based ontology population exploiting named-entity substitution. In Proc. of the 22nd Int. Conf. on Computational Linguistics (Coling 2008), pages 265-272, Manchester.
  6. LDC (2004). Ace (automatic content extraction) english annotation guidelines for entities. Livrable version 5.6.1 2005.05.23, Linguistic Data Consortium.
  7. Lopes, L. and Vieira, R. (2009). Automatic extraction of composite terms for construction of ontologies: an experiment in the health care area. Electronic Journal of Communication, Information and Innovation in Health, 3(1):72-84.
  8. Magnini, B., Pianta, E., Popescu, O., and Speranza, M. (2006). Ontology population from textual mentions: Task definition and benchmark. In Proc. of the OLP2 workshop on Ontology Population and Learning.
  9. Maynard, D., Li, Y., and Peters, W. (2008). NLP techniques for term extraction and ontology population. In Buitelaar, P. and Cimiano, P., editors, Bridging the Gap between Text and Knowledge, pages 199-223. IOS Press.
  10. Meyer, I., Skuce, D., Bowker, L., and Eck, K. (1992). Towards a new generation of terminological resources : an experiment in building a terminological knowledge base. In Proc. of the 15th Int. Conf. on Computational Linguistics (COLING'92), pages 956-960, Nantes, France.
  11. Morita, T., Fukuta, N., Izumi, N., and Yamaguchi, T. (2008). Doddle-owl: Interactive domain ontology development with open source software in java. IEICE Transactions on Information and Systems, 4:945-958.
  12. Nadeau, D. and Sekine, S. (2007). A survey of named entity recognition and classification. Linguisticae Investigaciones, 30(1):3-26.
  13. Studer, R., Benjamins, V. R., and Fensel, D. (1998). Knowledge engineering: Principles and methods. Data Knowl. Eng., 25:161-197.
  14. Wang, Y., Volker, J., and Haase, P. (2006). Towards semiautomatic ontology building supported by large-scale knowledge acquisition. In AAAI Fall Symposium On Semantic Web for Collaborative Knowledge Acquisition, volume FS-06-06, pages 70-77, Arlington, VA, USA. AAAI, AAAI Press.
Download


Paper Citation


in Harvard Style

Omrane N., Nazarenko A. and Szulman S. (2011). FROM LINGUISTICS TO ONTOLOGIES - The Role of Named Entities in the Conceptualisation Process . In Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2011) ISBN 978-989-8425-80-5, pages 249-254. DOI: 10.5220/0003659602490254


in Bibtex Style

@conference{keod11,
author={Nouha Omrane and Adeline Nazarenko and Sylvie Szulman},
title={FROM LINGUISTICS TO ONTOLOGIES - The Role of Named Entities in the Conceptualisation Process},
booktitle={Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2011)},
year={2011},
pages={249-254},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003659602490254},
isbn={978-989-8425-80-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2011)
TI - FROM LINGUISTICS TO ONTOLOGIES - The Role of Named Entities in the Conceptualisation Process
SN - 978-989-8425-80-5
AU - Omrane N.
AU - Nazarenko A.
AU - Szulman S.
PY - 2011
SP - 249
EP - 254
DO - 10.5220/0003659602490254