Authors:
Troels Andreasen
1
;
Henrik Bulskov
1
;
Tine Lassen
1
;
Sine Zambach
1
;
Per Anker Jensen
2
;
Bodil Nistrup Madsen
2
;
Hanne Erdman Thomsen
2
;
Jørgen Fischer Nilsson
3
and
Bartlomiej Antoni Szymczak
3
Affiliations:
1
Roskilde University, Universitetsvej 1, Denmark
;
2
Copenhagen Business School, Denmark
;
3
Technical University of Denmark, Denmark
Keyword(s):
Domain modelling, Ontology engineering, Natural language processing, Ontological, Content-oriented text search.
Related
Ontology
Subjects/Areas/Topics:
Applications
;
Artificial Intelligence
;
Data Engineering
;
Domain Analysis and Modeling
;
Enterprise Information Systems
;
Information Systems Analysis and Specification
;
Knowledge Engineering and Ontology Development
;
Knowledge-Based Systems
;
Natural Language Processing
;
Ontologies and the Semantic Web
;
Ontology Engineering
;
Pattern Recognition
;
Symbolic Systems
Abstract:
The scientific aim of the project presented in this paper is to provide an approach to representing, organizing, and accessing conceptual content of biomedical texts using a formal ontology. The ontology is based on UMLS resources supplemented with domain ontologies developed in the project. The approach introduces the notion of ‘generative ontologies’, i.e., ontologies providing increasingly specialized concepts reflecting the phrase structure of natural language. Furthermore, we propose a novel so-called ‘ontological semantics’ which maps noun phrases from texts and queries into nodes in the generative ontology. This enables an advanced form of data mining of texts identifying paraphrases and concept relations and measuring distances between key concepts in texts. Thus, the project gains its identity in its attempt to provide a formal underpinning of conceptual similarity or relatedness of meaning.