Authors:
Tome Eftimov
1
;
Gordana Ispirova
2
;
Peter Korošec
3
and
Barbara Koroušić Seljak
1
Affiliations:
1
Computer Systems Department, Jožef Stefan Institute, Jamova cesta 39, 1000 Ljubljana and Slovenia
;
2
Computer Systems Department, Jožef Stefan Institute, Jamova cesta 39, 1000 Ljubljana, Slovenia, Jožef Stefan International Postgraduate School, Jamova cesta 39, 1000 Ljubljana and Slovenia
;
3
Computer Systems Department, Jožef Stefan Institute, Jamova cesta 39, 1000 Ljubljana, Slovenia, Faculty of Mathematics, Natural Sciences and Information Technologies, Glagoljaška ulica 8, 6000 Koper and Slovenia
Keyword(s):
Semantic Interoperability, RICHFIELDS Ontology, Food Information, Ontology Population, Semantic Annotation.
Related
Ontology
Subjects/Areas/Topics:
Artificial Intelligence
;
Information Extraction
;
Knowledge Discovery and Information Retrieval
;
Knowledge-Based Systems
;
Mining Text and Semi-Structured Data
;
Symbolic Systems
Abstract:
In an EU-funded project RICHFIELDS, a data platform was designed with the aim to collect, link and harmonize, analyze, store, and deliver food- and nutrition-related data and information to various stakeholders. To integrate heterogenous food data sets, we propose a RICHFIELDS framework for semantic interoperability of food information, which is a combination of already developed NLP approaches for the food domain. The framework includes i) a food ontology to which foods are linked, ii) a part that explains how the relevant foods can be extracted and represented in a structured way, and iii) a similarity measure that is used to link the foods to the ontology. To evaluate the RICHFIELDS framework, we selected two distinct data sets from different food information systems. The experimental results provided promising results,i.e., 81.5% and 87.5% of the foods from the first and the second data set, respectively, obtained a tag from the ontology (i.e., semantic annotation was performed).
The annotations provided by the framework allow automatic integration of food information provided in both data sets.
(More)