Authors:
Ozgul Unal
and
Hamideh Afsarmanesh
Affiliation:
Informatics Institutes, University of Amsterdam, Netherlands
Keyword(s):
Schema Matching, Linguistic Matching, Collaborative Networks, Biodiversity.
Related
Ontology
Subjects/Areas/Topics:
Business Analytics
;
Communication and Software Technologies and Architectures
;
Data Engineering
;
Data Semantics
;
Data Warehouses and Data Mining
;
e-Business
;
Enterprise Information Systems
Abstract:
In order to deal with the problem of semantic and schematic heterogeneity in collaborative networks, matching components among database schemas need to be identified and heterogeneity needs to be resolved, by creating the corresponding mappings in a process called schema matching. One important step in this process is the identification of the syntactic and semantic similarity among elements from different schemas, usually referred to as Linguistic Matching. The Linguistic Matching component of a schema matching and integration system, called SASMINT, is the focus of this paper. Unlike other systems, which typically utilize only a limited number of similarity metrics, SASMINT makes an effective use of NLP techniques for the Linguistic Matching and proposes a weighted usage of several syntactic and semantic similarity metrics. Since it is not easy for the user to determine the weights, SASMINT provides a component called Sampler as another novelty, to support automatic generation of w
eights.
(More)