spective from a spatio-temporal point of view. In
(Benerecetti et al., 2001) the authors describe three
kinds of perspectives: spatio-temporal, logical, and
cognitive. Heterogeneity resulting from the first two
kinds can be solved by DL-based techniques like
SAT solver (Giunchiglia and Shvaiko, 2003). The
pragmatic heterogeneity (Bouquet et al., 2004)—
which is called semiotic heterogeneity by (Euzenat
and Shvaiko, 2007)—results from differences in inter-
preting entities with regard to a specific context: “The
intended use of entities has a great impact on their
interpretation, therefore, matching entities which are
not meant to be used in the same context is often error-
prone” (Euzenat and Shvaiko, 2007).
In our approach we focus on semantics from a
cognitive perspective which leads to pragmatic het-
erogeneity problems in ontology alignment. There-
fore, we prefer the notion model-pragmatic instead of
model-theoretic semantics. The cognitive perspective
includes the specific purpose of a modeled domain,
and therefore it is related to the (intensional) context
layer (Ehrig, 2007) of an ontology. Additionally, a
possible mismatch risk can occur at the ontology layer
which is called explication mismatch (Klein, 2001).
This mismatch results from differences in modeling
conventions (Chalupsky, 2000), which means dissimi-
larities in describing concepts. More detailed descrip-
tions of heterogeneity and mismatch types have been
given by (Visser et al., 1997), (Chalupsky, 2000),
(Klein, 2001), and (Euzenat and Shvaiko, 2007).
Another problem in ontology alignment is to give
end-users a quick and efficient overview of the source
ontologies. Additionally, they should be supported to
gain insight into the modeling process of those on-
tologies. A method which makes such an outline
feasible can give users an idea about the application
(modeling) context in which the entities are used for
a specific purpose.
This paper is structured as follows: first, we de-
scribe the need of efficient aids for user support in
schema-based ontology alignment. Then we intro-
duce our heuristic-based method align++ and present
details about its two parts. We describe the idea of
encoding context- and structure-based heterogeneity
as possible risk factors in numerical values to approx-
imate a mismatch-at-risk between ontologies. We fi-
nally underpin our research assumptions of align++
Part A with an evaluation survey.
2 APPROACH
In previous works we have proposed that in addi-
tion to the two factors entity labels and relation-
ships among entities the modeling focus on enti-
ties should be additionally considered (Mazak et al.,
2010). Analogous to the demand described by (Jani-
esch, 2010), “[...] we attempt to systematize the cur-
rent perceptions of context as relevant parameters for
the adaption of conceptual modeling methods”; and
relating to (Ehrig et al., 2004), “[...] similar enti-
ties are used in similar context”. In our approach the
entities we focus on are the concepts of ontologies
(or their classes, which are concrete representations
of concepts, respectively). Our approach considers
domain knowledge as meta-information in the form
of two indicators, an importance weighting indica-
tor and an importance outdegree indicator for classes.
We denote with domain knowledge the modeling fo-
cus, which results from the context in which a certain
domain has to be modeled.
Let us assume, for instance, that there are two on-
tologies (O
A
and O
B
) that describe the same domain
of interest, a software tool for conference organiza-
tion support (OAEI, 2009). We assume two differ-
ent usage scenarios for these ontologies. In the first
scenario, the purpose of creating both ontologies is to
describe authors and their papers (Scenario 1). There-
fore, the modeling focus of the ontology engineers is
mainly on the concepts Author, Contribution, and Ar-
ticle, as well as these concepts’ relations to other con-
cepts. In the second scenario, the specific purpose of
ontology O
A
is to describe the events and organiza-
tions of the conference (Scenario 2), while the pur-
pose of ontology O
B
remains the same as in Scenario
1. Therefore, the modeling focus of ontology O
A
in
Scenario 2 is on the concepts Working Event, Admin-
istrative Event, and Organization. The context rep-
resents the environment in which the entities of an
ontology have a certain level (importance level) of
meaning. Thus, the introduced modeling context is
equatable to the notion of application context (Ehrig
et al., 2004). The differences due to the modeling fo-
cus cause semantic, pragmatic, and also terminologi-
cal heterogeneity problems. Therefore, mismatch be-
tween ontologies may occur in the alignment process.
We have designed a heuristic-based method called
align++, which follows the objective to support
the end-user in ontology alignment by making het-
erogeneity between source ontologies visible before
starting a schema-based alignment technique. The
method provides a metric that quantifies the possible
mismatch between ontologies. It helps users to gain
a better understanding of ontologies, and disburdens
them from complex, time-, and cost-intensive tasks.
The name align++ results from the two steps in which
this method is divided, an ex ante and an ex post step.
Firstly, using the techniques of the ex ante step of
KEOD 2010 - International Conference on Knowledge Engineering and Ontology Development
18