Potential of Reuse. Ontology reuse is perhaps the
most obvious motivation to evaluate ontologies. Be-
fore an ontology can be reused, one has to evaluate
the ontology’s quality and most importantly, the on-
tology’s fitness for a particular purpose.
2.2 Ontology Evaluation Methodologies
Several methods for ontology evaluation have been
proposed over the years. The main methods have been
surveyed by (Vrande, 2009; Brewster et al., 2004) and
more recently by (Ouyang et al., 2011) to include the
following:
Comparison against a “Gold Standard”. The gold
standard may itself be an ontology. The problem with
this method is that it is difficult to establish the quality
of the gold standard.
User-based Evaluation. This typically involves eval-
uating the ontology through users’ experiences. The
problem with this method is that it is difficult to es-
tablish objective standards pertaining to the criteria
(metrics) for evaluation. In addition it is also hard to
establish who the right users are.
Application-based Evaluation. This would typically
involve evaluating how effective an ontology is in the
context of an application. While this may be practi-
cal for the purposes of evaluating a single ontology,
it may be challenging to evaluate a number of ontolo-
gies in an application area to determine which one is
best fitted for the application especially in an auto-
mated fashion.
Congruence Evaluation. This involves evaluating
the “fitness” or congruence between the ontology and
a domain of knowledge. Several approaches have
been pursued including comparison of the ontology
to a “gold standard” as discussed above. Another
approach is to evaluate the ontology or ontologies
against knowledge from the domain the ontologies
represent. More specifically, comparison can be made
against a corpus or text extracted from the documents
about the domain (e.g. (Brewster et al., 2004)).
Hybrid Evaluation: User-based Evaluation and
Corpus-based. This method is exemplified by
(Ouyang et al., 2011) which combines the corpus-
based and user-based evaluations. The ontology here
is evaluated against a set of metrics (coverage, coher-
ence and coupling). Users are allowed the flexibility
to weigh the influence of each of the metrics on the
evaluation.
It is important to know that there is no “gold
standard” evaluation; however, one should choose an
evaluation technique based on the purposes (reasons)
of the evaluation (Vrande, 2009).
3 PROPOSED ONTOLOGY
EVALUATION
The ontology evaluation of this paper is an instantia-
tion of the congruence ontology evaluation methodol-
ogy initially proposed by (Brewster et al., 2004). The
main motivation of this research is that, while there
are some general methodologies proposed for ontol-
ogy evaluation, there is a paucity of evidence in sup-
port of these methodologies. This is particularly true
for the congruence evaluation or data-driven ontology
evaluation methodology.
In defining and instantiating this methodology,
(Brewster et al., 2004) considered the domain of
arts. Our paper evaluates ontologies in the domain of
workflow management. The general steps followed
in this investigation are: corpus definition, similarity
calculation and statistical evaluation.
3.1 Corpus Definition and Distance
Measure
The ontologies considered in this paper pertain to the
concept of workflow. A workflow is by definition:
“The automation of a business process, in whole or
part, during which documents, information or tasks
are passed from one participant to another for
action, according to a set of procedural rules”
(WFMC, 1999).
Workflow ontologies model the workflow domain
based on concepts that have some relation to this def-
inition. This is because an ontology is a formal con-
ceptualization of a domain of interest (OMG, 2009;
W3C, 2009; Gruber, 1993). A conceptualization is
an abstraction of that which we wish to represent.The
corpus for the ontology evaluation of this paper con-
sists of text from documents about the workflow and
process modelling domain. These documents con-
sist of one hundred (100) peer reviewed academic ar-
ticles. These documents were obtained through the
assistance of three major search facilities: the IEEE
eXplore, Google Scholar and Primo Central (via the
university library). The key phrases that were used to
search for content are: Workflow modelling, Business
Process modelling, Workflow modelling languages,
Business process modelling languages. We will refer
to this corpus as the domain corpus.
In addition to the domain corpus we also define
the ontology corpus which consists of the concepts
extracted from the ontologies. This forms the docu-
ments to be compared to the domain corpus.
Following the provision of text which eventually
forms the corpus, there is a need for some representa-
KEOD2013-InternationalConferenceonKnowledgeEngineeringandOntologyDevelopment
208