Complexity (IMC) is computed based on the ax-
ioms present (Eq. 3), where a lower value indicates
a more complex ontology with relatively more ax-
ioms declaring properties of the classes compared to
a lightweight ontology or bare taxonomy.
IMC = |C| : (|Sax| + |Eax|) (3)
Second, the OWL features used are analysed twofold:
(i) by calculating the overall percentage of use of S,
R , O, I , Q and (D), i.e., a rough measure of the OWL
2 DL features used; (ii) by converting the DL frag-
ment into a numerical value, where AL is given the
lowest value of 0 and SR OI Q the highest value of
10, to be used in correlation calculations (see below).
The DL fragment and IMC are compared as well, for
they need not be similar (e.g., a bare taxonomy with
one object property declared reflexive already ‘mer-
its’ detection of an R , but actually is still a simple on-
tology with respect to the subject domain represented,
and, vv., an ontology can be comprehensive with re-
spect to the subject domain, but originally developed
in OWL DL but not updated since OWL 2).
Basic correlations are computed for the ontology
sizes and complexities with respect to the pitfalls, and
detailed correlations are computed for certain individ-
ual pitfalls: P5, P11, P13, P25, P26, P27, P28, and
P29 are pitfalls specific to object properties, hence,
the amount of properties in the ontologies may be cor-
related to the amount of pitfalls detected, and likewise
for P3, P6, P7, P10, P21, and P24 for classes, and P8
for classes and ontology elements.
Finally, manual qualitative analyses with ontolo-
gies in Set1 and Set2 were conducted on possible false
positives and additional pitfalls.
3.2 Results
We first present the calculations and statistics, and
subsequently a representative selection of the quali-
tative evaluation of the ontologies in Set1 and Set2.
3.2.1 Aggregated and Analysed Data
The raw data of the ontologies evaluated with
OOPS! are available online at http://www.oeg-
upm.net/oops/material/KEOD2013/pitfallsAnalysis.xlsx.
The type of mistakes made by novice ontology devel-
opers are: P4, P5, P7, P8, P10, P11, P13, P19, P22,
P24, P25, P26, P27, and P29. The percentages of oc-
currence of a pitfall over the total set of 23 ontologies
in Set1 is included in Fig. 1, the average amount of
pitfalls is shown in Fig. 2, and aggregate data is listed
in Table 1. The analogous results for Set3 are shown
in Figs. 1 and Fig. 2, and in Table 1, noting that all
OOPS! pitfalls have been detected in Set3 and that
the median amount of pitfalls/ontology is similar to
that of Set1. The high aggregate values are caused
by a few ontologies each with around 5000 or more
detected pitfalls; without P8 (annotations), there are
three ontologies that have more than 1000 detected
pitfalls at the time of scanning the ontology. The
results obtained with the 21 well-known ontologies
(Set2) can be found in the same table and figures,
and include pitfalls P2, P4, P5, P7, P8, P10, P11,
P12, P13, P19, P20 (0 upon manual assessment),
P21, P22, P24, P25, P26, P27, and P29, noting that
the percentages and averages differ little from those
of the novices and random ones. The high aggregate
values for Set2 is largely due to OBI with a pitfall
count of 3771 for P8 (annotations) and DMOP with a
pitfall count of 866 for P8; without P8, OBI, DMOP,
and the Government Ontology exceeded 100 pitfalls
due to P11 (missing domain and range axioms) and
P13 (missing inverses—but see also below). P8 is an
outlier both in prevalence and in quantity for all three
sets of ontologies and only some of the ontologies
have very many missing annotations, which skews the
average, as can be observed from the large standard
deviations.
For Set1 and Set2, we collected data about the
content of the ontologies and analysed them against
the pitfalls, as described in Section 3.1. The usage of
the OWL 2 DL features in Set1 are: S 44%, R 26%, I
83%, O 26%, Q 52%, and D 17%, whereas for Set2,
the percentages are 62%, 19%, 81%, 24%, 5%, and
86%, respectively; the difference is largely due to the
difference in timing of the development of the ontol-
ogy, with some of the well-known ontologies having
been developed before the OWL2 standard, and the
use of data properties was discouraged in the lectures
for the ontologies in Set1. In order to include the DL
fragment in the analyses, we assigned values to the
fragments prior to analysis, ranging from a value of
0 for an ontology in AL(D) to 10 for an ontology
in S R OI Q (D), and intermediate values for others
(e.g., ALCH I(D) with a value 3 and S H I F with
value 6—see supplementary data). With the calcu-
lated IMC (recall Eq. 3), the correlation between DL
fragment and the IMC is -0.18 for the ontologies in
Set1 and -0.74 for the ontologies in Set2. This pos-
sibly may change a little by tweaking the values as-
signed to the DL fragments, but not such as to obtain
a strong, meaningful correlation between detected DL
fragment and the IMC.
Correlations for several measures are included in
Table 2. The only substantial correlations found are
between all pitfalls per ontology elements and size (in
boldface), although with all pitfalls minus P8, there
is no obvious correlation anymore. p-values were
computed with the 1-tailed unpaired Student t-test,
TheCurrentLandscapeofPitfallsinOntologies
135