Ontologies as Theories

Marco A. Casanova

, Angela Pinheiro

, Jose Antonio F. Macedo

, Eveline Sacramento

and Vania M. P. Vidal

Departamento de Informática, Pontifícia Universidade Católica do Rio de Janeiro, Rio de Janeiro, Brazil

Departamento de Computação, Universidade Federal do Ceará, Fortaleza, Brazil

Keywords: Constraint Specification, Ontology Design, Linked Data Publication.

Abstract: This position paper argues that certain familiar ontology design problems are profitably addressed by

treating ontologies as theories and by defining a set of operations that create new ontologies, including their

constraints, out of other ontologies. The paper first illustrates how to use the operations in the context of

ontology reuse. It goes further and shows how to take advantage of the operations to compare different

ontologies, or different versions of an ontology, and to design mediated schemas in a bottom up fashion.

The discussion in this position paper is backed up by a tool that implements the operations and that offers

other facilities to manipulate ontologies.

1 INTRODUCTION

In this position paper we argue that certain familiar

ontology design problems are profitably addressed

by treating ontologies as theories and by defining a

set of operations on ontologies.

In more detail, we define an ontology as a pair

O=(V,



) such that V is a vocabulary and



is a set of

constraints in V. The theory of



is the set of all

constraints that are logical consequences of



. We

emphasize that the constraints in



capture the

semantics of the terms in V and must, therefore, be

brought to the foreground. The theory of



identifies

the constraints that are implicitly defined, but which

must be considered when using the ontology.

The operations we propose create new

ontologies, including their constraints, out of other

ontologies. Such operations extend the idea of

namespaces to take into account constraints and help

address familiar ontology design problems, which

we now outline to further motivate the discussion.

Consider first the problem of designing an

ontology to publish data on the Web. If the designer

follows the Linked Data principles (Bernes-Lee,

2006); (Bizer et al., 2007), he must select known

ontologies, as much as possible, to organize the data

so that applications “can dereference the URIs that

identify vocabulary terms in order to find their

definition”. We argue that the designer should go

further and analyze the constraints of the ontologies

from which he is drawing the terms to construct his

vocabulary. Furthermore, he should publish the data

so that the original semantics of the terms is

preserved. To facilitate ontology design from this

perspective, we introduce three operations on

ontologies, called projection, union and deprecation.

Consider now the problem of comparing the

expressive power of two ontologies, O

=(V



) and

=(V



). If the designer wants to know what they

have in common, he should create a mapping

between their vocabularies and detect which

constraints hold in both ontologies, after the terms

are appropriately mapped. The intersection

operation answers this question. We argued

elsewhere (Casanova et al., 2010) that intersection is

also useful to address the design of mediated

schemas that combine several export schemas in a

way that the data exposed by the mediator is always

consistent.

On the other hand, if the designer wants to know

what holds in O

=(V



), but not in O

=(V



), he

should again create a mapping between their

vocabularies and detect which constraints hold in the

theory of



, but not in the theory of



, after the

terms are appropriately mapped. The difference

operation answers this question.

Finally, a variant of ontology comparison is the

problem of analyzing what changed from one

version of an ontology to the other. Difference is

317

A. Casanova M., Pinheiro A., F. Macedo J., Sacramento E. and M. P. Vidal V..

Ontologies as Theories.

DOI: 10.5220/0004562603170322

In Proceedings of the 15th International Conference on Enterprise Information Systems (ICEIS-2013), pages 317-322

ISBN: 978-989-8565-60-0

 2013 SCITEPRESS (Science and Technology Publications, Lda.)

especially useful here.

The presentation of this position paper is

necessarily informal to stress the major point:

familiar ontology design problems can be properly

addressed by treating ontologies as theories and by

defining a set of operations on ontologies. The

machinery to handle constraints developed in

(Casanova et al., 2010; 2011; 2012a; 2012b)

provides the theoretical foundations of the paper.

Previous work by the authors (Casanova et al., 2011)

introduced the notion of open fragment, which is

captured by projection. The design of mediated

schemas was addressed in (Casanova et al., 2010). A

tool that implements the operations and that offers

other facilities to manipulate ontologies (Pinheiro,

2013) covers the practical aspects of the discussion.

The paper is organized as follows. Section 2

reviews background concepts and notation. Section

3 introduces the operations. Sections 4 and 5 discuss

how to use the operations to address ontology design

problems. Section 6 summarizes related work.

Section 7 contains the conclusions.

2 BACKGROUND

The examples in Sections 4 and 5 use the basic

notation of Description Logic (Baader et al., 2003).

Very briefly, a vocabulary V consists of a set of

atomic concepts, a set of atomic roles, and the

bottom concept . A language in V is a set of

strings, using symbols in V, defining the set of

concept descriptions in V and the set of role

descriptions in V.

An inclusion in V is a string of the form u ⊑ v,

where u and v both are concept descriptions in V or

both are role descriptions in V. Table 1 shows the

common types of inclusions used in the examples.

An ontology is a pair O=(V,



) such that V is a

vocabulary and



is a set of inclusions in V, called

the ontology constraints.

The theory of



(or the theory of O), denoted



[



] (or O[



]), is the set of all logical consequences



We say that two sets of inclusions,



and



, are

equivalent, denoted







, iff their theories are

equal, that is, the set of all logical consequences of



is equal to that of



. Likewise, two ontologies

=(V



) and O

=(V



) are equivalent, also

denoted O

 O

, iff



and



are equivalent.

3 ONTOLOGY OPERATIONS

This section introduces the ontology operations we

propose. Sections 4 and 5 illustrate their application

to ontology design problems.

Definition 1: Let O

=(V



) and O

=(V



) be two

ontologies, W be a subset of V

, and



be a set of

constraints in V

(i) The projection of O

=(V



) over W, denoted



[W](O

), returns the ontology O

=(V



where V

=W and



is the set of constraints in



[



] that use only classes and properties in W.

(ii) The deprecation of



fromO

=(V



denoted



[



](O

), returns the ontology

=(V



), where V

and







(iii) The union of O

=(V



) and O

=(V



denoted O

O

, returns the ontology

=(V



), where V

V

and







(iv) The intersection of O

=(V



) and O

=(V



denoted O

O

, returns the ontology

=(V



), where V

V

and





[



]



[



(v) The difference of O

=(V



) and O

=(V



denoted O

O

, returns the ontology

=(V



), where V

and





[



]



[



Note that deprecation does not reduce to difference

since, in general, we have



[



] =



[







] ≠



[



] 



[



]

We also note that the ontology O

that results from

an operation is unique, by definition. However, there

might be several ontologies that are equivalent to

. For example, if O

= (V



) is the projection of

on W, there might be several sets of constraints

that are equivalent to the set of constraints in the

theory of O

that use only terms in W. This simple

observation impacts the implementation of the

operations, discussed elsewhere (Pinheiro, 2013).

Finally, we observe that we may generalize

union, intersection and difference by considering a

renaming of one or both vocabularies of the

ontologies involved and appropriately renaming the

terms that occur in the constraints when comparing

the theories. This extension is considered in the first

example presented in Section 5 (Table 2).

4 PROJECTION, DEPRECATION

AND UNION

Projection allows the designer to define a set W

ICEIS2013-15thInternationalConferenceonEnterpriseInformationSystems

318

Table 1: Common inclusion types used in conceptual modeling.

Name Inclusion type Informal semantics

Domain Constraint

( 1 P) ⊑ C

property P has class C as domain, that is,

if (a,b) is a pair in P, then a is an individual in C

Range Constraint

( 1 P¯) ⊑ C

property P has class C as range, that is,

if (a,b) is a pair in P, then b is an individual in C

minCardinality

Constraint

C ⊑ ( k P) or

C ⊑ ( k P¯)

property P or its inverse P¯ maps each individual

in class C to at least k distinct individuals

maxCardinality

Constraint

C ⊑( k+1 P) or

C ⊑ ( k+1 P¯)

property P or its inverse P¯ maps each individual

in class C to at most k distinct individuals

Subset Constraint

C ⊑ D

each individual in C is also in D, that is,

class C denotes a subset of class D

Disjointness

Constraint

C ⊑ D

no individual is in both C and D, that is,

classes C and D are disjoint

Table 2: Partial Intersection of the DBLP and Lattes ontologies.

(a) DBLP (b) Lattes (c) Intersection

Article ⊑Publication Article ⊑Document Article ⊑Publication

Conference ⊑Event Book ⊑Document

ConferencePaper ⊑Article Collection ⊑Document

Continent ⊑Place Phdthesis ⊑Document

Proceedings ⊑Publication Proceedings ⊑Document Proceedings ⊑Publication

Professor ⊑Person Series ⊑Document

ConferencePaper ⊑Document ConferencePaper ⊑Publication

containing just a few terms from the vocabulary of

an ontology and retain the semantics of the terms in

W through the constraints, derivable from those of

the ontology, that apply to the terms in W.

Deprecation simply allows the designer to drop

constraints from an ontology. Finally, union allows

the designer to combine two ontologies. These three

operations offer the designer powerful tools to

(partially) reuse vocabularies and to preserve the

semantics of the terms. In the rest of this section, we

further motivate this argument with the help of an

example that uses the Music Ontology (Raimond et

al., 2010).

The Music Ontology (MO) provides concepts

and properties to describe artists, albums, tracks,

performances, arrangements, etc. on the Semantic

Web. It is used by several Linked Data sources,

including MusicBrainz and BBC Music. The Music

Ontology RDF schema uses terms from the Friend

of a Friend (FOAF) (Brickley and Miller, 2010) and

the XML Schema (XSD) vocabularies. We

respectively adopt the prefixes “mo:”, “foaf:” and

“xsd:” to refer to these vocabularies.

Figure 1 shows the class hierarchies of MO

rooted at classes foaf:Agent and foaf:Person. Let us

focus on this fragment of MO.

We first recall that FOAF has a constraint

informally formulated as:

foaf:Person and foaf:Organization are disjoint classes

Let V

be the following set of terms from the FOAF

and the XSD vocabularies, and let V

contain the rest

of the terms that appear in Figure 1:

= { foaf:Agent, foaf:Person, foaf:Group,

foaf:Organization, foaf:name, xsd:string }

= { mo:MusicArtist, mo:CorporateBody,

mo:SoloMusicArtist, mo:MusicGroup,

mo:Label, mo:member_of }

Let O

=(V



) be the ontology obtained by the

projection of FOAF over V

and defined in such a

way that



is the set of constraints over V

that are

logical consequences of the constraints of FOAF:



= { (1 foaf:name) ⊑ foaf:Person,

(1 foaf:name¯

) ⊑ xsd:string,

foaf:Person ⊑ foaf:Organization,

foaf:Group ⊑ foaf:Agent,

foaf:Organization ⊑ foaf:Agent }

Let O

=(V



) be such that



contains just the

subset constraints over V

shown in Figure 1:



= { mo:SoloMusicArtist ⊑mo:MusicArtist,

mo:MusicGroup ⊑mo:MusicArtist,

mo:Label ⊑mo:CorporateBody }

Then, most of Figure 1 is captured by the union of

and O

, defined as the ontology O

=(V



where V

= V

 V

and







The constraints shown in Figure 1, but not

included in O

, are obtained by the union of

OntologiesasTheories

319

Figure 1: The class hierarchies of MO rooted at classes foaf:Agent and foaf:Person.

=(V



) with O

=(V



) (the ontologies have the

same vocabulary), where



= { mo:SoloMusicArtist ⊑foaf:Person,

mo:MusicGroup ⊑foaf:Group,

mo:MusicArtist ⊑foaf:Agent,

mo:CorporateBody ⊑foaf:Organization,

(1 mo:member_of) ⊑foaf:Person,

(1 mo:member_of¯) ⊑foaf:Group }

The union returns the ontology O

=(V



), where V

= V

and







. Finally, we construct

=(V



), the ontology that corresponds to Figure

1 as:

= (([V

](FOAF)  O

)  O

)

The reader is invited to reflect upon the definition of

. We contend that the expression defined using the

operations provides a reasonable explanation of how

is constructed from FOAF and additional terms

and constraints.

5 INTERSECTION

AND DIFFERENCE

Intersection and difference help the designer

compare the expressive power of two ontologies. If

the designer wants to know what the ontologies have

in common, he should use intersection. On the other

hand, if he is interested in what holds in O

=(V



but not in O

=(V



), he should use difference.

To illustrate the use of intersection, we analyze

two data sources from the scientific research

domain, DBLP and Lattes. DBLP (Digital

Bibliographic and Logic Programming) stores

Computer Science bibliographic references – over

half a million references – and links to researchers’

homepages. Lattes is a database, organized by CNPq

– the Brazilian Research Agency, storing

researchers’ CVs and research group descriptions.

Assume that the Lattes vocabulary suffers a

renaming where Document is mapped to Publication.

To simplify the discussion, Table 2 shows just a

few constraints from each data sources. Column (a)

shows the DBLP constraints, Column (b), the Lattes

constraints, and Column (c) the constraints in the

intersection.

For example, Line 1 of the table indicates that Article

⊑Publication is a constraint in both ontologies, after

Document is renamed to Publication, and hence is in

their intersection. Line 7(b) indicates that

ConferencePaper ⊑ Publication is a constraint of the

Lattes ontology, again after Document is renamed to

Publication; whereas Lines 1(a) and 3(a) implies that

ConferencePaper ⊑ Publication is in the theory of the

DBLP ontology; hence this constraint is in the

intersection of the ontologies, as shown in Line 1(c).

To illustrate the use of difference, consider a

scenario where a domain specialist adopted the

version of the FOAF ontology released on January

, 2010 (call it FOAF1). However, on August 9

2010, a new release of the FOAF ontology was

published (call it FOAF2). The specialist then wants

to verify what changed from one version to the

other. He can then compute the difference between

FOAF1 and FOAF2 (and vice-versa).

Given this scenario, Table 3 shows the (partial)

ICEIS2013-15thInternationalConferenceonEnterpriseInformationSystems

320

Table 3: Partial difference between two versions of the FOAF ontology.

(a) FOAF1 (January 1

, 2010) (b) FOAF2 (August 9

, 2010) (c) Difference

Agent ⊑



Document

Project ⊑Document Project ⊑



Document Project ⊑



Image

Person ⊑



Document

Organization ⊑Document Organization ⊑



Document Organization ⊑Image

Group ⊑Agent Group ⊑Agent

Image ⊑Document

difference between FOAF1 and FOAF2.

Line 2(c) indicates that the constraint

Project ⊑ Image is in the difference between

FOAF1 and FOAF2. Indeed, since

Image ⊑Document is in the theory of FOAF1, we

have that Document ⊑Image is also in the theory

of FOAF1. Hence, since Project ⊑ Document is in

the theory of FOAF1 (in fact, it is a constraint of

FOAF1, according to Line 2(a)), we infer that

Project ⊑ Image is in the theory of FOAF1.

However, this constraint is not in the theory of

FOAF2.

Likewise, Line 4(c) indicates that

Organization ⊑Image is in the theory of FOAF1, but

not in the theory of FOAF2.

Finally, Line 6(a) indicates that FOAF1 has a

constraint, Image ⊑ Document, which is not in the

theory of FOAF2.

6 RELATED WORK

The results reported in the paper cover a topic –

improving Linked Data design by constraint reuse –

that is still neglected in the literature. The question

of Linked Data semantics is not new, though. For

example, recent investigation (Halpin and Haynes,

2010); (Jaffrin et al., 2008); (McCuster and

McGuinness, 2010) in fact questions the correct use

of owl:sameAs to inter-link datasets.

Jain et al., (2010) argues that the Linked Open

Data (LoD) Cloud, in its current form, is only of

limited value for furthering the Semantic Web

vision. They discuss that the Linked Open Data

Cloud can be transformed from “merely more data”

to “semantically linked data” by overcoming

problems such as lack of conceptual descriptions for

the datasets, schema heterogeneity and absence of

schema level links. Along this line, we advocated

that the design of Linked Data sources must include

constraints derived from those of the underlying

ontologies.

We note that the problem we cover in this paper

cannot be reduced to a question of ontology

alignment in the context of Linked Data, addressed

for example in (Prateek et al., 2010); (Wang et al.,

2011). Indeed, we stress that the problem we focus

on refers to bootstrapping a new ontology (including

its constraints) through the implementation of

ontology algebra operations (projection, deprecation,

union, intersection and difference) over one or more

existing ontologies.

Some tools, such as Prompt (Noy and Musen,

2000) and ODEMerge (Ramos, 2001), allow the user

to combine two or more ontologies in a

semiautomatic or in an automatic way. Other tools,

such as PromptDiff (Noy et al., 2004) and OntoDiff

(Tury and Bieliková, 2006), deal with ontology

change detection. However, these tools cannot

capture changes in the semantics of the terms, as the

OntologyManagement tool described in (Pinheiro,

2013), which is based on the operations described in

this paper. Furthermore, the OntologyManagement

tool offers to the user a complete environment to

design and maintain ontologies, which allows

applying a series of operations over one or more

ontologies and enabling reuse, versioning, evolution

and integration of ontologies. Volz et al., (2003)

proposes a tool that implements the projection

operation by the creation of a database view

resulting from query execution. However, this tool

does not allow the generation of semantic

information captured by the constraints that apply to

the vocabulary terms.

Finally, previous work by the authors (Casanova

et al., 2011) introduced the notion of open fragment,

captured by the projection operation, and (Casanova

et al., 2012b) covered some of the operations

discussed in this paper.

7 CONCLUSIONS

In this position paper we argued that certain familiar

ontology design problems could be profitably

addressed by treating ontologies as theories and by

defining a set of operations on ontologies. Such

operations extend the idea of namespaces to take

into account constraints.

A tool that implements the operations and that

OntologiesasTheories

321

offers other facilities to manipulate ontologies is

operational (Pinheiro, 2013). This tool and was used

to test the ideas and to generate the examples partly

described in Sections 4 and 5.

As for future work, we intend to integrate the

ontology management tool with the Protégé

ontology editor. The goal of this integration is to

take advantage of all functionalities already

available in Protégé, such as ontology modeling and

visualization, inference and reasoning tasks.

ACKNOWLEDGEMENTS

This work was partly supported by CNPq, grants

301497/2006-0, 483552/2009-7, 308247/2009-4,

475717/2011-2, by FAPERJ under grant E-

26/103.070/2011, by CAPES under grant NF

21/2009, and by FUNCAP under grant GPF

2151/22.

REFERENCES

Artale, A., Calvanese, D., Kontchakov, R.,

Zakharyaschev, M., 2009. The DL-Lite family and

relations. J. of Artificial Intelligence Research 36(1),

1–69.

Baader, F., Nutt, W. 2003. Basic Description Logics. In:

F. Baader, D. Calvanese, D.L. McGuiness, D. Nardi,

P. F. Patel-Schneider (eds), The Description Logic

Handbook: Theory, Implementation and Applications,

Cambridge U. Press, Cambridge, UK, 43–95.

Berners-Lee, T., 2006. Linked Data - Design Issues. W3C.

Bizer, C., Cyganiak, R., Heath, T., 2007. How to publish

Linked Data on the Web. http://www4.wiwiss.fu-

berlin.de/bizer/pub/LinkedDataTutorial/

Brickley, D., Miller, L., 2010. FOAF Vocabulary

Specification 0.98. Namespace Document 9. Marco

Polo Edition.

Casanova, M. A., Lauschner, T., Leme, L. A. P. P.,

Breitman, K. K., Furtado, A. L., Vidal, V. M. P., 2010.

Revising the Constraints of Lightweight Mediated

Schemas. Data & Knowledge Engineering 69(12),

1274–1301.

Casanova, M. A., Breitman, K. K., Furtado, A. L., Vidal,

V. M. P., Macêdo, J. A. F., 2011. The Role of

Constraints in Linked Data. Proceedings of the

Confederated International Conferences: CoopIS,

DOA-SVI, and ODBASE 2011, Part II. Lecture Notes

in Computer Science v.7045. Springer, 781–799.

Casanova, M. A., Breitman, K. K., Furtado, A. L., Vidal,

V. M. P., Macêdo, J. A. F., 2012a. An Efficient Proof

Procedure for a Family of Lightweight Database

Schemas. In: Michael G. Hinchey (ed.), Conquering

Complexity, Springer, 431–461.

Casanova, M. A., Macêdo, J. A. F., Sacramento, E.,

Pinheiro, A. M. A., Vidal, V. M. P., Breitman, K. K.,

Furtado, A. L., 2012b. Operations over Lightweight

Ontologies. Proc. 11th International Conference on

Ontologies, DataBases, and Applications of Semantics

- ODBASE 2012 (Sept. 11-12, 2012), Rome. LNCS

7566, 646–663.

Halpin, H., Hayes, P. J., 2010.When owl:sameAs isn’t the

same: An analysis of identity links on the semantic

web. In Proc. Int’l. Workshop on Linked Data on the

Web.

Jaffri, A., Glaser, H., Millard, I., 2008. URI

disambiguation in the context of linked data. In Proc.

1st Int’l. Workshop on Linked Data on the Web.

Jain, P., Hitzler, P., Yeh, P. Z., Verma, K., Sheth, A. P.,

2010. Linked Data is Merely More Data. In: Proc.

AAAI Spring Symp. ‘Linked Data Meets Artificial

Intelligence’, 82–86.

McCusker, J., McGuinness, D.L., 2010. owl:sameAs

considered harmful to provenance. In Proc. ISCB

Conference on Semantics in Healthcare and Life

Sciences.

Noy N. F., Kunnatur, S., Klein, M. and Musen, M. A.,

2004. Tracking changes during ontology evolution. In

Sheila A. Mcilraith, Dimitris Plexousakis and Frank

van Harmelen. In: Proc. 3rd International Semantic

Web Conference, 259–273, Hiroshima, Japan.

Pinheiro, A. M. A., 2013. “OntologyManagement Tool –

Uma Ferramenta para Gerenciamento de Ontologias

como Teorias Lógicas”. M.Sc. Dissertation, Dept.

Computing, UFC.

Prateek, J., Hitzler, P., Sheth, A. P., Verma, K., Yeh, P. Z.,

2010. Ontology alignment for linked open data. In:

Proc. 9th Int’l. Semantic Web Conf. Springer-Verlag,

402–417.

Raimond, Y., Giasson, F., 2010. Music Ontology

Specification. Specification Document.

Ramos, J. A. Mezcla automática de ontologías y catálogos

electrónicos, 2001, Final YearProject. Facultad de

Informática de la Universidad Politécnica de Madrid.

Spain.

Tury, M. and Bieliková, M., 2006. An Approach to

Detection Ontology Changes. In: Proc. 6th

Internacional Conference on Web Engineering, 14,

New York, NY, USA. ACM Press.

Volz, R., Oberle, D. and Studer, R., 2003. Implementing

Views for Light-Weight Web Ontologies. In: Proc. of

Int’l Database Engineering and Application

Symposium - IDEAS, 160–169, Hong Kong, China.

IEEE Computer Society.

Wang, Z., Zhang, X., Hou, L., Li, J. 2011. RiMOM2: A

Flexible Ontology Matching Framework. In: Proc.

ACM WebSci'11, Koblenz, Germany, 1–2.

ICEIS2013-15thInternationalConferenceonEnterpriseInformationSystems

322