A COMPREHENSIVE APPROACH FOR SOLVING POLICY

HETEROGENEITY

Rodolfo Ferrini and Elisa Bertino

Department of Computer Science, Purdue University, U.S.A.

Keywords:

Policy Analysis, Policy Heterogeneity, Ontology Extraction, Ontology Merging.

Abstract:

With the increasing popularity of collaborative application, policy-based access control models have become

the usual approach for access control enforcement. In the last years several tools have been proposed in

order to support the maintenance of such policy-based systems. However, no one of those tools is able to

deal with heterogeneous policies that is policies that belong to different domains and thus adopting different

terminologies. In this paper, we propose a stack of function that allow us to create a uniﬁed vocabulary for a

multidomain policy set. This uniﬁed vocabulary can then be exploited by analysis tools improving accuracy in

the results and thus applicability in real case scenarios. In our model, we represent the vocabulary of a policy

adopting ontologies. With an ontology it is possible to describe a certain domain of interest providing richer

information than a plain list of terms. On top of this additional semantic data it is possible to deﬁne complex

functions such as ontology matching, merging and extraction that can be combined together in the creation of

the uniﬁed terminology for the policies under consideration. Along with the deﬁnition of the proposed model,

detailed algorithms are also provided. We also present experimental results which demonstrate the efﬁciency

and practical value of our approach.

1 INTRODUCTION

The increasing popularity of collaborative applica-

tions and technologies has improved the ﬂexibility

and scalability in data provisioning and resource shar-

ing. Access control in such environments is usually

enforced by using policy-based access control mod-

els supported by specialized services, often based on

the principles of the XACML reference architecture.

Such an approach allows one to decouple access con-

trol management from the application logic. How-

ever, even for simple scenarios, the maintenance of

consistent access control policy sets is not a trivial

task

. Therefore tools supporting the analysis of poli-

cies are crucial, especially for highly dynamic envi-

ronments. In the last years, several approaches and

tools for policy analysis have been proposed (Fisler

et al., 2005; Rao et al., 2008; Kolovski et al., 2007).

However, a common shortcomingof these tools is that

the analyses they support are based on a string-based

matching of the terms used in the policies. Such a

simple approach makes the assumption that different

strings represents different concepts (a.k.a. Uniﬁed

In the remainder of the paper we use the term ‘policy’

as a shorthand for the term ‘access control policy’.

Name Assumption - UNA). This assumption unfortu-

nately does not always hold. When the analysis is

performed on policies from different organizations or

administrative domains, it is often the case that differ-

ent terminologies are used and thus the same concept

may be represented by different terms. In such a con-

text, current policy analysis tools have limited appli-

cability.

In this paper we address the problem of policy het-

erogeneity by proposing a stack of technologies that

when applied to a multidomain policy set is able to

generate a uniﬁed terminology that can be exploited

by the analysis tools without requiring changes to

these tools. The key feature of our approach is the

adoption of ontologies for the speciﬁcation of the vo-

cabulary of policies. The advantage of using semantic

schemas is in the powerfultechniques that can be used

for generating mappings between entities belonging

to these schemas. However, we cannot assume that

policies deﬁne their vocabulary according to an on-

tology and for this reason a comprehensive approach

for ontology management in the speciﬁc context of

policy analysis is needed. The goal of our work is to

devise and implement such an approach. We cast our

work in the context of the XACML standard policy

Ferrini R. and Bertino E. (2009).

A COMPREHENSIVE APPROACH FOR SOLVING POLICY HETEROGENEITY.

In Proceedings of the 11th International Conference on Enterprise Information Systems - Information Systems Analysis and Speciﬁcation, pages 63-68

DOI: 10.5220/0001951500630068

 SciTePress

language, because it is very well known general lan-

guage. Our approach can however be almost directly

applied to other attribute-based access control mod-

els.

The rest of the paper is organized as follows.

in Section 2 we give background information about

XACML, ontologies, and ontology mapping. In

Section 3 we present an overview of our approach.

Preliminary concepts are introduced in Section 4,

whereas in Section 5 we describe the key functions

underlying our approach. Section 6 reports imple-

mentation details and experimental results. Finally,

Sections 7 and 8 discuss related work and conclu-

sions, respectively.

2 BACKGROUND NOTIONS AND

PRELIMINARY DEFINITIONS

In this section we introduce background notions and

preliminary deﬁnitions that are used throughout the

rest of the paper.

2.1 XACML

XACML (eXtensible Access Control Mark-up Lan-

guage) (Moses, 2005) is the OASIS standard language

for the speciﬁcation of access control policies. It is an

XML language able to express a large variety of poli-

cies, taking into account properties of subjects and

protected objects as well as context information. In

general, a subject can request an action to be executed

on a resource and the policy decides whether to deny

or allow the execution of that action. Several proﬁles,

such as an RBAC proﬁle, and a privacy proﬁle, have

been deﬁned for XACML. An XACML policy con-

sists of three major components, namely a

Target

Rule set

, and a

Rule Combining Algorithm

for

conﬂict resolution. The

Target

identiﬁes the set of

requests that the policy is applicable to. It contains at-

tribute constraints characterizing subjects, resources,

actions, and environments. Each Rule in turn con-

sists of another optional

Target

, a

Condition

, and

Effect

element. The rule Target has the same

structure as the policy Target. The Condition spec-

iﬁes restrictions on the attribute values, provided as

part of the request, that must hold in order for the re-

quest to be permitted or denied as speciﬁed by the

Effect. The Effect speciﬁes whether the requested ac-

tions should be allowed (

Permit

) or denied (

Deny

The Rule combining algorithm is used to solve con-

ﬂicts among applicable rules with different effects. In

our context, we are interested in the user-deﬁned val-

ues used in policies that from now on we refer to as

policy terms. We thus assume that one can extract

such information from XACML policies; we assume

also that such information is represented in the form

of hattribute,valuei pairs.

2.2 Ontologies and Ontology Matching

An ontology typically provides the speciﬁcation of a

domain of interest in terms of classes, class instances

(or individuals), and the relations according to which

these classes are related. The actual W3C standard

Ontology Web Language (OWL) is an XML-based

language with a well deﬁned semantics grounded in

Description Logics (DL). In OWL relations are distin-

guished into Object properties and Datatype proper-

ties: Object properties relate instances of two classes,

whereas Datatype properties relate class instances to

some typed value. Given an ontology O

, from now

on we denote the set of the entities that belong to the

ontology O

as E(O

). Moreover, we use the terms

ontology, knowledge base, and vocabulary of a pol-

icy as synonyms. Ontology matching is the process

whereby two ontologies are related at the conceptual

level. From now on we refer to a matching between

the two ontologies O

and O

as π

. State of the

art ontology matching approaches are able to generate

mappings that reduce to the same high level general

form

. Given two ontologies O

and O

, a mapping

element is a tuple he

,si, where e

∈ E(O

), and s is a conﬁdence measure in some

mathematical structure (typically in the [0, 1] inter-

val). We discuss the speciﬁc Ontology Matching al-

gorithm adopted in our approach in Section 4.1.

3 OVERVIEW OF THE

APPROACH

The key idea in our approach is the deﬁnition of a pro-

cess for the creation of a uniﬁed vocabulary with re-

spect to which all policies from a multidomain policy

set

can be speciﬁed. In our model, we adopt ontolo-

gies for the formalization of the policy vocabulary.

Since an ontology provides richer information than a

simple list of strings, it is well suited for represent-

ing the terminology of a policy. However, we cannot

assume that all policies in a multidomain policy set

adopt the same ontology for the speciﬁcation of their

vocabularies. The reason is that the development of

an ontology is often a complex, time-consuming and

Proposed in (Shvaiko and Euzenat, 2005).

By multidomain policy set we refer to a set including

policies from different domains or organizations.

ICEIS 2009 - International Conference on Enterprise Information Systems

error-prone task. Hence, it is usually a good prac-

tice to reuse, when possible, already deﬁned ontolo-

gies instead of generating new ones. However, dif-

ferent ontologies may describe overlapping domains

and these similarities need to be detected to avoid rep-

etition of entities in the uniﬁed vocabulary we aim

to create. Moreover, the deﬁnition of the vocabu-

lary becomes even more complicated when a policy

combines entities that belong to more than one on-

tology and plain strings. We can thus summarize the

cases that may arise when dealing with multidomain

policy sets as follows: (i) all policy terms are sim-

ple strings and no ontologies are used; (ii) all policy

terms are associated with concepts in the same ontol-

ogy; (iii) all policy terms are associated with concepts

deﬁned in more than one ontology; and (iv) some of

the policy terms are associated with concepts deﬁned

in more than one ontology, while the remaining terms

are simple strings. Our approach is able to deal with

all these cases by combining together different ontol-

ogy management techniques. From now on we refer

to the terms associated with an ontology concepts as

semantic data, while we refer to all other terms as

non-semantic data. Figure 1 shows the architecture

of our model. All the technologies involved in the

ontology creation process are organized into a stack

of functions. The dependencies between the various

approaches (if any) are represented as arrows. The ba-

sic building block is the Ontology Matching process

that is used by all the other methods. On top of such

process we have the Ontology Merging and Ontology

Extraction processes. These processes are not com-

pletely decoupled because, in the general case, the re-

sult of an ontology merging can be exploited during

the extraction of an ontology from the non-semantic

data of a policy. The topmost blocks represent the cre-

ation of the Policy Reference Ontology and the Refer-

ence Ontology of the policy aset.

4 DEFINITIONS AND

FUNCTIONS

In this section we introduce deﬁnitions that are used

in the rest of the paper and the key functions in our

model.

Figure 1: The stack of technologies addressing policy het-

erogeneity.

4.1 Ontology Matching

Ontology Matching is the basic function in our archi-

tecture. To implement it, we have adopted the Falcon-

AO approach (Hu et al., 2008). The 2007 Ontology

Alignment Evaluation Initiative (OAEI 07) results

indicate Falcon to be the best performing ontology

matcher available. However, several ontology match-

ers have been proposed, each with different strengths

and weaknesses. Because we adopt a layered archi-

tecture, our model can be easily extended by adopt-

ing the ontology matching algorithm that is more suit-

able with speciﬁc scenarios. The only assumption we

make on an ontology matching π

between the on-

tologies O

and O

, is that if he

,si ∈ π

, then

∄ he

′

i ∈ π

such that e

= e

′

or e

′

4.2 Ontology Extraction

In this section we address case (i) introduced in Sec-

tion 3. When a policy does not use semantic data, it

is necessary to create a new ontology extracting se-

mantic knowledge by the information that can be de-

duced from the policy itself. The problem of extract-

ing meaningful knowledge from unstructured data is

usually referred to as Ontology Extraction or Ontol-

ogy Learning and has been extensively investigated,

especially after the introduction of the Semantic Web

paradigm. In our context the data are XACML poli-

cies; thus we can exploit the explicit knowledge pro-

vided by the policy language to obtain a ﬁrst classi-

ﬁcation of terms. In doing so, we adopt the mapping

XACML to Description Logics proposed in (Kolovski

et al., 2007). The key idea of such mapping is that

each attribute-value pair in a XACML policy can

be translated by adding two entities to the extracted

knowledge base: given the pair hattribute,valuei, the

A COMPREHENSIVE APPROACH FOR SOLVING POLICY HETEROGENEITY

relation

attribute

and the concept

value

are added

to the ontology. In our case we work with OWL and

for this reason we need to check the data type of the

value before translating the attribute into the correct

OWL properties. In this paper we deal with the string

datatypes and all the numeric data types. We plan to

extend this mapping for managing more data types as

part of our future work. In our mapping, if value

is a string, then it is translated into a new concept

and attribute become an object property. If value is

a XML Schema numeric data type, no concepts are

added and attribute becomes a data type property.

4.3 Ontology Merging

In the general case a policy may adopt more than one

ontology for the speciﬁcation of its vocabulary. For

this reason, we have to deﬁne an approach for com-

bining the set of exploited ontologies together in the

uniﬁed vocabulary. The problem of combining two

(or more) ontologies in a single knowledge base is

usually referred to as Ontology Merging. The intu-

itive idea is that, given ontologies O

and O

, we aim

at constructing the union of entities e

∈ E(O

) and

∈ EO

such that if e

and e

can be considered the

same, then just one of them is added to the resulting

ontology. For example, we may consider equivalent

the entities e

, e

belonging to the mapping element

,si if s is greater than a certain threshold τ.

Thus, in building up the merged ontologies we can

consider just one of the two entities.

Deﬁnition 1 (Merged Policy Ontology). Let P

a policy. Its merged ontology, denoted as

, is the

ontology recursively deﬁned as:

•

:= MERGE(

t−1

)

•

:= MERGE(

|σ(P

)|−1

|σ(P

)

where σ(P

) is a partially ordered set of the ontolo-

gies in P

; O

∈ σ(P

) with 1 ≤ t ≤| σ(P

) |; and

MERGE(O

) is a function that takes in input the

ontologies and adds the entities e

∈ O

to O

such

that ∄ he

,si with e

∈ O

and s greater than an

acceptance threshold τ.

4.4 Hybrid Scenarios

The more complicated scenario is when we have both

heterogeneous domains and partial knowledge. How-

ever, we can easily manage such scenario by combin-

ing together the approaches deﬁned in Deﬁnition 1

and in Section 4.2. We deﬁne the Policy Reference

Ontology as follows:

Deﬁnition 2 (Policy Reference Ontology). Let P

be a policy. Its reference ontology, denoted by

is deﬁned as follows:

:= MERGE(

It is important to notice that

is a general case of

both

and

. Such general deﬁnition can be ap-

plied to all of the cases introduced in Section 3. This

is the reason why we refer to

as the Policy Refer-

ence Ontology for policy P

5 CREATING THE UNIFIED

VOCABULARY OF A POLICY

SET

In this section we present the algorithms that we have

developed based on the deﬁnitions introduced in Sec-

tion 4.

5.1 Ontology Extraction

Our Ontology Extraction algorithm improves the

mapping deﬁned in Section 4.2 by reﬁning the re-

sulting ontology through a hierarchical organization

of the new entities. The motivation for such reﬁne-

ment is that after the extraction of the entities based on

the XACML-DL mapping, we obtain a simple list of

new properties and concepts. However, the extracted

ontology may be involved in some ontology match-

ing processes. Since ontology matching takes advan-

tage of both the entity names and their organization

in the ontology, a knowledge base without a struc-

ture, that is, without a hierarchical organization of the

entities, is useless for our purposes. For this reason,

we combine the mapping deﬁned in Section 4.2 with

the subsumption relation between the terms that from

are extracted from lexical databases, such as Word-

Net

. Details about the extraction of hierarchies are

reported in (Ferrini and Bertino, 2009). The algorithm

in Figure 2 gives the details of the Ontology Extrac-

tion process. For simplicity we only show the creation

of object properties; the creation of data type proper-

ties is straightforward because we just need to add the

new property without creating any concept. Lines 2-5

of the algorithm create a new property and a new con-

cept given an attribute-value pair. Line 6 updates the

range of the property with the new concept, ﬁnally

line 7 returns the

enriched with the hierarchies

added by the

CREATE HIERARCHY

function.

One may argue that our Ontology Extraction algo-

rithm might be improved by taking into account the

http://wordnet.princeton.edu/

ICEIS 2009 - International Conference on Enterprise Information Systems

knowledge that can be inferred by comparing differ-

ent rules within a policy. For example, if subject s

has

more privileges than subject s

, we may organize the

associated ontology concepts in a specialized subject

hierarchy. However, such additional information is a

property of the policies and is not knowledge describ-

ing a given term. This can misleads the matching al-

gorithm since the same concepts in different policies

may be differently related. Moreover, this additional

knowledge is typically taken into account during sub-

sequent steps in the policy analysis process.

INPUT: Pairs(P

)

: The attribute-value pairs of

OUTPUT:

: The ontology

extracted by the policy

= new

Ontology()

;

FOR EACH h attribute

, value

not contains

(

attribute

)

add

(

new

(ObjectProp(

attribute

)));

add

(

new

(Concept(

value

)));

.attribute

.range =

.value

;

return CREATE HIERARCHY

(

);

Figure 2: The Ontology Extraction Algorithm.

5.2 Ontology Merging

In our model, ontology merging plays a crucial role

since is one of the core functions in the solution stack.

We have developed two different algorithm to ad-

dress the requirements concerning ontology merging

(cfr.1): (i)

MERGE

. It is the base Merge algorithm that takes

in input two ontologies and returns the reconciled

knowledge base. This algorithm (see Figure 3 im-

plements the function MERGE deﬁned in 1.

ONTOLOGY MERGING

. It is the function that applies

MERGE

to all the ontologies exploited by policy P

This algorithm implements all he components de-

ﬁned in 1. Because of space limitation the algo-

rithm is not reported here.

5.3 Policy and Policy Set Reference

Ontology

The creation of the Policy and Policy Set Reference

Ontology is the ﬁnal step in our approach. Such func-

tions are straightforward and because of space limita-

tion are not reported here. The Policy Reference On-

tology is obtained by combining the Ontology Merg-

ing and the Ontology Extraction functions. With re-

spect to the Policy Set Reference Ontology, the key

idea is to extract the reference ontology for each pol-

icy in the multidomain set under consideration and

merge all the resulting ontologies.

INPUT

: The first ontology to be merged

: The second ontology to be merged

OUTPUT

i, j

: The merged ontology

i, j

new(Ontology())

;

mapping

MAP(O

, O

)

;

FOR EACH me

∈ mapping

IF me

.s > τ

.update(me

, me

)

;

i, j

.add(me

)

;

FOR EACH e

/∈

i, j

.add(e

)

;

FOR EACH e

/∈

i, j

10:

i, j

.add(e

)

;

11:

return

i, j

;

Figure 3: The MERGE Algorithm.

6 IMPLEMENTATION AND

EXPERIMENTAL RESULTS

We have implemented a JAVA prototype of the pro-

posed approach. The prototype uses the Sun imple-

mentation of XACML, the OWL API for loading,

updating, and creating ontologies, the Falcon-AO li-

brary for ontology matching, and the MIT Java Word-

Net Interface for managing the WordNet database.

For the experimental evaluation we generated a set

of XACML policies. Attributes were randomly se-

lected from a predeﬁned list, while semantic data was

obtained by randomly selecting entities from a set of

ontologies retrieved by using the SWOOGLE ontol-

ogy search engine. Figure 4 shows the total execution

time of our process for increasing values in the num-

ber of attributes. We plotted the execution time of the

approach for varying values in the number of total at-

tributes

ranging from 10 to 50. Each column shows

the time of: (i) the merge algorithm, (ii) the extrac-

tion algorithm, and (iii) the combination of their re-

sult. As expected, most of the execution time is spent

in merging ontologies. Conversely, the extraction is

very quick and even for a high number of attributes

(not reported in the ﬁgure) e.g. ≈ 100, the execution

time is ≈ 150 msec.

Table 1 reports data concerning the accuracy of

our model. We evaluate the number of the detected

Since in our approach we consider attribute-value pairs,

it makes more sense to analyze the times with respect to the

number of attributes instead of the policy number.

A COMPREHENSIVE APPROACH FOR SOLVING POLICY HETEROGENEITY

similar concepts and the correctness of the related

mappings for varying values of the threshold τ from

0.65 to 0.95

. The results show that for values of τ

between 0.75 and 0.80, our model achieves a good

balance between the correctness of the mappings and

the increase in the detected similarities. More ex-

periments along with an optimized version of the on-

tology matching process are reported in (Ferrini and

Bertino, 2009).

Figure 4: Total execution times for increasing values in the

number of policy attributes.

Table 1: The accuracy of the model.

τ Sim. Con. Detected Correctness

[0.65, 0.70] 86,458% 58,823%

[0.70, 0.75] 85,416% 64,705%

[0.75, 0.80] 80,208% 80,411%

[0.80, 0.85] 58,333% 85,294%

[0.85, 0,90] 52,083% 91,176%

[0.90, 0.95] 46,875% 94,117%

> 0.95 42,708% 97,059%

7 RELATED WORK

To the best of our knowledge, this is the ﬁrst approach

that exploits ontology-based techniques for address-

ing policy heterogeneity. However, policy analysis

and ontology-based technologies have already been

investigated. Rein (Kagal et al., 2006) is a gen-

eral policy framework based on semantic web tech-

nologies. Rein is able to support general purpose

policy systems and for this reason it is well suited

for solving mismatches among different policy lan-

guages. However, Rein does not address the prob-

lem of heterogeneity among vocabularies. Kolovski

et al. (Kolovski et al., 2007) propose a mapping be-

tween XACML and Description Logics along with

We run our prototype on a set of policies with an aver-

age number of 50 attributes.

some interesting analysis services. However, they do

not address the problem of policy heterogeneity. Fi-

nally, Lin et al. (Lin et al., 2007) propose a policy

similarity function exploited as a ﬁlter before apply-

ing more accurate analysis tools.

8 CONCLUSIONS

In this paper, we have addressed the problem of het-

erogeneity in the context of policy analysis. Our ap-

proach represents the terminology of a policy through

the use of ontologies and consists of a stack of func-

tions that allows one to generate a uniﬁed vocabulary

for a multidomain policy set. This vocabulary can be

then exploited by policy analysis tools for analyzing

and comparing policies. We have implemented a pro-

totype of the proposed approach and analyzed its per-

formance.

REFERENCES

Ferrini, R. and Bertino, E. (2009). A comprehensive ap-

proach for solving policy heterogeneity. Technical re-

port, Purdue University, Department of Computer Sci-

ence, CERIAS.

Fisler, K., Krishnamurthi, S., Meyerovich, L. A., and

Tschantz, M. C. (2005). Veriﬁcation and change-

impact analysis of acces scontrol policies. In Pro-

ceedings of the International Conference on Software

Engineering (ICSE), pages 196–205.

Hu, W., Qu, Y., and Cheng, G. (2008). Matching large

ontologies: A divide-and-conquer approach. Data &

Knowledge Engineering, 67(1):140–160.

Kagal, L., Berners-Lee, T., Connolly, D., and Weitzner, D.

(2006). Using semantic web technologies for policy

management on the web. In 21st National Conference

on Artiﬁcial Intelligence (AAAI).

Kolovski, V., Hendler, J., and Parsia, B. (2007). Analyzing

web access control policies. In Proceedings of the In-

ternational World Wide Web Conference WWW 2007,

pages 677–686.

Lin, D., Rao, P., Bertino, E., and Lobo, J. (2007). An ap-

proach to evaluate policy similarity. In SACMAT ’07:

Proceedings of the 12th ACM Symposium on Access

Control Models and Technologies, pages 1–10, New

York, NY, USA. ACM Press.

Moses, T. (2005). Extensible access control markup lan-

guage (XACML) version 2.0. OASIS Standard.

Rao, P., Lin, D., Bertino, E., Li, N., and Lobo, J. (2008).

Exam: An environment for access control policy anal-

ysis and management. In POLICY, pages 238–240.

Shvaiko, P. and Euzenat, J. (2005). A survey of schema-

based matching approaches. Journal on Data Seman-

tics IV, pages 146–171.

ICEIS 2009 - International Conference on Enterprise Information Systems