then created by applying a rollup function which re-
lates the instances of the level l with the instances of
the level pl. The domain of the personalized level
pl is composed of the k instances representing the
k obtained clusters. The obtained personalized se-
mantic hierarchies would provide new multidimen-
sional ways for analyzing data and obtain more rel-
evant analyses semantically richer.
Our operator is integrated inside Oracle DBMS
where we carried out some experimentation which
validated the relevance of our approach.
The remainder of this paper is organized as fol-
lows: Section 2 presents related works and compares
our approach to existing ones in the literature. In sec-
tion 3, we present our PRoCK operator. The frame-
work of creating semantic hierarchies using PRoCK is
described in section 4. After an illustrative example
presented in section 5, we describe implementation
with some preliminary experimental results in section
6. Finally, conclusions and our expected future work
are given in section 7.
2 RELATED WORKS
Since data warehouses are characterized by volumi-
nous data and are based on a user-centric analy-
sis process, including personalization into the data
warehousing process becomes a new research issue
(Rizzi, 2007). Despite first approaches for person-
alization on data warehouses that focus on user def-
inition with specific data as defined on traditional
databases, there exists some approaches based on
conceptual model and its multidimensional concepts
(fact, dimension, hierarchy, measure, attribute). For
example, using annotations, a new personalization
technique based on user preferences model is pro-
posed in which weights are associated to multidimen-
sional databases components (Ravat and Teste, 2008).
To assign priority weights to attributes of a multidi-
mensional schema, the personalization rules are de-
scribed using the Condition-Action formalism. More
recently, this model has been used for handling the
context notion in order to closely relating user re-
quirements to their current context (Jerbi et al., 2009).
Moreover, the importance of dimension hierar-
chies was reflected in (Bentayeb, 2008) where the au-
thor used data mining techniques as aggregation op-
erators to update dimension hierarchies in data ware-
houses without taking into account user preferences.
Garrigos et al. use the data warehouse multidi-
mensional model, user model and rules for the data
warehouse personalization (Garrig
´
os et al., 2009). As
a result, a data warehouse user is able to work with
a personalized OLAP schema, which best matches
his needs. Based on ECA-rules (Event-Condition-
Action) (Thalhammer et al., 2001)), PRML (Person-
alization Rules Modeling Language is used in (Gar-
rig
´
os et al., 2009) for specification of OLAP person-
alization rules. The structure of such PRML rules can
be presented with following statement: when event do
if condition then action endIf endWhen.
After that, in (Kozmina and Niedrite, 2010), a new
method was proposed which provides exhaustive de-
scription of interaction between user and data ware-
house. A set of user-describing profiles (user prefer-
ence, temporal, spatial, preferential and recommenda-
tional) have been developed. A metamodel which for-
mulates user preferences for OLAP schema elements
and aggregate functions has been proposed. This
model reflects connections among user-describing
profiles.
Recently, inspired by (Kießling and K
¨
ostler, 2002)
and (Golfarelli and Rizzi, 2009), (Golfarelli et al.,
2011) propose an approach to adapt preference con-
structors to multidimensional context. Formulated on
schema, preferences can not only be expressed over
attributes and thus over cuboids but also preferences
can be expressed over numerical values (measures).
The preferences composition is modeled using pred-
icate logic attributes and expressed through Pareto
composition (two preferences are equally relevant) or
Prioritization (a preference is more relevant than an-
other).
We argue that multidimensional structures such as
dimension hierarchies have a strong impact in OLAP
analysis and they should be considered in OLAP per-
sonalization. For this reason, users must be able to
express their preferences on dimension hierarchies.
In fact, preference model is considered a main open
problem in OLAP personalization in (Rizzi, 2007).
Our proposal comes close to a previous work
that proposes structural update of OLAP dimensions
(Bentayeb, 2008). However, it is different, so that, it
proposes personalizing hierarchies by exploiting user
preferences. Our method aims at improving OLAP
analysis process by taking into account the individual
interests of users.
In this section we have reviewed the current ap-
proaches for personalization in data warehouses. We
present a comparative table (table 1) confronting the
panoply of the proposed approaches. We choose some
criteria that we consider relevant to compare person-
alization approaches.
• Source: this criterion presents the object to exploit
for personalization which can be a user profile (in-
terests, preferences, constraints,...), query history
(log file) or user context.
ICEIS2013-15thInternationalConferenceonEnterpriseInformationSystems
194