Despite the fact that the generalized association
rules enable the knowledge exploration in different
levels of abstraction, there still is the need to find
a way to explore the potential of this kind of rules.
Thus, the main objective of this work is to aid the user
in the comprehension and identification of interesting
generalized association rules. Therefore an approach
is proposed to improve the advantages of combining
objective and subjective measures with information
visualization techniques.
2 GARVis: AN APPROACH TO
EVALUATE GENERALIZED
ASSOCIATION RULES
The GARVis approach allows the user to analyze,
through objective measures, a generalized association
rule set using SQL queries and graphic analysis, se-
lecting a subset of rules to be explored by a domain
expert. From the selected rules, subjective measures
can be computed and a subjective measure explo-
ration can be done. That way, the user can analyze a
set of rules observing different aspects. The approach
is divided in four steps, described as follows.
Objective Analysis (1): it is considered, in this step,
that the user already has a generalized association rule
set obtained by the approach proposed by (Carvalho
et al., 2007). Initially, SQL queries are carried out
aiming to select some features and/or objective mea-
sure values that are of application interest in order to
obtain a focus set (the objective measures considered
are the ones described in (Tan et al., 2004)). If the
user is not interested in specific items, the focus set
is formed by the whole rule set. Applying graphic
analysis on the focus set using objective measures,
the focus set is filtered, conducing to the identifica-
tion of a subset of potentially interesting rules (PIR).
The graphic analysis of this step is realized in an in-
teractive way in a X-Y graphic, enabling the user to
directly interact with the graphic, facilitating his/her
comprehension and usability.
PIR Subset Evaluation (2): has as input the PIR sub-
set obtained from Step 1 and has as objective the eval-
uation of these rules by a domain expert. The knowl-
edge expressed in each one of the rules is classified
as irrelevant, obvious, previous, unexpected or use-
ful (one or more classification can be selected). Dur-
ing evaluation, the rules considered as irrelevant, by
a domain expert, are eliminated from the focus set,
as all the other similar rules (rules that contain irrel-
evant items in the same position (same side) of the
rule classified as irrelevant). In cases where there
is a rule with the same items, but in different posi-
tions, the user is asked to check if the rule has also
to be considered irrelevant. During this process, the
user can, in parallel, visualize the rules in a textual
form and make a graphic analysis. In this step, X-Y
graphics and bar charts are available for rule visual-
ization. The user can also visualize redundant, com-
plements and exceptions rules, which can be added
in the PIR subset in order to be evaluated. The de-
finitions of redundancy and exception described in
the (Zaki, 2004) and (Gonalves et al., 2005) works,
are considered respectively. The complement defini-
tion is proposed in this work as follows. Consider
R = {r
1
, ..., r
l
} a set of rules and X = {x
1
, ..., x
k
} a
taxonomy set. Given that r
i
,r
j
∈ R, r
i
is a comple-
ment of r
j
if ((r
i
.LHS = r
j
.LHS ∧ r
i
.RHS\r
j
.RHS =
w ∧ r
j
.RHS \ r
i
.RHS = w
0
) ∨ (r
i
.RHS = r
j
.RHS ∧
r
i
.LHS \ r
j
.LHS = w ∧ r
j
.LHS \ r
i
.LHS = w
0
)) and
w, w
0
have the same ancestral in taxonomy x
h
. The
complement is symmetric, so r
i
is a complement of r
j
and r
j
is a complement of r
i
.
Subjective Processing (3): in this step, for each of
the rules that are not eliminated from the focus set,
the subjective measures conforming, unexpected an-
tecedent, unexpected consequent, and both-side un-
expected, defined by (Liu et al., 2000), are computed.
To compute these measures, the classifications made
in Step 2, for the rules contained in the PIR subset, are
used as domain knowledge. That way, it is possible to
carry out an analysis with the subjective measures to
aid the identification of possible interesting rules not
previously found only by the objective measures anal-
ysis.
Subjective Measures Analysis (4): during the ex-
ploration of the rules contained in the resultant focus
set, using the subjective measures computed in Step
3, the user has the support of graphic analysis using
X-Y graphics and bar charts. The aim, with these vi-
sualizations, is to increase the rule comprehensibil-
ity and to facilitate the identification of interesting
knowledge, since the user has a visual and interactive
exploration option. It is important to mention that the
exploration in the resultant focus set should be car-
ried out according to the goals of the user during the
analysis. For example, if the user wishes to confirm
his/her previous knowledge, he/she can use the con-
forming measure and list the rules that conform to
the rules that had been evaluated as obvious or pre-
vious knowledge in Step 2. During this evaluation the
user can find some rules, not previously found, that
are also interesting for him/her.
After applying the four steps, the approach gener-
ICEIS 2009 - International Conference on Enterprise Information Systems
286