representations are grid-like structures and bar charts
(Figure.1). The grid view consists of a 2D or 3D
matrix of cells where each cell represents a rule
(Kopanakis and Theodoulidis, 2003; Couturier et al.,
2007; Liu and Salvendy, 2006). One matrix
dimension represents rules antecedents and the other
one represents rules consequent. Each cell is filled
with colored bars indicating rule support and
confidence values. However, this representation
often suffers from occlusion. Moreover, it is difficult
to represent rules if there are too many different at-
tributes in the data or if the rules have many items.
Other visualization techniques are based on graph
visualization (Bruzzese and Buono, 2004; ?), the
nodes and the edges respectively representing the
items and the rules (Figures 2). The interestingness
measures are symbolized by colors and sizes. Other
work uses 3D objects to represent association rule.
In (Blanchard et al., 2007), each rule is represented
by a sphere, whose radius maps its support, and by
a cone, whose base width maps its confidence
(Figure 3). Additionally, the colors of the sphere and
cone redundantly represent a weighted average of the
measures. the rule position in the arena represents the
implication intensity. It’s must be noticed that the
presented methods and techniques are generally
supplied with few interestingness measures and
none of these methods represents the relations
between attributes in the rule and the contribution of
each one of them.
3 IMPORTANCE OF
RULE’S
INDIVIDUAL
A
TTRIB
UTES
3.1
Attribute Interaction
Two attributes are correlated if they are not
independent. Two attributes are independent if
changing the value of one does not affect the value
of the other. The lift measure calculates the
correlation between two attributes from the
antecedent or the con- sequent. The correlation
between two attributes rep- resents the amount of
information shared among the two attributes. The
lift measure determines whether attribute1 and
attribute2 have a positive (lift >1) or a negative (lift
<1) correlation. The correlation is considered
positive (negative) if the observed frequency of
example satisfying both attribute1 and attribute2 is
greater (smaller) than the expected frequency
assuming statistical independence
between attribute1
and at- tribute2. The (Freitas, 2001) study showed
that the concept of attributes interaction can be
beneficial to the association rule extraction process
and proposed to introduce attribute interaction in the
design of association rule mining systems.
Attributes interaction allows detection surprising
knowledge which can’t be discovered analyzing the
whole rule. The relationships expressed in a rule
totality is quite different from the relationships
expressed in separate rule parts (antecedent and
consequent).
On the other hand, to discover useful association
rules, the user needs to get insight into the data and
understand the relationships between the attributes
and their statistical properties (Chanda et al., 2010).
Exploring attributes relation enables deeper insight
into the data and learn about the data model. In
many case (biological or genetic context for example)
antecedent items has weak associations with
consequent. However, they interact together in a
complicated way to control the consequent
(Chanda et al., 2010).
3.2
Attribute Importance
An attribute can be important for the user if
regularities are observed in a smaller dataset, while
being unobservable in the entire data. A rule can be
considered as disjunction of rules. The size of a
disjunct (rule) is the number of items composed the
rule’s antecedent and the rule’s consequent. For
example: r : X1 X2 X3
➜ Y1 Y2 is a rule. A
disjunction of rules is r1 :
X1
➜
Y1 Y2, r2 :
X2
➜
Y1 Y2, r3 : X3 ➜ Y1 Y2, r4 : Y1 ➜ X1 X2 X3 and
r5 : Y2
➜ X1 X2 X3. At first sight, it seems that this
small rules has no importance, since they can be
considered as a redundant rules. Based on this view,
all most extraction algorithms do not keep this rules
in the results. However, small rules have the
potential to show unexpected relationships in the
data (Freitas, 1998). (Provost and Aronis, 1996)
proved that small rules were considered interesting in
their field application. Accordingly, it would
beneficial that the user can see automatically this
small rules.
In order to evaluate the contribution of each item
to rule (Freitas, 1998) has proposed the Information
Gain measure which can be positive or negative. Item
with high positive Information Gain is considered as
a good one. Item with high negative Information Gain
is considered as a bad one and should be removed
from the association rule. From a rule interesting
perspective, the user knows already the most
important attributes for its field, and rules containing
these items may not be much interesting. At the
same time, a rule includes attributes with low or
A NEW VISUALIZATION METAPHOR FOR ASSOCIATION RULES
805