term
ij
which does not occur in the constructed rule
must be decreased. The reduction of pheromone of
an unused term is performed by dividing the value of
each τ
ij
by the summation of all τ
ij
. The pheromone
levels of all terms are then normalized.
3 FIRST MODIFICATIONS
The authors of Ant-Miner (Parpinelli et al., 2004;
Parpinelli et al., 2002) suggested two directions for
future research:
1. Extension of Ant-Miner to cope with continuous
attributes;
2. The investigation of the effects of changes in the
main transition rule:
(a) the local heuristic function,
(b) the pheromone updating strategies.
Recently, Galea (Galea and Shen, 2006) proposed
a few modifications in Ant-Miner. Another modi-
fications (Oakes, 2004; Martens et al., 2006) cope
with the problem of attributes having ordered cate-
gorical values, some of them improve the flexibility
of the rule representation language. Finally, more so-
phisticated modifications have been proposed to dis-
cover multi-label classification rules (Chan and Fre-
itas, 2006) and to investigate fuzzy classification rules
(Galea and Shen, 2006). Certainly there are still many
problems and open questions for future research.
3.1 Data Sets used in our Experiments
The evaluation of the performance behavior of dif-
ferent modifications of Ant-Miner was performed us-
ing 5 public-domain data sets from the UCI.Please
note that Ant-Miner cannot cope directly with contin-
uous attributes (i.e. continuous attributes have to be
discretized in a preprocessing step, using the RSES
program (logic.mimuw.edu.pl/˜rses/)). In the original
Ant-Miner and Galea implementation (Galea, 2002),
the discretization was carried out using a method
called C4.5-Disc (Kohavi and Sahami, 1996). C4.5-
Disc is an entropy-based method that applies the
decision-tree algorithm C4.5 to obtain discretization
of the continuous attributes.
Both the original Ant-Miner and our proposal
have some parameters. The first one – the number
of ants will be examined during the experiments.
4 PROPOSED MODIFICATIONS
An Ant Colony Optimization technique is in essence,
a system based on agents which simulate the natural
behavior of ants, incorporating a mechanism of co-
operation and adaptation, especially via pheromone
updates. When solving different problems with the
ACO algorithm we have to analyze three major func-
tions. Choosing these functions appropriately helps
to create better results and prevents stacking in local
optima of the search space.
The first function is a problem-dependentheuristic
function (η) which measures the quality of terms that
can be added to the current partial rule. The heuris-
tic function stays unchanged during the algorithm run
in the classical approach. We want to investigate
whether the heuristic function depends on the previ-
ous well-known approaches in the data-mining area
(C4.5, CART, CN2) and can influence the behavior of
the whole colony,or not. According to the proposition
concerning the heuristic function (Liu et al., 2004),
we also analyze the simplicity of this part of a main
transition rule in Ant-Miner. The motivation is as fol-
lows: in ACO approaches we do not need sophisti-
cated information in the heuristic function, because of
the pheromone value, which compensates some mis-
takes in term selections. Our intention is to explore
the effect of using a simpler heuristic function instead
of a complex one, originally proposed by Parpinelli
(Parpinelli et al., 2004), so we change the formula
presented in the formula 1.
4.1 CART Influences
In the case of a method CART proposed by (Breiman
et al., 1984), the value of InfoT
ij
is determined ac-
cording to the following formula 2.
In foT
ij
= 2· P
L
· P
P
·
k
∑
w=1
|Pw
L
− Pw
P
| (2)
where:
• P
L
– a ratio of a number of objects in which the
specific attribute i has a value j to all objects in a
testable data set,
• P
P
– a ratio of a number of objects in which the
specific attribute i has not an analyzed value j to
all objects in a testable data set,
• Pw
L
– a ratio of a subset of objects belonging to
the decision class w in which the specific attribute
i has a value j to all objects having the value j,
• Pw
P
– the ratio of a subset of objects belonging
to the decision class w in which the specific at-
tribute i has not a value j to all objects having the
value j.
ICEIS 2009 - International Conference on Enterprise Information Systems
36