Table 3: Results of the wine dataset.
Algorithm
Function value
Time
(sec)
F best F average F worst
Hybrid GA-
ACO
16387.7595 16438.1623 16530.5338
216.
01
ACO 16530.5338 16530.5338 16530.5338
68.2
9
GA 16530.5338 16530.5338 16530.5338
226.
68
TS 16530.5338 16785.4592 16837.5356
161.
45
SA 16530.5338 16530.5338 16530.5338
57.2
8
4.2 Paired Comparison Results
As the ACO solution is better than SA, GA and TS
solutions, so our hybrid GA-ACO method is
compared with ACO. Considering the data in Tables
2 and 3, two paired comparison designs are carried
out on 10 pairs of the best solutions of ACO and
hybrid GA-ACO for each dataset to see whether if
any improvements achieved. The related results of
the paired comparison test are illustrated in Table 4.
Design of the comparison test is as follows:
i
X : The i-th best solution of the hybrid GA-ACO
method.
i
Y : The i-th best solution of the ACO method.
n
D
DniYXD
n
i
i
iii
∑
=
=⇒=−=
1
,...,2,1
(11)
()
n
S
D
T
n
DD
S
D
n
i
i
D
=⇒
−
−
=
∑
=
1
1
2
(12)
0:
0:
1
0
≠
=
D
D
H
H
μ
μ
If
],[
1,2/1,2/ −−
−∈
nn
ttT
αα
,
0
H
is accepted.
Table 4: Results of Paired comparison test.
D S
D
n
Test
statistic
Acceptance
Interval
Resu
lt
iri
s
-20.016 7.834 10 -8.079 [-2.262,2.262]
Reje
ct H
0
wi
ne
-692.776 319.53 10 -6.856 [-2.262,2.262]
Reje
ct H
0
By considering the test statistic values from
Table 4, it is concluded that with
05.0=
(
:
Level of significance), the hybrid GA-ACO method
has a meaningful difference with the ACO algorithm
in 10 independent runs.
It is worthy noting that the results obtained by
the hybrid method are superior to that of the ACO,
SA, GA, and TS methods. The associated results
illustrate that our proposed hybrid approach can be
considered as a viable and an efficient method to
find optimal or near-optimal solutions for clustering
problems in order to allocate
N objects to K clusters.
5 CONCLUSIONS
In this paper, a hybrid GA-ACO method has been
developed to solve clustering problems. To evaluate
the performance of our hybrid GA-ACO method, the
associated results are compared with other results
obtained by other meta-heuristic algorithms, i.e.
ACO, GA, SA, and TS by means of the paired test
statistics. Our proposed hybrid method has been
implemented and tested on several real datasets;
preliminary computational experience is very
encouraging in terms of the solution quality found
and in all cases the best dominant solution is
obtained by our proposed hybrid method and even
the average and the worst solutions of this method is
better than or equal to best solutions of the other
algorithms. Although this method needs longer time
to find solutions but the difference between the
solutions of our hybrid GA-ACO method and other
algorithms is totally encouraging and glamorous.
REFERENCES
Al-Sultan, K.S., 1995. A Tabu search approach to the
clustering problem, Pattern Recogn. 28 (9) 1443-1451.
Banfield, J., Raftery, A., 1993. Model-based Gaussian and
non- Gaussian clustering, Biometrics, 49, 803-821.
Dorigo, M., Maniezzo, V. Colorni, A., 1996. The ant
system: Optimization by a colony of cooperating
agents. IEEE Transactions on Systems, Man, and
Cybernetics-Part B 26 (1) 29-41.
Jiang, J.-H., Wang, J.H., Chu, X., Yu, R.-Q., 1997.
Clustering data using a modified integer genetic
algorithm (IGA). Analytica Chimica Acta 354, 263-
274.
Lin, H-J., Yang, F-W., Kao, Y-T., 2005. An efficient GA-
based clustering technique, Tamkang Journal of
Science and Engineering 8 (2) 113-122.
Murthy, C.A., Chowdhury, N., 1996. In search of optimal
clusters using genetic algorithms. Pattern Recognition
Letters. 17 (8) 825- 832.
Newman, D.J., Hettich, S., Blake, C.L., Merz, C.J. 1998.
UCI Repository of machine learning databases,
AN EFFICIENT HYBRID METHOD FOR CLUSTERING PROBLEMS
293