options to consider would be Crossover4 with low se-
lective pressure or Crossover(5,6) with moderate se-
lective pressure. Finally, if the time is not a constraint
and the dataset is high-dimensional, the best option to
consider would be Crossover6 with moderate selec-
tive pressure.
7 CONCLUSIONS
Sequential Projection Pursuit - SPP is a useful
method to find interesting linear projections of multi-
dimensional data. The main problem in SPP is the op-
timization of the PP index function, which measures
how interesting the projection is. Genetic algorithms
are promising optimizer of SPP, but their success is
dependent on the selection of their genetic operators.
One of the most important operators is the crossover,
which is responsible for the rapid exchange of useful
information among solutions. This article addressed
the problem of which crossover to choose in the de-
sign of a GA-based optimizer for SPP. An experimen-
tal study was presented, comparing the performance
of eight crossover operators: three available in litera-
ture (arithmetic crossover and single and multi-point
crossover) and five new proposed here (one single ex-
tension of the arithmetic crossover, two hyperconic
crossovers and two fitness-biased crossovers). The
study was carried out over four public datasets of
increasing dimension. The results showed that one
of the proposed hyperconic operators tends to find
projections with higher fitness than the other oper-
ators, clearly excelling in higher dimensions. This
interesting performance was also observed at differ-
ent stages of evolution. The multipoint crossover op-
erator presented the second best performance in fit-
ness, being competitive with the hyperconic operator
in low dimensional datasets. The final selection of
the crossover operator is dependent on the precision
required, the dimension of the dataset and the tolera-
ble time to get the answer. Some guidelines to select
the most appropriate operator for each situation were
presented.
This study is an important step towards the de-
sign of efficient GA-optimizers for SPP. We are cur-
rently investigating other PP indices and the influ-
ence of mutation on the performance of the pre-
sented crossover operators. Also, different evolution-
ary strategies are being studied to take advantage of
the features of the proposed operators.
ACKNOWLEDGEMENTS
The authors acknowledge the CAPES/ PEC-PG -
Brazil for the scholarship granted to the first author
of this article.
REFERENCES
Berro, A., Marie-Sainte, S. L., and Ruiz-Gazen, A. (2010).
Genetic algorithms and particle swarm optimization
for exploratory projection pursuit. Annals of Math-
ematics and Artificial Intelligence, 60(1-2, SI):153–
178.
Bolton, R. and Krzanowski, W. (2003). Projection pursuit
clustering for exploratory data analysis. Journal of
Computational and Graphical Statistics, 12(1):121–
142.
Chiang, S.-S., Chang, C.-I., and Ginsberg, I. (2001). Unsu-
pervised target detection in hyperspectral images us-
ing projection pursuit. Geoscience and Remote Sens-
ing, IEEE Transactions on, 39(7):1380 –1391.
Cook, D., Buja, A., and Cabrera, J. (1993). Projection
pursuit indexes based on orthonormal function ex-
pansions. Journal of Computational and Graphical
Statistics, 2(3):225–250.
Daszykowski, M., Kaczmarek, K., Heyden, Y. V., and Wal-
czak, B. (2007). Robust statistics in data analysis - a
review: Basic concepts. Chemometrics and Intelligent
Laboratory Systems, 85(2):203– 219.
Demirci, O., Clark, V. P., and Calhoun, V. D. (2008). A pro-
jection pursuit algorithm to classify individuals using
fMRI data: Application to schizophrenia. Neuroim-
age, 39(4):1774–1782.
Frank, A. and Asuncion, A. (2010). UCI machine learning
repository.
Friedman, J. H. (1987). Exploratory projection pursuit.
American Statistical Association, 82(397):249–266.
Friedman, J. H. and Tukey, J. W. (1974). A projection
pursuit algorithm for exploratory data analysis. IEEE
Trans. Comput., 23(9):881–890.
Guo, Q., Wu, W., Massart, D., Boucon, C., and de Jong,
S. (2001). Feature selection in sequential projection
pursuit. Analytica Chimica Acta, 446(1-2):85–96.
Guo, Q., Wu, W., Questier, F., Massart, D., Boucon, C., and
de Jong, S. (2000). Sequential projection pursuit using
genetic algorithms for data mining of analytical data.
Analytical Chemestry, 72(13):2846–2855.
Holland, J. H. (1975). Adaptation in Natural and Artificial
Systems. University of Michigan Press, Ann Arbor,
MI, USA.
Huber, P. J. (1985). Projection pursuit. Annals of Statistics,
13(2):435–475.
Jee, J. R. (2009). Projection pursuit. Wiley Interdisciplinary
Reviews: Computational Statistics, 1(2):208–215.
Jones, M. C. and Sibson, R. (1987). What is projection pur-
suit? Journal of the Royal Statistical Society. Series A
(General), 150(1):pp. 1–37.
ON THE CROSSOVER OPERATOR FOR GA-BASED OPTIMIZERS IN SEQUENTIAL PROJECTION PURSUIT
101