ON THE CROSSOVER OPERATOR FOR GA-BASED OPTIMIZERS IN SEQUENTIAL PROJECTION PURSUIT

Soledad Espezua, Edwin Villanueva, Carlos D. Maciel

Abstract

Sequential Projection Pursuit (SPP) is a useful tool to uncover structures hidden in high-dimensional data by constructing sequentially the basis of a low-dimensional projection space where the structure is exposed. Genetic algorithms (GAs) are promising finders of optimal basis for SPP, but their performance is determined by the choice of the crossover operator. It is unknown until now which operator is more suitable for SPP. In this paper we compare, over four public datasets, the performance of eight crossover operators: three available in literature (arithmetic, single-point and multi-point) and five new proposed here (two hyperconic, two fitness biased and one extension of arithmetic crossover). The proposed hyperconic operators and the multi-point operator showed the best performance, finding high-fitness projections. However, it was noted that the final selection is dependent on the dataset dimension and the timeframe allowed to get the answer. Some guidelines to select the most appropriate operator for each situation are presented.

References

  1. Berro, A., Marie-Sainte, S. L., and Ruiz-Gazen, A. (2010). Genetic algorithms and particle swarm optimization for exploratory projection pursuit. Annals of Mathematics and Artificial Intelligence, 60(1-2, SI):153- 178.
  2. Bolton, R. and Krzanowski, W. (2003). Projection pursuit clustering for exploratory data analysis. Journal of Computational and Graphical Statistics, 12(1):121- 142.
  3. Chiang, S.-S., Chang, C.-I., and Ginsberg, I. (2001). Unsupervised target detection in hyperspectral images using projection pursuit. Geoscience and Remote Sensing, IEEE Transactions on, 39(7):1380 -1391.
  4. Cook, D., Buja, A., and Cabrera, J. (1993). Projection pursuit indexes based on orthonormal function expansions. Journal of Computational and Graphical Statistics, 2(3):225-250.
  5. Daszykowski, M., Kaczmarek, K., Heyden, Y. V., and Walczak, B. (2007). Robust statistics in data analysis - a review: Basic concepts. Chemometrics and Intelligent Laboratory Systems, 85(2):203- 219.
  6. Demirci, O., Clark, V. P., and Calhoun, V. D. (2008). A projection pursuit algorithm to classify individuals using fMRI data: Application to schizophrenia. Neuroimage, 39(4):1774-1782.
  7. Frank, A. and Asuncion, A. (2010). UCI machine learning repository.
  8. Friedman, J. H. (1987). Exploratory projection pursuit. American Statistical Association, 82(397):249-266.
  9. Friedman, J. H. and Tukey, J. W. (1974). A projection pursuit algorithm for exploratory data analysis. IEEE Trans. Comput., 23(9):881-890.
  10. Guo, Q., Wu, W., Massart, D., Boucon, C., and de Jong, S. (2001). Feature selection in sequential projection pursuit. Analytica Chimica Acta, 446(1-2):85-96.
  11. Guo, Q., Wu, W., Questier, F., Massart, D., Boucon, C., and de Jong, S. (2000). Sequential projection pursuit using genetic algorithms for data mining of analytical data. Analytical Chemestry, 72(13):2846-2855.
  12. Holland, J. H. (1975). Adaptation in Natural and Artificial Systems. University of Michigan Press, Ann Arbor, MI, USA.
  13. Huber, P. J. (1985). Projection pursuit. Annals of Statistics, 13(2):435-475.
  14. Jee, J. R. (2009). Projection pursuit. Wiley Interdisciplinary Reviews: Computational Statistics, 1(2):208-215.
  15. Jones, M. C. and Sibson, R. (1987). What is projection pursuit? Journal of the Royal Statistical Society. Series A (General), 150(1):pp. 1-37.
  16. Kruskal, J. (1969). Toward a practical method which helps uncover the structure of a set of multivariate observations by finding the linear transformation which optimizes a new 'index of condensation'. Statistical Computation, pages 427-440.
  17. Lee, E., Cook, D., Klinke, S., and Lumley, T. (2005). Projection pursuit for exploratory supervised classification. Journal of Computational and Graphical Statistics, 14(4):831-846.
  18. Nason, G. (1995). Three-dimensional projection pursuit. J. Royal Statistical Society, Series C, 44:411-430.
  19. Pena, D. and Prieto, F. (2001). Cluster identification using projections. Journal of the American Statistical Association, 96(456):1433-1445. 160th Annual Meeting of the American-Statistical-Association, BOSTON, MASSACHUSETTS, FEB, 2000.
  20. Posse, C. (1990). An Effective 2-dimensional Projection Pursuit algorithm. Communications in StatisticsSimulation and Computation, 19(4):1143-1164.
  21. Posse, C. (1995a). Projection pursuit exploratory data analysis. Computational Statistics and Data Analysis, 20:669-687.
  22. Posse, C. (1995b). Tools for two-dimensional exploratory projection pursuit. Computational and Graphical Statistics, 4(2):83-100.
  23. Ren, Y., Liu, H., Yao, X., and Liu, M. (2007). Prediction of ozone tropospheric degradation rate constants by projection pursuit regression. Analytica Chimica Acta, 589(1):150-158.
  24. Rodriguez-Martinez, E., Goulermas, J. Y., Mu, T., and Ralph, J. F. (2010). Automatic induction of projection pursuit indices. IEEE Transactions on Neural Networks, 21(8):1281-1295.
  25. Srinivas, M. and Patnaik, L. (1994). Genetic Algorithms - A Survey. Computer, 27(6):17-26.
  26. Webb-Robertson, B. J. M., Jarman, K. H., Harvey, S. D., Posse, C., and Wright, B. W. (2005). An improved optimization algorithm and bayes factor termination criterion for sequential projection pursuit. Chemometrics and Intelligent Laboratory Systems, 77(1-2):149- 160.
Download


Paper Citation


in Harvard Style

Espezua S., Villanueva E. and D. Maciel C. (2012). ON THE CROSSOVER OPERATOR FOR GA-BASED OPTIMIZERS IN SEQUENTIAL PROJECTION PURSUIT . In Proceedings of the 1st International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM, ISBN 978-989-8425-98-0, pages 93-102. DOI: 10.5220/0003784400930102


in Bibtex Style

@conference{icpram12,
author={Soledad Espezua and Edwin Villanueva and Carlos D. Maciel},
title={ON THE CROSSOVER OPERATOR FOR GA-BASED OPTIMIZERS IN SEQUENTIAL PROJECTION PURSUIT},
booktitle={Proceedings of the 1st International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,},
year={2012},
pages={93-102},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003784400930102},
isbn={978-989-8425-98-0},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 1st International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,
TI - ON THE CROSSOVER OPERATOR FOR GA-BASED OPTIMIZERS IN SEQUENTIAL PROJECTION PURSUIT
SN - 978-989-8425-98-0
AU - Espezua S.
AU - Villanueva E.
AU - D. Maciel C.
PY - 2012
SP - 93
EP - 102
DO - 10.5220/0003784400930102