loading
Documents

Research.Publish.Connect.

Paper

Authors: Amaal R. Al Shorman 1 ; Hossam Faris 1 ; Pedro A. Castillo 2 ; J. J. Merelo 2 and Nailah Al-Madi 3

Affiliations: 1 Business Information Technology Department, King Abdullah II School for Information Technology, The University of Jordan, Amman and Jordan ; 2 Department of Computer Architecture and Computer Technology, ETSIIT and CITIC, University of Granada, Granada and Spain ; 3 Computer Science Department, Princess Sumaya University for Technology, Amman and Jordan

ISBN: 978-989-758-327-8

Keyword(s): Classification, Genetic Programming, Preprocessing, Standardization Methods.

Abstract: Genetic programming (GP) is a powerful classification technique. It is interpretable and it can dynamically build very complex expressions that maximize or minimize some fitness functions. It has a capacity to model very complex problems in the area of Machine Learning, Data Mining and Pattern Recognition. Nevertheless, GP has a high computational complexity time. On the other side, data standardization is one of the most important pre-processing steps in machine learning. The purpose of this step is to unify the scale of all input features to have equal contribution to the model. The objective of this paper is to investigate the influence of input data standardization methods on GP, and how it affects its prediction accuracy. Six different methods of input data standardization were checked in order to determine which one allows to achieve the most accurate result with lowest computational cost. The simulations have been implemented on ten benchmarked datasets with three different sce narios (varying the population size and number of generations). The results showed that the computational efficiency of GP is highly enhanced when coupled with some standardization methods, specifically Min-Max method for scenario I and Vector method for scenario II, and scenario III. Whereas, Manhattan and Z-Score methods had the worst results for all three scenarios. (More)

PDF ImageFull Text

Download
Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.234.51.17

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Shorman, A.; Faris, H.; Castillo, P.; Merelo, J. and Al-Madi, N. (2018). The Influence of Input Data Standardization Methods on the Prediction Accuracy of Genetic Programming Generated Classifiers.In Proceedings of the 10th International Joint Conference on Computational Intelligence - Volume 1: IJCCI, ISBN 978-989-758-327-8, pages 79-85. DOI: 10.5220/0006959000790085

@conference{ijcci18,
author={Amaal R. Al Shorman. and Hossam Faris. and Pedro A. Castillo. and J. J. Merelo. and Nailah Al{-}Madi.},
title={The Influence of Input Data Standardization Methods on the Prediction Accuracy of Genetic Programming Generated Classifiers},
booktitle={Proceedings of the 10th International Joint Conference on Computational Intelligence - Volume 1: IJCCI,},
year={2018},
pages={79-85},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006959000790085},
isbn={978-989-758-327-8},
}

TY - CONF

JO - Proceedings of the 10th International Joint Conference on Computational Intelligence - Volume 1: IJCCI,
TI - The Influence of Input Data Standardization Methods on the Prediction Accuracy of Genetic Programming Generated Classifiers
SN - 978-989-758-327-8
AU - Shorman, A.
AU - Faris, H.
AU - Castillo, P.
AU - Merelo, J.
AU - Al-Madi, N.
PY - 2018
SP - 79
EP - 85
DO - 10.5220/0006959000790085

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.