A CART-based Genetic Algorithm for Constructing Higher Accuracy Decision Trees

Elif Ersoy, Erinç Albey, Enis Kayış

2020

Abstract

Decision trees are among the most popular classification methods due to ease of implementation and simple interpretation. In traditional methods like CART (classification and regression tree), ID4, C4.5; trees are constructed by myopic, greedy top-down induction strategy. In this strategy, the possible impact of future splits in the tree is not considered while determining each split in the tree. Therefore, the generated tree cannot be the optimal solution for the classification problem. In this paper, to improve the accuracy of the decision trees, we propose a genetic algorithm with a genuine chromosome structure. We also address the selection of the initial population by considering a blend of randomly generated solutions and solutions from traditional, greedy tree generation algorithms which is constructed for reduced problem instances. The performance of the proposed genetic algorithm is tested using different datasets, varying bounds on the depth of the resulting trees and using different initial population blends within the mentioned varieties. Results reveal that the performance of the proposed genetic algorithm is superior to that of CART in almost all datasets used in the analysis.

Download


Paper Citation


in Harvard Style

Ersoy E., Albey E. and Kayış E. (2020). A CART-based Genetic Algorithm for Constructing Higher Accuracy Decision Trees.In Proceedings of the 9th International Conference on Data Science, Technology and Applications - Volume 1: DATA, ISBN 978-989-758-440-4, pages 328-338. DOI: 10.5220/0009893903280338


in Bibtex Style

@conference{data20,
author={Elif Ersoy and Erinç Albey and Enis Kayış},
title={A CART-based Genetic Algorithm for Constructing Higher Accuracy Decision Trees},
booktitle={Proceedings of the 9th International Conference on Data Science, Technology and Applications - Volume 1: DATA,},
year={2020},
pages={328-338},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0009893903280338},
isbn={978-989-758-440-4},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 9th International Conference on Data Science, Technology and Applications - Volume 1: DATA,
TI - A CART-based Genetic Algorithm for Constructing Higher Accuracy Decision Trees
SN - 978-989-758-440-4
AU - Ersoy E.
AU - Albey E.
AU - Kayış E.
PY - 2020
SP - 328
EP - 338
DO - 10.5220/0009893903280338