Exploring the Relationships between Data Complexity and Classification Diversity in Ensembles
Nathan Garcia, Frederico Tiggeman, Eduardo Borges, Giancarlo Lucca, Helida Santos, Graçaliz Dimuro, Graçaliz Dimuro
2021
Abstract
Several classification techniques have been proposed in the last years. Each approach is best suited for a particular classification problem, i.e., a classification algorithm may not effectively or efficiently recognize some patterns in complex data. Selecting the best-tuned solution may be prohibitive. Methods for combining classifiers have also been proposed aiming at improving the generalization ability and classification results. In this paper, we analyze geometrical features of the data class distribution and the diversity of the base classifiers to understand better the performance of an ensemble approach based on stacking. The experimental evaluation was conducted using 32 real datasets, twelve data complexity measures, five diversity measures, and five heterogeneous classification algorithms. The results show that stacked generalization outperforms the best individual base classifier when there is a combination of complex and imbalanced data with diverse predictions among weak learners.
DownloadPaper Citation
in Harvard Style
Garcia N., Tiggeman F., Borges E., Lucca G., Santos H. and Dimuro G. (2021). Exploring the Relationships between Data Complexity and Classification Diversity in Ensembles. In Proceedings of the 23rd International Conference on Enterprise Information Systems - Volume 1: ICEIS, ISBN 978-989-758-509-8, pages 652-659. DOI: 10.5220/0010440006520659
in Bibtex Style
@conference{iceis21,
author={Nathan Garcia and Frederico Tiggeman and Eduardo Borges and Giancarlo Lucca and Helida Santos and Graçaliz Dimuro},
title={Exploring the Relationships between Data Complexity and Classification Diversity in Ensembles},
booktitle={Proceedings of the 23rd International Conference on Enterprise Information Systems - Volume 1: ICEIS,},
year={2021},
pages={652-659},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010440006520659},
isbn={978-989-758-509-8},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 23rd International Conference on Enterprise Information Systems - Volume 1: ICEIS,
TI - Exploring the Relationships between Data Complexity and Classification Diversity in Ensembles
SN - 978-989-758-509-8
AU - Garcia N.
AU - Tiggeman F.
AU - Borges E.
AU - Lucca G.
AU - Santos H.
AU - Dimuro G.
PY - 2021
SP - 652
EP - 659
DO - 10.5220/0010440006520659