Comparing Ensemble and Single Classifiers Using KNN Imputation for Incomplete Heart Disease Datasets

Ismail Moatadid, Ibtissam Abnane, Ali Idri

2023

Abstract

Heart disease remains a significant global health challenge, necessitating accurate and reliable classification techniques for early detection and diagnosis. Choosing a suitable classifier model for a dataset containing missing data is a pervasive issue in medical datasets, which can severely impact the performance of classification models. In this work, we present a comparative analysis of three ensemble techniques (i.e. Random Forest (RF), Extreme Gradient Boosting (XGB), and Bagging) and three single technique (i.e. K-nearest neighbor (KNN), Multilayer Perceptron (MLP), and Support Vector Machine (SVM)) applied to four heart disease medical datasets (i.e. Hungarian, Cleveland, Statlog and HeartDisease). The main objective of this study is to compare the performance of ensemble and single classifiers in handling incomplete heart disease datasets using KNN imputation and identify an effective approach for heart disease classification. We found that, overall, MLP outperformed SVM and KNN across datasets. Moreover, we found that ensemble techniques consistently outperformed the single techniques across multiple metrics and datasets. The ensemble models consistently achieved higher accuracy, precision, recall, F1 score, and AUC values. Therefore, for heart disease classification using KNN imputation, the ensemble techniques, particularly RF, Bagging, and XGB, proved to be the most effective models.

Download


Paper Citation


in Harvard Style

Moatadid I., Abnane I. and Idri A. (2023). Comparing Ensemble and Single Classifiers Using KNN Imputation for Incomplete Heart Disease Datasets. In Proceedings of the 15th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR; ISBN 978-989-758-671-2, SciTePress, pages 379-386. DOI: 10.5220/0012208300003598


in Bibtex Style

@conference{kdir23,
author={Ismail Moatadid and Ibtissam Abnane and Ali Idri},
title={Comparing Ensemble and Single Classifiers Using KNN Imputation for Incomplete Heart Disease Datasets},
booktitle={Proceedings of the 15th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR},
year={2023},
pages={379-386},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012208300003598},
isbn={978-989-758-671-2},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 15th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR
TI - Comparing Ensemble and Single Classifiers Using KNN Imputation for Incomplete Heart Disease Datasets
SN - 978-989-758-671-2
AU - Moatadid I.
AU - Abnane I.
AU - Idri A.
PY - 2023
SP - 379
EP - 386
DO - 10.5220/0012208300003598
PB - SciTePress