loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Vincenza Carchiolo and Michele Malgeri

Affiliation: Dip. Ingegneria Elettrica Elettronica Informatica (DIEEI), Università di Catania, Via Santa Sofia 64, Catania, Italy

Keyword(s): Machine Learning, Data Analysis, Health Informatics.

Abstract: The utilization of machine learning in the prevention of serious diseases such as cancer or heart disease is increasingly crucial. Various studies have demonstrated that enhanced forecasting performance can significantly extend patients’ life expectancy. Naturally, having sufficient datasets is vital for employing techniques to classify the clinical situation of patients, facilitating predictions regarding disease onset. However, available datasets often exhibit imbalances, with more records featuring positive metrics than negative ones. Hence, data preprocessing assumes a pivotal role. In this paper, we aim to assess the impact of machine learning and SMOTE (Synthetic Minority Over-sampling Technique) methods on prediction performance using a given set of examples. Furthermore, we will illustrate how the selection of an appropriate SMOTE process significantly enhances performance, as evidenced by several metrics. Nonetheless, in certain instances, the effect of SMOTE is scarcely not iceable, contingent upon the dataset and machine learning methods employed. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.118.189.178

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Carchiolo, V. and Malgeri, M. (2024). Dataset Balancing in Disease Prediction. In Proceedings of the 13th International Conference on Data Science, Technology and Applications - DATA; ISBN 978-989-758-707-8; ISSN 2184-285X, SciTePress, pages 293-300. DOI: 10.5220/0012755700003756

@conference{data24,
author={Vincenza Carchiolo and Michele Malgeri},
title={Dataset Balancing in Disease Prediction},
booktitle={Proceedings of the 13th International Conference on Data Science, Technology and Applications - DATA},
year={2024},
pages={293-300},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012755700003756},
isbn={978-989-758-707-8},
issn={2184-285X},
}

TY - CONF

JO - Proceedings of the 13th International Conference on Data Science, Technology and Applications - DATA
TI - Dataset Balancing in Disease Prediction
SN - 978-989-758-707-8
IS - 2184-285X
AU - Carchiolo, V.
AU - Malgeri, M.
PY - 2024
SP - 293
EP - 300
DO - 10.5220/0012755700003756
PB - SciTePress