A Machine Learning Workflow to Address Credit Default Prediction
Rambod Rahmani, Marco Parola, Mario Cimino
2024
Abstract
Due to the recent increase in interest in Financial Technology (FinTech), applications like credit default prediction (CDP) are gaining significant industrial and academic attention. In this regard, CDP plays a crucial role in assessing the creditworthiness of individuals and businesses, enabling lenders to make informed deci-sions regarding loan approvals and risk management. In this paper, we propose a workflow-based approach to improve CDP, which refers to the task of assessing the probability that a borrower will default on his or her credit obligations. The workflow consists of multiple steps, each designed to leverage the strengths of different techniques featured in machine learning pipelines and, thus best solve the CDP task. We employ a comprehensive and systematic approach starting with data preprocessing using Weight of Evidence encoding, a technique that ensures in a single-shot data scaling by removing outliers, handling missing values, and making data uniform for models working with different data types. Next, we train several families of learning models, introducing ensemble techniques to build more robust models and hyperparameter optimization via multi-objective genetic algorithms to consider both predictive accuracy and financial aspects. Our research aims at contributing to the FinTech industry in providing a tool to move toward more accurate and reliable credit risk assessment, benefiting both lenders and borrowers.
DownloadPaper Citation
in Harvard Style
Rahmani R., Parola M. and Cimino M. (2024). A Machine Learning Workflow to Address Credit Default Prediction. In Proceedings of the 26th International Conference on Enterprise Information Systems - Volume 1: ICEIS; ISBN 978-989-758-692-7, SciTePress, pages 714-720. DOI: 10.5220/0012640200003690
in Bibtex Style
@conference{iceis24,
author={Rambod Rahmani and Marco Parola and Mario Cimino},
title={A Machine Learning Workflow to Address Credit Default Prediction},
booktitle={Proceedings of the 26th International Conference on Enterprise Information Systems - Volume 1: ICEIS},
year={2024},
pages={714-720},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012640200003690},
isbn={978-989-758-692-7},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 26th International Conference on Enterprise Information Systems - Volume 1: ICEIS
TI - A Machine Learning Workflow to Address Credit Default Prediction
SN - 978-989-758-692-7
AU - Rahmani R.
AU - Parola M.
AU - Cimino M.
PY - 2024
SP - 714
EP - 720
DO - 10.5220/0012640200003690
PB - SciTePress