data, thereby improving its generalization
performance when faced with real-world complex
data. However, a number of studies have shown that
determining a universally effective model that
performs well across a majority of institutions is
challenging due to variability in customer
information and credit metrics (Butaru et al 2016).
Therefore, the final model selection and data
processing should depend on the dataset
characteristics, sample distribution, and performance
requirements. In future research, it is recommended
to explore various variants of traditional algorithms
for further enhancement. Simultaneously, expanding
the scope of the study by including a broader range of
economic indicators and user behavioural metrics
into the training data, it is more likely to establish a
model that integrates multiple perspectives,
thoroughly considers data diversity.
4 CONCLUSION
In summary, based on machine learning principles,
this study constructed classification models including
logistic regression, KNN, decision trees and random
forests for predicting credit card default. This study
also compared various SMOTE-based resampling
techniques to correct the data distribution and
evaluate their impact on improving the performance
of predictive models. From the results, the model
based on the random forest algorithm had higher
generalisability and prediction accuracy. The
research also indicates that dealing with class
imbalance data can significantly enhance the
prediction accuracy for minority categories while
maintain robustness for majority groups. Therefore,
the key to building effective default prediction
models lies in the use of sound and superior
algorithms combined with efficient data processing
methods. Future explorations can further delve into
more advanced algorithms and techniques to uncover
more robust results. This will help financial
institutions construct a comprehensive credit risk
prediction and credit assessment system to lower
financial risk. With the continuous strengthening of
financial regulations, default prediction is poised to
become a vital tool in risk management, offering
financial institutions judgment criteria and decision
support, thereby fostering the stable operation and
healthy development of financial markets.
REFERENCES
L. Yin, Y. Ge, K. Xiao, X. Wang, X. Quan,
Neurocomputing, 105, 3-11 (2013).
M. Leo, S. Sharma, K. Maddulety, Risks, 7(1), 29 (2019).
F. Butaru, Q. Chen, B. Clark, et al.,J. Banking & Finance,
72, 218-239 (2016).
J. Zhou, W. Li, J. Wang, S. Ding, C. Xia, Physica A: Stat.
Mech. Appli., 534, 122370 (2019).
Y. Chen, R. Zhang, Complex, 1-13 (2021).
X. Zeng, L. Lu, X. Lu et al. Wireless Internet Tech., 17(18),
166-168 (2020).
E. Kim, J. Lee, H. Shin, et al., Expert Syst Appl, 128, 214-
224 (2019).
E. Ileberi, Y. Sun, Z. Wang, IEEE, 9, 16528-165294 (2021).
S. Hamori, M. Kawai, T. Kume, Y. Murakami, C.
Watanabe, J. Risk Finan. Manage., 11(1), 12 (2018).
Y. Zhu, L. Zhou, C. Xie, G.J Wang, T.V. Nguyen, Int J Prod
Econ, 211, 22-33 (2019).
M. Z. Abedin, C. Guotai, P. Hajek, T. Zhang, Comp. Intel.
Sys., 9(4), 3559-3579 (2023).
T.M. Alam, K. Shaukat, I.A. Hameed, et al., IEEE, 8,
(2020).
N. Chawla, K. Bowyer, L. Hall, J. Arti. Intel. Res., 16, 321-
357 (2002).
H. Han, W. Y. Wang, B. H. Mao, Inter. Conf. Intel. Comp.,
Berlin, Heidelberg: Springer Berlin Heidelberg, (2005).
S. Boughorbel, F. Jarray, M. El-Anbari, PloS one, 12(6),
(2017).