Considering all evaluation metrics, the best-
performing model is AdaBoost.
4 DISCUSSION
Essentially, the prediction problem is a supervised
learning task that uses ML models to implicitly
capture the underlying patterns in the manual
approval process.
The optimal model obtained in this study is
AdaBoost, which is consistent with the choice made
by previous researchers (Kumar et al. 2022).
Specifically, the performance of this model is
significantly better than the results of Mridha et al
(Mridha et al. 2022). This is due to the ensemble
learning algorithm's excellent generalization
capabilities, which were not examined by Mridha et
al. Additionally, the data preprocessing process
differs between the two studies.
The author notes that XGBoost achieved 99%
accuracy on the training set. This indicates that the
model suffered from overfitting during training,
which explains why the DT-based model while
performing best on Precision, does not perform well
on the metrics of Accuracy and F1-score. One
possible explanation is that the training set was too
small.
Compared to previous studies, this paper's
advantage lies in its finer and more reasonable data
preprocessing. This can be seen from the fact that the
author's model still outperforms Mridha et al.'s model
in terms of logistic regression (Mridha et al. 2022).
Due to the shortcomings of this paper, and the
small size of the dataset, some models face the
challenge of balancing overfitting and underfitting
during parameter tuning. Future studies should select
a more appropriate dataset to fully explore the
intrinsic relationships among these variables.
5 CONCLUSION
This paper applies ML techniques to the prediction of
loan approval outcomes. When predicting loan
approvals, financial institutions typically focus on
positive examples. Whether these borrowers repay on
time and with interest determines whether the
financial institution will make a profit or a loss.
Appropriate preprocessing of the collected dataset
was performed and eight models were trained,
including SVM, KNN, LR, DT, RF, XGBoost,
AdaBoost, and GBDT. The performance of the
models was evaluated using Precision, Accuracy, and
F1-score. The author concluded that the DT-based
learning method AdaBoost produced the best
prediction results, with a remarkable accuracy of
84.95%.
This study demonstrates the potential of ML in the
financial sector, specifically in reducing costs and
increasing efficiency. An example of this is the
possibility of building an automatic loan approval
system using the AdaBoost model in the future. This
tool can help financial institutions review loan
applications efficiently and fairly, streamline the loan
approval process, and promote financial inclusion
while improving efficiency.
Also, this research presents an innovative method
for more accurate loan approval in the financial sector
and provides reliable support for intelligent financial
decision-making. The finding extends the empirical
research in financial technology and provides
valuable insights for optimizing decisions during the
current digital transformation in the financial sector.
REFERENCES
A. Gupta, V. Pant, S. Kumar, P. K. Bansal, “Bank Loan
Prediction System using Machine Learning,” in 2020
9th International Conference System Modeling and
Advancement in Research Trends, (IEEE, 2020), pp.
423–426.
P. K. Ozili, Forum for Social Economics 50 (4), 457-479
(2021).
M. A. Sheikh, A. K. Goel, T. Kumar, “An approach for
prediction of loan approval using machine learning
algorithm,” in 2020 International Conference on
Electronics and Sustainable Communication Systems,
(IEEE, 2020), pp. 490–494.
V. Singh, A. Yadav, R. Awasthi, G. N. Partheeban,
“Prediction of modernized loan approval system based
on machine learning approach,” in 2021 International
Conference on Intelligent Technologies, (IEEE, 2021),
pp. 1–4.
P. Tumuluru, L. R. Burra, M. Loukya, S. Bhavana, H.
CSaiBaba, N. Sunanda, “Comparative Analysis of
Customer Loan Approval Prediction using Machine
Learning Algorithms,” in 2022 Second International
Conference on Artificial Intelligence and Smart Energy,
(IEEE, 2022), pp. 349–353.
P. S. Saini, A. Bhatnagar, L. Rani, “Loan Approval
Prediction using Machine Learning: A Comparative
Analysis of Classification Algorithms,” in 2023 3rd
International Conference on Advance Computing and
Innovative Technologies in Engineering, (IEEE, 2023),
pp. 1821–1826.
Loan Prediction Problem Dataset, 2019, available at
https://www.kaggle.com/datasets/altruistdelhite04/loa
n-prediction-problem-dataset.