the Chinese features. The second is the parameters of
the models is assigned without any standards, so the
parameters may affect the results. Therefore, the grid
search is applied for searching the best parameters of
the models and the comparison of the models becomes
exacter, because all performance of the models is the
best with the best parameters.
The standard for evaluation includes the 5-fold
accuracy rate (from 5 times cross validation), the ROC
rate, the fit time, the standard variance and mean of
accuracy rate. Those ratios can provide
comprehensive evaluation in effectiveness, stability
and efficiency. According to the standard, the 3
models: the random forest, decision tree and bagging
show outstanding performance. The random forest has
the highest ROC and the accuracy rate, so the random
forest is the best in effectiveness. The decision tree has
the similar accuracy and ROC, while the fit time is
extremely smaller than the random forest, so the
decision tree is the most efficient model. The bagging
performs best in the standard variance of the accuracy
rates, so the performance of the bagging is steadier
than the others.
For accessing the best result, the paper tests the
effect of feature selection, while the performance is
very bad. The 4 models: the random forest, decision
tree, bagging and SVM suffers from obvious fall in
accuracy, except for the logistic regression.
The ratios calculated in the paper do not have
much academic value, while the comparison results
have practical and academic value in some distance.
Meanwhile, the thinking of data processing, model
training and comparison standard shown above may
inspire some later scholars to test the application of the
new models in risk management field.
REFERENCES
A.~L.~Samuel, “Some Studies in Machine Learning Using
the Game of Checkers,” IBM J. Res. Dev., vol. 3, no. 3,
1959.
D. DURAND, Risk elements in consumer installment
financing, vol. 17. New York: National Bureau of
Economic Research, 1941.
F. Butaru, Q. Chen, B. Clark, S. Das, A. W. Lo, and A.
Siddique, “Risk and risk management in the credit card
industry,” J. Bank. Financ., vol. 72, pp. 218–239, Nov.
2016, doi: 10.1016/j.jbankfin.2016.07.015.
H. Han, W. Y. Wang, and B. H. Mao, “Borderline-SMOTE:
A New Over-Sampling Method in Imbalanced Data
Sets Learning,” Lect. Notes Comput. Sci. (including
Subser. Lect. Notes Artif. Intell. Lect. Notes
Bioinformatics), vol. 3644 LNCS, pp. 878–887, 2005,
doi: 10.1007/11538059_91.
J. B. Heaton, N. G. Polson, and J. H. Witte, “Deep learning
for finance: deep portfolios,” Applied Stochastic
Models in Business and Industry. 2017, doi:
10.1002/asmb.2209.
J. H. Myers and E. W. Forgy, “The Development of
Numerical Credit Evaluation Systems,” J. Am. Stat.
Assoc., vol. 58, no. 303, pp. 799–806, 1963, doi:
10.1080/01621459.1963.10500889.
J. Rong, “A study on the relationship between credit card
quota and credit card consumption attitude(in
Chinese),” Nanjing Univ., p. 67, 2018.
M. K. Linnenluecke, X. Chen, X. Ling, T. Smith, and Y.
Zhu, “Research in finance: A review of influential
publications and a research agenda,” Pacific Basin
Finance Journal, vol. 43. Elsevier B.V., pp. 188–199,
Jun. 01, 2017, doi: 10.1016/j.pacfin.2017.04.005.
M. Leo, S. Sharma, and K. Maddulety, “Machine learning
in banking risk management: A literature review,”
Risks, vol. 7, no. 1, Mar. 2019, doi:
10.3390/risks7010029.
M. Z. Husejinovic Admel, Keco Dino, “Application of
Machine Learning Algorithms in Credit Card Default
Payment Prediction,” Int. J. Sci. Res., vol. 7, no. 10, pp.
425–426, 2018, doi: 10.15373/22778179#husejinovic.
N. Rtayli and N. Enneya, “Enhanced credit card fraud
detection based on SVM-recursive feature elimination
and hyper-parameters optimization,” J. Inf. Secur.
Appl., vol. 55, no. September, p. 102596, 2020, doi:
10.1016/j.jisa.2020.102596.
N. Sariannidis, S. Papadakis, A. Garefalakis, C. Lemonakis,
and T. Kyriaki-Argyro, “Default avoidance on credit
card portfolios using accounting, demographical and
exploratory factors: decision making based on machine
learning (ML) techniques,” Ann. Oper. Res., Nov.
2019, doi: 10.1007/s10479-019-03188-0.
P. K. Ozili and T. Arun, “Spillover of COVID-19: Impact
on the Global Economy,” SSRN Electron. J., Mar.
2020, doi: 10.2139/ssrn.3562570.
R. J. Bolton, and D. J. Hand, “Unsupervised Profiling
Methods for Fraud Detection,” Proc. Credit Scoring
Credit Control VII, 2001.
R. W. Johnson, “Legal, social and economic issues
implementing scoring in the US.,” Thomas, L. C \,
1992.
S. A. Block and P. M. Vaaler, “The price of democracy:
Sovereign risk ratings, bond spreads and political
business cycles in developing countries,” J. Int. Money
Financ., vol. 23, no. 6, pp. 917–946, Oct. 2004, doi:
10.1016/j.jimonfin.2004.05.001.
S. B. Kotsiantis, I. D. Zaharakis, and P. E. Pintelas,
“Machine learning: A review of classification and
combining techniques,” Artif. Intell. Rev., vol. 26, no.
3, pp. 159–190, Nov. 2006, doi: 10.1007/s10462-007-
9052-3.
S. Yang and H. Zhang, “Comparison of Several Data
Mining Methods in Credit Card Default Prediction,”
Intell. Inf. Manag., vol. 10, pp. 115–122, 2018, doi:
10.4236/iim.2018.105010.