fulfilled. With regards to the first objective (identify-
ing the most effective feature set), we extracted differ-
ent features, namely: demographic features, “global”
statistics that are relative to the entire observation
window, dynamic monthly statistics for each month
in the observation window, a vector containing daily
purchase amounts and another vector containing the
number of purchases done towards each MCC. The
best results were achieved when utilising all the fea-
tures except for demographic features.
We fulfilled the second objective (build a setup
that can model and predict customer churn) by ap-
plying the Artificial Neural Network and the Gra-
dient Boosting Model for this problem. The GBM
classifier resulted in the best machine learning frame-
work of this study, obtaining an AUROC score of
0.6927. In addition, we also observed that our learn-
ing framework is capable of correctly identifying 70%
of “Churners”, potentially making it a suitable solu-
tion in Customer Relationship Management.
When handling the third objective (to determine
the minimum amount of customer activity needed in
order to predict its likelihood of churning), exper-
iment results show that decreasing the observation
window to a month’s length does not extensively af-
fect the predictive performance of the classifier, giv-
ing the ability to negotiate between prediction accu-
racy and amount of data observed.
For our final objective (attempting to handle the
cold-start problem using a customer’s demographic
features that can be made accessible upon registra-
tion), we attempted to predict customer churn using
only demographic information and in time, combine
any new purchase data. This experiment showed that
for the current dataset, predicting churn behaviour
using only customer demographics (the customer’s
age, country and currency information), is not any-
where sufficient enough to be able to predict whether
a newly registered customer is going to default or not
in the coming month.
The work described in this paper can be further
improved by augmenting the constructed framework
to a tree-based model in order to extract meaningful
behavioural rules. These can be used to capture the
actual characteristics of churning customers. Further-
more, after addressing the problem of customer churn
prediction, it now makes sense to tackle the problem
of predicting the next purchases of customers. The
approaches performed in collaborative recommenda-
tion systems can be adopted and tweaked to our pur-
pose.
REFERENCES
Bilal Zori
´
c, A. (2016). Predicting customer churn in
banking industry using neural networks. Interdisci-
plinary Description of Complex Systems: INDECS,
14(2):116–124.
Cil, F., Cetinyokus, T., and Gokcen, H. (2018). Knowl-
edge discovery on investment fund transaction his-
tories and socio-demographic characteristics for cus-
tomer churn. International Journal of Intelligent Sys-
tems and Applications in Engineering, 6(4):262–270.
Farquad, M. A. H., Ravi, V., and Raju, S. B. (2014). Churn
prediction using comprehensible support vector ma-
chine: An analytical crm application. Applied Soft
Computing, 19:31–40.
Hsu, T.-C., Liou, S.-T., Wang, Y.-P., Huang, Y.-S., et al.
(2019). Enhanced recurrent neural network for com-
bining static and dynamic features for credit card de-
fault prediction. In ICASSP 2019-2019 IEEE Inter-
national Conference on Acoustics, Speech and Signal
Processing (ICASSP), pages 1572–1576. IEEE.
Jain, S. V. and Jayabalan, M. (2022). Applying machine
learning methods for credit card payment default pre-
diction with cost savings. Biomedical and Business
Applications Using Artificial Neural Networks and
Machine Learning, pages 285–305.
Kaya, E., Dong, X., Suhara, Y., Balcisoy, S., Bozkaya, B.,
et al. (2018). Behavioral attributes and financial churn
prediction. EPJ Data Science, 7(1):41.
Keramati, A., Ghaneei, H., and Mirmohammadi, S. M.
(2016). Developing a prediction model for customer
churn from electronic banking services using data
mining. Financial Innovation, 2(1):10.
Kim, S., Shin, K.-s., and Park, K. (2005). An application
of support vector machines for customer churn analy-
sis: Credit card case. In International Conference on
Natural Computation, pages 636–647. Springer.
Leung, H. C. and Chung, W. (2020). A dynamic classifica-
tion approach to churn prediction in banking industry.
Martens, D., Provost, F., Clark, J., and de Fortuny, E. J.
(2016). Mining massive fine-grained behavior data to
improve predictive analytics. MIS quarterly, 40(4).
Rosa, N. B. d. C. (2019). Gauging and foreseeing customer
churn in the banking industry: a neural network ap-
proach. PhD thesis.
Safinejad, F., Noughabi, E. A. Z., and Far, B. H. (2018). A
fuzzy dynamic model for customer churn prediction in
retail banking industry. In Applications of Data Man-
agement and Analysis, pages 85–101. Springer.
Shirazi, F. and Mohammadi, M. (2019). A big data analyt-
ics model for customer churn prediction in the retiree
segment. International Journal of Information Man-
agement, 48:238–253.
Szmydt, M. (2018). Predicting customer churn in electronic
banking. In International Conference on Business In-
formation Systems, pages 687–696. Springer.
Predicting Customer Behavioural Patterns using a Virtual Credit Card Transactions Dataset
167