4 CONCLUSIONS
This study explores the application of multi-armed
bandit (MAB) algorithms, including UCB1 and
Thompson Sampling, to the problem of credit card
fraud detection. Transactions are categorized into 52
distinct groups, or 'arms,' based on their types and
amounts. The goal of these models is to pinpoint the
arm with the highest likelihood of fraudulent activity,
thereby directing human investigators to the most
suspect transactions within a vast dataset. The
approach is validated by its performance in
minimizing cumulative regret, which enables
financial institutions to efficiently focus on arms
yielding the highest average reward. Furthermore, the
Thompson Sampling algorithm demonstrates
superior performance over the UCB1 algorithm by
achieving lower cumulative regret, exhibiting small
standard errors akin to those of UCB1, and
maintaining low computational complexity. For
future work, the arms can be formed more reasonably
and comprehensively. In this paper, merely
transaction types and transaction amounts are taken
into account. More features of these transactions can
be utilized since the dataset provides additional 20
unused attributes with advanced algorithms including
the incremental Regressions Trees, KNN, etc., to
cluster different transactions into multiple arms. On
the other hand, it’s claimed that fraudsters will
constantly modify their behaviors in order to escape
detection from existing models, known as concept
drift (Soemers et al., 2018). In this sense, the methods
of clustering different transactions into arms should
also take concept drift into consideration and be
updated regularly. Furthermore, more MAB
algorithms such as LinUCB, Efficient-UCBV,
Discounted UCB and Sliding window UCB (Garivier
and Moulines, 2008) can be implemented and tested
so that the computational complexity and cumulative
regret can be further reduced, or the concept drift may
be better handled.
REFERENCES
Lattimore, T., Szepesvári, C., 2020. Bandit Algorithms.
Cambridge University Press.
Mukherjee, S., Naveen, K. P., Nandan, S., Balaraman, R.,
2018. Efficient-UCBV: An almost optimal algorithm
using variance estimates. Proceedings of the AAAI
Conference on Artificial Intelligence.
Soemers, D., Brys, T., Driessens, K., Winands, M., Nowé,
A., 2018. Adapting to concept drift in credit card
transaction data streams using contextual bandits and
decision trees. Proceedings of the AAAI Conference on
Artificial Intelligence.
Awoyemi, J., Adetunmbi, A., and Oluwadare S., 207.
Credit card fraud detection using Machine Learning
Techniques: A Comparative Analysis. 2017
International Conf. on Computing Networking and
Informatics (ICCNI).
Garivier, A., Moulines, E., 2008. On upper-confidence
bound policies for non-stationary bandit problems.
Bernard P., El Mekkaoui De Freitas N., Maillet B., 2019. A
financial fraud detection indicator for investors: An
IDeA. Annals of Operations Research.
Ferreira, F., Meidutė-Kavaliauskienė, I., 2019. Toward a
sustainable supply chain for Social Credit: Learning by
experience using single-valued neutrosophic sets and
fuzzy cognitive maps. Annals of Operations Research.
Sariannidis, N., Papadakis, S., Garefalakis A., Lemonakis
C., Kyriaki-Argyro T., 2019. Default avoidance on
credit card portfolios using accounting,
Demographical and exploratory factors: Decision
making based on machine learning (ML) techniques.
Annals of Operations Research.
Zhang, S., 2022. Challenges in KNN classification. IEEE
Transactions on Knowledge and Data Engineering.
Dornadula, V., Geetha, S., 2019. Credit card fraud
detection using machine learning algorithms. Procedia
Computer Science.