these strategies and exploring hybrid approaches for
better performance on imbalanced datasets.
The classification model in this study was trained
on the HI-small and HI-medium datasets instead of
the HI-large datasets. We chose these three machine
algorithms, which show the performance; the HI-
large dataset will require more GPU power. Thus,
cloud computing services such as Amazon Web
Services (AWS) could be used. Future work will
explore hybrid approaches that combine
oversampling and undersampling strategies to
address the limitations of each model.
ACKNOWLEDGEMENT
This work is partly supported by the International
Science Partnerships Fund (ISPF: 1185068545) and
VC Research (VCR 000233).
REFERENCES
Aldania, A. N. A., Soleh, A. M., & Notodiputro, K. A.
(2023). A comparative study of CatBoost and double
random forest for multi-class classification. Jurnal
RESTI (Rekayasa Sistem Dan Teknologi Informasi),
7(1), 129–137.
Ambe, K. N. (2024). [Enter Paper Title] Analysis of the risk
associated with bank crimes in Africa. Analysis of the
Risk Associated with Bank Crimes in Africa (January
27, 2024). https://papers.ssrn.com/sol3/
papers.cfm?abstract_id=4708322
Arefin, M. S. (2024). A probabilistic approach for missing
data imputation. Journal of Applied Mathematics and
Computation, 8(3), 473–796. Wiley Online Library
Bakry, A. N., Alsharkawy, A. S., Farag, M. S., & Raslan,
K. R. (2024). Combating Financial Crimes with
Unsupervised Learning Techniques: Clustering and
Dimensionality Reduction for Anti-Money Laundering.
Al-Azhar Bulletin of Science, 35, 10–22.
Bao, L., Juan, C., Li, J., & Zhang, Y. (2016). Boosted near-
miss under-sampling on SVM ensembles for concept
detection in large-scale imbalanced datasets.
Neurocomputing, 172, 198–206.
Breiman, L. (2001). [No title found]. Machine Learning,
45(1), 5–32. https://doi.org/10.1023/A:1010933404324
Chawla, N. V., Bowyer, K. W., Hall, L. O., & Kegelmeyer,
W. P. (2002). SMOTE: Synthetic minority over-
sampling technique. Journal of Artificial Intelligence
Research, 16, 321–357.
Chen, C.-H., & Honavar, V. (1995). A neural architecture
for content as well as address-based storage and recall:
Theory and applications. Connection Science, 7(3),
281–300.
Chen, T., He, T., Benesty, M., Khotilovich, V., Tang, Y.,
Cho, H., Chen, K., Mitchell, R., Cano, I., & Zhou, T.
(2015). Xgboost: Extreme gradient boosting. R
Package Version 0.4-2, 1(4), 1–4.
Cherif, A., Badhib, A., Ammar, H., Alshehri, S., Kalkatawi,
M., & Imine, A. (2023). Credit card fraud detection in
the era of disruptive technologies: A systematic review.
Journal of King Saud University-Computer and
Information Sciences, 35(1), 145–174.
Dastidar, K. G., Caelen, O., & Granitzer, M. (2024).
Machine Learning Methods for Credit Card Fraud
Detection: A Survey. IEEE Access.
https://ieeexplore.ieee.org/abstract/document/1073724
1/
Dobrowolski, Z. (2024). Implementing a Sustainable
Model for Anti-Money Laundering in the Context of
the Sustainable Development Goals. Sustainability,
12(1), 244.
Gaviyau, W., & Sibindi, A. B. (2023). Global anti-money
laundering and combating terrorism financing
regulatory framework: A critique. Journal of Risk and
Financial Management, 16(7), 313.
Gurcan, F., & Soylu, A. (2024). Learning from Imbalanced
Data: Integration of Advanced Resampling Techniques
and Machine Learning Models for Enhanced Cancer
Diagnosis and Prognosis. Cancers, 16(19), 3417.
He, H., Bai, Y., Garcia, E. A., & Li, S. (2008). ADASYN:
Adaptive synthetic sampling approach for imbalanced
learning. 2008 IEEE International Joint Conference on
Neural Networks (IEEE World Congress on
Computational Intelligence), 1322–1328.
https://ieeexplore.ieee.org/abstract/document/4633969/
Hilal, W., Gadsden, S. A., & Yawney, J. (2022). Financial
fraud: A review of anomaly detection techniques and
recent advances. Expert Systems With Applications,
193, 116429.
Khalil, A. A., Liu, Z., Fathalla, A., Ali, A., & Salah, A.
(2024). Machine Learning based Method for Insurance
Fraud Detection on Class Imbalance Datasets with
Missing Values. IEEE Access. https://ieeexplore.
ieee.org/abstract/document/10695046/
Kokori, E., Patel, R., Olatunji, G., Ukoaka, B. M.,
Abraham, I. C., Ajekiigbe, V. O., Kwape, J. M.,
Babalola, A. E., Udam, N. G., & Aderinto, N. (2024).
Machine learning in predicting heart failure survival: A
review of current models and future prospects. Heart
Failure Reviews. https://doi.org/10.1007/s10741-024-
10474-y
Koziarski, M. (2020). Radial-based undersampling for
imbalanced data classification. Pattern Recognition,
102, 107262.
Masrom, S., Tarmizi, M. A., Halid, S., Rahman, R. A., Abd
Rahman, A. S., & Ibrahim, R. (2023). Machine learning
in predicting anti-money laundering compliance with
protection motivation theory among professional
accountants. International Journal of Advanced and
Applied Sciences, 10(7), 48–53.
Moromoke, O., Aro, O., Adepetun, A., & Iwalehin, O.
(2024). Navigating Regulatory Challenges In Digital
Finance: A Strategic Approach.
https://www.researchgate.net/profile/Anthony-
Adepetun/publication/385247337_navigating_regulato