Unbalanced Data Classification in Fraud Detection by Introducing a Multidimensional Space Analysis

Roberto Saia

Abstract

The problem of frauds is becoming increasingly important in this E-commerce age, where an enormous number of financial transactions are carried out by using electronic instruments of payment such as credit cards. In this scenario it is not possible to adopt human-driven solutions due to the huge number of involved operations. The only approach is therefore to adopt automatic solutions able to discern the legitimate transactions from the fraudulent ones. For this reason, today the development of techniques capable of carrying out this task efficiently represents a very active research field that involves a large number of researchers around the world. Unfortunately, this is not an easy task, since the definition of effective fraud detection approaches is made difficult by a series of well-known problems, the most important of them being the non-balanced class distribution of data that leads towards a significant reduction of the machine learning approaches performance. Such limitation is addressed by the approach proposed in this paper, which exploits three different metrics of similarity in order to define a three-dimensional space of evaluation. Its main objective is a better characterization of the financial transactions in terms of the two possible target classes (legitimate or fraudulent), facing the information asymmetry that gives rise to the problem previously exposed. A series of experiments conducted by using real-world data with different size and imbalance level, demonstrate the effectiveness of the proposed approach with regard to the state-of-the-art solutions.

Download


Paper Citation


in Harvard Style

Saia R. (2018). Unbalanced Data Classification in Fraud Detection by Introducing a Multidimensional Space Analysis.In Proceedings of the 3rd International Conference on Internet of Things, Big Data and Security - Volume 1: IoTBDS, ISBN 978-989-758-296-7, pages 29-40. DOI: 10.5220/0006663000290040


in Bibtex Style

@conference{iotbds18,
author={Roberto Saia},
title={Unbalanced Data Classification in Fraud Detection by Introducing a Multidimensional Space Analysis},
booktitle={Proceedings of the 3rd International Conference on Internet of Things, Big Data and Security - Volume 1: IoTBDS,},
year={2018},
pages={29-40},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006663000290040},
isbn={978-989-758-296-7},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 3rd International Conference on Internet of Things, Big Data and Security - Volume 1: IoTBDS,
TI - Unbalanced Data Classification in Fraud Detection by Introducing a Multidimensional Space Analysis
SN - 978-989-758-296-7
AU - Saia R.
PY - 2018
SP - 29
EP - 40
DO - 10.5220/0006663000290040