SMS Spam Identification and Risk Assessment Evaluations

Alaa Mohasseb, Benjamin Aziz, Andreas Kanavos

2020

Abstract

Short Message Service (SMS) constitutes one of the most used communication medium. It has become an integral part of people’s lives and like other communication media, SMS texts have been used for propagating spam messages. Despite the fact that a broad range of spam techniques have been proposed to reduce the frequency of such incidents, many difficulties are still present due to text ambiguity; there, the same words can be used in seemingly similar texts which makes it more difficult to identify spam messages. In this paper, we propose an approach for identifying and classifying spam SMS based on the Syntactical features and patterns of the message. The proposed approach consists of four main parts, namely, SMS Pre-processing, Syntactical Features Extraction and Pattern Formulation, Classification and, Risk Analysis. Experimental results show that the proposed approach achieves a good level of accuracy. In addition, to show the effectiveness of handling class imbalance on the classification performance, two additional experiments were conducted using the implementation of the SMOTE algorithm. There, the results depicted that handling class imbalance help in improving identification and classification accuracy. Furthermore, based on the above, a risk model has been proposed that addresses the risk probability and the impact of spam SMS.

Download


Paper Citation


in Harvard Style

Mohasseb A., Aziz B. and Kanavos A. (2020). SMS Spam Identification and Risk Assessment Evaluations.In Proceedings of the 16th International Conference on Web Information Systems and Technologies - Volume 1: DMMLACS, ISBN 978-989-758-478-7, pages 417-424. DOI: 10.5220/0010022404170424


in Bibtex Style

@conference{dmmlacs20,
author={Alaa Mohasseb and Benjamin Aziz and Andreas Kanavos},
title={SMS Spam Identification and Risk Assessment Evaluations},
booktitle={Proceedings of the 16th International Conference on Web Information Systems and Technologies - Volume 1: DMMLACS,},
year={2020},
pages={417-424},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010022404170424},
isbn={978-989-758-478-7},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 16th International Conference on Web Information Systems and Technologies - Volume 1: DMMLACS,
TI - SMS Spam Identification and Risk Assessment Evaluations
SN - 978-989-758-478-7
AU - Mohasseb A.
AU - Aziz B.
AU - Kanavos A.
PY - 2020
SP - 417
EP - 424
DO - 10.5220/0010022404170424