SMS Spam Identification and Risk Assessment Evaluations
Alaa Mohasseb, Benjamin Aziz, Andreas Kanavos
2020
Abstract
Short Message Service (SMS) constitutes one of the most used communication medium. It has become an integral part of people’s lives and like other communication media, SMS texts have been used for propagating spam messages. Despite the fact that a broad range of spam techniques have been proposed to reduce the frequency of such incidents, many difficulties are still present due to text ambiguity; there, the same words can be used in seemingly similar texts which makes it more difficult to identify spam messages. In this paper, we propose an approach for identifying and classifying spam SMS based on the Syntactical features and patterns of the message. The proposed approach consists of four main parts, namely, SMS Pre-processing, Syntactical Features Extraction and Pattern Formulation, Classification and, Risk Analysis. Experimental results show that the proposed approach achieves a good level of accuracy. In addition, to show the effectiveness of handling class imbalance on the classification performance, two additional experiments were conducted using the implementation of the SMOTE algorithm. There, the results depicted that handling class imbalance help in improving identification and classification accuracy. Furthermore, based on the above, a risk model has been proposed that addresses the risk probability and the impact of spam SMS.
DownloadPaper Citation
in Harvard Style
Mohasseb A., Aziz B. and Kanavos A. (2020). SMS Spam Identification and Risk Assessment Evaluations.In Proceedings of the 16th International Conference on Web Information Systems and Technologies - Volume 1: DMMLACS, ISBN 978-989-758-478-7, pages 417-424. DOI: 10.5220/0010022404170424
in Bibtex Style
@conference{dmmlacs20,
author={Alaa Mohasseb and Benjamin Aziz and Andreas Kanavos},
title={SMS Spam Identification and Risk Assessment Evaluations},
booktitle={Proceedings of the 16th International Conference on Web Information Systems and Technologies - Volume 1: DMMLACS,},
year={2020},
pages={417-424},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010022404170424},
isbn={978-989-758-478-7},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 16th International Conference on Web Information Systems and Technologies - Volume 1: DMMLACS,
TI - SMS Spam Identification and Risk Assessment Evaluations
SN - 978-989-758-478-7
AU - Mohasseb A.
AU - Aziz B.
AU - Kanavos A.
PY - 2020
SP - 417
EP - 424
DO - 10.5220/0010022404170424