Yasin, Abuhasan (2016) work, the Naive Bayes’
classifier shows precision and recall of 94% when
applied to a dataset from the Nazario phishing corpus
which consists of 5940 legit and 4598 phishing
emails.
5 CONCLUSION AND FUTURE
WORK
Phishing emails are tricking recipients to either give
their credentials (i.e. bank account number,
passwords) or force them unconsciously to click on a
malicious link which will do nothing but harm to the
receiver's computer.
This work proposed a new approach to applying
NLP and Naïve Bayes’ classifier to detect phishing
emails. Based on the achieved results, the classifier
offers a higher accuracy than other works that used
the same dataset in the literature that is 97.21%.
Future work includes the application of the Apriori
algorithm to find associations between the significant
words in phishing emails. Furthermore, combining all
algorithms and testing the new model on the dataset
without headers is on our future work plans.
REFERENCES
Aggarwal, S., Kumar, V., & Sudarsan, S. D. (2014,
September). Identification and detection of phishing
emails using natural language processing techniques. In
Proceedings of the 7th International Conference on
Security of Information and Networks (pp. 217-222).
Ahmed, I., Guan, D., & Chung, T. C. (2014). Sms
classification based on naive bayes classifier and apriori
algorithm frequent itemset. International Journal of
machine Learning and computing, 4(2), 183.
Alhawi, O. M., Baldwin, J., & Dehghantanha, A. (2018).
Leveraging machine learning techniques for windows
ransomware network traffic detection. In Cyber Threat
Intelligence (pp. 93-106). Springer, Cham.
Basnet, R., Mukkamala, S., & Sung, A. H. (2008).
Detection of phishing attacks: A machine learning
approach. In Soft computing applications in industry
(pp. 373-383). Springer, Berlin, Heidelberg
Basnet, R. B., Sung, A. H., & Liu, Q. (2011). Rule-based
phishing attack detection. In Proceedings of the
International Conference on Security and Management
(SAM) (p. 1). The Steering Committee of The World
Congress in Computer Science, Computer Engineering
and Applied Computing (WorldComp).
Feng, W., Sun, J., Zhang, L., Cao, C., & Yang, Q. (2016,
December). A support vector machine based naive
Bayes algorithm for spam filtering. In 2016 IEEE 35th
International Performance Computing and
Communications Conference (IPCCC) (pp. 1-8). IEEE.
Hasan, K. Z., Hasan, M. Z., & Zahan, N. (2019, July).
Automated Prediction of Phishing Websites Using
Deep Convolutional Neural Network. In International
Conference on Computer, Communication, Chemical,
Materials and Electronic Engineering (IC4ME2) (pp. 1-
4). IEEE.
Hiransha, M., Unnithan, N. A., Vinayakumar, R., Soman,
K., & Verma, A. D. R. (2018, March). Deep learning
based phishing e-mail detection. In Proc. 1st
AntiPhishing Shared Pilot 4th ACM Int. Workshop
Secur. Privacy Anal. (IWSPA). Tempe, AZ, USA.
James, J., Sandhya, L., & Thomas, C. (2013, December).
Detection of phishing URLs using machine learning
techniques. In 2013 international conference on control
communication and computing (ICCC) (pp. 304-309).
IEEE.
Jason Milletary. (2005, May). Technical trends in Phishing
attacks
Ma, L., Ofoghi, B., Watters, P., & Brown, S. (2009, July).
Detecting phishing emails using hybrid features. In
2009 Symposia and Workshops on Ubiquitous,
Autonomic and Trusted Computing (pp. 493-497).
IEEE.
Miyamoto, D., Hazeyama, H., & Kadobayashi, Y. (2008,
November). An evaluation of machine learning-based
methods for detection of phishing sites. In International
Conference on Neural Information Processing (pp. 539-
546). Springer, Berlin, Heidelberg.
Verma, R. M., Zeng, V., & Faridi, H. (2019, November).
Data Quality for Security Challenges: Case Studies of
Phishing, Malware and Intrusion Detection Datasets. In
Proceedings of the 2019 ACM SIGSAC Conference on
Computer and Communications Security (pp. 2605-
2607).
Vinayakumar, R., Barathi Ganesh, H. B., Anand Kumar,
M., & Soman, K. P. (2018). Deep Anti-PhishNet:
Applying Deep Neural Networks for Phishing Email
Detection. CEN-AISecurity@ IWSPA, 40-50.
Vinayakumar, R., Soman, K. P., Velan, K. S., & Ganorkar,
S. (2017, September). Evaluating shallow and deep
networks for ransomware detection and classification.
In 2017 International Conference on Advances in
Computing, Communications and Informatics
(ICACCI) (pp. 259-265). IEEE.
Yasin, A., & Abuhasan, A. (2016). An intelligent
classification model for phishing email detection. arXiv
preprint arXiv:1608.02196.