AUTOMATIC EMAIL CLASSIFICATION USING USER PREFERENCE ONTOLOGY

Niladri Chatterjee, Saroj Kaushik, Smit Rastogi, Varun Dua

Abstract

In this work we have extended and implemented an ontology based approach for email classification based on user characteristics proposed by Kim et al.(2007). The approach focuses on finding relationships between user interests and their responses to emails. Rules and Ontology are created using the data and metadata of user characteristics, their preferences and responses to emails. Rules and ontology are then used to predict the response of a user to a new email. In Kim et al. (2007) approach, labels to emails were provided manually by a human expert. We have endeavored to remove the human intervention by developing an Automated Email Categorizer to provide label to an email based on its contents. We have also proposed a new term weighing method for emails to incorporate prominence of subject terms. Finally, we have integrated and tested the Ontology Based Classifier in conjunction with Email Categorizer where the former effectively uses the label provided by latter to classify an email based on user preferences.

References

  1. Brewer, D., Thirumalai, S., Gomadam, K., Li, K., 2006. Towards an Ontology Driven Spam Filter. In Proceedings of 22nd International Conference on Data Engineering Workshops.
  2. Itskevitch, J. 2001. Automatic hierarchical e-mail classification using association rules. MS Thesis, Simon Fraser University.
  3. Kim, J., Dou, D., Liu, H., Kwak, D., 2007. Constructing A User Preference Ontology for Anti-spam Mail Systems. In Proceedings of the 20th Conference of the Canadian Society For Computational Studies of intelligence on Advances in Artificial intelligence. Montreal, Canada.
  4. Rennie, J. Ken Lang 2010. 20newsgroup dataset http://people.csail.mit.edu/jrennie/20Newsgroups
  5. Youn S., McLeod D., 2006. Ontology Development Tools for Ontology-Based Knowledge Management. Encyclopedia of E-Commerce, E-government and Mobile Commerce, Idea Group Inc.
  6. Youn, S., Mcleod, D., 2007. Spam Email Classification using an Adaptive Ontology. In Proceedings of 4th International Conference on Information Technology: New Generations (ITNG). Las Vegas, NV.
  7. Youn S., Mcleod D., 2009. Spam Decisions on Gray Email using Personalized Ontologies. In Proceedings of the 2009 ACM symposium on Applied Computing SAC09, Honolulu, Hawaii, U.S.A.
  8. Zhang, L. and Yao, T., 2003. Filtering Junk Email with a Maximum Entropy Model. In ICCPOL03, Shen yang, China.
  9. Zhang, L., Zhu, J., Yao, T., 2004. An Evaluation of Spam Filtering Techniques. In ACM transactions on Asian Language Information Processing, Vol.3 No.4
Download


Paper Citation


in Harvard Style

Chatterjee N., Kaushik S., Rastogi S. and Dua V. (2010). AUTOMATIC EMAIL CLASSIFICATION USING USER PREFERENCE ONTOLOGY . In Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2010) ISBN 978-989-8425-29-4, pages 165-170. DOI: 10.5220/0003061501650170


in Bibtex Style

@conference{keod10,
author={Niladri Chatterjee and Saroj Kaushik and Smit Rastogi and Varun Dua},
title={AUTOMATIC EMAIL CLASSIFICATION USING USER PREFERENCE ONTOLOGY},
booktitle={Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2010)},
year={2010},
pages={165-170},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003061501650170},
isbn={978-989-8425-29-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2010)
TI - AUTOMATIC EMAIL CLASSIFICATION USING USER PREFERENCE ONTOLOGY
SN - 978-989-8425-29-4
AU - Chatterjee N.
AU - Kaushik S.
AU - Rastogi S.
AU - Dua V.
PY - 2010
SP - 165
EP - 170
DO - 10.5220/0003061501650170