Cross-domain Sentiment Classification using an Adapted Naïve Bayes Approach and Features Derived from Syntax Trees

Srilaxmi Cheeti, Ana Stanescu, Doina Caragea

Abstract

Online product reviews contain information that can assist in the decision making process of new customers looking for various products. To assist customers, supervised learning algorithms can be used to categorize the reviews as either positive or negative, if large amounts of labeled data are available. However, some domains have few or no labeled instances (i.e., reviews), yet a large number of unlabeled instances. Therefore, domain adaptation algorithms that can leverage the knowledge from a source domain to label reviews from a target domain are needed. We address the problem of classifying product reviews using domain adaptation algorithms, in particular, an Adapted Naïve Bayes classifier, and features derived from syntax trees. Our experiments on several cross-domain product review datasets show that this approach produces accurate domain adaptation classifiers for the sentiment classification task.

References

  1. Blitzer, J., Dredze, M., and Pereira, F. (2007). Biographies, bollywood, boomboxes and blenders: Domain adaptation for sentiment classification. In ACL.
  2. Blitzer, J., McDonald, R., and Pereira, F. (2006). Domain adaptation with structural correspondence learning. In Proc. of the 2006 Conference on Empirical Methods in Natural Language Processing, EMNLP 7806. ACL.
  3. Cheeti, S. (2012). Cross-domain sentiment classification using grams derived from syntax trees and an adapted naïve bayes approach (thesis).
  4. Dai, W., Xue, G.-R., Yang, Q., and Yu, Y. (2007). Transferring naive bayes classifiers for text classification. In Proc. of the 22nd national conference on Artificial intelligence - Volume 1, AAAI'07. AAAI Press.
  5. Harb, A., Plantié, M., Dray, G., Roche, M., Trousset, F., and Poncelet, P. (2008). Web opinion mining: how to extract opinions from blogs? In Proc. of the 5th international conference on Soft computing as transdisciplinary science and technology, CSTST 7808. ACM.
  6. Klein, D. and Manning, C. D. (2003). Accurate unlexicalized parsing. In Proc. of the 41st Meeting of the ACL.
  7. Nigam, K., McCallum, A., Thrun, S., and Mitchell, T. (1998). Learning to classify text from labeled and unlabeled documents. In Proc. of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence, AAAI 7898/IAAI 7898. AAAI.
  8. Pan, S. J., Ni, X., Sun, J.-T., Yang, Q., and Chen, Z. (2010). Cross-domain sentiment classification via spectral feature alignment. In Proc. of the 19th international conference on World wide web, WWW 7810. ACM.
  9. Tan, S., Cheng, X., Wang, Y., and Xu, H. (2009). Adapting naive bayes to domain adaptation for sentiment analysis. Advances In Information Retrieval Proceedings, 5478.
  10. Zhang, W., Li, P., and Zhu, Q. (2010). Sentiment classification based on syntax tree pruning and tree kernel. In Proc. of the 2010 Seventh Web Information Systems and Applications Conference, WISA 7810. IEEE Computer Society.
Download


Paper Citation


in Harvard Style

Cheeti S., Stanescu A. and Caragea D. (2013). Cross-domain Sentiment Classification using an Adapted Naïve Bayes Approach and Features Derived from Syntax Trees . In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval and the International Conference on Knowledge Management and Information Sharing - Volume 1: KDIR, (IC3K 2013) ISBN 978-989-8565-75-4, pages 169-176. DOI: 10.5220/0004546501690176


in Bibtex Style

@conference{kdir13,
author={Srilaxmi Cheeti and Ana Stanescu and Doina Caragea},
title={Cross-domain Sentiment Classification using an Adapted Naïve Bayes Approach and Features Derived from Syntax Trees},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval and the International Conference on Knowledge Management and Information Sharing - Volume 1: KDIR, (IC3K 2013)},
year={2013},
pages={169-176},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004546501690176},
isbn={978-989-8565-75-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval and the International Conference on Knowledge Management and Information Sharing - Volume 1: KDIR, (IC3K 2013)
TI - Cross-domain Sentiment Classification using an Adapted Naïve Bayes Approach and Features Derived from Syntax Trees
SN - 978-989-8565-75-4
AU - Cheeti S.
AU - Stanescu A.
AU - Caragea D.
PY - 2013
SP - 169
EP - 176
DO - 10.5220/0004546501690176