DICTIONARY EXTENSION FOR IMPROVING AUTOMATED SENTIMENT DETECTION
Johannes Liegl, Stefan Gindl, Arno Scharl, Alexander Hubmann-Haidvogel
2010
Abstract
This paper investigates approaches to improve the accuracy of automated sentiment detection in textual knowledge repositories. Many high-throughput sentiment detection algorithms rely on sentiment dictionaries containing terms classified as either positive or negative. To obtain accurate and comprehensive sentiment dictionaries, we merge existing resources into a single dictionary and extend this dictionary by means of semisupervised learning algorithms such as Pointwise Mutual Information - Information Retrieval (PMI-IR) and Latent Semantic Analysis (LSA). The resulting extended dictionary is then evaluated on various datasets from different domains, which were annotated on both the document and sentence level.
References
- Deerwester, S. C., Dumais, S. T., Landauer, T. K., Furnas, G. W., and Harshman, R. A. (1990). Indexing by latent semantic analysis. Journal of the American Society of Information Science, 41(6):391-407.
- Gindl, S. and Liegl, J. (2008). Evaluation of different sentiment detection methods for polarity classification on web-based reviews. In Proceedings of the ECAI Workshop on Computational Aspects of Affectual and Emotional Interaction.
- Landauer, T. K. and Dumais, S. T. (1997). A solution to plato's problem: The latent semantic analysis theory of the acquisition, induction, and representation of knowledge. Psychological Review, 104:211-240.
- Mullen, T. and Collier, N. (2004). Sentiment analysis using support vector machines with diverse information sources. In Lin, D. and Wu, D., editors, Proceedings of EMNLP 2004, pages 412-418, Barcelona, Spain. Association for Computational Linguistics.
- Pang, B., Lee, L., and Vaithyanathan, S. (2002). Thumbs up? Sentiment classification using machine learning techniques. In Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing (EMNLP).
- Rafelsberger, W. and Scharl, A. (2009). Games with a purpose for social networking platforms. In Proceedings of the 21st ACM Conference on Hypertext and Hypermedia.
- Read, J. and Carroll, J. (2009). Weakly supervised techniques for domain-independent sentiment classification. In Proceedings of the 1st International CIKM Workshop on Topic-Sentiment Analysis for Mass Opinion Measurement.
- Stone, P. J., Dunphy, D. C., and Smith, M. S. (1966). The General Inquirer : A Computer Approach to Content Analysis. MIT. Press, Cambridge, Mass. [u.a.].
- Turney, P. (2002). Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics.
- Turney, P. D. (2001). Mining the web for synonyms: PMIIR versus LSA on TOEFL. In Proceedings of the 12th European Conference on Machine Learning.
- Wilson, T., Wiebe, J., and Hoffmann, P. (2005). Recognizing contextual polarity in phrase-level sentiment analysis. In Proceedings of Human Language Technologies Conference/Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP 2005), Vancouver, CA.
- Yu, H. and Hatzivassiloglou, V. (2003). Towards answering opinion questions: separating facts from opinions and identifying the polarity of opinion sentences. In Proceedings of the 2003 conference on Empirical methods in natural language processing, pages 129-136, Morristown, NJ, USA. Association for Computational Linguistics.
Paper Citation
in Harvard Style
Liegl J., Gindl S., Scharl A. and Hubmann-Haidvogel A. (2010). DICTIONARY EXTENSION FOR IMPROVING AUTOMATED SENTIMENT DETECTION . In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2010) ISBN 978-989-8425-28-7, pages 404-407. DOI: 10.5220/0003070304040407
in Bibtex Style
@conference{kdir10,
author={Johannes Liegl and Stefan Gindl and Arno Scharl and Alexander Hubmann-Haidvogel},
title={DICTIONARY EXTENSION FOR IMPROVING AUTOMATED SENTIMENT DETECTION},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2010)},
year={2010},
pages={404-407},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003070304040407},
isbn={978-989-8425-28-7},
}
in EndNote Style
TY - CONF
JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2010)
TI - DICTIONARY EXTENSION FOR IMPROVING AUTOMATED SENTIMENT DETECTION
SN - 978-989-8425-28-7
AU - Liegl J.
AU - Gindl S.
AU - Scharl A.
AU - Hubmann-Haidvogel A.
PY - 2010
SP - 404
EP - 407
DO - 10.5220/0003070304040407