Identifying Tweets that Contain a ’Heartwarming Story’

Manabu Okumura, Yohei Yamaguchi, Masatomo Suzuki, Hiroko Otsuka

Abstract

We present a rather new task of detecting and collecting tweets that contain heartwarming stories from a huge amount of tweets on Twitter in this paper. We also present a method for identifying heartwarming tweets. Our prediction method is based on a supervised learning algorithm in SVM along with features from the tweets. We found by comparing the feature sets that adding sentiment features mostly improves the performance. However, simply adding the features for detecting a story in a tweet (past tense and tweet length) cannot contribute to improving the performance, while adding all the features to the baseline feature set mostly yields the best performance from among the feature sets.

References

  1. Fan, R., Chen, P., and Lin, C. (2005). Working set selection using second order information for training support vector machines. The Journal of Machine Learning Research.
  2. Hasegawa, T., Kaji, N., and Yoshinaga, N. (2013). Predicting and eliciting addressee's emotion in online dialogue. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL2013), pages 964-972.
  3. Lin, K. H.-Y., Yang, C., and Chen, H.-H. (2008). Emotion classification of online news articles from the reader's perspective. In Proc. of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence (WI'08).
  4. Liu, B. (2012). Sentiment Analysis and Opinion Mining. Morgan & Claypool Publishers.
  5. Pang, B. and Lee, L. (2008). Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval, 2(1-2):1-135.
  6. Razavi, A. H., Inkpen, D., Uritsky, S., and Matwin, S. (2010). Offensive language detection using multilevel classification. In Proceedings of the 23rd Canadian conference on Advances in Artificial Intelligence (AI'10), pages 16-27.
  7. Suzuki, Y., Takamura, H., and Okumura, M. (2007). Semisupervised learning to classify evaluative expressions from labeled and unlabeled texts. IEICE Transactions on Information and Systems, pages 1516-1522.
  8. Takamura, H., Inui, T., and Okumura, M. (2005). Extracting semantic orientations of words using spin model. In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL2005), pages 133-140.
  9. Yang, C., Lin, K. H.-Y., and Chen, H.-H. (2009). Writer meets reader: Emotion analysis of social media from both the writer's and reader's perspectives. In Proc. of the 2009 IEEE/WIC/ACM International Conference on Web Intelligence (WI'09), pages 287-290.
  10. Zhang, Y., Zhang, L., and Wang, Y. (2010). Cluster-based majority under-sampling approaches for class imbalance learning. In Proceedings of the 2nd IEEE International Conference on Information and Financial Engineering (ICIFE), pages 400-404.
Download


Paper Citation


in Harvard Style

Okumura M., Yamaguchi Y., Suzuki M. and Otsuka H. (2014). Identifying Tweets that Contain a ’Heartwarming Story’ . In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2014) ISBN 978-989-758-048-2, pages 323-326. DOI: 10.5220/0005129503230326


in Bibtex Style

@conference{kdir14,
author={Manabu Okumura and Yohei Yamaguchi and Masatomo Suzuki and Hiroko Otsuka},
title={Identifying Tweets that Contain a ’Heartwarming Story’},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2014)},
year={2014},
pages={323-326},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005129503230326},
isbn={978-989-758-048-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2014)
TI - Identifying Tweets that Contain a ’Heartwarming Story’
SN - 978-989-758-048-2
AU - Okumura M.
AU - Yamaguchi Y.
AU - Suzuki M.
AU - Otsuka H.
PY - 2014
SP - 323
EP - 326
DO - 10.5220/0005129503230326