Identifying Tweets that Contain a ’Heartwarming Story’
Manabu Okumura
1
, Yohei Yamaguchi
1
, Masatomo Suzuki
2
and Hiroko Otsuka
2
1
Tokyo Institute of Technology, 4259 Nagatsuta, Midori, Yokohama 226-8503, Japan
2
Future University Hakodate, 116-2 Kamedanakano-cho, Hakodate-shi, Hokkaido 041-8655, Japan
Keywords:
Social Media, Twitter, Emotion Analysis, Readers’ Emotion.
Abstract:
We present a rather new task of detecting and collecting tweets that contain heartwarming stories from a huge
amount of tweets on Twitter in this paper. We also present a method for identifying heartwarming tweets. Our
prediction method is based on a supervised learning algorithm in SVM along with features from the tweets.
We found by comparing the feature sets that adding sentiment features mostly improves the performance.
However, simply adding the features for detecting a story in a tweet (past tense and tweet length) cannot
contribute to improving the performance, while adding all the features to the baseline feature set mostly yields
the best performance from among the feature sets.
1 INTRODUCTION
Since users of social media such as blogs and Twit-
ter have been increasing with the development of the
Internet, we now have a huge amount and variety of
information on the Web. It is now very common to
effectively use such a vast amount of information dis-
seminated from many people worldwide to find out
their opinions and feelings. Therefore, the research
topics, sentiment analysis, and opinion mining, have
recently received more attention(Pang and Lee, 2008;
Liu, 2012).
Although a lot of work have recently been con-
ducted on sentiment analysis and opinion mining,
most of the work specifically targeted the writer’s
opinions and feelings. However, only a given few
came from the reader’s perspective. We believe sen-
timent analysis and opinion mining from the reader’s
perspective is also useful in many applications. When
we can devise a method for identifying what docu-
ments make readers happy, we will then be able to
more effectively mine a collection of ’heartwarming
stories.’ When we can identify what sentences or ex-
pressions make readers feel unpleasant, we will be
able to develop a system of supporting writing that
prevents writers from using rude expressions.
We were able to find the following few work
on sentiment analysis from the reader’s perspective.
Yang et al.(Yang et al., 2009) tried to construct
not only a writer-emotion corpus but also a reader-
emotion corpus. Furthermore, they tried to statisti-
cally analyze the corpus. Lin et al.(Lin et al., 2008)
built a reader-emotion classifier that classifies a doc-
ument into one of eight reader emotion classes. They
used Support Vector Machines (SVM) and the fol-
lowing types of features: character bigrams, words,
affix similarities, and emotional words. Hasegawa et
al.(Hasegawa et al., 2013) tried to predict the emotion
of the addressee and to generate a response that elicits
a specific emotion in the addressee’s mind. They tar-
geted Japanese Twitter posts as a source of dialogue
data and automatically built training data for learning
the predictors and generators.
Offensive content detection is not a new field,
but there are only a small number of existing work.
Razavi et al.(Razavi et al., 2010) treated the offen-
sive language detection problem as a machine learn-
ing task and adopted three-level classifications with
bag-of-words features that are based on an abusive ex-
pression dictionary.
In this paper, we first present a rather new task of
detecting and collecting documents that contain heart-
warming stories from a huge variety of contents on the
Web. More specifically, since we target tweets on the
social networking site Twitter(https://twitter.com/),
our task is to detect tweets that contain heartwarm-
ing stories from the huge amount of tweets posted
on Twitter. We then present a method for identifying
heartwarming tweets. We believe the task for detect-
ing documents (tweets) containing heartwarming sto-
ries is new in that they should not only make readers
happy (have positive sentiment from the reader’s per-
spective) but contain the writer’s actual experiences.
323
Okumura M., Yamaguchi Y., Suzuki M. and Otsuka H..
Identifying Tweets that Contain a ’Heartwarming Story’.
DOI: 10.5220/0005129503230326
In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (KDIR-2014), pages 323-326
ISBN: 978-989-758-048-2
Copyright
c
2014 SCITEPRESS (Science and Technology Publications, Lda.)