TRENDSPOTTER DETECTION SYSTEM FOR TWITTER
Wataru Shirakihara
Graduate School of Information Science and Electrical Engineering, Kyushu University, Fukuoka City, Japan
Tetsuya Oishi
†
, Ryuzo Hasegawa
‡
, Hiroshi Hujita
‡
, Miyuki Koshimura
‡
†
Research Institute for Information Technology, Kyushu University, Fukuoka City, Japan
‡
Faculty of Information Science and Electrical Engineering, Kyushu University, Fukuoka City, Japan
Keywords:
Twitter, Data stream algorithms, Information recommendation.
Abstract:
It is too difficult for us to find out trends with search engines. Twitter, a popular microblogging tool, has seen
a lot of growth since it launched in October, 2006. Information about the trends is posted by many twitterers.
If we find out trendspotters from twitterers, and follow them, we can get it more easily. Our trendspotter
detection system uses the burst detection algorithm, and we verified its effectiveness for Twitter’s posts. We
succeeded in detecting the 24 trendspotters by 5277 users.
1 INTRODUCTION
Over the past few decades Internet has developed
rapidly. Many people use Internet when they want
to get some pieces of information, but information we
find on the Internet is out of date unless it is renewed.
Therefore it is too difficult for us to find out trends
with search engines.
Twitter, a popular microblogging tool, has seen a
lot of growth since it launched in October, 2006(Java
et al., 2007). According to Netratings, in Japan, Twit-
ter had about 4,730,000 users as of January, 2010
1
.
However, the figures include only the users who use
Twitter from Twitter’s website
2
, that is to say, the
users from mobile phone, or Twitter client are not in-
cluded. To sum up, actually, there are much more
users(Kanda, 2009).
Twitter has a lot of differentiating good factors
from other SNS or blogging, and is used by many
users as an area of exchange of information.
Users (Twitterers) can broadcast an unlimited
amount of messages (tweets) to a group of other Twit-
terers who have opted to subscribe to these broadcasts
(followers). Twitterers also recieve broadcasts from
other users. Individual tweets are made within a limit
of 140 characters(Starbird and Palen, 2010).
1
http://www.netratings.co.jp/
New news/News02242010.htm
2
http://twitter.com
Twitter has four main characteristics. The first is
that twitterers can ’retweet’ someone else’s post by
copying the post and the person’s username. The
retweeted post is shared with all of their followers.
The second is that the hashtag convention(#[hash-
tag term]) is used inline to call out user-chosen key-
words. Hashtags tag or markup a tweet to help oth-
ers understand the content context, as well as support
keyword term-searching.
The third is that the posts may be directed to a par-
ticular person by putting an @username at the bigin-
ning of the post. Even though the post is directed to
a person, others can still view it(Ehrlich and Shami,
2010).
The fourth is that twitterers can use Twitter’s API.
Many client tools were invented with API. API allows
other web services to integrate with Twitter. Buzzt-
ter
3
, one of the Twitter’s web services, shows buzz
terms in Twitter.
In Twitter, because of its characteristics, informa-
tion about the trends is posted by many twitterers. If
we find out trendspotters from twitterers, and follow
them, we can get it more easily. The purpose of this
paper is to develop a system that detects the trendspot-
ters.
For detecting trendspotters, our system uses burst
detection algorithm(Fujiki et al., 2004). In blog-
ging or bulletin board, a particular phrase appears fre-
3
http://buzztter.com
625
Shirakihara W., Oishi T., Hasegawa R., Hujita H. and Koshimura M..
TRENDSPOTTER DETECTION SYSTEM FOR TWITTER.
DOI: 10.5220/0003183806250628
In Proceedings of the 3rd International Conference on Agents and Artificial Intelligence (ICAART-2011), pages 625-628
ISBN: 978-989-8425-40-9
Copyright
c
2011 SCITEPRESS (Science and Technology Publications, Lda.)