User Influence and Follower Metrics in a Large Twitter Dataset

Jari Veijalainen, Alexander Semenov, Miika Reinikainen

Abstract

Social media has become an important means to convey information. The microblogging service Twitter with about 284 million users and currently over 500 million tweets per day is an example. The site stores all the tweets once sent so that they can be retrieved later. The site has rather simple site ontology, i.e. the concepts it implements; the users are represented by a profile. They can follow other users, and a received tweet can be retweeted to all the followers of a user. In this paper we investigate diffusion of messages and influence of users on other users, mainly based on the retweet cascade size and attenuation patterns inside the cascade. We rely on a big data set collected after Boston marathon bombing on April 15, 2013. It contains about 8 million tweets and retweets sent by over 4 million different users. It was collected through the Twitter API that selects all the messages containing given keywords, including hashtags. We also collected all 7-8 billion followers of the above users during 2014. The follower relation is also used in influence estimations in some respects. The largest cascades originate from users with most followers and the cascade dies out after two or three frequency peaks.

References

  1. “About Twitter, Inc. | About.” 2014. Twitter About. Accessed December 8. https://about.twitter.com/ company.
  2. Bakshy, Eytan, Jake M. Hofman, Winter A. Mason, and Duncan J. Watts. 2011. “Everyone's an Influencer: Quantifying Influence on Twitter.” In Proceedings of the Fourth ACM International Conference on Web Search and Data Mining, 65-74. WSDM 7811. New York, NY, USA: ACM. doi:10.1145/1935826. 1935845.
  3. Bruns, Axel, and Stefan Stieglitz. 2013. “Towards More Systematic Twitter Analysis: Metrics for Tweeting Activities.” International Journal of Social Research Methodology 16 (2): 91-108. doi:10.1080/13645579. 2012.756095.
  4. Cha, Meeyoung, Hamed Haddadi, Fabricio Benevenuto, and Krishna P. Gummadi. 2010. “Measuring User Influence in Twitter: The Million Follower Fallacy.” http://www.mpisoft.mpg.de/gummadi/papers/icwsm2010_cha.pdf.
  5. Cheng, Justin, Lada Adamic, P. Alex Dow, Jon Michael Kleinberg, and Jure Leskovec. 2014. “Can Cascades Be Predicted?” In Proceedings of the 23rd International Conference on World Wide Web, 925- 36. WWW 7814. New York, NY, USA: ACM. doi:10.1145/2566486.2567997.
  6. Da Cruz, Josemar Faustino, and Ronaldo Menezes. 2015. “Characterizing the Importance of Non-Famous Users in Information Spread in Social Networks.” Accessed January 6. http://my.fit.edu/cruzj2012/wordpress/wpcontent/uploads/2014/11/source.pdf.
  7. Galuba, Wojciech, Karl Aberer, Dipanjan Chakraborty, Zoran Despotovic, and Wolfgang Kellerer. 2010. “Outtweeting the Twitterers - Predicting Information Cascades in Microblogs.” In Proceedings of the 3rd Wonference on Online Social Networks, 3-3. WOSN'10. Berkeley, CA, USA: USENIX Association. http://dl.acm.org/citation.cfm?id=1863190.1863193.
  8. Hui, Cindy, Yulia Tyshchuk, William A. Wallace, Malik Magdon-Ismail, and Mark Goldberg. 2012. “Information Cascades in Social Media in Response to a Crisis: A Preliminary Model and a Case Study.” In Proceedings of the 21st International Conference Companion on World Wide Web, 653-56. WWW 7812 Companion. New York, NY, USA: ACM. doi:10.1145/2187980.2188173.
  9. Kwon, Joseph, and Ingoo Han. 2013. “Information Diffusion with Content Crossover in Online Social Media: An Empirical Analysis of the Social Transmission Process in Twitter.” In 2013 46th Hawaii International Conference on System Sciences (HICSS), 3292-3301. doi:10.1109/HICSS.2013.268.
  10. Romero, Daniel M., Wojciech Galuba, Sitaram Asur, and Bernardo A. Huberman. 2011. “Influence and Passivity in Social Media.” In Proceedings of the 20th International Conference Companion on World Wide Web, 113-14. WWW 7811. New York, NY, USA: ACM. doi:10.1145/1963192.1963250.
  11. Suh, B., Lichan Hong, P. Pirolli, and Ed H. Chi. 2010. “Want to Be Retweeted? Large Scale Analytics on Factors Impacting Retweet in Twitter Network.” In 2010 IEEE Second International Conference on Social Computing (SocialCom), 177-84. doi:10.1109/Social Com.2010.33.
  12. Sun, Jimeng, and Jie Tang. 2011. “A Survey of Models and Algorithms for Social Influence Analysis.” In Social Network Data Analytics, edited by Charu C. Aggarwal, 177-214. Boston, MA: Springer US. http://www.springerlink.com/content/p225p61145657 311/.
  13. Weng, Jianshu, Ee-Peng Lim, Jing Jiang, and Qi He. 2010. “TwitterRank: Finding Topic-Sensitive Influential Twitterers.” In Proceedings of the Third ACM International Conference on Web Search and Data Mining, 261-70. WSDM 7810. New York, NY, USA: ACM. doi:10.1145/1718487.1718520.
  14. Yang, Jaewon, and Jure Leskovec. 2010. “Modeling Information Diffusion in Implicit Networks.” In Proceedings of the 2010 IEEE International Conference on Data Mining, 599-608. ICDM 7810. Washington, DC, USA: IEEE Computer Society. doi:10.1109/ICDM.2010.22.
Download


Paper Citation


in Harvard Style

Veijalainen J., Semenov A. and Reinikainen M. (2015). User Influence and Follower Metrics in a Large Twitter Dataset . In Proceedings of the 11th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-989-758-106-9, pages 487-497. DOI: 10.5220/0005410004870497


in Bibtex Style

@conference{webist15,
author={Jari Veijalainen and Alexander Semenov and Miika Reinikainen},
title={User Influence and Follower Metrics in a Large Twitter Dataset},
booktitle={Proceedings of the 11th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2015},
pages={487-497},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005410004870497},
isbn={978-989-758-106-9},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 11th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - User Influence and Follower Metrics in a Large Twitter Dataset
SN - 978-989-758-106-9
AU - Veijalainen J.
AU - Semenov A.
AU - Reinikainen M.
PY - 2015
SP - 487
EP - 497
DO - 10.5220/0005410004870497