ANALYSIS OF WEBLOG LINK STRUCTURE – A COMMUNITY PERSPECTIVE

Ying Zhou, Joseph Davis

Abstract

In this paper, we report a two level study on weblog link structures. At the micro level, we carried out an indepth investigation of individual weblogs. Our goal was to obtain some preliminary understanding of the different types of links that might indicate underlying communities of bloggers. Complete and detailed link data was collected from eight weblogs followed by a variety of analyses. The result shows that both incoming and outgoings follow Zipf like distribution in terms of the sources of those links. These suggest clustering patterns (communities) within the whole blogspace. We also examine the temporal aspects of weblogs. The average life span of a weblog entry is fairly long in most of our sample cases. In addition, analysis on individual comment authors shows that in average, active comment authors maintain a rather long relationship with a certain weblog. It provides evidence that historical data may be useful in understanding weblog communities. On a larger scale, we developed a program to collect complete link data from large number of interconnected weblogs and performed cluster analysis on it. Communities with common topics are successfully extracted using those link data.

References

  1. Ada, E. & Zhang, L (2004) Implicit Structure and the Dynamics of Blogspace. In Workshop on the Weblogging Ecosystem, WWW2004, New York City.
  2. Adamic, A.L and Huberman, B.A. (2002), Zipf's law and the Internet. Glottometrics 3, 143-250
  3. Barabasi, A.L., Jeong, H., Neda, Z., Ravasz E., Schubert A., & Vicsek T. (2001, April), Evolution of the social network of scientific collaborations. arXiv:condmat/0104162 v1. 10.
  4. Barabasi A.L. (2002) Linked: the new science of networks. Perseus Books Group
  5. Bar-llan Judit. An ousider's view on “topic-oriented” blogging. WWW2004, May 2004, New York, USA
  6. M.Ceglowski. (2003) Www::identify-identify blogging tools based on url and content. Retrieved from http://search.cpan.org/mceglows/WWW-BlogIdentify-0.06/Identify.pm,
  7. Downes. S. (2003 July/August) Web logs at Harvard Law. The Technology source. Retrieved from http://ts.mivu.org/default.asp?show=article&id=2019
  8. Flake G.W., Lawrence S, & Giles C.L. (2000) Efficient identification of web communities. In Proc. 6th ACM SIGKDD Intel. Conf. On Knowledge Discovery and Data Mining, page 150-160
  9. Granovetter, M. (1983) The Strength of weak ties:a network theory revisited. Sociological Theory, Vol. 1, 201-233
  10. Gruhl, D., Guha, R., Liben-Novell D. & Tomkins A. (2004 May) Information diffusion through blogspace. WWW2004, , New York, USA
  11. Kumar, R. Novak, J., Raghavan, P. & Tomkins, A. On the bursty evolution of blogspace. (2003 May) WWW2003, Budapest, Hungary
  12. Kumar, R., Novak J., Raghavan, P., & Tomkins, A. (2004 Dec) Structure and evolution of blogspace, Communications of the ACM, Vol.47/No.12
  13. Nanno, T., Fujiki, T., Suzuki, Y. & Okumura, M. (2004 May) Automatically collecting, mornitoring and mining Japanese Weblogs. WWW2004, New York City
  14. Newman, M.E.J. (2001) Scientific collaboration networks. I. Network construction and fundamental results, Physical Review Vol. 64, 016131,
  15. Rubel, S. (2004, Oct) Finding Influential Blogs That Reach Your Key Audiences, Retrieved from http://www.micropersuasion.com/2004/10/finding_infl uen.html
  16. Searls, D. & Sifry, D. (2003 Mar), Building with Blogs, Linux Journal, Issue 107
  17. Tedeschi, B. (2005 Jul. 4) Blogging while browsing, but not buying”, The New York Times.
  18. Wasserman, S. & Faust, K. (1994) Social Network Analysis, Cambridge University Press, Cambridge
  19. Winer, D. (2002 May) History of weblogs, Retrived from http://newhome.weblogs.com/historyOfWeblogs.
Download


Paper Citation


in Harvard Style

Zhou Y. and Davis J. (2006). ANALYSIS OF WEBLOG LINK STRUCTURE – A COMMUNITY PERSPECTIVE . In Proceedings of WEBIST 2006 - Second International Conference on Web Information Systems and Technologies - Volume 2: WEBIST, ISBN 978-972-8865-47-4, pages 13-20. DOI: 10.5220/0001248200130020


in Bibtex Style

@conference{webist06,
author={Ying Zhou and Joseph Davis},
title={ANALYSIS OF WEBLOG LINK STRUCTURE – A COMMUNITY PERSPECTIVE},
booktitle={Proceedings of WEBIST 2006 - Second International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,},
year={2006},
pages={13-20},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001248200130020},
isbn={978-972-8865-47-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of WEBIST 2006 - Second International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,
TI - ANALYSIS OF WEBLOG LINK STRUCTURE – A COMMUNITY PERSPECTIVE
SN - 978-972-8865-47-4
AU - Zhou Y.
AU - Davis J.
PY - 2006
SP - 13
EP - 20
DO - 10.5220/0001248200130020