QUALITY ASSESSMENT OF WIKIPEDIA EXTERNAL LINKS
Paraskevi Tzekou, Sofia Stamou, Nikos Kirtsis, Nikos Zotos
2011
Abstract
Wikipedia is a unique source of information that has been collectively supplied by thousands of people. Since its nascence in 2001, Wikipedia is continuously evolving and like most websites it is interconnected via hyperlinks to other web information sources. Wikipedia articles contain two types of links: internal and external. Internal links point to other Wikipedia articles, while external links point outside Wikipedia and normally they are not used in the body of the article. Although there exist specific guidelines about both the style and the purpose of the article external links, no approach has been recorded that tries to capture in a systematic manner the quality of Wikipedia external links. In this paper, we study the quality of Wikipedia external links by assessing the degree to which these conform to their intended purpose; that is to formulate a comprehensive list of accurate information sources about the article contents. For our study, we estimate the decay of Wikipedia external links and we investigate their distribution in the Wikipedia articles. Our measurements give perceptible evidence for the value of external links and may imply their corresponding articles' quality in a holistic Wikipedia evaluation.
References
- Adler, N. T., de Alfaro, L. 2007. A content-driven reputation system for the Wikipedia. In Proceedings of the 16th International World Wide Web Conference, pp. 261-270.
- Bar-Yossef, Z., Broder, A. Z., Kumar, R., Tomkins, A. 2004. Sic transit gloria telae: towards an understanding of the web's decay. In Proceedings of the 13th International World Wide Web Conference, pp. 328-337.
- Blumenstock, J. E. 2008(a). Automatically Assessing the Quality of Wikipedia Articles. UCBiSchool Report 021.
- Blumenstock, J. E. 2008(b). Size matters: Word count as a measure of quality on Wikipedia. In Proceedings of the 17th International World Wide Web Conference, pp. 1095-1096.
- Broder, A. Z., Glassman, S. C., Manasse, M. S., Zweig, G. 1997. Syntactic clustering of the web. In Proceedings of the 6th International World Wide Web Conference, pp. 391-404.
- Buriol, J., Castillo, C., Donato, D., Leonardi, S., Millozzi, S. 2006. Temporal evolution of the wikigraph. In Proceedings of the IEEE Web Intelligence Conference, pp. 45-51.
- Cross, T. 2006. Puppy smoothies: improving the reliability of open, collaborative wikis. First Monday 11(9).
- Denning, P., Horning, J., Parnas, D., Weinstein, L. 2005. Wikipedia risks. Communications of the ACM, vol.48, no.12.
- Emigh, W., Herring, S. 2005. Collaborative authoring on the Web. In Proceedings of the 38th Hawaii Intl. Conference on System Sciences.
- Fetterly, D., Manasee, M., Najork, M., Wiener, J.L. 2003. A large-scale study of the evolution of web pages. In Proceedings of the 12th International World Wide Web Conference, pp. 669-678.
- Giles, J. 2005. Internet encyclopedias go head to head. Nature, 438, pp. 900-901.
- Jatowt, A., Kawai, Y., Tanaka, K. 2007. Detecting age of page content. In Proceedings of the 9th Annual ACM International Workshop on Web Information and Data Management, pp. 137-144.
- Kamps, J., Koolen M. 2009. Is Wikipedia link structure different? In Proceedings of the 2nd International Conference on Web Search and Data Mining, pp. 232-241.
- Kirtsis N., Stamou S., Tzekou P., Zotos N. 2010. Information Uniqueness in Wikipedia. In Proceedings of the 6th International Conference on Web Information Systems and Technologies (WebIST), Valencia, Spain.
- Koolen, M., Kamps, J. 2009. What's in a link? Form document importance to topical relevance. In Proceedings of the 2nd International Conference on the Theory of Information Retrieval, pp. 313-321.
- Lee, T., Kim, J., Kim, J. W., Kim, R. S., Park, K. 2009. Detecting soft errors by redirection classification. In Proceedings of the 18th International Web Conference, pp. 1119-1120.
- Mrishima, A., Nakamizo, A., Iida, T., Sugimoto, S., Kitagawa, H. 2008. PageCasher: A tool for the automatic correction of broken web links. In Proceedings of the 24th International IEEE Conference on Data Engineering, pp. 1486-1488.
- Nielsen, F. A. 2007. Scientific citations in Wikipedia. Computing Research Repository.
- Popitsch, N. P., Haslhofer, B. 2010. DSNotify: Handing Broken Links in the Web of Data. In Proceedings of the World Wide Web Conference.
- Riehle, D. 2005. How and why Wikipedia works: an interview. In Proceedings of the Intl. Symposium on Wikis, pp. 3-8.
- Stvilia, B., Twidale, M. B., Smith, L. C., Gasser, L. 2005(a). Assessing information quality of a community-based encyclopedia. In Proceedings of the International Conference on Information Quality, pp. 442- 454.
- Stvilia, B., Twidale, M. B., Gasser, L., Smith, L. C. 2005.(b) Information quality discussions in Wikipedia. In Proceedings of the International Conference on Knowledge Management.
- Wilkinson, D. M., Huberman, B. A. 2007. Assessing the value of cooperation in Wikipedia. First Monday 12(4).
Paper Citation
in Harvard Style
Tzekou P., Stamou S., Kirtsis N. and Zotos N. (2011). QUALITY ASSESSMENT OF WIKIPEDIA EXTERNAL LINKS . In Proceedings of the 7th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-989-8425-51-5, pages 248-254. DOI: 10.5220/0003299502480254
in Bibtex Style
@conference{webist11,
author={Paraskevi Tzekou and Sofia Stamou and Nikos Kirtsis and Nikos Zotos},
title={QUALITY ASSESSMENT OF WIKIPEDIA EXTERNAL LINKS},
booktitle={Proceedings of the 7th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2011},
pages={248-254},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003299502480254},
isbn={978-989-8425-51-5},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 7th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - QUALITY ASSESSMENT OF WIKIPEDIA EXTERNAL LINKS
SN - 978-989-8425-51-5
AU - Tzekou P.
AU - Stamou S.
AU - Kirtsis N.
AU - Zotos N.
PY - 2011
SP - 248
EP - 254
DO - 10.5220/0003299502480254