A STATISTICAL APPROACH TO THE IMPACT OF FEATURED ARTICLES IN WIKIPEDIA
Antonio J. Reinoso, Felipe Ortega, Jesus M. Gonzalez-Barahona, Israel Herraiz
2010
Abstract
This paper presents an empirical study on the impact of featured articles on the attention that Wikipedia’s articles attract, and how this behavior differs in different editions of Wikipedia. The study is based on the analysis of the log lines registered by the Wikimedia Foundation Squid servers after having sent the appropriate content in response to the corresponding request submitted by any Wikipedia user. The analysis has been conducted regarding the six most visited editions of the Wikipedia and has involved more than 4,100 million log lines corresponding to the traffic of September, October and November 2009. The methodology of work has mainly consisted on the parsing of the requests sent by the users and on their subsequent filtering according to the study directives. Relevant information fields has been finally stored in a database for persistence and further characterization. The main results of this paper are twofold: it shows how to use the the traffic log to extract information about the use ofWikipedia, which is a novel research approach without precedences in the research community, and it analyzes whether the featured articles mechanism achieve to attract more attention or not.
References
- Javanmardi, S., Ganjisaffar, Y., Lopes, C., and Baldi, P. (2009). User contribution and trust in wikipedia. In Collaborative Computing: Networking, Applications and Worksharing, 2009 CollaborateCom 2009. 5th International Conference on, pages 1 -6.
- Lindsey, D. (2010). Evaluating quality control of Wikipedia's feature articles. First Monday, 15(4).
- Olleros, F. (2008). Learning to trust the crowd: Some lessons from wikipedia. In e-Technologies, 2008 International MCETECH Conference on, pages 212 - 216.
- Priedhorsky, R., Chen, J., Lam, S. T. K., Panciera, K., Terveen, L., and Riedl, J. (2007). Creating, destroying, and restoring value in wikipedia. In GROUP 7807: Proceedings of the 2007 international ACM conference on Supporting group work, pages 259-268, New York, NY, USA. ACM.
- R Development Core Team (2009). R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. ISBN 3- 900051-07-0.
- Suh, B., Convertino, G., Chi, E. H., and Pirolli, P. (2009). The singularity is not near: slowing growth of wikipedia. In WikiSym 7809: Proceedings of the 5th International Symposium on Wikis and Open Collaboration, pages 1-10, New York, NY, USA. ACM.
- Tony, S. and Riedl, J. (2009). Is wikipedia growing a longer tail? In GROUP 7809: Proceedings of the ACM 2009 international conference on Supporting group work, pages 105-114, New York, NY, USA. ACM.
Paper Citation
in Harvard Style
J. Reinoso A., Ortega F., M. Gonzalez-Barahona J. and Herraiz I. (2010). A STATISTICAL APPROACH TO THE IMPACT OF FEATURED ARTICLES IN WIKIPEDIA . In Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2010) ISBN 978-989-8425-29-4, pages 420-423. DOI: 10.5220/0003101404200423
in Bibtex Style
@conference{keod10,
author={Antonio J. Reinoso and Felipe Ortega and Jesus M. Gonzalez-Barahona and Israel Herraiz},
title={A STATISTICAL APPROACH TO THE IMPACT OF FEATURED ARTICLES IN WIKIPEDIA},
booktitle={Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2010)},
year={2010},
pages={420-423},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003101404200423},
isbn={978-989-8425-29-4},
}
in EndNote Style
TY - CONF
JO - Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2010)
TI - A STATISTICAL APPROACH TO THE IMPACT OF FEATURED ARTICLES IN WIKIPEDIA
SN - 978-989-8425-29-4
AU - J. Reinoso A.
AU - Ortega F.
AU - M. Gonzalez-Barahona J.
AU - Herraiz I.
PY - 2010
SP - 420
EP - 423
DO - 10.5220/0003101404200423