A DECENTRALIZED WIKI ENGINE FOR COLLABORATIVE WIKIPEDIA HOSTING

Guido Urdaneta, Guillaume Pierre, Maarten van Steen

Abstract

This paper presents the design of a decentralized system for hosting large-scale wiki web sites like Wikipedia, using a collaborative approach. Our design focuses on distributing the pages that compose the wiki across a network of nodes provided by individuals and organizations willing to collaborate in hosting the wiki. We present algorithms for placing the pages so that the capacity of the nodes is not exceeded and the load is balanced, and algorithms for routing client requests to the appropriate nodes. We also address fault tolerance and security issues.

References

  1. Akamai Technologies (2006). http://www.akamai. com.
  2. Alexa Internet (2006). Alexa web search - top 500. http://www.alexa.com/site/ds/ top_sites?ts_mode=global.
  3. Anderson, D. P., Cobb, J., Korpela, E., Lebofsky, M., and Werthimer, D. (2002). SETI@home: an experiment in public-resource computing. Commun. ACM, 45(11):56-61.
  4. Castro, M., Druschel, P., Ganesh, A., Rowstron, A., and Wallach, D. S. (2002). Secure routing for structured peer-to-peer overlay networks. SIGOPS Oper. Syst. Rev., 36(SI):299-314.
  5. Cholvi, V., Felber, P., and Biersack, E. (2004). Efficient search in unstructured peer-to-peer networks. In Proc. SPAA Symposium, pages 271-272.
  6. Freedman, M. J., Freudenthal, E., and Mazires, D. (2004). Democratizing content publication with Coral. In Proc. NSDI Conf.
  7. Jelasity, M., Montresor, A., and Babaoglu, O. (2003). Towards secure epidemics: Detection and removal of malicious peers in epidemic-style protocols. Technical Report UBLCS-2003-14, University of Bologna, Bologna, Italy.
  8. Leuf, B. and Cunningham, W. (2001). The Wiki Way: Collaboration and Sharing on the Internet. AddisonWesley Professional.
  9. Lv, Q., Cao, P., Cohen, E., Li, K., and Shenker, S. (2002). Search and replication in unstructured peer-to-peer networks. In Proc. Intl. Conf. on Supercomputing, pages 84-95.
  10. Markoff, J. and Hansell, S. (2006). Hiding in plain sight, Google seeks more power. New York Times. http://www. nytimes.com/2006/06/14/technology/ 14search.html?pagewanted=1&ei= 5088&en=c96a72bbc5f90a47&ex= 1307937600&partner=rssnyt&emc=rss.
  11. O'Hanlon, C. (2006). A conversation with Werner Vogels. Queue, 4(4):14-22.
  12. Petersen, K., Spreitzer, M., Terry, D., Theimer, M., and Demers, A. (1997). Flexible update propagation for weakly consistent replication. In Proc. SOSP Conf.
  13. Pierre, G. and van Steen, M. (2006). Globule: a collaborative content delivery network. IEEE Communications Magazine, 44(8):127-133.
  14. Popescu, B. C., van Steen, M., Crispo, B., Tanenbaum, A. S., Sacha, J., and Kuz, I. (2005). Securely replicated Web documents. In Proc. IPDPS Conf.
  15. Ratnasamy, S., Francis, P., Handley, M., Karp, R., and Schenker, S. (2001). A scalable content-addressable network. In Proc. SIGCOMM Conf., pages 161-172.
  16. Rowstron, A. I. T. and Druschel, P. (2001). Pastry: Scalable, decentralized object location, and routing for large-scale peer-to-peer systems. In Proc. Middleware Conf., pages 329-350.
  17. Stoica, I., Morris, R., Liben-Nowell, D., Karger, D. R., Kaashoek, M. F., Dabek, F., and Balakrishnan, H. (2003). Chord: a scalable peer-to-peer lookup protocol for internet applications. IEEE/ACM Trans. Netw., 11(1):17-32.
  18. Voulgaris, S., Gavidia, D., and Steen, M. (2005). CYCLON: Inexpensive membership management for unstructured P2P overlays. Journal of Network and Systems Management, 13(2):197-217.
  19. Wang, L., Park, K., Pang, R., Pai, V. S., and Peterson, L. L. (2004). Reliability and security in the CoDeeN content distribution network. In Proc. USENIX Technical Conf., pages 171-184.
  20. Wikipedia (2006). Wikipedia, the free encyclopedia. http://en.wikipedia.org/w/index. php?title=Wikipedia.
Download


Paper Citation


in Harvard Style

Urdaneta G., Pierre G. and van Steen M. (2007). A DECENTRALIZED WIKI ENGINE FOR COLLABORATIVE WIKIPEDIA HOSTING . In Proceedings of the Third International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-972-8865-77-1, pages 156-163. DOI: 10.5220/0001268901560163


in Bibtex Style

@conference{webist07,
author={Guido Urdaneta and Guillaume Pierre and Maarten van Steen},
title={A DECENTRALIZED WIKI ENGINE FOR COLLABORATIVE WIKIPEDIA HOSTING},
booktitle={Proceedings of the Third International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2007},
pages={156-163},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001268901560163},
isbn={978-972-8865-77-1},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Third International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - A DECENTRALIZED WIKI ENGINE FOR COLLABORATIVE WIKIPEDIA HOSTING
SN - 978-972-8865-77-1
AU - Urdaneta G.
AU - Pierre G.
AU - van Steen M.
PY - 2007
SP - 156
EP - 163
DO - 10.5220/0001268901560163