DISPATCHING REQUESTS IN PARTIALLY REPLICATED WEB CLUSTERS - An Adaptation of the LARD Algorithm

Jose Daniel Garcia, Laura Prada, Jesus Carretero, Felix Garcia, Javier Fernandez, Luis Miguel Sanchez

2007

Abstract

Traditional alternatives for Web content allocation have been full replication and full distribution. An hybrid alternative is partial replication where each content element is replicated to a subset of server nodes. Partial replication gives advantages in terms of balancing reliability and storage capacity. However, partial replication has architectural implications. In this paper we present a Web cluster architecture which may be used in single switched Web clusters and multiple switched Web clusters. We present an algorithm for Web content allocation which determines the number of replicas for each content based on its relative importance and that performs the allocation keeping in mind resource constraints in clusters with heterogeneous storage capacity. We also provide an adaptation of the LARD algorithm for request dispatching that copes with the fact that contents are partially replicated. Our evaluations show that performance of partial replication solutions is comparable to performance of traditional fully replicated solutions.

References

  1. Andresen, D., Yang, T., and Ibarra, O. H. (1997). Toward a scalable distributed www server on workstation clusters. Journal on Parallel and Distributed Computing, 42(1):91-100.
  2. Apostolopoulos, G., Aubespin, D., Peris, V., Pradham, P., and Saha, D. (2000). Design, implementation and performance of a content-based switch. In Proceedings of the Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies (INFOCOM 2000), volume 3, pages 1117-1126.
  3. Aron, M., Druschel, P., and Zwaenepoel, W. (1999). Efficient support for P-HTTP in cluster-based web servers. In Proceedings of the 1999 USENIX Annual Technical Conference, pages 185-198.
  4. Aron, M., Sanders, D., Druschel, P., and Zwaenepoel, W. (2000). Scalable content-aware request distribution in cluster-based network servers. In Proceedings of the 2000 USENIX Annual Technical Conference, pages 232-336.
  5. Aversa, L. and Bestavros, A. (2000). Load balancing a cluster of web servers: using distributed packet rewriting. In Conference Proceedings of the 2000 IEEE International Performance, Computing, and Communications Conference (IPCCC 2000), pages 24-29.
  6. Baker, S. M. and Moon, B. (1999). Distributed cooperative web servers. Computer Networks, 31(11-16):1215- 1229.
  7. Banga, G., Druschel, P., and Mogul, J. C. (1998). Better operating system features for faster network servers. Performance Evaluation Review, 26(3):23-30.
  8. Barford, P. and Crovella, M. (1998). Generating representative web workloads for network and server performance evaluation. SIGMETRICS Performance Evaluation Review, 26(1):151-160.
  9. Breslau, L., Cao, P., Fan, L., Phillips, G., and Shenker, S. (1999). Web caching and Zipf-like distributions: Evidence and implications. In Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies (INFOCOMM 7899), volume 1, pages 126-134.
  10. Brisco, T. (1995). DNS Support for Load Balancing. RFC 1794. Internet Engineering Task Force.
  11. Cardellini, V. (2003). Request redirection algorithms for distributed web systems. IEEE Transactions on Parallel and Distributed Systems, 14(4):355-368.
  12. Cardellini, V., Casalicchio, E., Colajanni, M., and Yu, P. S. (2002). The state of the art in locally distributed Webserver systems. ACM Computing Surveys, 34(2):263- 311.
  13. Casalicchio, E. and Colajanni, M. (2001). A client-aware dispatching algorithm for web clusters providing multiple services. In Proceedings of the tenth international conference on World Wide Web, pages 535-544.
  14. Colajanni, M., Yu, P. S., and Dias, D. M. (1998). Analysis of task assignment policies in scalable distributed web-server systems. IEEE Transactions on Parallel and Distributed Systems, 9(6):585-600.
  15. Devlin, B., Gray, J., Laing, B., and Spix, G. (1999). Scalability terminology: Farms, clones, partitions, and packs: RACS and RAPS. Technical Report MS-TR99-85, Microsoft Research.
  16. Garcia, J. D., Carretero, J., Garcia, F., Calderon, A., Fernandez, J., and Singh, D. E. (2006a). On the reliability of web clusters with partial replication of contents. In First International Conference on Availability, Reliability and Security, 2006. ARES 2006., pages 617- 624.
  17. Garcia, J. D., Carretero, J., Garcia, F., Fernandez, J., Calderon, A., and Singh., D. E. (2006b). A quantitative justification to partial replication of web contents. In International Conference on Computational Science and its Applications, volume 3983 of Lecture Notes in Computer Science, pages 1136-1145.
  18. Garcia, J. D., Carretero, J., Prez, J. M., Garcia, F., and Fernandez, J. (2003). A distributed web switch for partially replicated contents. In Proceedings of the 7th World Multiconference on Systemics, Cybernetics and Informatics (SCI 2003), volume VIII, pages 1-6, Orlando, FL, USA.
  19. Kwan, T. T., McGrath, R. E., and Reed, D. A. (1995). NCSA's world wide web server: design and performance. IEEE Computer, 28(11):68-74.
  20. Li, Q. and Moon, B. (2001). Distributed cooperative Apache Web server. In Proceedings of the tenth international conference on World Wide Web, pages 555- 564.
  21. Luo, M.-Y., Tseng, C.-W., and Yang, C.-S. (2002). URL formalization: An efficient technique to speedup content-aware switching. IEEE Communications Letters, 6(12):553-555.
  22. Maltz, D. A. and Bhagwat, P. (1999). TCP splice for application layer proxy performance. Journal of High Speed Networks, 8(3):225-240.
  23. Pai, V. S., Aron, M., Banga, G., Svendsen, M., Zwaenepoel, P. D. W., and Nahum, E. (1998). Locality-aware request distribution in cluster-based network servers. ACM SIGPLAN Notices, 33(11):205-216.
  24. Pai, V. S., Druschel, P., and Zwaenepoel, W. (1999). Flash: An efficient and portable Web server. In Proceedings of the USENIX 1999 Annual Technical Conference, pages 199-212.
  25. Pai, V. S., Druschel, P., and Zwaenepoel, W. (2000). IOLite: A unified I/O buffering and caching system. ACM Transactions on Computer Systems, 18(1):37- 66.
  26. Schroeder, T., Goddard, S., and Ramamurthy, B. (2000). Scalable web server clustering technologies. IEEE Network, 14(3):38-45.
  27. Shukla, A., Li, L., Ward, A. S. P., and Brecht, T. (2004). Evaluating the performance of user-space and kernelspace web servers. In Proceedings of the 2004 Conference of the Centre for Advanced Studies on Collaborative Research, pages 189-201.
  28. Tse, S. S. H. (2005). Approximate algorithms for document placement in distributed web servers. Transactions on Parallel and Distributed Systems, 16(6):489-496.
  29. Vaidya, S. and Christensen, K. J. (2001). A single system image server cluster using duplicated MAC and IP addresses. In Proceedings of the 26th Annual IEEE Conference on Local Computer Networks (LCN 2001), pages 206-214.
  30. Zhuo, L., Wang, C.-L., and Lau, F. C. M. (2003). Document replication and distribution in extensible geographically distributed web servers. Journal of Parallel and Distributed Computing, 63(10):927-944.
Download


Paper Citation


in Harvard Style

Daniel Garcia J., Prada L., Carretero J., Garcia F., Fernandez J. and Miguel Sanchez L. (2007). DISPATCHING REQUESTS IN PARTIALLY REPLICATED WEB CLUSTERS - An Adaptation of the LARD Algorithm . In Proceedings of the Third International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-972-8865-77-1, pages 141-149. DOI: 10.5220/0001265201410149


in Bibtex Style

@conference{webist07,
author={Jose Daniel Garcia and Laura Prada and Jesus Carretero and Felix Garcia and Javier Fernandez and Luis Miguel Sanchez},
title={DISPATCHING REQUESTS IN PARTIALLY REPLICATED WEB CLUSTERS - An Adaptation of the LARD Algorithm},
booktitle={Proceedings of the Third International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2007},
pages={141-149},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001265201410149},
isbn={978-972-8865-77-1},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Third International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - DISPATCHING REQUESTS IN PARTIALLY REPLICATED WEB CLUSTERS - An Adaptation of the LARD Algorithm
SN - 978-972-8865-77-1
AU - Daniel Garcia J.
AU - Prada L.
AU - Carretero J.
AU - Garcia F.
AU - Fernandez J.
AU - Miguel Sanchez L.
PY - 2007
SP - 141
EP - 149
DO - 10.5220/0001265201410149