ON REVENUE DRIVEN SERVER MANAGEMENT IN CLOUD

Laiping Zhao, Kouichi Sakurai

2012

Abstract

As failures are becoming frequent due to the increasing scale of data centers, Service Level Agreement (SLA) violation often occurs at a cloud provider, thereby affecting the normal operation of job requests and incurring high penalty cost. To this end, we examine the problem of managing a server farm in a way that reduces the penalty caused by server failures according to an Infrastructure-as-a-Service model. We incorporate the malfunction and recovery states into the server management process, and improve the cost efficiency of server management by leveraging the failure predictors. We also design a utility model describing the expected net revenue obtained from providing service. The basic idea is that, a job could be rejected or migrate to another server if a negative utility is anticipated. The formal and experimental analysis manifests our expected net revenue improvement.

References

  1. Ajith, A. and Grosan, C. (2005). Genetic programming approach for fault modeling of electronic hardware. In The 2005 IEEE Congress on Evolutionary Computation, CEC'05, pages 1563-1569. IEEE.
  2. Ajith, A. and Grosan, C. (2005). Genetic programming approach for fault modeling of electronic hardware. In The 2005 IEEE Congress on Evolutionary Computation, CEC'05, pages 1563-1569. IEEE.
  3. AmazonSLA (2012). Amazon ec2 service level agreement. Retrieved Jan. 31, 2012, from http://aws.amazon.com/ ec2-sla/.
  4. AmazonSLA (2012). Amazon ec2 service level agreement. Retrieved Jan. 31, 2012, from http://aws.amazon.com/ ec2-sla/.
  5. AzureSLA (2012). Microsoft azure compute service level agreement. Retrieved Jan. 31, 2012, from http:// www.windowsazure.com/en-us/support/sla/.
  6. AzureSLA (2012). Microsoft azure compute service level agreement. Retrieved Jan. 31, 2012, from http:// www.windowsazure.com/en-us/support/sla/.
  7. Bobroff, N., Kochut, A., and Beaty, K. (2007). Dynamic Placement of Virtual Machines for Managing SLA Violations. 10th IFIP/IEEE International Symposium on Integrated Network Management, pages 119-128.
  8. Bobroff, N., Kochut, A., and Beaty, K. (2007). Dynamic Placement of Virtual Machines for Managing SLA Violations. 10th IFIP/IEEE International Symposium on Integrated Network Management, pages 119-128.
  9. Breitgand, D., Kutiel, G., and Raz, D. (2010). Cost-aware live migration of services in the cloud. In Proceedings of the 3rd Annual Haifa Experimental Systems Conference, SYSTOR 7810, pages 11:1-11:6, New York, USA. ACM.
  10. Breitgand, D., Kutiel, G., and Raz, D. (2010). Cost-aware live migration of services in the cloud. In Proceedings of the 3rd Annual Haifa Experimental Systems Conference, SYSTOR 7810, pages 11:1-11:6, New York, USA. ACM.
  11. Chandrakasan, A. P., Sheng, S., and Brodersen, R. W. (1995). Low power cmos digital design. IEEE Journal of Solid State Circuits, 27:473-484.
  12. Chandrakasan, A. P., Sheng, S., and Brodersen, R. W. (1995). Low power cmos digital design. IEEE Journal of Solid State Circuits, 27:473-484.
  13. Dean, J. (2006). Experiences with mapreduce, an abstraction for large-scale computation. In Proceedings of the 15th international conference on Parallel architectures and compilation techniques, PACT 7806, pages 1-1, New York, NY, USA. ACM.
  14. Dean, J. (2006). Experiences with mapreduce, an abstraction for large-scale computation. In Proceedings of the 15th international conference on Parallel architectures and compilation techniques, PACT 7806, pages 1-1, New York, NY, USA. ACM.
  15. Elnozahy, E. M., Kistler, M., and Rajamony, R. (2002). Energy-efficient server clusters. In Proceedings of the 2nd Workshop on Power-Aware Computing Systems, pages 179-196.
  16. Elnozahy, E. M., Kistler, M., and Rajamony, R. (2002). Energy-efficient server clusters. In Proceedings of the 2nd Workshop on Power-Aware Computing Systems, pages 179-196.
  17. Fitó, J. O., Presa, I. G., and Guitart, J. (2010). Sladriven elastic cloud hosting provider. In Proceedings of the 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing, PDP 7810, pages 111-118, Washington, DC, USA. IEEE Computer Society.
  18. Fitó, J. O., Presa, I. G., and Guitart, J. (2010). Sladriven elastic cloud hosting provider. In Proceedings of the 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing, PDP 7810, pages 111-118, Washington, DC, USA. IEEE Computer Society.
  19. Fu, S. and Xu, C. Z. (2007). Exploring event correlation for failure prediction in coalitions of clusters. In Proceedings of the 2007 ACM/IEEE conference on Supercomputing, SC 7807, pages 41:1-41:12, New York, USA. ACM.
  20. Fu, S. and Xu, C. Z. (2007). Exploring event correlation for failure prediction in coalitions of clusters. In Proceedings of the 2007 ACM/IEEE conference on Supercomputing, SC 7807, pages 41:1-41:12, New York, USA. ACM.
  21. GoogleSLA (2012). Google apps ec2 service level agreement. Retrieved Jan. 31, 2012, from http:// www.google.com/apps/intl/en/terms/sla.html.
  22. GoogleSLA (2012). Google apps ec2 service level agreement. Retrieved Jan. 31, 2012, from http:// www.google.com/apps/intl/en/terms/sla.html.
  23. Hoelzle, U. and Barroso, L. A. (2009). The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines. Morgan and Claypool Publishers, 1st edition.
  24. Hoelzle, U. and Barroso, L. A. (2009). The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines. Morgan and Claypool Publishers, 1st edition.
  25. IBM (2012). Ibm system x 71451ru entry-level server. Retrieved Jan. 31, 2012, from http://www.amazon.com/ System- 71451RU- Entry- level- Server- E7520/dp/ B003U772W4.
  26. IBM (2012). Ibm system x 71451ru entry-level server. Retrieved Jan. 31, 2012, from http://www.amazon.com/ System- 71451RU- Entry- level- Server- E7520/dp/ B003U772W4.
  27. Koomey, J., Brill, K., Turner, P., Stanley, J., and Taylor, B. (2007). A simple model for determining true total cost of ownership for data centers. Uptime institute white paper.
  28. Koomey, J., Brill, K., Turner, P., Stanley, J., and Taylor, B. (2007). A simple model for determining true total cost of ownership for data centers. Uptime institute white paper.
  29. Lewis, P. A. (1964). A branching poisson process model for the analysis of computer failure patterns. Journal of the Royal Statistical Society, Series B, 26(3):398- 456.
  30. Lewis, P. A. (1964). A branching poisson process model for the analysis of computer failure patterns. Journal of the Royal Statistical Society, Series B, 26(3):398- 456.
  31. Macías, M., Rana, O., Smith, G., Guitart, J., and Torres, J. (2010). Maximizing revenue in grid markets using an economically enhanced resource manager. Concurrency and Computation Practice and Experience, 22:1990-2011.
  32. Macías, M., Rana, O., Smith, G., Guitart, J., and Torres, J. (2010). Maximizing revenue in grid markets using an economically enhanced resource manager. Concurrency and Computation Practice and Experience, 22:1990-2011.
  33. Mao, M., Li, J., and Humphrey, M. (2010). Cloud autoscaling with deadline and budget constraints. In 11th IEEE/ACM International Conference on Grid Computing, pages 41-48. IEEE.
  34. Mao, M., Li, J., and Humphrey, M. (2010). Cloud autoscaling with deadline and budget constraints. In 11th IEEE/ACM International Conference on Grid Computing, pages 41-48. IEEE.
  35. Mastroianni, C., Meo, M., and Papuzzo, G. (2011). Selfeconomy in cloud data centers: statistical assignment and migration of virtual machines. In Proceedings of the 17th international conference on Parallel processing - Volume Part I, Euro-Par'11, pages 407-418, Berlin, Heidelberg. Springer-Verlag.
  36. Mastroianni, C., Meo, M., and Papuzzo, G. (2011). Selfeconomy in cloud data centers: statistical assignment and migration of virtual machines. In Proceedings of the 17th international conference on Parallel processing - Volume Part I, Euro-Par'11, pages 407-418, Berlin, Heidelberg. Springer-Verlag.
  37. Mazzucco, M., Dyachuk, D., and Deters, R. (2010a). Maximizing cloud providers' revenues via energy aware allocation policies. In Proceedings of the 2010 IEEE 3rd International Conference on Cloud Computing, CLOUD 7810, pages 131-138, Washington, DC, USA. IEEE Computer Society.
  38. Mazzucco, M., Dyachuk, D., and Deters, R. (2010a). Maximizing cloud providers' revenues via energy aware allocation policies. In Proceedings of the 2010 IEEE 3rd International Conference on Cloud Computing, CLOUD 7810, pages 131-138, Washington, DC, USA. IEEE Computer Society.
  39. Mazzucco, M., Dyachuk, D., and Dikaiakos, M. (2010b). Profit-aware server allocation for green internet services. In Proceedings of the 2010 IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems, MASCOTS 7810, pages 277-284, Washington, DC, USA. IEEE Computer Society.
  40. Mazzucco, M., Dyachuk, D., and Dikaiakos, M. (2010b). Profit-aware server allocation for green internet services. In Proceedings of the 2010 IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems, MASCOTS 7810, pages 277-284, Washington, DC, USA. IEEE Computer Society.
  41. Nightingale, E. B., Douceur, J. R., and Orgovan, V. (2011). Cycles, cells and platters: an empirical analysisof hardware failures on a million consumer pcs. In Proceedings of the sixth conference on Computer systems, EuroSys 7811, pages 343-356, New York, NY, USA. ACM.
  42. Nightingale, E. B., Douceur, J. R., and Orgovan, V. (2011). Cycles, cells and platters: an empirical analysisof hardware failures on a million consumer pcs. In Proceedings of the sixth conference on Computer systems, EuroSys 7811, pages 343-356, New York, NY, USA. ACM.
  43. Patel, C. D. and Shah, A. J. (2005). A simple model for determining true total cost of ownership for data centers. Hewlett-Packard Development Company report HPL-2005-107, pages 1-36.
  44. Patel, C. D. and Shah, A. J. (2005). A simple model for determining true total cost of ownership for data centers. Hewlett-Packard Development Company report HPL-2005-107, pages 1-36.
  45. Pinheiro, E., Weber, W. D., and Barroso, L. A. (2007). Failure trends in a large disk drive population. In Proceedings of the 5th USENIX conference on File and Storage Technologies, pages 17-28, Berkeley, CA, USA. USENIX Association.
  46. Pinheiro, E., Weber, W. D., and Barroso, L. A. (2007). Failure trends in a large disk drive population. In Proceedings of the 5th USENIX conference on File and Storage Technologies, pages 17-28, Berkeley, CA, USA. USENIX Association.
  47. Polze, A., Troger, P., and Salfner, F. (2011). Timely virtual machine migration for pro-active fault tolerance. In Proceedings of the 2011 14th IEEE International Symposium on Object/Component/ServiceOriented Real-Time Distributed Computing Workshops, ISORCW 7811, pages 234-243, Washington, DC, USA. IEEE Computer Society.
  48. Polze, A., Troger, P., and Salfner, F. (2011). Timely virtual machine migration for pro-active fault tolerance. In Proceedings of the 2011 14th IEEE International Symposium on Object/Component/ServiceOriented Real-Time Distributed Computing Workshops, ISORCW 7811, pages 234-243, Washington, DC, USA. IEEE Computer Society.
  49. Rackspace (2012). Rackspace cloud servers. Retrieved Jan. 31, 2012, from http://www.rackspace.com.
  50. Rackspace (2012). Rackspace cloud servers. Retrieved Jan. 31, 2012, from http://www.rackspace.com.
  51. Salfner, F., Lenk, M., and Malek, M. (2010). A survey of online failure prediction methods. ACM Comput.
  52. Salfner, F., Lenk, M., and Malek, M. (2010). A survey of online failure prediction methods. ACM Comput.
  53. Surv., 42:10:1-10:42.
  54. Surv., 42:10:1-10:42.
  55. Schroeder, B. and Gibson, G. A. (2006). A large-scale study of failures in high-performance computing systems. In Proceedings of the International Conference on Dependable Systems and Networks, pages 249- 258, Washington, DC, USA. IEEE Computer Society.
  56. Schroeder, B. and Gibson, G. A. (2006). A large-scale study of failures in high-performance computing systems. In Proceedings of the International Conference on Dependable Systems and Networks, pages 249- 258, Washington, DC, USA. IEEE Computer Society.
  57. Smith, W., Foster, I. T., and Taylor, V. E. (1998). Predicting application run times using historical information. In Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing, pages 122-142, London, UK. Springer-Verlag.
  58. Smith, W., Foster, I. T., and Taylor, V. E. (1998). Predicting application run times using historical information. In Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing, pages 122-142, London, UK. Springer-Verlag.
  59. Vishwanath, K. V. and Nagappan, N. (2010). Characterizing cloud computing hardware reliability. In Proceedings of the 1st ACM symposium on Cloud computing, SoCC 7810, pages 193-204, New York, USA. ACM.
  60. Vishwanath, K. V. and Nagappan, N. (2010). Characterizing cloud computing hardware reliability. In Proceedings of the 1st ACM symposium on Cloud computing, SoCC 7810, pages 193-204, New York, USA. ACM.
Download


Paper Citation


in Harvard Style

Zhao L. and Sakurai K. (2012). ON REVENUE DRIVEN SERVER MANAGEMENT IN CLOUD . In Proceedings of the 2nd International Conference on Cloud Computing and Services Science - Volume 1: CLOSER, ISBN 978-989-8565-05-1, pages 295-305. DOI: 10.5220/0003901002950305


in Harvard Style

Zhao L. and Sakurai K. (2012). ON REVENUE DRIVEN SERVER MANAGEMENT IN CLOUD . In Proceedings of the 2nd International Conference on Cloud Computing and Services Science - Volume 1: CLOSER, ISBN 978-989-8565-05-1, pages 295-305. DOI: 10.5220/0003901002950305


in Bibtex Style

@conference{closer12,
author={Laiping Zhao and Kouichi Sakurai},
title={ON REVENUE DRIVEN SERVER MANAGEMENT IN CLOUD},
booktitle={Proceedings of the 2nd International Conference on Cloud Computing and Services Science - Volume 1: CLOSER,},
year={2012},
pages={295-305},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003901002950305},
isbn={978-989-8565-05-1},
}


in Bibtex Style

@conference{closer12,
author={Laiping Zhao and Kouichi Sakurai},
title={ON REVENUE DRIVEN SERVER MANAGEMENT IN CLOUD},
booktitle={Proceedings of the 2nd International Conference on Cloud Computing and Services Science - Volume 1: CLOSER,},
year={2012},
pages={295-305},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003901002950305},
isbn={978-989-8565-05-1},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 2nd International Conference on Cloud Computing and Services Science - Volume 1: CLOSER,
TI - ON REVENUE DRIVEN SERVER MANAGEMENT IN CLOUD
SN - 978-989-8565-05-1
AU - Zhao L.
AU - Sakurai K.
PY - 2012
SP - 295
EP - 305
DO - 10.5220/0003901002950305


in EndNote Style

TY - CONF
JO - Proceedings of the 2nd International Conference on Cloud Computing and Services Science - Volume 1: CLOSER,
TI - ON REVENUE DRIVEN SERVER MANAGEMENT IN CLOUD
SN - 978-989-8565-05-1
AU - Zhao L.
AU - Sakurai K.
PY - 2012
SP - 295
EP - 305
DO - 10.5220/0003901002950305