Adaptive Resource Management for Balancing Availability and Performance in Cloud Computing

Ravi Jhawar, Vincenzo Piuri

Abstract

Security, availability and performance are critical to meet service level agreements in most Cloud computing services. In this paper, we build on the virtual machine technology that allows software components to be cheaply moved, replicated, and allocated on the hardware infrastructure to devise a solution that ensures users availability and performance requirements in Cloud environments. To deal with failures and vulnerabilities also due to cyber-attacks, we formulate the availability and performance attributes in the users perspective and show that the two attributes may often be competing for a given application. We then present a heuristicsbased approach that restores application’s requirements in the failure and recovery events. Our algorithm uses Markov chains and queuing networks to estimate the availability and performance of different deployment contexts, and generates a set of actions to re-deploy a given application. By simulation, we show that our proposed approach improves the availability and lowers the degradation of system’s response time compared to traditional static schemes.

References

  1. Buyya, R., Garg, S. K., and Calheiros, R. N. (2011). Slaoriented resource provisioning for cloud computing: Challenges, architecture, and solutions. In Proc. of the 2011 International Conference on Cloud and Service Computing, pages 1-10, Washington, DC, USA.
  2. Cully, B., Lefebvre, G., Meyer, D., Feeley, M., Hutchinson, N., and Warfield, A. (2008). Remus: high availability via asynchronous virtual machine replication. In Proc. of the 5th USENIX Symposium on Networked Systems Design and Implementation, pages 161-174, San Francisco, California.
  3. De Capitani di Vimercati, S., Foresti, S., and Samarati, P. (2012). Managing and accessing data in the cloud: Privacy risks and approaches. In Proc. of the 7th International Conference on Risks and Security of Internet and Systems, Cork, Ireland.
  4. Franks, G., Al-Omari, T., Woodside, M., Das, O., and Derisavi, S. (2009). Enhanced modeling and solution of layered queueing networks. IEEE Transactions on Software Engineering, 35(2):148-161.
  5. Gilbert, S. and Lynch, N. (2002). Brewer's conjecture and the feasibility of Consistent, Available, Partitiontolerant web services. SIGACT News, 33(2):51-59.
  6. Gill, P., Jain, N., and Nagappan, N. (2011). Understanding network failures in data centers: measurement, analysis, and implications. ACM Computer Communication Review, 41(4):350-361.
  7. Hermenier, F., Lawall, J., Menaud, J.-M., and Muller, G. (2011). Dynamic Consolidation of Highly Available Web Applications. Technical Report RR-7545, INRIA.
  8. Jensen, P. A. (2011). Operations Research Models and Methods - Markov Analysis Tools. Available at www.me.utexas.edu/jensen/ormm/excel/markov.html.
  9. Jhawar, R. and Piuri, V. (2012). Fault tolerance management in iaas clouds. In Proc. of 2012 IEEE First AESS European Conference on Satellite Telecommunications, pages 1-6, Rome, Italy.
  10. Jhawar, R., Piuri, V., and Samarati, P. (2012a). Supporting security requirements for resource management in cloud computing. In Proc. of the 15th IEEE International Conference on Computational Science and Engineering, Paphos, Cyprus.
  11. Jhawar, R., Piuri, V., and Santambrogio, M. (2012b). Fault tolerance management in cloud computing: A systemlevel perspective. IEEE Systems Journal, PP(99).
  12. Jung, G., Joshi, K., Hiltunen, M., Schlichting, R., and Pu, C. (2010). Performance and availability aware regeneration for cloud based multitier applications. In Proc. of 2010 IEEE/IFIP International Conference on Dependable Systems and Networks, pages 497-506, Chicago, IL, USA.
  13. Jung, G., Joshi, K. R., Hiltunen, M. A., Schlichting, R. D., and Pu, C. (2008). Generating adaptation policies for multi-tier applications in consolidated server environments. In Proc. of the 2008 International Conference on Autonomic Computing, pages 23-32, Washington, DC, USA.
  14. Kim, S., Machida, F., and Trivedi, K. (2009). Availability modeling and analysis of virtualized system. In Proc. of 15th IEEE Pacific Rim International Symposium on Dependable Computing, pages 365-371, Shanghai, China.
  15. Kopetz, H., Damm, A., Koza, C., Mulazzani, M., Schwabl, W., Senft, C., and Zainlinger, R. (1989). Distributed Fault-Tolerant Real-Time Systems: The Mars Approach. IEEE Micro, 9(1):25-40.
  16. Machida, F., Kawato, M., and Maeno, Y. (2010). Redundant virtual machine placement for fault-tolerant consolidated server clusters. In Proc. of Network Operations and Management Symposium, pages 32-39, Osaka, Japan.
  17. Pu, C., Noe, J., and Proudfoot, A. (1988). Regeneration of replicated objects: a technique and its eden implementation. IEEE Transactions on Software Engineering, 14(7):936-945.
  18. Qian, H., Medhi, D., and Trivedi, T. (2011). A hierarchical model to evaluate quality of experience of online services hosted by cloud computing. In Proc. of IFIP/IEEE International Symposium on Integrated Network Management, pages 105-112, Dublin, Ireland.
  19. Sahai, A., Machiraju, V., Sayal, M., Moorsel, A. P. A. v., and Casati, F. (2002). Automated sla monitoring for web services. In Proc. of the 13th IFIP/IEEE International Workshop on Distributed Systems: Operations and Management: Management Technologies for ECommerce and E-Business Applications, pages 28- 41, London, UK.
  20. Samarati, P. and De Capitani di Vimercati, S. (2010). Data protection in outsourcing scenarios: issues and directions. In Proc. of the 5th ACM Symposium on Information, Computer and Communications Security, pages 1-14, Beijing, China.
  21. Seiden, S. S. (2002). On the online bin packing problem. ACM Journal, 49(5).
  22. Shin, K., Krishna, C. M., and hang Lee, Y. (1989). Optimal dynamic control of resources in a distributed system. IEEE Transactions on Software Engineering, 15(10):1188-1198.
  23. Smith, W. E., Trivedi, K. S., Tomek, L. A., and Ackaret, J. (2008). Availability analysis of blade server systems. IBM Systems Journal, 47(4):621-640.
  24. Tang, C., Steinder, M., Spreitzer, M., and Pacifici, G. (2007). A scalable application placement controller for enterprise data centers. In Proc. of 16th International conference on World Wide Web, pages 331-340, Alberta, Canada.
  25. Urgaonkar, B., Pacifici, G., Shenoy, P., Spreitzer, M., and Tantawi, A. (2005). An analytical model for multi-tier internet services and its applications. SIGMETRICS Performance Evaluation Review, 33(1):291-302.
  26. VMware (2007). White paper: Vmware high availability concepts, implementation and best practices.
Download


Paper Citation


in Harvard Style

Jhawar R. and Piuri V. (2013). Adaptive Resource Management for Balancing Availability and Performance in Cloud Computing . In Proceedings of the 10th International Conference on Security and Cryptography - Volume 1: SECRYPT, (ICETE 2013) ISBN 978-989-8565-73-0, pages 254-264. DOI: 10.5220/0004535902540264


in Bibtex Style

@conference{secrypt13,
author={Ravi Jhawar and Vincenzo Piuri},
title={Adaptive Resource Management for Balancing Availability and Performance in Cloud Computing},
booktitle={Proceedings of the 10th International Conference on Security and Cryptography - Volume 1: SECRYPT, (ICETE 2013)},
year={2013},
pages={254-264},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004535902540264},
isbn={978-989-8565-73-0},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 10th International Conference on Security and Cryptography - Volume 1: SECRYPT, (ICETE 2013)
TI - Adaptive Resource Management for Balancing Availability and Performance in Cloud Computing
SN - 978-989-8565-73-0
AU - Jhawar R.
AU - Piuri V.
PY - 2013
SP - 254
EP - 264
DO - 10.5220/0004535902540264