Recovery-Oriented Resource Management in Hybrid Cloud Environments

Yasser Aldwyan, Richard O. Sinnott


Cloud-based systems suffer from an increased risk of individual server failures due to their scale. When failures happen, resource utilization and system reliability can be negatively affected. Hybrid cloud models allow utilization of local resources in private clouds with resources from public clouds as and when needed through cloudbursting. There is an urgent need to develop cloudbursting approaches that are cognisant of the reliability and fault tolerance of external cloud environments. Recovery oriented computing (ROC) is a new approach for building reliable services that places emphasis on recovery from failures rather than avoiding them completely since even the most dependable systems will eventually fail. All fault tolerant techniques aim to reduce time to recover (TTR). In this paper, we develop a ROC-based fault tolerant approach for managing resources in hybrid clouds by proposing failure models with associated feedback control supporting a local resource-aware resource provisioning algorithm. We present a recovery-oriented virtual infrastructure management system (RVIMS). Results show that RVIMS is more reliable than those of single cloud environments even though TTR in the single cloud environments are about 10% less than those of RVIMS.


  1. Amazon. 2016. Amazon Elastic Compute Cloud [Online]. Available: [Accessed 22- 09-2016].
  2. Armbrust, M., Fox, A., Griffith, R., Joseph, A. D., Katz, R. H., Konwinski, A., Lee, G., Patterson, D. A., Rabkin, A., Stoica, I. & Zaharia, M. 2009. Above the Clouds: A Berkeley View of Cloud Computing. EECS Department, University of California, Berkeley.
  3. Berkeley. 2004. Recovery-Oriented Computing Overview [Online]. Available: [Accessed 03-11-2016].
  4. Cascella, R. G., Morin, C., Harsh, P. & Jegou, Y. 2012. Contrail: a reliable and trustworthy cloud platform. Proceedings of the 1st European Workshop on Dependable Cloud Computing. Sibiu, Romania: ACM.
  5. El-Refaey, M. 2011. Virtual Machines Provisioning and Migration Services. Cloud Computing. John Wiley & Sons, Inc.
  6. Eucalyptus. 2008. Eucalyptus Cloud Platform [Online]. Available: [Accessed 25-09- 2016].
  7. Google. 2010. Google. Post-mortem for February 24th, 2010 outage [Online]. Available:!topic/googleappengine/p2QKJ0OSLc8 [Accessed 02-10-2016].
  8. Google. 2016. Google Compute Engine [Online]. Available: [Accessed 04-07-2016].
  9. Grozev, N. & Buyya, R. 2014. Inter-Cloud architectures and application brokering: taxonomy and survey. Software: Practice and Experience, 44, 369-390.
  10. Javadi, B., Abawajy, J. & Sinnott, R. O. 2012. Hybrid Cloud resource provisioning policy in the presence of resource failures. Cloud Computing Technology and Science (CloudCom), 2012 IEEE 4th International Conference on, 10-17.
  11. Laing, B. 2012. Summary of Windows Azure Service Disruption on Feb 29th, 2012 [Online]. Available: [Accessed 07-07-2016].
  12. Libcloud. 2009. Apach Libcloud [Online]. Available: [Accessed 05-09-2016].
  13. Mattess, M., Calheiros, R. N. & Buyya, R. Scaling MapReduce Applications Across Hybrid Clouds to Meet Soft Deadlines. Advanced Information Networking and Applications (AINA), 2013 IEEE 27th International Conference on, 25-28 March 2013 2013. 629-636.
  14. Microsoft. 2014a. An Introduction to designing reliable cloud services [Online]. Trustworthy Computing. Available: [Accessed 08-01-2017].
  15. Microsoft. 2014b. Resilience by design for cloud services [Online]. Trustwothy Computing. Available: [Accessed 02-01-2017].
  16. Nectar. 2016. The Australian National eResearch Collaboration Tools and Resources (Nectar) Research Cloud [Online]. Available: [Accessed 13-10-2016].
  17. NIST. 2011. The NIST Definition of Cloud Computing [Online]. Available: PubsSPs.html [Accessed 9-10-2016].
  18. OpenNebula. 2016. OpenNebula Cloud Platform [Online]. Available: [Accessed 13-11- 2016].
  19. OpenStack. 2010. OpenStack Cloud Platform [Online]. Available: [Accessed 10- 10-2016].
  20. RightScale. 2006. RightScale - A Cloud Management Solution [Online]. Available: [Accessed 25-10-2016].
  21. Sotomayor, B., Montero, R. S., Llorente, I. M. & Foster, I. 2009. Virtual Infrastructure Management in Private and Hybrid Clouds. IEEE Internet Computing, 13, 14- 22.
  22. Tanenbaum, A. S. & Steen, M. v. 2006. Distributed Systems: Principles and Paradigms (2nd Edition), Prentice-Hall, Inc.
  23. Voorsluys, W., Broberg, J. & Buyya, R. 2011. Introduction to Cloud Computing. Cloud Computing. John Wiley & Sons, Inc.
  24. Yixin, D., Hellerstein, J. L., Parekh, S., Griffith, R., Kaiser, G. E. & Phung, D. 2005. A control theory foundation for self-managing computing systems. Selected Areas in Communications, IEEE Journal on, 23, 2213-2222.

Paper Citation

in Harvard Style

Aldwyan Y. and Sinnott R. (2017). Recovery-Oriented Resource Management in Hybrid Cloud Environments . In Proceedings of the 7th International Conference on Cloud Computing and Services Science - Volume 1: CLOSER, ISBN 978-989-758-243-1, pages 253-265. DOI: 10.5220/0006359602530265

in Bibtex Style

author={Yasser Aldwyan and Richard O. Sinnott},
title={Recovery-Oriented Resource Management in Hybrid Cloud Environments},
booktitle={Proceedings of the 7th International Conference on Cloud Computing and Services Science - Volume 1: CLOSER,},

in EndNote Style

JO - Proceedings of the 7th International Conference on Cloud Computing and Services Science - Volume 1: CLOSER,
TI - Recovery-Oriented Resource Management in Hybrid Cloud Environments
SN - 978-989-758-243-1
AU - Aldwyan Y.
AU - Sinnott R.
PY - 2017
SP - 253
EP - 265
DO - 10.5220/0006359602530265