and airlines) need management services such as
monitoring, patching, backup, change control, high
availability and disaster recovery to support systems
running complex applications with stringent IT
process control and quality-of-service (QoS)
requirements. Such features are typically offered by
IT service providers in strategic outsourcing (SO)
engagements, a business model for which the
provider takes over several aspects of management
of a customer’s datacenter resources, software
assets, and processes. Servers with such support are
characterized as being managed.
This should be contrasted with unmanaged
servers provisioned using basic Amazon Web
Services (AWS) (Miller, 2010; AWS Corporation,
2017) and IBM’s SoftLayer (SoftLayer, 2017)
offerings, where the cloud provider offers automated
server provisioning. To make a server managed,
these cloud service providers have networked with
other service partners that customers can engage to
fill all the gaps up and down the stack. This enables
the user to add services to the provisioned server,
but the cloud provider assumes no responsibility for
their upkeep or the additional services added.
Therefore, it puts burden on the customer to obtain a
fully managed solution for their enterprise workload
rather than the cloud service providing an end-to-end
fully managed solution for the customers.
AWS provides the IT resources so that the
customers can launch entire SAP enterprise software
stacks on the AWS Cloud. AWS Cloud is SAP
verified and certified. AWS provides highly reliable
services and multiple fault-tolerant Availability
Zones for disaster recovery implementations.
The IBM Cloud Managed Services (CMS)
product (IBM Corporation, 2017) from IBM is an
enterprise cloud which provides managed services
for critical workloads and enterprise-level SLA
mechanisms. CMS supports several software
services on CMS, such CMS4SAP CMS4ORCALE
and AMM4SAP.
HANA is fully certified to run on VMware
platform (King, 2014). vSphere 5.5 has a limitation
in that the largest VM can be created with 1 TB of
disk storage only. Depending on the usage of the
data, both warm and cold data can reside together on
the disk. This enables extension of the total size of
the SAP HANA database above 1 TB. Currently,
several cloud providers that are enabling themselves
to support more options for SAP and SAP HANA
workloads.
In (Dekel, 2003), the authors have described a
system that focuses on performance aware high
availability which is achieved through cloning and
replication of application’s state. Our work focuses
on a resiliency framework to determine and deploy
the optimal resiliency support for a given workload
based on its characteristics.
8 LESSONS LEARNT AND
CONCLUSIONS
During enablement of enterprise workloads in the
IBM’s CMS cloud, several points became apparent.
First insight is that each enterprise customer has a
varied set of resiliency requirements for the
workload that they are running depending on the
nature of their business. Therefore, the cloud service
providers must handle such heterogeneous
requirements with least amount of customization
possible that must be delivered in a tight scheduled
while maintaining the low cost.
Second insight is that there is a variety of cluster
set up configurations that may be possible and the
required set up may vary from workload to
workload. Additionally, the cluster set up may
evolve overtime based on the changing requirements
of the workload. Additionally, the cloud provider
must support the application level replication
technology depending on the applications being
deployed. As the requirements are highly variable
and may evolve overtime as the workload evolves, it
is crucial to systematize and standardize the end to
end process of the resiliency solution planning,
implementation, testing and delivery.
Another insight is that multiple levels of resiliency
at infrastructure, middleware and application levels
are required for increased system reliability.
Implementing multiple levels of resiliency delivers a
more robust system, while enabling operation of
these different levels of resiliency seamlessly.
Enterprise-class customers, such as banks,
financial institutions, hospitals, governments, utility
companies, etc. can suffer high business losses even
from short outages and service interrupts in the IT
infrastructure. Cost of downtime could dissolve
business, or cause irreparable brand damage, loss of
customer data and reputation. A structured and
continuously improving mechanism is required to
deliver the level of resiliency needed by the various
enterprise applications.
We introduced an end-to-end business resiliency
framework and resiliency life cycle. We further
discussed various resiliency patterns implemented
for enterprise applications using a diverse set of
platforms in the IBM CMS cloud offering. To