Figure 1: Use-case: An online collaborative meeting room
with participants from across the globe.
ruptions, no stuttering, no connection loss, etc.). Note
that users can join or leave these meetings at any point
in time, leading to large fluctuations in terms of num-
ber of tenants currently using the system.
The algorithms proposed in this paper deal with
the situation where resource scaling must be per-
formed for each individual service component of the
overall application workflow. The runtime behavior
of each component is captured by a generic moni-
toring mechanism, and is used to keep track of how
this component is behaving based on its current ten-
ant load and assigned resources. Thus, we intend to
automatically intervene if the load on a particular ser-
vice is becoming too high for it to keep its SLA. Like-
wise, resource assignment should also be automati-
cally downscaled to save on resources and/or budget.
The rest of this paper is structured as follows. Re-
lated work is presented in Section 2, while Section 3
present the problem statement. Following this, Sec-
tion 4 introduces the SLA monitoring-based resource
provisioning algorithms. Section 5 discusses the
evaluation setup after which Section 6 discusses the
CloudSim evaluation results of the proposed heuristic
algorithms. Finally, Section 7 concludes.
2 RELATED WORK
A lot of work has been performed with regards to
Cloud resource provisioning strategies for IaaS (In-
frastructure as service), PaaS (Platform as a service)
and SaaS (Software as a service) providers. More-
over, research on multi-tenancy in cloud applications
(Guo et al., 2007) with SLA-driven simulations (An-
tonescu and Braun, 2014) is not uncommon today.
(Espadasa et al., 2013) have focused on under and
over-provisioning of resources in SaaS and its influ-
ence on cost-effectiveness. In their work, a multi-
tenant based resource allocation model has been de-
signed. Research done by (Bellenger et al., 2011)
discussed semi-automatic and automatic scaling. The
authors provide an overview of the pros and cons of
semi-automatic (users are forced to balance request-
ing more resources to avoid under-provisioning ver-
sus releasing resources to avoid over-provisioning)
versus automatic scaling (users follow workloads).
User satisfaction is the key concern of cloud ser-
vices, which, in certain situations, can be adversely
affected by SLA violations. (Morshedlou and Mey-
bodi, 2014) state that SLA violations depend on some
user characteristics, and eventually define two types
of user characteristics to reduce the impact of SLA vi-
olation. Another interesting work (L. Wu and Buyya,
2011), deals with algorithms for automated resource
provisioning of SaaS services based on their SLA.
This work was further extended to develop a method
for admission control (L. Wu and Buyya, 2012) of
user requests, thus facilitating prevention of addi-
tional user requests from being accepted which in turn
would lead to violating the SLA of the service. In
continuation to this, (L. Wu et al., 2014) also focused
on Customer Satisfaction Level (CSL) which depends
on the SLA violations. To improve CSL and re-
duce SLA violations, various algorithms are designed
based on resource reservation and request reschedul-
ing. The work presented in this paper differs from
all the works discussed above, owing to our focus on
workflows of service endpoints, each with indepen-
dent runtime behavior, but contributing to the overall
application workflow’s SLA adherence as well.
Various other studies (Taheri et al., 2014) (Glitho,
2011) show that auto-scaling for multimedia services
is an actively studied topic of research. A recent work
by (Soltanian et al., 2015) lay their focus on a very
specific sub-problem of scaling media services. This
work differs from the work presented in this paper as
our focus is to make generic and robust algorithms for
the entire service workflow spectrum.
3 PROBLEM DESCRIPTION
This section presents a concise model of multi-tenant,
multi-component SaaS workflows, with an introduc-
tion of its basic concepts followed by a formal de-
scription of the ARP-M (Automatic Resource Provi-
sioning under Multi-tenancy) problem. Table 1 sum-
marizes the notations used in the rest of the paper.
The basis of the issue at hand is the observation
that cloud-based SaaS applications currently are, a lot
of times, built as workflows of multiple existing ser-
vices (albeit with the necessary custom glue code to
tie all of them together). In multi-tenant usage sce-
CLOSER 2016 - 6th International Conference on Cloud Computing and Services Science
222