execute BoDT, whose data is globally distributed all
over the world. It is challenging to decide how to as-
sign tasks to Cloud VMs based on a user’s budget con-
straint while minimising the execution time.
The above problem was mathematically modelled
in this paper. We also proposed a heuristic approach
which assigned BoDT to Cloud VM(s) in order to
maximise performance and to satisfy the allowed cost
provided by a user.
Furthermore, we implemented a dynamic reas-
signment feature to utilise the idle time of a VM
that completes execution ahead of others by assigning
tasks from other VMs onto it. This feature reduces the
overall execution time when a number of VMs take
longer to finish their execution due to service failure
or network instability.
Our approach was evaluated and able to provide
execution plans which satisfied given budget con-
straints. Compared to the centralised and round robin
approaches, our approach reduced the execution time
on average by 27%. Our approach was also able to
satisfy the low budget while the others did not.
In the future, we plan to further improve dynamic
resource provisioning and tasks scheduling so that
they can be performed during execution in order to
handle expected events, e.g. network instability or
machine failure. Moreover, the different types of
Cloud instances, which have varying performance and
cost will be taken into account.
ACKNOWLEDGEMENTS
This research is supported by the EPSRC grant
‘Working Together: Constraint Programming and
Cloud Computing’ (EP/K015745/1), a Royal Society
Industry Fellowship, an Impact Acceleration Account
Grant (IAA) and an Amazon Web Services (AWS)
Education Research Grant.
REFERENCES
Chun, B., Culler, D., Roscoe, T., Bavier, A., Peterson, L.,
Wawrzoniak, M., and Bowman, M. (2003). Planet-
lab: An overlay testbed for broad-coverage services.
SIGCOMM Comput. Commun. Rev., 33(3):3–12.
Kaya, K. and Aykanat, C. (2006). Iterative-improvement-
based heuristics for adaptive scheduling of tasks shar-
ing files on heterogeneous master-slave environments.
Parallel and Distributed Systems, IEEE Transactions
on, 17(8):883–896.
Luckeneder, M. and Barker, A. (2013). Location, location,
location: Data-intensive distributed computing in the
cloud. In In Proceedings of IEEE CloudCom 2013,
pages 647–653.
Mao, M., Li, J., and Humphrey, M. (2010). Cloud auto-
scaling with deadline and budget constraints. In Grid
Computing (GRID), 2010 11th IEEE/ACM Interna-
tional Conference on, pages 41–48.
Oprescu, A. and Kielmann, T. (2010). Bag-of-tasks
scheduling under budget constraints. In Cloud Com-
puting Technology and Science (CloudCom), 2010
IEEE Second International Conference on, pages
351–359.
Ranganathan, K. and Foster, I. (2002). Decoupling compu-
tation and data scheduling in distributed data-intensive
applications. In Proceedings of the 11th IEEE Interna-
tional Symposium on High Performance Distributed
Computing, HPDC ’02, pages 352–, Washington, DC,
USA. IEEE Computer Society.
Ryden, M., Oh, K., Chandra, A., and Weissman, J. B.
(2014). Nebula: Distributed edge cloud for data in-
tensive computing.
Thai, L., Barker, A., Varghese, B., Akgun, O., and Miguel,
I. (2014a). Optimal deployment of geographically dis-
tributed workflow engines on the cloud. In 6th IEEE
International Conference on Cloud Computing Tech-
nology and Science (CloudCom 2014).
Thai, L., Varghese, B., and Barker, A. (2014b). Execut-
ing bag of distributed tasks on the cloud: Investi-
gating the trade-offs between performance and cost.
In Cloud Computing Technology and Science (Cloud-
Com), 2014 IEEE 6th International Conference on,
pages 400–407.
Venugopal, S. and Buyya, R. (2005). A deadline and bud-
get constrained scheduling algorithm for escience ap-
plications on data grids. In in Proc. of 6th Inter-
national Conference on Algorithms and Architectures
for Parallel Processing (ICA3PP-2005, pages 60–72.
Springer-Verlag.
CLOSER2015-5thInternationalConferenceonCloudComputingandServicesScience
380