itoring, several examples can be found in letterature
in different application domains but we are not aware
of any work applying it to the monitoring of MapRe-
duce jobs in a cloud environment. EC has been used
in various fields to verify the compliance of a sys-
tem to user-defined behavioral properties. For exam-
ple, (Spanoudakis and Mahbub, 2006), (Farrel et al.,
2005) exploit ad-hoc event processing algorithms to
manipulate events and fluents, written in JAVA. Dif-
ferently from MOBUCON they do not have an under-
lying formal basis, and they cannot take advantage of
the expressiveness and computational power of logic
programming.
Several authors – (Giannakopoulou and Havelund,
2001), (Bauer et al., 2011) – have investigated the use
of temporal logics – Linear Temporal Logic (LTL)
in particular – as a declarative language for specify-
ing properties to be verified at runtime. Neverthe-
less, these approaches lack the support of quantitative
time constraints, non-atomic activities with identifier-
based correlation, and data-aware conditions. These
characteristics – supported by MOBUCON – are in-
stead very important in our application domain.
5 CONCLUSIONS
This work present a framework architecture that en-
capsulates an application level platform for data-
processing. The system lends the Map Reduce in-
frastructure the ability to autonomously check the
execution, detecting bottlenecks and constraint vio-
lations through Business Process Management tech-
niques with a best effort approach.
Focusing on activities and constraints, the use of
Declare language has shown significant advantages
in the monitoring system implementation and cus-
tomization.
Although this work represents just a first step to-
wards an auto-scaling engine for Map Reduce, its
declarative approach to the monitoring issue shows
promising results, both regarding the reactivity to crit-
ical conditions and the simplification in monitoring
constraint definition.
For the future, we plan to employ the defined
framework architecture to test various diagnosis and
recovery policies and verify the efficacy of the over-
all auto-scaling engine in a wider scenario (i.e., with
a higher number of Map Reduce workers involved).
Finally, particular attention will be given to the
hybrid cloud scenario, where the HyIaaS component
is employed to transparently perform VM provision-
ing either on an on-premise internal or an off-premise
public cloud. In case of a hybrid deploy, several ad-
ditional constraints will need to be taken into account
(e.g., the limited inter-cloud bandwidth), thus further
complicating the implemented monitoring and recov-
ery policies. Nevertheless, we believe that a declara-
tive approach to the problem can contribute to signif-
icantly simplify the implementation of the solution.
REFERENCES
Amazon Cloud Watch (2015). Amazon cloud monitor
system. https://aws.amazon.com/it/cloudwatch/. Web
Page, last visited in Dec. 2015.
Apache Hadoop (2015). Apache software foundation.
https://hadoop.apache.org/. Web Page, last visited in
Dec. 2015.
Apache Spark (2015). Apache software foundation.
http://spark.apache.org. Web Page, last visited in
Dec. 2015.
Armbrust, M., Fox, O., and R., G. (2009). Above the
clouds: A berkeley view of cloud computing. Techni-
cal report, Electrical Engineering and Computer Sci-
ences University of California at Berkeley.
Bauer, A., Leucker, M., and Schallhart, C. (2011). Runtime
verification for ltl and tltl. ACM Trans. Softw. Eng.
Methodol., 20(4):14:1–14:64.
Bragaglia, S., Chesani, F., Mello, P., Montali, M., and Tor-
roni, P. (2012). Reactive event calculus for monitoring
global computing applications. In Logic Programs,
Norms and Action. Springer.
Ceilometer, O. (2015). the openstack monitoring module.
https://wiki.openstack.org/wiki/ceilometer.
Chen, K., Powers, J., Guo, S., and Tian, F. (2014a). Cresp:
Towards optimal resource provisioning for mapreduce
computing in public clouds. Parallel and Distributed
Systems, IEEE Transactions on, 25(6):1403–1412.
Chen, M., Mao, S., and Liu, Y. (2014b). Big data: A
survey. Mobile Networks and Applications, Volume
19(2):171–209.
Collins, E. (2014). Intersection of the cloud and big data.
Cloud Computing, IEEE, 1(1):84–85.
Dean, J. and Ghemawat, S. (2008). Mapreduce: Simpli-
fied data processing on large clusters. Commun. ACM,
51(1):107–113.
Farrel, A., Sergot, M., Sall
`
e, M., and Bartolini, C. (2005).
Using the event calculus for tracking the normative
state of contracts. International Journal of Coopera-
tive Information Systems, 14(02n03):99–129.
Giannakopoulou, D. and Havelund, K. (2001). Automata-
based verification of temporal properties on running
programs. In Automated Software Engineering, 2001.
(ASE 2001). Proceedings. 16th Annual International
Conference on, pages 412–416.
Kailasam, S., Dhawalia, P., Balaji, S., Iyer, G., and Dha-
ranipragada, J. (2014). Extending mapreduce across
clouds with bstream. Cloud Computing, IEEE Trans-
actions on, 2(3):362–376.
CLOSER 2016 - 6th International Conference on Cloud Computing and Services Science
104