Agarwal, M., Appleby, K., Gupta, M., and Kar, G. (2004).
Problem determination using dependency graphs and
run-time behavior models. Utility Computing, pages
171–182.
Appleby, K., Goldszmidt, G., and Steinder, M. (2001).
Yemanja-a layered event correlation engine for multi-
domain server farms. In Integrated Network Man-
agement Proceedings, 2001 IEEE/IFIP International
Symposium on, volume 00, pages 329–344. IEEE.
Barham, P., Isaacs, R., and Mortier, R. (2003). Magpie:
Online modelling and performance-aware systems. In
In Proceedings of the Ninth Workshop on Hot Topics
in Operating Systems.
Chen, M., Kiciman, E., and Fratkin, E. (2002). Pinpoint:
Problem determination in large, dynamic internet ser-
vices. In In Proc. 2002 Intl. Conf. on Dependable Sys-
tems and Networks.
Ejarque, J., Fit´o, J. O., Katsaros, G., Luis, J., and Martinez,
P. (2011). OPTIMIS Deliverable Requirements Anal-
ysis ( M16 ). Technical report, NTUA, ATOS, SCAI,
SAP, BT, CITY, LUH, 451G, FLEXIANT, ULEEDS.
ESPER (2013). Home page of esper. http://esper.
codehaus.org/index.html. [Online; accessed 26-
March-2013].
Gruschke, B. and Others (1998). Integrated event manage-
ment: Event correlation using dependency graphs. In
Proceedings of the 9th IFIP/IEEE International Work-
shop on Distributed Systems: Operations & Manage-
ment (DSOM 98), pages 130–141.
Hanemann, A. (2007). Automated IT Service Fault Diag-
nosis Based on Event Correlation Techniques. PhD
thesis.
Hasselmeyer, P. and D’Heureuse, N. (2010). Towards holis-
tic multi-tenant monitoring for virtual data centers.
2010 IEEE/IFIP Network Operations and Manage-
ment Symposium Workshops, pages 350–356.
Hoke, E., Sun, J., Strunk, J., and Ganger, G. (2006). Inte-
Mon: continuous mining of sensor data in large-scale
self-infrastructures. ACM SIGOPS Operating Systems
Review, 40(3):38–44.
Jeune, G. L., Garc´ıa, E., Perib´a˜nez, J. M., and Mu˜noz,
H. (2012). 4CaaSt Scientific and Technical Report
D5.1.1. Technical report, Seventh Framework Pro-
gramme.
Kang, H., Chen, H., and Jiang, G. (2010). PeerWatch: a
fault detection and diagnosis tool for virtualized con-
solidation systems. In Proceedings of the 7th inter-
national conference on Autonomic computing, pages
119–128.
Kang, H., Zhu, X., and Wong, J. (2012). DAPA: diagnos-
ing application performance anomalies for virtualized
infrastructures. 2nd USENIX workshop on Hot-ICE.
Katsaros, G., K¨ubert, R., and Gallizo, G. (2011). Building a
Service-Oriented Monitoring Framework with REST
and Nagios. 2011 IEEE International Conference on
Services Computing, 567:426–431.
Massie, M. (2004). The ganglia distributed monitoring sys-
tem: design, implementation, and experience. Parallel
Computing, 30(7):817–840.
Molenkamp, G. (2002). Diagnosing quality of service faults
in distributed applications. Performance, Computing,
and Communications Conference, 2002. 21st IEEE
International.
Nagios (2013). Home page of nagios. http://
www.nagios.org/. [Online; accessed 26-March-2013].
O’Hara, R. B. and Sillanp¨a¨a, M. J. (2009). A review of
Bayesian variable selection methods: what, how and
which. Bayesian Analysis, 4(1):85–117.
OpenShift (2013). Home page of openshift. https://
www.openshift.com/. [Online; accessed 26-March-
2013].
OpenStack (2013). Home page of openstack. http://
www.openstack.org/. [Online; accessed 26-March-
2013].
OpenTSDB (2013). Home page of opentsdb. http://
opentsdb.net/. [Online; accessed 26-March-2013].
OpenView (2013). Hp openview — wikipedia, the free
encyclopedia. http://en.wikipedia.org/w/index.php?
title=HP OpenView&oldid=547020972. [Online; ac-
cessed 26-March-2013].
Rak, M., Venticinque, S., Mhr, T., Echevarria, G., and Es-
nal, G. (2011). Cloud Application Monitoring: The
mOSAIC Approach. 2011 IEEE Third International
Conference on Cloud Computing Technology and Sci-
ence, pages 758–763.
Sharma, B., Jayachandran, P., Verma, A., and Das, C.
(2012). CloudPD: Problem Determination and Diag-
nosis in Shared Dynamic Clouds. cse.psu.edu, pages
1–30.
Stratan, I. L., Newman, H., Voicu, R., Cirstoiu, C., Grigo-
ras, C., Dobre, C., Muraru, A., Costan, A., Dediu, M.,
and C. (2009). MONALISA: An Agent based , Dy-
namic Service System to Monitor , Control and Op-
timize Grid based Applications The Distributed Ser-
vices. Computer Physics Communications, 180:2472–
2498.
Tan, Y., Nguyen, H., and Shen, Z. (2012). PREPARE: Pre-
dictive Performance Anomaly Prevention for Virtual-
ized Cloud Systems. In Distributed Computing Sys-
tems (ICDCS), 2012 IEEE 32nd International Confer-
ence on, number Vcl.
Tivoli (2013). Home page of ibm tivoli. http://
www.tivoli.com/. [Online; accessed 26-March-2013].
Yaqub, E., Wieder, P., Kotsokalis, C., Mazza, V., Pasquale,
L., Rueda, J. L., G´omez, S. G., and Chimeno, A. E.
(2011). A generic platform for conducting sla negoti-
ations. In Service Level Agreements for Cloud Com-
puting, pages 187–206. Springer.
Yaqub, E., Yahyapour, R., Wieder, P., and Lu, K. (2012).
A protocol development framework for sla negotia-
tions in cloud and service computing. In Service
Level Agreements for Cloud Computing, pages 1–15.
Springer.
CLOSER2013-3rdInternationalConferenceonCloudComputingandServicesScience
454