8 CONCLUSION AND FUTURE
WORK
The combination of monitoring performance metrics
and log files allows the unification of necessary
system parameters into one predictive model. In this
paper, we propose 1) a method mix to unify numerical
and textual data as well as 2) a method to obtain
(automated) guidance on what sort of model to
construct for the prediction of alarm states.
On RQ1: The described mixed application of logistic
regression and decision trees accomplishes the
unified use of continuous monitoring with discrete
event data in the same model.
On RQ2: Limited to our experimental setup, our
results show that the occurrence of log file events
does not have any impact on the system state turning
critical so far. Hence, prediction based on monitoring
performance metrics seems to be the most promising
way to predict incoming critical system states.
On RQ3: We clearly see that the different
configurations influence the relevance of the
variables as well as the accuracy.
Future work will complement our analysis with a
posteriori analysis of the respective prediction
models. Thus, we will statistically compare the
significance and importance that variables play in the
respective models.
ACKNOWLEDGEMENTS
The authors would like to thank Stefanie Alex,
Corinna Cichy and Roxane Stelzel for having made
invaluable suggestions to the content of the paper.
REFERENCES
Cater-Steel, A., Tan, W.-G. and Toleman, M., 2008.
“Summary of ITSM standards and frameworks survey
responses” in Proc. of the itSMF Australia 2007 Conf..
Toowoomba, Australia.
Ester, M., Kriegel, H.-P., Sander, J. and Xiaowei, X., 1996.
“A density-based algorithm for discovering clusters in
large spatial databases with noise” in Proc. of the
Second Int. Conf. on Knowledge Discovery and Data
Mining (KDD’96). Portland, OR, USA.
Hochstein, A., Tamm, G. and Brenner, W., 2005. “Service-
Oriented IT Management: Benefit, Cost and Success
Factors” in Proc. of the 13th European Conf. on
Information Systems. Regensburg, Germany.
Hosmer, D.W. and Lemeshow, S., 2000. “Applied Logistic
Regression”, Wiley, New York et al., 2
nd
edition.
Imai, K., 2017. "Quantitative Social Science: An
Introduction". Woodstock, Oxfordshire, GB: Princeton
University Press.
Kiernan, J. and Terzi, E., 2009. “Constructing
comprehensive summaries of large event sequences”
ACM Transactions on Knowledge Discovery from
Data (TKDD), vol. 3, no. 4, Art. No. 21.
Kubiak, P., Rass, S., 2018. “An overview of data-driven
techniques for IT-service-management”. IEEE Access,
vol. 6, pp. 63664–63688.
Li, T., Liang, F., Ma, S. and Peng, W., 2005. “An integrated
framework on mining logs files for computing system
management” in Proc. of the eleventh ACM SIGKDD
int. Conf. on Knowledge discovery in data mining.
Chicago, IL, USA.
Luo, C., Lou, J.G., Lin, Q., Fu, Q., Ding R., Zhang, D.,
Wang, Z., 2014. “Correlating events with time series for
incident diagnosis” in Proc. of the 20th ACM SIGKDD
int. Conf. on Knowledge discovery and data mining.
New York, NY, USA.
Ma, S. and Hellerstein, J.L., 2001. “Mining partially
periodic event patterns with unknown periods” in Proc.
of the IEEE Int. Conf. on Data Engineering. Heidelberg,
Germany.
Peng, W., Perng, C., Li, T. and Wang, H., 2007. “Event
summarization for system management” in Proc. of the
13th ACM SIGKDD int. Conf. on Knowledge
discovery and data mining. San Jose, CA, USA.
Potgieter, B.C., Botha, J.H. and Lew, C., 2005. “Evidence
that use of the ITIL framework is effective” in Proc. of
the 8th Annual Conf. of the national advisory
committee on computing qualifications. Tauranga, New
Zealand.
Rokach, L. and Maimon, O., 2010. “Data Mining and
Knowledge Discovery Handbook”, Springer, New
York, 2
nd
edition.
Salfner, F., Lenk, M. and Malek, M., 2010. “A survey of
online failure prediction methods”. ACM Computing
Surveys (CSUR), vol. 42, no. 3, Art. No. 10.
Tang, L., Li, T. and Shwartz, L., 2012. “Discovering lag
intervals for temporal dependencies” in Proc. of the
18th ACM SIGKDD int. Conf. on Knowledge
discovery and data mining. Beijing, China.
Wang, P., Wang, H., Liu, M. and Wang, W., 2010. “An
algorithmic approach to event summarization” in Proc.
of the 2010 ACM SIGMOD Int. Conf. on Management
of data. Indianapolis, IN, USA.
Xu, W., Liu, X. and Gong, Y.,2003. “Document clustering
based on non-negative matrix factorization” in Proc. of
the 26th annual int. ACM SIGIR Conf. on Research and
development in information retrieval. Toronto, Canada.
Zöller, M.-A., Baum, M. and Huber, M. F., 2017.
“Framework for mining event correlations and time
lags in large event sequences” in Proc. of the IEEE 15th
Int. Conf. on Industrial Informatics (INDIN). Emden,
Germany.