5 CONCLUSIONS
Our paper evaluated machine learning models across
three algorithms, two encoding methods, and two
clustering techniques for alarm prediction using
data from an Italian company. XGB, particularly
with K-Means++ clustering, showed the highest
performance. Despite hyperparameter tuning,
improvements were marginal. SHAP analysis em-
phasized key features like cluster identification and
alarm time. However, further study is needed as our
best scenarios fell below 0.5 AP. As future work,
we intend to explore techniques for dealing with
highly unbalanced datasets or one-class classification
algorithms.
ACKNOWLEDGEMENTS
K. Abreu, J. Reis, and A. Santos are grateful to
CAPES and FAPEMIG for funding different parts of
this work.
REFERENCES
Arthur, D. and Vassilvitskii, S. (2007). K-means++: The
advantages of careful seeding. In Proceedings of the
Eighteenth Annual ACM-SIAM Symposium on Dis-
crete Algorithms, SODA ’07, page 1027–1035, USA.
Society for Industrial and Applied Mathematics.
Au-Yeung, W.-T. M., Sahani, A. K., Isselbacher, E. M., and
Armoundas, A. A. (2019). Reduction of false alarms
in the intensive care unit using an optimized ma-
chine learning based approach. npj Digital Medicine,
2(1):86.
Chen, T. and Guestrin, C. (2016). XGBoost. In Proceedings
of the 22nd ACM SIGKDD International Conference
on Knowledge Discovery and Data Mining. ACM.
Developers, X. (2023). Xgboost documentation.
https://xgboost.readthedocs.io/en/stable/index.html.
Accessed on: November 30, 2023.
DMLC (2023). Xgboost: Scalable and flexible gradient
boosting. https://github.com/dmlc/xgboost.
Ekanayake, I., Meddage, D., and Rathnayake, U. (2022). A
novel approach to explain the black-box nature of ma-
chine learning in compressive strength predictions of
concrete using shapley additive explanations (shap).
Case Studies in Construction Materials, 16:e01059.
Hand, D. J. and Yu, K. (2001). Idiot’s bayes: Not so stupid
after all? International Statistical Review / Revue In-
ternationale de Statistique, 69(3):385–398.
Jain, A. (2016). Mastering xgboost parameter tuning: A
complete guide with python codes. Accessed on 2023-
11-30.
Lateano, F., Ayoub, O., Musumeci, F., and Tornatore, M.
(2023). Machine-learning-assisted failure prediction
in microwave networks based on equipment alarms.
In 2023 19th International Conference on the Design
of Reliable Communication Networks (DRCN), pages
1–7.
Li, Z. (2022). Extracting spatial effects from machine learn-
ing model using local interpretation method: An ex-
ample of shap and xgboost. Computers, Environment
and Urban Systems, 96:101845.
Lundberg, S. M. and Lee, S.-I. (2017). A unified approach
to interpreting model predictions. In Proceedings of
the 31st International Conference on Neural Informa-
tion Processing Systems, NIPS’17, page 4768–4777,
Red Hook, NY, USA. Curran Associates Inc.
Mangalathu, S., Hwang, S.-H., and Jeon, J.-S. (2020).
Failure mode and effects analysis of rc members
based on machine-learning-based shapley additive ex-
planations (shap) approach. Engineering Structures,
219:110927.
Meng, Y. and Kwok, L.-f. (2012). Adaptive false alarm fil-
ter using machine learning in intrusion detection. In
Wang, Y. and Li, T., editors, Practical Applications
of Intelligent Systems, pages 573–584, Berlin, Heidel-
berg. Springer Berlin Heidelberg.
Murtagh, F. (1991). Multilayer perceptrons for classifica-
tion and regression. Neurocomputing, 2(5):183–197.
Pezze, D. D., Masiero, C., Tosato, D., Beghi, A., and Susto,
G. A. (2022). Formula: A deep learning approach for
rare alarms predictions in industrial equipment. IEEE
Transactions on Automation Science and Engineer-
ing, 19(3):1491–1502.
Quinn, C. (2020). Alarm forecasting in natural gas
pipelines. Master’s thesis, Milwaukee, Wisconsin.
Rutgers University (2009). Rutgers study finds
alarm systems are valuable crime-fighting tool.
https://www.rutgers.edu/news/rutgers-study-finds-
alarm-systems-are-valuable-crime-fighting-tool.
Accessed on May 15, 2023.
Zhang, M. and Wang, D. (2019). Machine learning based
alarm analysis and failure forecast in optical net-
works. In 2019 24th OptoElectronics and Commu-
nications Conference (OECC) and 2019 International
Conference on Photonics in Switching and Computing
(PSC), pages 1–3.
Zhu, J., Wang, C., Li, C., Gao, X., and Zhao, J. (2016).
Dynamic alarm prediction for critical alarms using a
probabilistic model. Chinese Journal of Chemical En-
gineering, 24(7):881–885.
Zhuang, H., Zhao, Y., Yu, X., Li, Y., Wang, Y., and Zhang,
J. (2020). Machine-learning-based alarm prediction
with gans-based self-optimizing data augmentation in
large-scale optical transport networks. In 2020 Inter-
national Conference on Computing, Networking and
Communications (ICNC), pages 294–298.
Zucchi, G., Corr
ˆ
ea, V. H., Santos, A. G., Iori, M., and Yag-
iura, M. (2022). A metaheuristic algorithm for a multi-
period orienteering problem arising in a car patrolling
application. In INOC, pages 99–104.
Explainable Machine Learning for Alarm Prediction
697