
outliers effectively, preventing them from nega-
tively impacting model performance.
• Continuous monitoring and improvement: Estab-
lish continuous monitoring mechanisms to track
the performance of the AI model over time. Im-
plement feedback loops that allow the model to
adapt and improve based on corrected data, main-
taining accuracy in dynamic operational environ-
ments.
In summary, the experiments provide empirical evi-
dence supporting the critical role of accurate datasets
in machinery functional safety setting. By adhering
to these recommendations, practitioners can build AI
systems that are more resilient, accurate, and reliable,
ultimately contributing to enhanced safety outcomes
in high-risk applications.
5 CONCLUSION
The empirical experiments conducted to investigate
the impact of dataset accuracy on AI model perfor-
mance in the realm of machinery functional safety
have yielded valuable insights into the critical na-
ture of high-quality data in ensuring the reliability
and effectiveness of AI systems. It is evident that in-
accuracies in the training dataset lead to diminished
predictive capabilities, potentially compromising the
safety and reliability of machinery in industrial set-
tings. The findings in this work underscore two key
points, namely the importance of dataset quality and
recommendations for data quality assurance.
Future work in this domain could explore ad-
vanced techniques for enhancing dataset quality, such
as the integration of anomaly detection algorithms
and robust preprocessing methods. Investigating
the adaptability of the AI model to dynamic opera-
tional environments and evolving machinery condi-
tions would be valuable. Another avenue for research
involves experimentation with a combination of real-
world datasets and synthetic datasets. This approach
would allow for a more comprehensive evaluation of
model performance and generalizability by incorpo-
rating the complexities and nuances present in real-
world data, while still maintaining the benefits of con-
trolled experimentation offered by synthetic datasets.
REFERENCES
Budach, Lukas, e. (2022). The effects of data quality on ma-
chine learning performance. arXiv:2207.14529. arXiv
preprint, https://arxiv.org/abs/2207.14529.
IEC62061 (2022). Safety of machinery—Functional
safety of safety-related electrical, electronic and pro-
grammable electronic control systems. IEC.
ISO13849 (2023). Safety of machinery—Safety-related
parts of control systems—Part 1: General principles
for design. ISO.
Katsuki, T. and Osogami, T. (2023). Regression with sen-
sor data containing incomplete observations. In Pro-
ceedings of the 40th International Conference on Ma-
chine Learning, volume 202 of Proceedings of Ma-
chine Learning Research, Honolulu, Hawaii, USA.
PMLR. Copyright 2023 by the author(s).
Lwakatare, L. E., R
˚
ange, E., Crnkovic, I., and Bosch,
J. (2021). On the experiences of adopting auto-
mated data validation in an industrial machine learn-
ing project. CoRR, abs/2103.04095.
Northcutt, C. G., Athalye, A., and Mueller, J. (2021). Perva-
sive label errors in test sets destabilize machine learn-
ing benchmarks.
Pedregosa, F. e. (2011). Scikit-learn: Machine learning
in Python. Journal of Machine Learning Research,
12:2825–2830.
Perez-Cerrolaza, e. (2023). Artificial intelligence for safety-
critical systems in industrial and transportation do-
mains: A survey. ACM Comput. Surv. Just Accepted.
Polyzotis, N., Zinkevich, M., Roy, S., Breck, E., and
Whang, S. (2019). Data validation for machine learn-
ing. In Talwalkar, A., Smith, V., and Zaharia, M., ed-
itors, Proceedings of Machine Learning and Systems,
volume 1, pages 334–347.
Priestley, M., O’donnell, F., and Simperl, E. (2023). A
Survey of Data Quality Requirements That Matter in
ML Development Pipelines. J. Data and Information
Quality, 15(2).
Schelter, S., Rukat, T., and Biessmann, F. (2020). Learn-
ing to validate the predictions of black box classi-
fiers on unseen data. In 2020 ACM SIGMOD Interna-
tional Conference on Management of Data, SIGMOD
’20, pages 1289–1299, New York, NY. Association for
Computing Machinery.
Schelter, S., Rukat, T., and Biessmann, F. (2021). JENGA -
A framework to study the impact of data errors on the
predictions of machine learning models. In EDBT,
pages 529–534. OpenProceedings.org.
Sessions, V. and Valtorta, M. (2009). Towards a method for
data accuracy assessment utilizing a bayesian network
learning algorithm. J. Data and Information Quality,
1(3).
Shankar, S., Fawaz, L., Gyllstrom, K., and Parameswaran,
A. (2023). Automatic and precise data validation for
machine learning. page 2198–2207, New York, NY,
USA. Association for Computing Machinery.
Teh, H., Kempa-Liehr, A., and Wang, K. (2020). Sensor
data quality: A systematic review. Journal of Big
Data, 7:11.
Exploring the Impact of Dataset Accuracy on Machinery Functional Safety: Insights from an AI-Based Predictive Maintenance System
497