8 SUMMARY AND FUTURE
WORK
This paper explores the potential of time series
analysis of sensor data from heating systems in
operation for detecting and predicting errors, a critical
area complicated by the significant distance between
users and manufacturers. A procedure based on
Fayyad's model was implemented and applied to an
air-to-water heat pump system to identify and forecast
specific control faults in the compressor.
A RFC model was developed to recognize system
status and assess the impact of parameter weights on
fault detection. This model successfully determined
the status of the systems, achieving a detection accu-
racy of 97.6% and a precision of 97.4%. A key chal-
lenge was the limited dataset, which complicated the
expert validation and underscored the necessity for a
larger data foundation. The analysis underscored the
significance of certain parameters, particularly tem-
perature readings, in fault detection. Experts valida-
ted these findings, emphasizing the need for ongoing
adjustment of weight factors.
The limited availability of fault data and the lack
of system information restricts the effectiveness of
the RFR model. This limitation stems from the sys-
tem's lifecycle; after sale, third-party service and
maintenance companies oversee installation and
upkeep, while manufacturers conduct field monito-
ring for a brief period. As a result, failure data collec-
tion is primarily limited to this monitoring phase, thus
affecting the model's ability to predict accurately.
Future research directions, inspired by this work,
will explore the potential of Random Forest models
to analyze more extensive datasets with increased
error instances and assess other machine learning
algorithms for error detection and prediction in heat
pump systems. An optimized dataset, including
detailed parameter and fault information, is crucial
for developing models that accurately reflect system
reliability and behavior. Additionally, future studies
should explore the reliability of specific system
components and their impact on overall system
reliability. Future investigations should incorporate
not only existing data but also laboratory results,
simulations, and physical models. The integration of
physics-based models will be explored to establish
causal relationships between system parameters and
fault occurrences, thereby enhancing the model’s
ability to predict and diagnose faults with higher
accuracy. This approach is expected to improve the
overall effectiveness of the system, contributing to a
deeper understanding of system dynamics, and
advancing control strategies for heating systems.
REFERENCES
Arora, S.-J., & Rabe, M. (2023). Predictive maintenance:
Assessment of potentials for residential heating
systems. International Journal of Computer Integrated
Manufacturing, 1--25. https://doi.org/10.1080/
0951192X.2023.2204471.
Başağaoğlu, H., Chakraborty, D., Lago, C. D., Gutierrez,
L., Şahinli, M. A., Giacomoni, M., Furl, C., Mirchi, A.,
Moriasi, D., & Şengör, S. S. (2022). A review on
interpretable and explainable artificial intelligence in
hydroclimatic applications. Water, 14(8), 1230.
https://doi.org/10.3390/w14081230.
Biau, G., & Scornet, E. (2016). A random forest guided
tour. TEST, 25(1), 197--227. https://doi.org/
10.1007/s11749-016-0481-7.
Bode, G., Thul, S., Baranski, M., & Müller, D. (2020).
Real-world application of machine-learning-based fault
detection trained with experimental data. Energy, 198,
323. https://doi.org/10.1016/j.energy.2020.117323.
Breiman, L. (2001). Random forests in machine learning.
Springer, New York, NY, 5--32. https://doi.org
/10.1023/A:1010933404324.
Chen, J. (2013). Model-based fault diagnosis in dynamic
systems using identification techniques. Springer,
London, United Kingdom. ISBN: 978-1-4471-3829-7.
https://doi.org/10.1007/978-1-4471-3829-7.
Chiang, L. H. (2001). Fault detection and diagnosis in
industrial systems. Springer, London, United Kingdom.
https://doi.org/10.1088/0957-0233/12/10/706.
Corrales, D., Corrales, J., & Ledezma, A. (2018). How to
address the data quality issues in regression models: A
guided process for data cleaning. Symmetry, 10(4), 99.
https://doi.org/10.3390/sym10040099.
Cutler, A., Cutler, D. R., & Stevens, J. R. (2012). Random
forests in Ensemble Machine Learning. Springer, New
York, NY, 157--175. https://doi.org/10.1007/978-1-
4419-9326-7_5.
Dey, M., Rana, S. P., & Dudley, S. (2020). Smart building
creation in large scale HVAC environments through
automated fault detection and diagnosis. Future
Generation Computer Systems, 108, 950--966.
https://doi.org/10.1016/j.future.2018.02.019.
Ebrahimifakhar, A., Kabirikopaei, A., & Yuill, D. (2020).
Data-driven fault detection and diagnosis for packaged
rooftop units using statistical machine learning
classification methods. Energy and Buildings, 225, 318.
https://doi.org/10.1016/j.enbuild.2020.110318.
Fayyad, U., Piatetsky-Shapiro, G., & Smyth, P. (1996). The
KDD process for extracting useful knowledge from
volumes of data. Communications of the ACM, 39,
27--34. https://doi.org/10.1145/240455.240464.
García, S., Luengo, J., & Herrera, F. (2015). Data
preprocessing in data mining. Springer International
Publishing, Cham, Switzerland, 19--38.
https://doi.org/10.1007/978-3-319-10247-4.
Gharsellaoui, S., Mansouri, M., Trabelsi, M., Harkat, M.-
F., Refaat, S. S., & Messaoud, H. (2020). Interval-
valued features based machine learning technique for
fault detection and diagnosis of uncertain HVAC