approach to find patterns in a system state that takes
place priorly to the fault.
Most industrial processes are complex, so they
cannot be designed faultless and cannot be properly
modelled in advance. It is also hard to tell which
observing variables could be used for detection of the
cases when something is wrong with the process.
Moreover, even the process experts cannot always list
the states and conditions by which we could identify
the situations when process could cause the system
fault. If we could have an adequate mathematical
model of the target system, it could be used to predict
the future system state by the inputs and previous
states. In that case, if we know the future system state,
we can predict the fault. But due to complexity of the
production process there is no mathematical model.
But in the case when the most of processes
characteristics are being monitored, we have
observations, that we can use to build data-driven
models.
All the above lead the fault prediction to be based
on analysis of the data that corresponds to stable
functioning and the data that is prior to the fault. The
goal of the prediction system is to identify the patterns
that lead the system to the fault. It is important to
mention that not all the fault prediction problems can
be considered initially as a regression or classification
problems. We consider a case, in which we only know
the time the fault happened and there are only a few
faults occurred during the comparatively large time
interval. Here we need to reduce the initial problem
to regression problem by adjusting the criteria and
auxiliary variable construction. Then we apply
statistical learning methods to the adjusted dataset to
build up a prediction system.
Machine learning algorithms are being widely
utilized to find the relation between the input and the
output of the system (Kuhn and Johnsson, 2016).
There are studies on applying the machine learning
algorithms for solving the fault prediction problem
for supervised learning, but the most of these studies
are focused on specific processes. Since the fault
prediction requires recognition of specific patterns in
data, that cause the system fault, by fault prediction
we would mean the risk estimation problem. By risk
we mean some variable, that indicates the degree of
how dangerous the current situation is, this
interpretation is a simplification of the risk definition
done by (Kaplan and Garrick, 1981), so we are not
estimating the consequences and probabilities. In
(Paltrinieri et. al., 2019) the machine learning based
approach is considered as a promising tool of solving
risk estimation problems. The difference in
approaches is the following: is in that paper there is a
variable that can be used for risk estimation, and in
our case, we need to construct it first. Other
approaches of fault detection can be based on training
model on labelled observations of the system with
and without faults (Bondyra et al., 2018), but these
approaches require observations for both regular and
fault system states. In this study we are interested in
recognition pre-fault state instead of the fault state.
In study (Rakhshani et al., 2009) authors consider
the fault prediction problem for a power plant boiler,
where the risk estimation is based on the dataset with
labeled observations. There is continuous variable
that equals its max value for normal system states and
min value for the faults. The risk estimation is the
prediction of that variable and its values become the
base for the fault detection system. Depending on the
value, the state can be classified as normal, low fault
risk and fault. But it could be too late to prevent the
fault if we detect the fault by the time it has happened
and this case we consider in this paper. It has also
been considered in the study (Hujanen, 2019), where
the problem was reduced to the classification problem
and deep neural networks were applied to find a
model. In this study we propose different approach,
where the risk is assumed to grow constantly starting
from the time prior to the fault. Also, the risk
modeling is adjusted according to uncertainty of the
actual risk level for the observations that is not in this
prior to the fault interval, since there is no prior
information that these observations are of the low or
high risk.
In this paper we describe the reduction of the
initial fault detection problem, the way to construct
the risk variable and adjusted criterion and making
data-driven models. We also discuss the possibility of
using the risk prediction models for identification of
relation between different fault cases.
2 RISK ESTIMATION
APPROACH
Today computational resources allow us to make the
data-driven solutions based on the artificial neural
networks and the other computationally intensive
algorithms (Chollet and Allaire, 2018), (Goodfellow
et al., 2016). These methods and their
implementations are becoming more important in the
era of Industry 4.0 (Brink et al., 2016), when the
collected data could be analyzed and used as
decision-making systems for improving
performance.
The considered process state can be characterized
by different inputs that correspond to the sensor data