text a method to detect and remove structural breaks.
Following a structural break means a rapid and strong
shift of the mean demand of a certain article. In other
words, one can find two different distributions which
can be separated at a certain point in time. Figure 1
shows an example for such a strong and rapid shift
of the mean. One can see that in week 69 the behav-
ior of the time series changes dramatically. The mean
demand changes from 181 items based on the deliver-
ies until week 69 to 119 items based on the deliveries
starting from week 70. Since safety stock levels are
often affected by variance or standard deviation, an
estimation of stock level based on the complete time
series can lead to overstocking in cases as described
above.
Taking the previous considerations into account,
the task can be summarized as follows: If a set of data
is likely to correspond to two differentdistributions, is
there a point in time which can be used to differentiate
between both distributions, or do the random numbers
come alternating from both distributions? Having ob-
tained the results from section 2.1 it is necessary to
find an adequate transformation routine.
A well-known clustering algorithm is the k-
means-clustering as described e.g. in (Press et al.,
2007). Clustering is known to be NP-hard in stan-
dard scenarios, hence polynomial clustering heuris-
tics like k-means-clustering do not guarantee optimal
solutions. Since in this case clustering is performed
for only one dimension the algorithm convergesto the
optimum as described in (Hartigan and Wong, 1979).
The goal of the algorithm is to find k clusters in n-
dimensional space, where a cluster is described by its
n-dimensional mean vector. Whereas some modifica-
tions of the algorithm allow an adaptive fit of k to the
data samples, the problem described above requires
to set k = 2, as the task is to find two separate distri-
butions of samples. Unfortunately, using exactly two
clusters brings up a weakness of this method concern-
ing outliers. In order to prevent identifying outliers as
a cluster, it is recommended to remove outliers prior
to the analysis, e.g. by using the 3σ rule, i.e. elimi-
nating samples which deviate from the mean value by
more than three times the standard deviation, compare
(Wadsworth, 1997).
Having found two clusters, C
0
and C
1
, the trans-
formation routine P
C
: R → {0,1} can be defined as
follows and the time series x
i
∈ R can be transformed
to y
i
∈ {0,1}
y
i
=
0 x ∈ C
0
1 x ∈ C
1
(11)
Just as the design of the transformation routine de-
pends on the task considered, certain parameters have
to be set depending on it. Since the task in this case
is to detect a clustering of samples from different dis-
tributions, it is recommended to set the length of the
analyzed subsequence τ in section 2.1 equals
τ = min{|C
0
|,|C
1
|} (12)
by default. In order to reduce the number of false
alarms it is helpful to define an offset. This has the
effect that a time series can only be reduced to a cer-
tain minimum number of samples. Another strategy
to prevent false alarms is to demand a minimum size
of each cluster. Both points are justified by the goal to
analyze whether the distribution of samples has reli-
ably changed and choice of settings should depend on
risks associated with increasing either type I or type
II error.
2.3 Dealing with Multiple Structural
Breaks
In order to deal with multiple structural breaks, an it-
erative procedure of the algorithm presented within
this paper is applied. Given a time series x
1
,...,x
s
and the algorithm detects a structural break at time
index b, the algorithm is applied again on time series
x
b
,...,x
s
until convergence, i.e. no further change-
point is detected on the subsequence. If one is inter-
ested in identifying all change-points, the procedure
can be applied to all remaining subsequence until con-
vergence.
2.4 A Brief Note on Dealing with
Nonadditive Changes
Nonadditive changes are defined in e.g. (Basseville
and Nikiforov, 1993) as changes in variance, corre-
lations, spectral characteristics, and dynamics of the
signal or system. Hence, these types of changes are
considered to be more complex to detect than addi-
tive changes, i.e. shifts in the mean value. Although
additive changes play the central role in the follow-
ing application on real data, the algorithm can easily
be adapted to detect nonadditive changes. In order
to demonstrate the flexibility of the novel approach, a
rough recipe for this adaptation is provided.
The task of detecting either additive or nonaddi-
tive changes can be summarized as generating resid-
uals of the original measurements that reflect the
changes of interest, which are in this particular case
of nonadditive nature. As stated above, instead of
residuals the algorithm introduced within this paper
demands a sequence of binary numbers, which is in-
terpreted as the outcome of a binary stochastic pro-
cess {Y
i
}
i∈N
with Ω := {0,1}. Afterwards, the se-
FORECAST ERROR REDUCTION BY PREPROCESSED HIGH-PERFORMANCE STRUCTURAL BREAK
DETECTION
265