mainly at 11 o’clock and 12 o’clock. In the
following, we only use the data representing
weekdays.
Table 2: Data sample.
ID hour Bedroom Grid Refrigerator
499 2015-05-
01 07:00
0 0.67 0.2
499 2015-05-
01 08:00
0 0.39 0.12
499 2015-05-
01 09:00
0 0.04 0.17
3.2 Results and Discussions
Figure 5: Visualization of different test days using Auto-
encoders reconstruction. Different colors represent the
reconstruction MSE.
We applied two different approach for two different
scenarios. The first one is trying to detect anomalous
days without localizing the anomaly. This can be
useful to building managers to better understand
consumers’ behaviors and for making energy
efficient home improvements. Fig. 4 represents the
actual power consumption values of the test days.
Fig. 5 shows a visualization of the Auto-Encoder
reconstruction with 5 hidden layers as following [24,
50, 20, 2, 20, 50, 24]. Different colors represent the
reconstruction MSE (Mean Square Error). The
reconstruction error is then compared with a
threshold in order to determine if the day in normal
or not. For a threshold of 0.04, only days 4, 6 and 7
are considered as anomalous as can be seen in Fig.
5.
In the second scenario, we tried to localize the
anomaly using the method we proposed in section 3.
Fig. 6, Fig.7 and Fig. 8 illustrate the results of our
proposed method on the same test days. As can be
seen, our method localized two anomalies for day 4
(at 8 a.m. and 10 a.m.) and one anomaly for day 7
(at 10 a.m.). This can explain why days 4 and 7 have
been considered anomalous by the auto encoders and
can also explain day 4 has a higher reconstruction
error than day 7 since day 4 has 2 anomalies
whereas day 7 has only one anomaly. Contrary to
the Auto-Encoder that considered the day number 6
as anomalous, our method conserves a consistent
overall prediction as can be seen in Fig.7.
4 CONCLUSION
Finding anomalies in time series data is a very
promising topic permitting to reduce the waste of
energy and to better monitor building energy
consumption. In this paper, we present a hybrid
model combining LSTM and K-means algorithm in
order to detect outliers in time series data. Auto-
Encoders detects abnormal days, whereas the
proposed algorithm identifies the typical scenario
permitting to localize the detected anomalies.
Despite of these encouraging results, this work
needs the assistance of real expert users and analysts
in order to better define the anomaly in this domain.
Experts can also provide some annotations for the
learning data in order to give us the possibility of
applying semi-supervised approaches in this domain.
ACKNOWLEDGEMENTS
This work has been supported by SOLOTEC project
which is financed by the European Union and
Champagne Ardenne region.
REFERENCES
S. Bilgen, Structure and environmental impact of global
energy consumption, Renewable and Sustainable
Energy Reviews, Volume 38,2014,Pages 890-
902,ISSN 1364-0321,
https://doi.org/10.1016/j.rser.2014.07.004.(http://www
.sciencedirect.com/science/article/pii/S136403211400
4560).
Jovane, F., Yoshikawa, H., Alting, L., Boer, C. R.,
Westkamper, E., Williams, D., ... & Paci, A. M.
(2008). The incoming global technological and
industrial revolution towards competitive sustainable
manufacturing. Cirp Annals, 57(2), 641-659.