ing scenarios, showcasing its ability to detect anoma-
lies without relying on color information.
The most likely reason for false detections is con-
sidered to be the inadequate performance of feature
extraction by the VAE. In this approach, anomaly de-
tection relies on the difference in feature vectors be-
tween that of the predicted normal map at t + 1 and
the normal map at t + 1. Hence, the performance of
the VAE’s encoder plays a crucial role in influencing
the outcomes. Enhancing the detection performance
is anticipated by achieving a more accurate feature ex-
traction for unknown normal maps using the VAE.
While there is still significant room for improve-
ment in avoiding the misclassification of normal (non-
abnormal) frames as anomaly frames in both Sec-
tion 4.2.2 and Section 4.2.3, the results presented
above effectively highlight the efficacy of the pro-
posed method. This approach, utilizing normal maps
and anomaly detection, demonstrates its effectiveness
in detecting anomalies on the road.
5 CONCLUSION
In this paper, we propose a novel approach for detect-
ing road surface anomalies using normal maps and
anomaly detection. When walking, individuals may
unconsciously perceive that there is no danger based
solely on the color information of the road surface.
However, in reality, there could be anomalies that lead
to significant accidents. Our method aims to address
the potential risks posed by these anomalies by pre-
dicting the normal map of the ground surface one is
about to walk on, leveraging a time series of normal
maps, and generating anomaly scores. The effective-
ness of our proposed method has been demonstrated
through experiments using the custom datasets. This
research, combining normal maps with anomaly de-
tection, contributes to advancements in the fields of
pedestrian assistance and anomaly detection.
REFERENCES
A. Kirillov, E. Mintun, N. Ravi, H. Mao, C. Rolland, L.
Gustafson, T. Xiao, S. Whitehead, A. C. Berg, W.
Lo, P. Dollar, and R. Girshick (2023). Segment any-
thing. In Proceedings of the IEEE/CVF International
Conference on Computer Vision (ICCV), pages 4015–
4026.
D. Ha and J. Schmidhuber (2018). World models. arXiv
preprint arXiv:1803.10122.
D. T. Shipmon, J. M. Gurevitch, P. M. Piselli, and S. T.
Edwards (2017). Time series anomaly detection; de-
tection of anomalous drops with limited features and
sparse examples in noisy highly periodic data. arXiv
preprint arXiv:1708.03665.
D.P. Kingma and J. Ba (2015). Adam: A method for
stochastic optimization. In International Conference
on Learning Representations (ICLR), pages 1–15.
Hunter, J. S. (1986). The exponentially weighted moving
average. Journal of quality technology, 18(4):203–
210.
I. Higgins, L. Matthey, A. Pal, C. Burgess, X. Glorot,
M. Botvinick, S. Mohamed, and A. Lerchner (2016).
beta-vae: Learning basic visual concepts with a con-
strained variational framework. In International Con-
ference on Learning Representations.
I. Vasiljevic, N. Kolkin, S. Zhang, R. Luo, H. Wang, F. Z.
Dai, A. F. Daniele, M. Mostajabi, S. Basart, M. R.
Walter, and G. Shakhnarovich (2019). Diode: A dense
indoor and outdoor depth dataset. arXiv preprint
arXiv:1908.00463.
J. Goh, S. Adepu, M. Tan, and Z. S. Lee (2017).
Anomaly detection in cyber-physical systems using
recurrent neural networks. In IEEE 18th International
Symposium on High Assurance Systems Engineering
(HASE), pages 140–145.
K. Hundman, V. Constantinou, C. Laporte, I. Colwell, and
T. Soderstrom (2018). Detecting spacecraft anomalies
using lstms and nonparametric dynamic thresholding.
In Proceedings of the 24th ACM SIGKDD Interna-
tional Conference on Knowledge Discovery & Data
Mining, pages 387–395.
K. Imai, I. Kitahara, and Y. Kameda (2017). Detecting
walkable plane areas by using rgb-d camera and ac-
celerometer for visually impaired people. In 3DTV
Conference: The True Vision-Capture, Transmission
and Display of 3D Video (3DTV-CON), pages 1–4.
K. Yanagihara, H. Takefuji, P. Sarakon, and H. Kawano
(2020). A method to detect steps on the sidewalks for
supporting visually impaired people in walking. In
Proceedings of the Fuzzy System Symposium (Japan
Society for Fuzzy Theory and Intelligent Informatics),
volume 36, pages 395–398.
L. Shen, Z. Li, and J. Kwok (2020). Time series anomaly
detection using temporal hierarchical one-class net-
work. In Advances in Neural Information Processing
Systems 33, pages 13016–13026.
N. Ding, H. Ma, H. Gao, Y. Ma, and G. Tan (2019).
Real-time anomaly detection based on long short-term
memory and gaussian mixture model. Computers &
Electrical Engineering, 79.
R.R. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D.
Parikh, and D. Batra (2017). Grad-cam: Visual ex-
planations from deep networks via gradient-based lo-
calization. In IEEE International Conference on Com-
puter Vision, pages 618–626.
S. Hochreiter and J. Schmidhuber (2015). Anomaly detec-
tion in ecg time signals via deep long short-term mem-
ory networks. In 2015 IEEE International Conference
on Data Science and Advanced Analytics (DSAA),
pages 1–7.
T. Voj
´
ı
ˇ
r and J. Matas (2023). Image-consistent detection of
road anomalies as unpredictable patches. In Proceed-
VISAPP 2024 - 19th International Conference on Computer Vision Theory and Applications
254