
6 CONCLUSION
In this article, we introduced a novel method for unsu-
pervised surface anomaly detection, centered around
a contrastively selected embedding designed to ag-
gregate the most pertinent features for the task of
defect detection. Leveraging the representational
capabilities of deep features extracted from a pre-
trained model, our approach achieves state-of-the-art
performance in surface defect detection on both the
MVTEC AD dataset and the TILDA dataset. Through
the employment of a compact network comprised of
pointwise convolutions and a judicious selection of
samples for inference comparison, our method en-
sures that inference speed is solely contingent on
the chosen pre-trained classifier for deep feature ex-
traction. This design leads to state-of-the-art perfor-
mance in terms of model latency. However, it is cru-
cial to acknowledge the potential limitations of our
method. The primary constraint is associated with the
choice of the feature extractor and our substantial re-
liance on its representational power. As we focus on
deep features, it becomes challenging to unbias the
extracted features if the anomaly is not discernible
within them. Another constraint lies in the process of
defect generation during training, which significantly
slows down model training, resulting in a relatively
extended training duration compared to other state-
of-the-art approaches. In conclusion, we posit that
this methodology holds considerable promise in the
field of surface defect detection, and we earnestly en-
courage researchers to explore and further investigate
such approaches.
REFERENCES
Akcay, S., Ameln, D., Vaidya, A., Lakshmanan, B., Ahuja,
N., and Genc, U. (2022). Anomalib: A deep learning
library for anomaly detection.
Bergmann, P., Fauser, M., Sattlegger, D., and Steger, C.
(2019). MVTec AD — a comprehensive real-world
dataset for unsupervised anomaly detection. In 2019
IEEE/CVF Conference on Computer Vision and Pat-
tern Recognition (CVPR), pages 9584–9592. IEEE.
Cimpoi, M., Maji, S., Kokkinos, I., Mohamed, S., and
Vedaldi, A. (2014). Describing textures in the wild. In
2014 IEEE Conference on Computer Vision and Pat-
tern Recognition, pages 3606–3613. IEEE.
Defard, T., Setkov, A., Loesch, A., and Audigier, R. PaDiM:
a patch distribution modeling framework for anomaly
detection and localization. In 2021 ICPR Interna-
tional Workshops and Challenges.
Deng, H. and Li, X. (2022). Anomaly detection via reverse
distillation from one-class embedding. In Proceedings
of the IEEE/CVF Conference on Computer Vision and
Pattern Recognition (CVPR), pages 9737–9746.
Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., and Fei-
Fei, L. ImageNet: A large-scale hierarchical image
database. In 2009 IEEE Conference on Computer Vi-
sion and Pattern Recognition, pages 248–255. ISSN:
1063-6919.
DFG (1996). TILDA textile texture-database.
Goodfellow, I. J., Pouget-Abadie, J., Mirza, M., Xu, B.,
Warde-Farley, D., Ozair, S., Courville, A., and Ben-
gio, Y. Generative adversarial networks. In Advances
in neural information processing systems. 2014.
Han, S., Hu, X., Huang, H., Jiang, M., and Zhao, Y. (2022).
ADBench: Anomaly detection benchmark.
He, K., Zhang, X., Ren, S., and Sun, J. Deep residual learn-
ing for image recognition. In 2016 IEEE Conference
on Computer Vision and Pattern Recognition (CVPR.
Hinton, G., Vinyals, O., and Dean, J. (2015). Distilling the
knowledge in a neural network. In NIPS 2014 Deep
Learning Workshop.
Kingma, D. P. and Ba, J. (2017). Adam: A method for
stochastic optimization. In 2015 International Con-
ference on Learning Representations (ICLR).
Krizhevsky, A., Sutskever, I., and Hinton, G. E. (2012). Im-
ageNet classification with deep convolutional neural
networks. In NIPS’12: Proceedings of the 25th Inter-
national Conference on Neural Information Process-
ing Systems, volume 60, pages 84–90.
Lee, S., Lee, S., and Song, B. C. CFA: Coupled-
hypersphere-based feature adaptation for target-
oriented anomaly localization. In IEEE Access Vol-
ume 10 Pages 78446-78454 2022.
Li, C.-L., Sohn, K., Yoon, J., and Pfister, T. (2021). Cut-
Paste: Self-supervised learning for anomaly detection
and localization. In 2021 IEEE Conference on Com-
puter Vision and Pattern Recognition.
Liang, Y., Zhang, J., Zhao, S., Wu, R., Liu, Y., and Pan, S.
(2022). Omni-frequency channel-selection represen-
tations for unsupervised anomaly detection. In IEEE
Transactions on Image Processing 2022.
Mei, S., Wang, Y., and Wen, G. (2018). Automatic fabric
defect detection with a multi-scale convolutional de-
noising autoencoder network model. In Sensors 2018,
volume 18, page 1064.
Nguyen, Q. P., Lim, K. W., Divakaran, D. M., Low,
K. H., and Chan, M. C. (2019). GEE: A gradient-
based explainable variational autoencoder for network
anomaly detection. In IEEE Conference on Commu-
nications and Network Security (CNS) 2019.
Pourreza, M., Mohammadi, B., Khaki, M., Bouindour, S.,
Snoussi, H., and Sabokrou, M. (2021). G2d: Generate
to detect anomaly. In 2021 IEEE Winter Conference
on Applications of Computer Vision (WACV), pages
2002–2011. IEEE. event-place: Waikoloa, HI, USA.
Rezende, D. J. and Mohamed, S. (2016). Variational infer-
ence with normalizing flows. In Proceedings of the
32nd International Conference on Machine Learning
2016.
Roth, K., Pemula, L., Zepeda, J., Sch
¨
olkopf, B., Brox, T.,
and Gehler, P. (2021). Towards total recall in in-
VISAPP 2024 - 19th International Conference on Computer Vision Theory and Applications
288