
taminants is challenging, our results demonstrate that
carefully selected unsupervised learning methods can
offer meaningful detection capabilities. The robust-
ness of these methods to unknown contaminants is
crucial for the practical implementation of water qual-
ity monitoring systems.
6 CONCLUSION
This paper presented a novel HS dataset for detect-
ing anomalous substances in water that are visually
indistinguishable. Our comprehensive spectral analy-
sis demonstrates the superiority of HS imaging over
conventional RGB imaging in capturing subtle dif-
ferences between normal water and water contami-
nated with anomalous substances. Experimental eval-
uations of various unsupervised anomaly detection
methods shows the effectiveness of distance-based
and subspace-based approaches, particularly when
utilizing combined visible and near-infrared spectral
data.
While challenges remain in detecting unknown
anomalies, our findings provide a foundation for fu-
ture work. Further work could explore advanced deep
learning techniques for HS data and methods to im-
prove generalization. Expanding the dataset to in-
clude more substances could also enhance this work’s
scope.
ACKNOWLEDGEMENTS
This research was supported by the KDDI Founda-
tion. We are grateful to our laboratory members for
their invaluable assistance in this work. Their meticu-
lous work in data acquisition is crucial to the success
of our work.
REFERENCES
Brando, V. E. and Dekker, A. G. (2003). Satellite hy-
perspectral remote sensing for estimating estuarine
and coastal water quality. IEEE transactions on geo-
science and remote sensing, 41(6):1378–1387.
Breunig, M. M., Kriegel, H.-P., Ng, R. T., and Sander, J.
(2000). Lof: identifying density-based local outliers.
SIGMOD Rec., 29(2):93–104.
Cao, Q., Yu, G., Sun, S., Dou, Y., Li, H., and Qiao, Z.
(2021). Monitoring water quality of the haihe river
based on ground-based hyperspectral remote sensing.
Water, 14(1):22.
Du, Q., Tang, B., Xie, W., and Li, W. (2021). Parallel and
distributed computing for anomaly detection from hy-
perspectral remote sensing imagery. Proceedings of
the IEEE, 109(8):1306–1319.
Eba Japan Co.Ltd. Hyper spetral camera NH9. https:
//ebajapan.jp/products/hyper-spectral-camera/.
He, Z., Xu, X., and Deng, S. (2003). Discovering
cluster-based local outliers. Pattern Recogn. Lett.,
24(9–10):1641–1650.
Kriegel, H.-P., Kr
¨
oger, P., Schubert, E., and Zimek, A.
(2009). Outlier detection in axis-parallel subspaces
of high dimensional data. In Proceedings of the
13th Pacific-Asia Conference on Advances in Knowl-
edge Discovery and Data Mining, PAKDD ’09, page
831–838, Berlin, Heidelberg. Springer-Verlag.
Kriegel, H.-P., Schubert, M., and Zimek, A. (2008). Angle-
based outlier detection in high-dimensional data. In
Knowledge Discovery and Data Mining.
Liu, F. T., Ting, K. M., and Zhou, Z.-H. (2008). Isolation
forest. In 2008 Eighth IEEE International Conference
on Data Mining, pages 413–422.
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V.,
Thirion, B., Grisel, O., Blondel, M., Prettenhofer,
P., Weiss, R., Dubourg, V., Vanderplas, J., Passos,
A., Cournapeau, D., Brucher, M., Perrot, M., and
Duchesnay, E. (2011). Scikit-learn: Machine learning
in Python. Journal of Machine Learning Research,
12:2825–2830.
Ramaswamy, S., Rastogi, R., and Shim, K. (2000). Effi-
cient algorithms for mining outliers from large data
sets. SIGMOD Rec., 29(2):427–438.
Scholkopf, B., Platt, J. C., Shawe-Taylor, J., Smola, A., and
Williamson, R. C. (2001). Estimating the support of
a high-dimensional distribution. Neural Computation,
13:1443–1471.
Shyu, M.-L., Chen, S., Sarinnapakorn, K., and Chang, L.
(2003). A novel anomaly detection scheme based on
principal component classifier.
Su, H., Wu, Z., Zhang, H., and Du, Q. (2021). Hyperspec-
tral anomaly detection: A survey. IEEE Geoscience
and Remote Sensing Magazine, 10(1):64–90.
Tang, J., Chen, Z., Fu, A. W.-C., and Cheung, D. W.-L.
(2002). Enhancing effectiveness of outlier detections
for low density patterns. In Pacific-Asia Conference
on Knowledge Discovery and Data Mining.
VanderWoude, A. and Marshall, L. (2021). Hyperspectral
imagery to study harmful algal blooms (habs) of lake
erie, lake st. clair, and saginaw bay, lake huron of the
great lakes region.
Zhao, Y., Nasrullah, Z., and Li, Z. (2019). Pyod: A python
toolbox for scalable outlier detection. Journal of Ma-
chine Learning Research, 20(96):1–7.
Anomalous Water Dataset Captured by Hyperspectral Cameras
449