Figure 7: Zoom on one image leads to the next image.
9 CONCLUSION
We have developed hyperSAX, a representation and
method for indexing multidimensional arrays. We
have shown that we can use it to index millions of
images, and perform very fast approximate searches
on those images. We have introduced a way to dy-
namically reduce the discretization, i.e. increase word
length, when it is appropriate, rather than providing
a constant word length as required by iSAX and its
derivatives. There is, however, room for improve-
ment, such as improving the splitting policy from Sec-
tion 5.1 to ensure more balanced splits. While com-
paring images using the Frobenius distance measure
may not be optimal, it is still likely to produce good
enough results (from approximate search), as long as
an index contains enough images.
REFERENCES
Andr
´
e-J
¨
onsson, H. (2002). Indexing Strategies for Time Se-
ries Data. Department of Computer and Information
Science, Link
¨
opings universitet.
Bach, J. R., Fuller, C., Gupta, A., Hampapur, A., Horowitz,
B., Humphrey, R., Jain, R., and Shu, C.-F. (1996).
Virage image search engine: an open framework for
image management. In Sethi, I. K. and Jain, R. C.,
editors, Storage and Retrieval for Still Image and
Video Databases IV, volume 2670 of Society of Photo-
Optical Instrumentation Engineers (SPIE) Conference
Series, pages 76–87.
Camerra, A., Palpanas, T., Shieh, J., and Keogh, E. (2010).
iSAX 2.0: Indexing and mining one billion time se-
ries. In Proceedings of the 2010 IEEE International
Conference on Data Mining, pages 58–67. IEEE
Computer Society.
Camerra, A., Shieh, J., Palpanas, T., Rakthanmanon, T., and
Keogh, E. J. (2014). Beyond one billion time series:
indexing and mining very large time series collections
with iSAX2+. Knowl. Inf. Syst., 39(1):123–151.
Cheng, S.-C. and Wu, T.-L. (2006). Speeding up the sim-
ilarity search in high-dimensional image database by
multiscale filtering and dynamic programming. Image
Vision Comput., 24(5):424–435.
Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., and Fei-
Fei, L. (2009). ImageNet: A Large-Scale Hierarchi-
cal Image Database. In Computer Vision and Pattern
Recognition.
Faloutsos, C., Ranganathan, M., and Manolopoulos, Y.
(1994). Fast subsequence matching in time-series
databases. SIGMOD Rec., 23(2):419–429.
Flickner, M., Sawhney, H., Niblack, W., Ashley, J., Huang,
Q., Dom, B., Gorkani, M., Hafner, J., Lee, D.,
Petkovic, D., Steele, D., and Yanker, P. (1995). Query
by image and video content: the QBIC system. Com-
puter, 28(9):23–32.
Gaede, V. and G
¨
unther, O. (1998). Multidimensional access
methods. ACM Comput. Surv., 30(2):170–231.
Jain, A. K. and Vailaya, A. (1996). Image retrieval using
color and shape. Pattern Recognition, 29(8):1233 –
1244.
Kasson, J. M. and Plouffe, W. (1992). An analysis of
selected computer interchange color spaces. ACM
Trans. Graph., 11(4):373–405.
Keogh, E., Chakrabarti, K., Pazzani, M., and Mehrotra, S.
(2001a). Dimensionality reduction for fast similarity
search in large time series databases. Knowl. Inf. Syst.,
3(3):263–286.
Keogh, E., Chakrabarti, K., Pazzani, M., and Mehrotra, S.
(2001b). Locally adaptive dimensionality reduction
for indexing large time series databases. SIGMOD
Rec., 30(2):151–162.
Lin, J., Keogh, E., Lonardi, S., and Chiu, B. (2003). A sym-
bolic representation of time series, with implications
for streaming algorithms. In Proceedings of the 8th
ACM SIGMOD Workshop on Research Issues in Data
Mining and Knowledge Discovery, pages 2–11. ACM.
Shieh, J. and Keogh, E. (2008). iSAX: Indexing and mining
terabyte sized time series. In Proceedings of the 14th
ACM SIGKDD International Conference on Knowl-
edge Discovery and Data Mining, pages 623–631.
ACM.
Torralba, A., Fergus, R., and Freeman, W. (2008). 80 mil-
lion tiny images: A large data set for nonparamet-
ric object and scene recognition. In IEEE Transac-
tions on Pattern Analysis and Machine Intelligence,
30(11):1958–1970.
Yi, B.-K. and Faloutsos, C. (2000). Fast time sequence
indexing for arbitrary Lp norms. In Proceedings of
the 26th International Conference on Very Large Data
Bases, VLDB ’00, pages 385–394, San Francisco,
CA, USA. Morgan Kaufmann Publishers Inc.
ICPRAM2015-InternationalConferenceonPatternRecognitionApplicationsandMethods
198