opportunity: the data-adaptive linear approximation
is relatively easy to scale in length and amplitude,
which could make our ABC-based approach a good
candidate for identifying similarity even among time
series of varying lengths. We intend to evaluate this
aspect in future work.
Finally, the problem of similarity search in time
series is often used in combination with streaming
sets of data. For such scenarios however the entire
similarity search approach needs to be designed in
consideration of this incremental nature of the data
sets.
As a consequence, in future work we plan to
analyse the applicability of the ABC distance
measure in combination with an index structure in
the context of online similarity search, where
sequences are represented by continuously flowing
streams of data.
REFERENCES
Aghabozorgi, S. and Teh, Y. W., 2014. Stock market co-
movement assessment using a three-phase clustering
method. Expert Systems with Applications, 41(4), pp.
1301-1314.
Agrawal, R., Faloutsos, C. and Arun, S., 1993. Efficient
Similarity Search in sequence databases. Proceedings
of the 4th international Conference on Foundations of
Data Organization and Algorithms, pp. 69-84.
Batista, G., Keogh, E., Tataw, O. M. and de Souza, V. M.
A., 2014. CID: an efficient complexity-invariant
distance for time series. Data Mining and Knowledge
Discovery, 28(3), pp. 634-669.
Chen, Q. et al., 2007. Indexable PLA for Efficient
Similarity Search. Proceedings of the 33rd
international conference on Very Large Data Bases,
pp. 435-446.
Faloutsos, C., Ranganathan, M. and Manolopoulos, Y.,
1994. Fast subsequence matching in time-series
databases. Proceedings of the 1994 Annual ACM
SIGMOD Conference, pp. 419-429.
Fulcher, B. D. and Jones, N. S., 2014. Highly Comparative
Feature-Based Time-Series Classification. IEEE
Transactions on Knowledge and Data Engineering,
26(12), pp. 3026-3037.
Fu, T.-c., 2011. A review on time series data mining.
Engineering Applications of Artificial Intelligence,
24(1), pp. 164-181.
Fu, T.-c., Chung, F.-l., Lunk, R. and Ng, C.-m., 2005.
Preventing meaningless Stock Time Series Pattern
Discovery by Changing Perceptually Important Point
Detection. Fuzzy Systems and Knowledge Discovery,
pp. 1171-1174.
Keogh, E., 2003. Probabilistic Discovery of Time Series
Motifs. Proceedings of the ninth ACM SIGKDD
international conference on Knowledge discovery and
data mining, pp. 493-498.
Keogh, E., Chakrabarti, K., Pazzani, M. and Mehrotra, S.,
2001. Dimensionality Reduction for fast similarity
search in large time series databases. Knowledge and
information Systems, Volume 3, pp. 263-286.
Keogh, E., Chakrabarti, K., Pazzani, M. and Mehrotra, S.,
2001. Locally Adaptive Dimensionality Reduction for
Indexing Large Time Series Databases. ACM
SIGMOD Record, Volume 30, pp. 151-162.
Keogh, E., Chu, S., Hart, D. and Pazzani, M., 2001. An
online algorithm for segmenting time series.
Proceedings IEEE International Conference on Data
Mining, pp. 289-296.
Keogh, E. and Pazzani, M., 2001. Derivative Dynamic
Time Warping. SDM, Volume 1, pp. 5-7.
Keogh, E., Zhu, Q., Hu, B., Hao. Y., Xi, X., Wei, L. and
Ratanamahatana, C. A., 2011. The UCR Time Series
Classification/Clustering Homepage
Lines, J. and Bagnall, A., 2014. Time series classification
with ensembles of elastic distance measures. Data
Mining and Knowledge Discovery, 29(3), pp. 565-592.
Lin, J., Khade, R. and Li, Y., 2012. Rotation-invariant
similarity in time series using bag-of-patterns
representation. Journal of Intelligent Information
Systems, 38(2), pp. 287-315.
Lin, J., Williamson, S., Borne, K. and DeBarr, D., 2012.
Pattern Recognition in Time Series. Advances in
Machine Learning and Data Mining for Astronomy,
Volume 1, pp. 617-645.
Olszewski, R., 2001. Generalized Feature Extraction for
Structural Pattern Matching in Time-Series Data.
Pazzani, M. and Keogh, E., 1998. An enhanced
representation of time series which allows fast and
accurate classification, clustering and relevance
feedback. KDD, Volume 98, pp. 239-243.
Ratanamahatana, C. A. and Keogh, E., 2005. Exact
indexing of dynamic time warping. Knowledge and
information systems, 7(3), pp. 358-386.
Shatkay, H. and Zdonik, S., 1996. Approximate Queries
and Representation for Large Data Sequences.
Proceedings of the Twelfth International Conference
on Data Engineering, pp. 536-545.
Shieh, J. and Keogh, E., 2008. iSAX: Indexing and
Mining Terabyte Sized Time Series. Proceedings of
the 14th ACM SIGKDD international conference on
Knowledge discovery and data mining, pp. 623-631.
Toshniwal, D. and Joshi, R. C., 2005. Similarity Search in
Time Series Data Using Time Weighted Slopes.
Informatica, 29(1).
Wang, X. et al., 2013. Experimental comparison of
representation methods and distance measures for time
series data. Data Mining and Knowledge Discovery,
pp. 275-309.