Similarity Range Queries in Streaming Time Series

Maria Kontaki, Apostolos N. Papadopoulos, Yannis Manolopoulos

2004

Abstract

Similarity search in time series databases is an important research direction. Several methods have been proposed in order to provide algorithms for efficient query processing in the case of static time series of fixed length. In streaming time series the similarity problem is more complex, since the dynamic nature of streaming data make these methods inappropriate. In this paper, we propose a new method to evaluate similarity range queries in streaming time series. The method is based on the use of a multidimensional access method, the R∗ -tree, which is used to store features of the time series extracted by means of the DFT (Discrete Fourier Transform). We take advantage of the incremental computation of the DFT and equip the R∗ -tree with a deferred update policy in order to improve maintenance costs. The experimental evaluation based on synthetic random walk time series and on real stock market data shows that significant performance improvement is achieved in comparison to the sequential scanning of the database.

References

  1. R. Agrawal, C. Faloutsos, and A. Swami: \E±cient Similarity Search In Sequence Databases", Proc. FODO, pp. 69-84, Evanston, Illinois, USA, October 1993.
  2. B. Babcock, S. Babu, M. Datar, R. Motwani, J. Widom: \Models and Issues in Data Stream Systems", Proc. ACM PODS, pp. 1-16, Madison, Wisconsin, June 2002.
  3. N. Beckmann, H.-P. Kriegel, R. Schneider, and B. Seeger: \The R¤-tree: an E±cient and Robust Access Method for Points and Rectangles", Proc. ACM SIGMOD, pp. 322-331, Atlantic City, NJ, May 1990.
  4. S. Babu, and J. Widom: \Continuous Queries over Data Streams", SIGMOD Record, Vol. 30, No. 3, pp. 109-120, September 2001.
  5. T. Bozkaya, N. Yazdani, and M. Ozsoyoglu: \Matching and Indexing Sequences of Di®erent Lengths", Proc. CIKM, Las Vegas, NV, USA, 1997.
  6. K. Chan, and A. W. Fu: \E±cient Time Series Matching by Wavelets", Proc. IEEE ICDE, pp. 126-133, 1999.
  7. S. Chandrasekaran, and M. J. Franklin: \Streaming Queries over Streaming Data", Proc. VLDB, Hong Kong, China, August 2002.
  8. C. Faloutsos, M. Ranganathan, and Y. Manolopoulos: \Fast Subsequence Matching in Time-Series Databases", Proc. ACM SIGMOD, pp. 419-429, Minneapolis, Minnesota, USA, May 1994.
  9. L. Gao, and X. S. Wang: \Continually Evaluating Similarity-Based Pattern Queries on a Streaming Time Series", Proc. ACM SIGMOD, Madison, Wisconsin 2002.
  10. L. Gao, Z. Yao, and X. S. Wang: \Evaluating Continuous Nearest Neighbor Queries for Streaming Time Series via Pre-fetching", Proc. VLDB, Hong Kong, China, August 2002.
  11. E. J. Keogh, and M. J. Pazzani: \An Indexing Scheme for Fast Similarity Search in Large Time Series Databases", Proc. SSDBM, Clevelant, Ohio, 1999.
  12. X. Liu, and H. Ferhatosmanoglu: \E±cient k-NN Search on Streaming Data Series". Proc. SSTD, Santorini, Greece, July 2003.
  13. A. Nanopoulos, Y. Theodoridis and Y. Manolopoulos: \C2P: Clustering based on Closest-Pairs", Proc. VLDB, pp.331-340, 2001.
  14. S. Park, W. W. Chu, J. Yoon, and C. Hsu: \E±cient Searches for Similar Subsequences of Di®erent Lengths in Sequence Databases", Proc. IEEE ICDE, 2000.
  15. R. Weber, H.-J. Schek and S. Blott: \A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces", Proc. VLDB, pp.194-205, New York City, New York, August 1998.
  16. B.-K. Yi, and C. Faloutsos: \Fast Time Sequence Indexing for Arbitrary Lp Norms", Proc. VLDB, Cairo, Egypt, 2000.
  17. B. Yi, H V. Jagadish, and C. Faloutsos: \E±cient Retrieval of Similar Time Sequences Under Time Wraping", Proc IEEE ICDE, pp. 201-208, Orlando, Florida, February 1998.
Download


Paper Citation


in Harvard Style

Kontaki M., N. Papadopoulos A. and Manolopoulos Y. (2004). Similarity Range Queries in Streaming Time Series . In Proceedings of the 4th International Workshop on Pattern Recognition in Information Systems - Volume 1: PRIS, (ICEIS 2004) ISBN 972-8865-01-5, pages 69-79. DOI: 10.5220/0002675700690079


in Bibtex Style

@conference{pris04,
author={Maria Kontaki and Apostolos N. Papadopoulos and Yannis Manolopoulos},
title={Similarity Range Queries in Streaming Time Series},
booktitle={Proceedings of the 4th International Workshop on Pattern Recognition in Information Systems - Volume 1: PRIS, (ICEIS 2004)},
year={2004},
pages={69-79},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002675700690079},
isbn={972-8865-01-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 4th International Workshop on Pattern Recognition in Information Systems - Volume 1: PRIS, (ICEIS 2004)
TI - Similarity Range Queries in Streaming Time Series
SN - 972-8865-01-5
AU - Kontaki M.
AU - N. Papadopoulos A.
AU - Manolopoulos Y.
PY - 2004
SP - 69
EP - 79
DO - 10.5220/0002675700690079