Eliminating Noise in the Matrix Profile
Dieter De Paepe, Olivier Janssens, Sofie Van Hoecke
2019
Abstract
As companies are increasingly measuring their products and services, the amount of time series data is rising and techniques to extract usable information are needed. One recently developed data mining technique for time series is the Matrix Profile. It consists of the smallest z-normalized Euclidean distance of each subsequence of a time series to all other subsequences of another series. It has been used for motif and discord discovery, for segmentation and as building block for other techniques. One side effect of the z-normalization used is that small fluctuations on flat signals are upscaled. This can lead to high and unintuitive distances for very similar subsequences from noisy data. We determined an analytic method to estimate and remove the effects of this noise, adding only a single, intuitive parameter to the calculation of the Matrix Profile. This paper explains our method and demonstrates it by performing discord discovery on the Numenta Anomaly Benchmark and by segmenting the PAMAP2 activity dataset. We find that our technique results in a more intuitive Matrix Profile and provides improved results in both usecases for series containing many flat, noisy subsequences. Since our technique is an extension of the Matrix Profile, it can be applied to any of the various tasks that could be solved by it, improving results where data contains flat and noisy sequences.
DownloadPaper Citation
in Harvard Style
De Paepe D., Janssens O. and Van Hoecke S. (2019). Eliminating Noise in the Matrix Profile.In Proceedings of the 8th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM, ISBN 978-989-758-351-3, pages 83-93. DOI: 10.5220/0007314100830093
in Bibtex Style
@conference{icpram19,
author={Dieter De Paepe and Olivier Janssens and Sofie Van Hoecke},
title={Eliminating Noise in the Matrix Profile},
booktitle={Proceedings of the 8th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,},
year={2019},
pages={83-93},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007314100830093},
isbn={978-989-758-351-3},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 8th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,
TI - Eliminating Noise in the Matrix Profile
SN - 978-989-758-351-3
AU - De Paepe D.
AU - Janssens O.
AU - Van Hoecke S.
PY - 2019
SP - 83
EP - 93
DO - 10.5220/0007314100830093