our approach. The fourth version is the online SVM
algorithm which is trained with the same first 3000
examples and then increases the initial training set
with the following 3000
samples. The difference
between this and our method is that this model is re-
trained every time a new instance arrives (full
instance memory). Table 1 shows the average
accuracy the methods achieved in predicting the
price of stock for 3000 examples. The incremental
SVM algorithm achieves the highest accuracy which
is increased with the increase of the training set. It
achieves 71% accuracy for the first half of the
training set and 77% for the second half.
Table 1: Accuracy of various SVM algorithms.
ALGORITHM ACCURACY
Classic SVM 52%
Static Incremental SVM 60%
Online SVM 62%
Incremental SVM Learning 74%
5 CONCLUSION
In this paper we proposed two different data miming
techniques for time series and data streams. The first
is associated with the problem of lag correlation
discovery of time series. The proposed HSLC
algorithm achieved a reduction in time complexity in
contrast to the state-of-the-art method PLC while
preserving high accuracy in the results. Furthermore,
it has an infinitesimal error for both the value of the
lag and the value of the correlation coefficient for
every detected correlated pair of series. In the
second part of this paper, we examined the problem
of classification of data streams and evaluated
several approaches on a stock prediction case.
Specifically, an incremental SVM learning
algorithm used for data mining on streams was
employed on the KOSPI dataset. The algorithm
achieved higher accuracy compared to three other
versions of the SVM algorithm concluding that the
training models for the stock prediction problem
should follow an incremental iteration methodology.
ACKNOWLEDGEMENTS
This work was supported in part by the European
Union’s Horizon 2020 research and innovation
programme under grant agreement No 690140.
REFERENCES
Diehl, C. P. and Cauwenberghs, C. (2003) SVM
Incremental Learning, Adaption and Optimization. In:
Proceedings of the IEEE International Joint
Conference on Neural Networks, pp. 2685-2690.
Edwards, R.D., Magee, J. and Bassetti, W.H.C. (2007)
Technical Analysis of Stock Trends, 9
th
ed.
Kim, Kyoung-jae (2003) Financial time series forecasting
using support vector machines. Neurocomputing
Journal, 55(1-2), pp. 307-319.
Mporas I., Tsirka, V., Zacharaki, E. I., Koutroumanidis,
M., Richardson, M., Megalooikonomou, V. (2015)
Seizure detection using EEG and ECG signals for
computer-based monitoring, analysis and management
of epileptic patients. Expert Systems with Applications,
42(6), pp. 3227-3233.
Poggio, T. and Cauwenberghs, C. (2001) Incremental and
Decremental Support Vector Machine Learning. In:
Proceedings of the 2000 Conference on Advances in
Neural Information Processing Systems, Vol. 13, MIT
Press.
Shin, K.S., Lee, T.S. and Kim, H. (2005) An application
of support vector machines in bankruptcy model.
Expert Systems with Applications, 28(1), pp. 127-135.
Syed, N. A., Liu, H. and Sung, K. (1999) Incremental
Learning with Support Vector Machines. In:
Proceedings of the International Joint Conference on
Artificial Intelligence.
Yi, B.-K., Sidiropoulos, N.D., Johnson, T., Jagadish, H.V.,
Faloutsos, C., Biliris, A. (2000) Online Data Mining
for Co-Evolving Time Sequences. In Proceedings of
the 16
th
International Conference on Data
Engineering, pp. 13-22.
Zhang, T., Yue, D., Gu, Y., Wang, Y. and Yu, G. (2009)
Adaptive correlation analysis in stream time series
with sliding windows. Computers & Mathematics with
Applications, 57(6), pp. 937-948.
Zhang, T., Yue, D., Wang, Y. and Yu, G. (2011) A Novel
Approach for Mining Multiple Data Streams based on
Lag Correlation. In: Proceedings of the Control and
Decision Conference, pp. 2377-2382.
Zhu, Y. and Shasha, D. (2002) StatStream: Statistical
Monitoring of Thousands of Data Streams in Real
Time: In: Proceedings of the 28
th
International
Conference on Very Large Data Bases, pp. 358-369.
Google/Finance. (2014). [Online] Available from:
http://www.google.com/finance. [Accessed 14/3/17].
Microsoft StreamInsight. (2016). [Software]
[Online]Available from: https://technet.microsoft.com/
enus/library/ee362541.
Diehl, C. (2011). Github, inc. [Online] Available from:
https://github.com/diehl/Incremental-SVM-Learning-
in-MATLAB.
IoTBDS 2017 - 2nd International Conference on Internet of Things, Big Data and Security