during upward trends. The question is whether a ma-
jority of positive news will be reached before the mar-
ket trend starts to go up. In such a case there would
be perhaps a small forecast possibility.
The Fig. 1 shows both linear and nonlinear prog-
nosis as well as the DAX values. The linear and non-
linear classifications are almost identical. Theoreti-
cally, a nonlinear SVM should classify better (Zanni
and Zanghirati, 2003). However, this was not the
case since our dataset was biased towards the DOWN
class. Therefore, the classifier classifies most of the
DOWN news correctly, since they are more promi-
nent.
During the time when the trend stays stable, i.e.
DOWN-phase 2000, about 90% - 95% of news are
correctly classified. When the trend becomes unsta-
ble, the precision of classification is going down to
40%, i.e. in UP-phase 2001, DOWN-phase 2004,
DOWN-phase 2006.
We can see that the news trend changed from
DOWN to UP with July 2002 but the market trend
changed from DOWN to UP with a delay in the 11.
week of 2003. In other break points, this phenomenon
cannot be seen so clearly but we can see the change of
the news trend from UP to DOWN at the end of 2006,
i.e. some months before the subprime crisis started to
break the long-term market trend.
This can be explained by investor psychology.
During the enthusiastic UP market trend, investors do
not want to accept the coming negative news and be-
cause of that the market trend changes with a delay.
Similarly, during a DOWN trend, investors interpret
the positive news with too much pessimism and be-
cause of that the market trend changes again, albeit
with a delay.
Unfortunately, we haven’t got enough data to pos-
tulate some new laws based on our investigations. In
such a way, we can investigate only changes of big
trends and they are rare. We haven’t got news in elec-
tronical form before 1999.
7 CONCLUSIONS
In this paper, we focused on the classification of large
sets of textual news by Support Vector Machine. In
technical terms, the SVM worked relatively well with
succesful classification rates of about 80%. It’s also
noteable that with 6 min during linear-parallel classi-
fication, the classification time was relatively low. As
described above, the nonlinear approach should have
yielded a better result (Zanni and Zanghirati, 2003).
Since the runtime of 6 minutes using linear PGPDT
is a lot smaller than 2 hours on nonlinear GPDT and
they differ only in about 0,8% classification quality,
the linear-parallel approach proved to be the best in
our experiments.
It is very interesting to note that PGPDT and
8 process-OpenMPI cut the runtime of the nonlin-
ear runs down from the extremely long runtime of
48hours to 0,04% of that - 2 hours. Also, paral-
lel solving greatly improved linear runtime as well.
Therefore we advise to make full use of the parallel
implementation, the differences in speed we discov-
ered were immense.
We found that SVM is a viable tool to classify
large sets of stock market messages. However, the
market processes are known to be chaotic (Peters,
1996). This means that any prediction and any fore-
cast mechanism is on principle questionable. We
could argue from our experimentation that SVMs are
a viable means of forecasting large movements in the
stock market. Also, we explained above the possi-
ble psychologic reasons of the delayed reaction of in-
vestors on changed news. But since our dataset is very
limited (our set of news in electronic form starts at the
end of 1999) and long-time stock market data is hard
to come by, no conclusive statement about the fore-
cast quality can be reached in a mathematically cor-
rect way. But for all that, we find the news classifi-
cation results and their correlation with the long-term
market trends interesting.
In the further work, we will try to support our
hypothesis by using more sophisticated classification
methods. Also, we will look for correlation with other
market indices.
REFERENCES
Boser, B., Guyon, I., and Vapnik, V. (1992). A training algo-
rithm for optimal margin classifiers. In In: Haussler,
D. (Ed.): Proceedings of 5th Annual ACM Workshop
on COLT, pages 144-152. Pittsburgh, PA, ACM Press.
Braverman, E., Aizerman, M., and Rozonoer, L. (1964).
Theoretical foundations of the potential function
methodin pattern recognition learning. In Automation
and Remote Control 25, pp. 821 - 837.
Janetzko, R. (2008). Using Support Vector Machines for
Classification of News. TU Chemnitz, (In German).
Joachims, T. (1998a). Making large-scale svm learning
practical. In LS8-Report, 24. Universitaet Dortmund.
Joachims, T. (1998b). Text categorization with sup-
port vector machines: Learning with many rele-
vant features. In http://www.cs.cornell.edu/ Peo-
ple/tj/publications/joachims98a.
Joachims, T. (2001). Learning to classify text using Support
Vector Machines. Kluwer Academic Publishers.
A CASE STUDY - Classification of Stock Exchange News by Support Vector Machines
335