5 CONCLUSIONS
In this paper, we have presented a new approach to the
analysis of topics changing over time by considering
changes in the gobal contexts of terms as indicative
of a change of meaning. First results carried out us-
ing data from contemporary news corpora for German
and English indicate the validity of the approach. In
particular, it could be shown that the proposed mea-
sure of a term’s volatility of meaning is highly inde-
pendent from a term’s frequency.
In a next step, the analysis proposed can be ex-
tended to look at individual topics changing over
those time spans identified as interesting. Instead of
only looking at the terms that change their meaning
over time, it might also be of value to look at those
terms that for some time span retain a “stable” mean-
ing, expressing a societie’s unquestioned consensus
on a topic, as it were. In the long run, this approach
might lead to an infrastructure for easily analyzing di-
achronic text corpora with many useful and interest-
ing applications in trend and technology mining, mar-
keting, and E-Humanities.
REFERENCES
Allan, J. (2002). Introduction to topic detection and track-
ing, pages 1–16. Kluwer Academic Publishers, Nor-
well, MA, USA.
Allan, J., Carbonell, J., Doddington, G., Yamron, J., Yang,
Y., Umass, J. A., Cmu, B. A., Cmu, D. B., Cmu, A. B.,
Cmu, R. B., Dragon, I. C., Darpa, G. D., Cmu, A. H.,
Cmu, J. L., Umass, V. L., Cmu, X. L., Dragon, S. L.,
Dragon, P. V. M., Umass, R. P., Cmu, T. P., Umass,
J. P., and Umass, M. S. (1998). Topic detection and
tracking pilot study final report. In In Proceedings of
the DARPA Broadcast News Transcription and Under-
standing Workshop, pages 194–218.
Heyer, G., Quasthoff, U., and Wittig, T. (2008). Text Min-
ing: Wissensrohstoff Text – Konzepte, Algorithmen,
Ergebnisse. W3L-Verlag, 2nd edition.
Kleinberg, J. (2002). Bursty and hierarchical structure in
streams. In KDD ’02: Proceedings of the eighth ACM
SIGKDD international conference on Knowledge dis-
covery and data mining, pages 91–101, New York,
NY, USA. ACM Press.
Kumaran, G. and Allan, J. (2004). Text classification and
named entities for new event detection. In SIGIR ’04:
Proceedings of the 27th annual international ACM SI-
GIR conference on Research and development in in-
formation retrieval, pages 297–304, New York, NY,
USA. ACM.
Quasthoff, U. and Wolff, C. (2002). The poisson col-
locations measure and its application. In Workshop
on Computational Approaches to Collocations, Wien,
Austria.
Swan, R. and Allan, J. (1999). Extracting significant time
varying features from text. In CIKM ’99: Proceedings
of the eighth international conference on Information
and knowledge management, pages 38–45, New York,
NY, USA. ACM.
Swan, R. and Allan, J. (2000). Automatic generation of
overview timelines. In SIGIR ’00: Proceedings of the
23rd annual international ACM SIGIR conference on
Research and development in information retrieval,
pages 49–56, New York, NY, USA. ACM.
Taylor, S. J. (2007). Introduction to asset price dynam-
ics, volatility, and prediction. In Asset Price Dynam-
ics, Volatility, and Prediction, Introductory Chapters.
Princeton University Press.
Wang, X. and McCallum, A. (2006). Topics over time: a
non-markov continuous-time model of topical trends.
In KDD ’06: Proceedings of the 12th ACM SIGKDD
international conference on Knowledge discovery and
data mining, pages 424–433, New York, NY, USA.
ACM.
KDIR 2009 - International Conference on Knowledge Discovery and Information Retrieval
228