GENERATING A VISUAL OVERVIEW OF LARGE DIACHRONIC DOCUMENT COLLECTIONS BASED ON THE DETECTION OF TOPIC CHANGE

Florian Holz, Sven Teresniak, Gerhard Heyer, Gerik Scheuermann

Abstract

Large digital diachronic document collections are a central source of information in science, business, and for the general public. One challenge for the efficient visualization of these collections is the automatic calculation and visualization of the main topics. These topics can then serve as the basis for an overview of the content and any subsequent interactive visual analysis. We introduce the new language processing concept of volatility of terms measured as the change of the context of terms. We demonstrate that volatility can serve as an excellent basis for the visual overview of large collections using two different examples.

References

  1. Allan, J. (2002). Introduction to topic detection and tracking, pages 1-16. Kluwer Academic Publishers, Norwell, MA, USA.
  2. Allan, J. et al. (1998). Topic detection and tracking pilot study final report. In Proc. DARPA Broadcast News Transcription and Understanding Workshop, pages 194-218.
  3. Dunning, T. E. (1993). Accurate methods for the statistics of surprise and coincidence. Computational Linguistics, 19(1):61-74.
  4. Heyer, G., Quasthoff, U., and Wittig, T. (2008). Text Mining: Wissensrohstoff Text - Konzepte, Algorithmen, Ergebnisse. W3L-Verlag, 2nd edition.
  5. Holz, F. and Teresniak, S. (2010). Towards automatic detection and tracking of topic change. In Gelbukh, A., editor, Proc. CICLing 2010: Conference on Intelligent Text Processing and Computational Linguistics, LNCS 6008. Springer LNCS.
  6. Taylor, S. J. (2007). Introduction to asset price dynamics, volatility, and prediction. In Asset Price Dynamics, Volatility, and Prediction, Introductory Chapters. Princeton University Press.
Download


Paper Citation


in Harvard Style

Holz F., Teresniak S., Heyer G. and Scheuermann G. (2010). GENERATING A VISUAL OVERVIEW OF LARGE DIACHRONIC DOCUMENT COLLECTIONS BASED ON THE DETECTION OF TOPIC CHANGE . In Proceedings of the International Conference on Imaging Theory and Applications and International Conference on Information Visualization Theory and Applications - Volume 1: IVAPP, (VISIGRAPP 2010) ISBN 978-989-674-027-6, pages 153-156. DOI: 10.5220/0002836701530156


in Bibtex Style

@conference{ivapp10,
author={Florian Holz and Sven Teresniak and Gerhard Heyer and Gerik Scheuermann},
title={GENERATING A VISUAL OVERVIEW OF LARGE DIACHRONIC DOCUMENT COLLECTIONS BASED ON THE DETECTION OF TOPIC CHANGE},
booktitle={Proceedings of the International Conference on Imaging Theory and Applications and International Conference on Information Visualization Theory and Applications - Volume 1: IVAPP, (VISIGRAPP 2010)},
year={2010},
pages={153-156},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002836701530156},
isbn={978-989-674-027-6},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Imaging Theory and Applications and International Conference on Information Visualization Theory and Applications - Volume 1: IVAPP, (VISIGRAPP 2010)
TI - GENERATING A VISUAL OVERVIEW OF LARGE DIACHRONIC DOCUMENT COLLECTIONS BASED ON THE DETECTION OF TOPIC CHANGE
SN - 978-989-674-027-6
AU - Holz F.
AU - Teresniak S.
AU - Heyer G.
AU - Scheuermann G.
PY - 2010
SP - 153
EP - 156
DO - 10.5220/0002836701530156