SEGMENTING OF RECORDED LECTURE VIDEOS - The Algorithm VoiceSeg

Stephan Repp; Christoph Meinel

doi:10.5220/0001570603170322

SEGMENTING OF RECORDED LECTURE VIDEOS - The Algorithm VoiceSeg

Stephan Repp, Christoph Meinel

2006

Abstract

In the past decade, we have witnessed a dramatic increase in the availability of online academic lecture videos. There are technical problems in the use of recorded lectures for learning: the problem of easy access to the multimedia lecture video content and the problem of finding the semantically appropriate information very quickly. The first step to a semantic lecture-browser is the segmenting of the large video-corpus into a smaller cohesion area. The task of breaking documents into topically coherent subparts is called topic segmentation. In this paper, we present a segmenting algorithm for recorded lecture videos based on their imperfect transcripts. The recorded lectures are transcripted by an out-of-the-box speech recognition software with a accuracy of approximately 70%-80%. Words as well as a time stamp for each word are stored in a database. This data acts as the input to our algorithm. We will show that the clustering of similar words, the generation of vectors with the values from the clusters and the calculation of the cosine-mass of adjacent vectors, leads to a better segmenting result compared to a standard algorithm.

References

Schillings V.; Meinel, C., 2002. tele-TASK - Teleteaching Anywhere Solution Kit. In Proceedings. ACM SIGUCCS 2002, 130-133. Providence, USA.
Hürst, W., 2003. A qualitative study towards using large vocabulary automatic speech recognition to index recorded presentations for search and access over the web. IADIS International Journal on WWW/Internet, Volume I, Number 1: 43-58.
Chau, M.; Jay, F.; Nunamaker, Jr.; Ming, L., Chen, H. ,2004. Segmentation of Lecture Videos Based on Text: A Method Combining Multiple Linguistic Features. In Proceedings of the 37th Hawaii International Conference on System Sciences. Hawaii, USA.
Baeza-Yates, R.; Ribeiro-Neto, B., 1999. Modern Information Retrieval. New York, USA: AddisonWesley.
Glass, J.; Hazen, T.J.; Hetherington, L.; Wang, C., 2004. Analysis and Processing of Lecture Audio Data: Preliminary Investigations. In Proceedings of the HLT-NAACL 2004 Workshop on Interdisciplinary Approaches to Speech Indexing and Retrieval, 9-12. Boston, MA, USA.
Linckels, S.; Meinel, Ch.; Engel, T., 2005. Teaching in theCyber Age: Technologies, Experiments, and Realizations. In Proceedings of 3. Deutschen eLearning Fachtagung der Gesellschaft für Informatik (DeLFI), 225 - 236. Rostock, Germany.
Repp, S.; Meinel, C., 2006. Semantic Indexing for Recorded Educational Lecture Videos. In Proceedings of the Fourth Annual IEEE International Conference on Pervasive Computing and Communications Workshops (PERCOMW'06), 240-245. Pisa, Italy.
Nicola, S., 2004. Applications of Lexical Cohesion Analysis in the Topic Detection and Tracking Domain. Ph.D. diss., Dept. of Computer Science, University College Dublin.
Hearst, Marti A., 1997. TextTiling: Segmenting Text into Multi-paragraph Subtopic Passages. Computational Linguistics 23, 33-64. Cambridge, MA: MIT Press.
Reynar, J. C., 1998. Topic Segmentation: Algorithms and application. Ph.D. diss., University of Pennsylvania.
Morris, J.; Hirst, G., 1991. Lexical cohesion computed by thesaural relations as an indicator of the structure of text. Computational Linguistics 17, 21-48. Cambridge, MA: MIT Press.
Tür, G; Hakkani-Tür, D; Shriberg, E., 2001. Integrating Prosodic and Lexical Cues for Automatic Topic. In Segmentation CoRR
Beeferman, D; Adam, L.; Berger, A.; Lafferty, J., 1999. Statistical Models for Text Segmentation. In Machine Learning 34
Choi, F., 2000. Advance in domain independent linear text segmentation. In Proceedings of NAACL
Galley, M.; McKeown, K.; Fosler-Lussier, E.; Jing, H., 2003. Discourse Segmentation of Multi-Party Conversation In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics

Download

Paper Citation

in Harvard Style

Repp S. and Meinel C. (2006). SEGMENTING OF RECORDED LECTURE VIDEOS - The Algorithm VoiceSeg . In Proceedings of the International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2006) ISBN 978-972-8865-64-1, pages 317-322. DOI: 10.5220/0001570603170322

in Bibtex Style

@conference{sigmap06,
author={Stephan Repp and Christoph Meinel},
title={SEGMENTING OF RECORDED LECTURE VIDEOS - The Algorithm VoiceSeg},
booktitle={Proceedings of the International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2006)},
year={2006},
pages={317-322},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001570603170322},
isbn={978-972-8865-64-1},
}

in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2006)
TI - SEGMENTING OF RECORDED LECTURE VIDEOS - The Algorithm VoiceSeg
SN - 978-972-8865-64-1
AU - Repp S.
AU - Meinel C.
PY - 2006
SP - 317
EP - 322
DO - 10.5220/0001570603170322