Intended Boundaries detection in Topic Change Tracking for Text Segmentation
Alexandre Labadié, Violaine Prince
2008
Abstract
This paper presents a topical text segmentation method based on intended boundaries detection and compares it to a well known default boundaries detection method, c99. Running the two methods on a corpus of twenty two French political discourse and results showed that intended boundaries detection performs better than default boundaries detection on well structured texts.
References
- Kaszkiel, M., Zobel, J.: Passage retrieval revisited. Proceedings of theTwentieth International Conference on Research and Development in Information Access (ACMSIGIR) (1997) 178- 185
- Prince, V., Labadié, A.: Text segmentation based on document understanding for information retrieval. In Proceedings of NLDB'07 (2007) 295-304
- Kan, M., Klavans, J.L., McKeown, K.R.: Linear segmentation and segment significance. Proceedings of WVLC-6 (1998) 197-205
- Hearst, M.A.: Text-tilling : segmenting text into multi-paragraph subtopic passages. Computational Linguistics (1997) 59-66
- Pevzner, L., Hearst, M.: A critique and improvement of anevaluation metric for text segmentation. Computational Linguistics (2002) 113-125
- Choi, F.Y.Y.: Advances in domain independent linear text segmentation. Proceedings of NAACL-00 (2000) 26-33
- Morris, J., Hirst, G.: Lexical cohesion computed by thesaural relations as an indicator of the structure of text. Computational Linguistics 17 (1991) 20-48
- Bestgen, Y., Piérard, S.: Comment évaluer les algorithmes de segmentation automatiques ? essai de construction d'un matriel de référence. Proceedings of TALN'06 (2006)
- Choi, F.Y.Y., Wiemer-Hastings, P., Moore, J.: Latent semantic analysis for text segmentation. Proceedings of EMNLP (2001) 109-117
- Reynar, J.C.: Topic Segmentation: Algorithms and Applications. Phd thesis, University of Pennsylvania (1998)
- Passonneau, R.J., Litman, D.: Lintention-based segmentation: Humanreliability and correlation with linguistic cues. Proceedings of the 31st Annual Meeting of theAssociation for Computational Linguistics, (1993) 148-155
- Chauché, J.: Un outil multidimensionnel de l'analyse du discours. Proceedings of Coling'84 1 (1984) 11-15
- Roget, P.: Thesaurus of English Words and Phrases. Longman, London (1852)
- Larousse: Thésaurus Larousse - des idées aux mots, des mots aux idées. Larousse, Paris (1992)
- Chauché, J., Prince, V.: Classifying texts through natural language parsing and semantic filtering. In Proceedings of LTC'03 (2007)
- Labadié, A., Chauché: Segmentation thématique par calcul de distance sémantique. Proceedings of DEFT'06 1 (2006) 45-59
- Lelu, A., M., C., Aubain, S.: Coopération multiniveau d'approches non-supervises et supervises pour la détection des ruptures thématiques dans les discours présidentiels franc¸ais. In Proceedings of DEFT'06 (2006)
- Azé, J., Heitz, T., Mela, A., Mezaour, A., Peinl, P., Roche, M.: Présentation de deft'06 (defi fouille de textes). Proceedings of DEFT'06 1 (2006) 3-12
Paper Citation
in Harvard Style
Labadié A. and Prince V. (2008). Intended Boundaries detection in Topic Change Tracking for Text Segmentation . In Proceedings of the 5th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2008) ISBN 978-989-8111-45-6, pages 13-21. DOI: 10.5220/0001728200130021
in Bibtex Style
@conference{nlpcs08,
author={Alexandre Labadié and Violaine Prince},
title={Intended Boundaries detection in Topic Change Tracking for Text Segmentation},
booktitle={Proceedings of the 5th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2008)},
year={2008},
pages={13-21},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001728200130021},
isbn={978-989-8111-45-6},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 5th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2008)
TI - Intended Boundaries detection in Topic Change Tracking for Text Segmentation
SN - 978-989-8111-45-6
AU - Labadié A.
AU - Prince V.
PY - 2008
SP - 13
EP - 21
DO - 10.5220/0001728200130021