Automatic Summarization Based on Sentence Morpho-Syntactic Structure: Narrative Sentences Compression

Mehdi Yousfi-Monod, Violaine Prince

2005

Abstract

We propose an automated text summarization through sentence compression. Our approach uses constituent syntactic function and position in the sentence syntactic tree. We first define the idea of a constituent as well as its role as an information provider, before analyzing contents and discourse consistency losses caused by deleting such a constituent. We explain why our method works best with narrative texts. With a rule-based system using SYGFRAN’s morphosyntactic analysis for French [1], we select removable constituents. Our results are satisfactory at the sentence level but less effective at the whole text level, a situation we explain by describing the difference of impact between constituents and relations.

References

  1. Chauché, J.: Un outil multidimensionnel de l'analyse du discours. In: Coling'84, Standford University, California (1984) 11-15
  2. Knight, K., Marcu, D.: Summarization beyond sentence extraction: a probabilistic approach to sentence compression. Artificial Intelligence archive 139(1) (2002) 91-107
  3. Siddharthan, A.: Resolving relative clause attachment ambiguities using machine learning techniques and wordnet hierarchies. In: 5th National Colloquium for Computational Linguistics in the UK (CLUK 2002). (2002) 45-49
Download


Paper Citation


in Harvard Style

Yousfi-Monod M. and Prince V. (2005). Automatic Summarization Based on Sentence Morpho-Syntactic Structure: Narrative Sentences Compression . In Proceedings of the 2nd International Workshop on Natural Language Understanding and Cognitive Science - Volume 1: NLUCS, (ICEIS 2005) ISBN 972-8865-23-6X, pages 161-167. DOI: 10.5220/0002570201610167


in Bibtex Style

@conference{nlucs05,
author={Mehdi Yousfi-Monod and Violaine Prince},
title={Automatic Summarization Based on Sentence Morpho-Syntactic Structure: Narrative Sentences Compression},
booktitle={Proceedings of the 2nd International Workshop on Natural Language Understanding and Cognitive Science - Volume 1: NLUCS, (ICEIS 2005)},
year={2005},
pages={161-167},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002570201610167},
isbn={972-8865-23-6X},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 2nd International Workshop on Natural Language Understanding and Cognitive Science - Volume 1: NLUCS, (ICEIS 2005)
TI - Automatic Summarization Based on Sentence Morpho-Syntactic Structure: Narrative Sentences Compression
SN - 972-8865-23-6X
AU - Yousfi-Monod M.
AU - Prince V.
PY - 2005
SP - 161
EP - 167
DO - 10.5220/0002570201610167