USING GENETIC ALGORITHMS WITH LEXICAL CHAINS FOR AUTOMATIC TEXT SUMMARIZATION

Mine Berker, Tunga Güngör

Abstract

Automatic text summarization takes an input text and extracts the most important content in the text. Determining the importance depends on several factors. In this paper, we combine two different approaches that have been used in text summarization. The first one is using genetic algorithms to learn the patterns in the documents that lead to the summaries. The other one is using lexical chains as a representation of the lexical cohesion that exists in the text. We propose a novel approach that incorporates lexical chains into the model as a feature and learns the feature weights by genetic algorithms. The experiments showed that combining different types of features and also including lexical chains outperform the classical approaches.

References

  1. Barzilay, R., 1997. Lexical Chains for Summarization. M.Sc. Thesis, Ben-Gurion University of the Negev, Department of Mathematics and Computer Science.
  2. Barzilay, R., Elhadad, M., 1997. Using Lexical Chains for Text Summarization. In ACL/EACL Workshop on Intelligent Scalable Text Summarization, pp. 10-17.
  3. Brandow, R., Mitze, K., Rau, L., 1994. Automatic Condensation of Electronic Publications by Sentence Selection. Information Processing and Management, 31(5), 675-685.
  4. Brunn, M., Chali, Y., Pinchak, C. J., 2001. Text Summarization Using Lexical Chains. In Document Understanding Conference, pp. 135-140.
  5. Dehkordi, P. K., Khosravi, H., Kumarci, F., 2009. Text Summarization Based on Genetic Programming. International Journal of Computing and ICT Research, 3(1), 57-64.
  6. Edmundson, H. P., 1969. New Methods in Automatic Abstracting. Journal of the Association for Computing Machinery, 16(2), 264-285.
  7. Fattah, M. A., Ren, F., 2009. GA, MR, FFNN, PNN and GMM Based Models for Automatic Text Summarization. Computer Science and Language, 23, 126-144.
  8. Fuentes, M., Rodriguez, H., 2002. Using Cohesive Properties of Text for Automatic Summarization. In Workshop on Processing and Information Retrieval.
  9. Kiani, A., Akbarzadeh, M. R., 2006. Automatic Text Summarization Using Hybrid Fuzzy GA-GP. In IEEE International Conference on Fuzzy Systems, pp. 5465- 5471.
  10. Kupiec, J., Pedersen, J., Chen, F., 1995. A Trainable Document Summarizer. In ACM-SIGIR.
  11. Li, J., Sun, L., Kit, C., Webster, J., 2007. A QueryFocused Multi-Document Summarizer Based on Lexical Chains. In Document Understanding Conference.
  12. Mani, I., 2001. Automatic Summarization. John Benjamins, Amsterdam.
  13. Mani, I., Bloedorn, E., 1998. Machine Learning of Generic and User-Focused Summarization. In 15th National Conference on Artificial Intelligence, pp. 821-826.
  14. Paice, C., Jones, P., 1993. The Identification of Important Concepts in Highly Structured Technical Papers. In ACM-SIGIR.
  15. Silber, H. G., McCoy, K. F., 2000. Efficient Text Summarization Using Lexical Chains. In 5th International Conference on Intelligent User Interfaces, pp. 252-255.
Download


Paper Citation


in Harvard Style

Berker M. and Güngör T. (2012). USING GENETIC ALGORITHMS WITH LEXICAL CHAINS FOR AUTOMATIC TEXT SUMMARIZATION . In Proceedings of the 4th International Conference on Agents and Artificial Intelligence - Volume 1: SSML, (ICAART 2012) ISBN 978-989-8425-95-9, pages 595-600. DOI: 10.5220/0003882405950600


in Bibtex Style

@conference{ssml12,
author={Mine Berker and Tunga Güngör},
title={USING GENETIC ALGORITHMS WITH LEXICAL CHAINS FOR AUTOMATIC TEXT SUMMARIZATION},
booktitle={Proceedings of the 4th International Conference on Agents and Artificial Intelligence - Volume 1: SSML, (ICAART 2012)},
year={2012},
pages={595-600},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003882405950600},
isbn={978-989-8425-95-9},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 4th International Conference on Agents and Artificial Intelligence - Volume 1: SSML, (ICAART 2012)
TI - USING GENETIC ALGORITHMS WITH LEXICAL CHAINS FOR AUTOMATIC TEXT SUMMARIZATION
SN - 978-989-8425-95-9
AU - Berker M.
AU - Güngör T.
PY - 2012
SP - 595
EP - 600
DO - 10.5220/0003882405950600