SINGLE DOCUMENT TEXT SUMMARIZATION USING RANDOM INDEXING AND NEURAL NETWORKS

Niladri Chatterjee, Avikant Bhardwaj

2010

Abstract

This paper presents a new extraction-based summarization technique developed using neural networks and Random Indexing. The technique exploits the advantages that a neural network provides in terms of compatibility and adaptability of a system as per the user. A neural network is made to learn the important properties of sentences that should be included in the summary through training. The trained neural network is then used as a sieve to filter out the sentences relevant for corresponding summary. Neural network along with Random Indexing extracts the semantic similarity between sentences in order to remove redundancy from the text to great success. One major advantage of the proposed scheme is that it takes care of human subjectivity as well.

References

  1. Bishop, C. M. (2005). Neural Networks for Pattern Recognition. Oxford Press.
  2. Chatterjee, N. and Mohan, S. (2007). Extraction-based single-document summarization using random indexing. In ICTAI (2), pages 448-455.
  3. Edmundson, H. P. (1969). New methods in automatic extracting. J. ACM, 16(2):264-285.
  4. Higgins, D., Burstein, J., Marcu, D., and Gentile, C. (2004). Evaluating multiple aspects of coherence in student essays. In HLT-NAACL, pages 185-192.
  5. Kaikhah, K. (2004). Text summarization using neural networks. WSEAS Transactions on Systems3, 3:960-963.
  6. Kanerva, P. (1988). Sparse distributed memory. Cambridge: MIT Press.
  7. Karlgren, J. and Sahlgren, M. (2001). From words to understanding: Foundations of real-world intelligence. CSLI Publications.
  8. Kupiec, J., Pedersen, J. O., and Chen, F. (1995). A trainable document summarizer. In SIGIR, pages 68-73.
  9. Luhn, H. (1958). The automatic creation of literature abstracts. IBM Journal of Research and Development.
  10. Mani, I. and Bloedorn, E. (1999). Summarizing similarities and differences among related documents. Inf. Retr., 1(1-2):35-67.
  11. Rojas, R. (1996). Neural Networks: A Systematic Introduction. Springer-Verlag, Berlin.
  12. Sahlgren, M. (2005). An introduction to random indexing. In Proceedings of the Methods and Applications of Semantic Indexing Workshop at the 7th International Conference on Terminology and Knowledge Engineering.
  13. Sahlgren, M. (2006). The Word-Space Model: Using distributional analysis to represent syntagmatic and paradigmatic relations between words in highdimensional vector spaces. Ph.D. dissertation, Department of Linguistics, Stockholm University.
  14. Yates, R. and Neto, B. (1999). Modern Information Retrieval. Pearson Education.
Download


Paper Citation


in Harvard Style

Chatterjee N. and Bhardwaj A. (2010). SINGLE DOCUMENT TEXT SUMMARIZATION USING RANDOM INDEXING AND NEURAL NETWORKS . In Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2010) ISBN 978-989-8425-29-4, pages 171-176. DOI: 10.5220/0003066601710176


in Bibtex Style

@conference{keod10,
author={Niladri Chatterjee and Avikant Bhardwaj},
title={SINGLE DOCUMENT TEXT SUMMARIZATION USING RANDOM INDEXING AND NEURAL NETWORKS},
booktitle={Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2010)},
year={2010},
pages={171-176},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003066601710176},
isbn={978-989-8425-29-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2010)
TI - SINGLE DOCUMENT TEXT SUMMARIZATION USING RANDOM INDEXING AND NEURAL NETWORKS
SN - 978-989-8425-29-4
AU - Chatterjee N.
AU - Bhardwaj A.
PY - 2010
SP - 171
EP - 176
DO - 10.5220/0003066601710176