performance using only these four features.
The second column in Table 4 shows the weights
of the features learned during training and the
success rate of the model on the test corpus. We can
see that the result is almost the same as the
performance of the system when all the features are
considered.
Table 4: Feature weights and success rates for the
alternative models.
Feature weight
(top features)
Feature weight
(modified criterion)
Another alternative model is decreasing the
threshold for determining strong lexical chains.
Assuming that an increase in the lexical chain scores
might affect the performance of the system, we
changed Eqn. 12 in such a way that chains whose
scores are more than one standard deviation from the
average are accepted as strong chains. The four
features in the previous experiment were used in this
model also. The results are shown in the last column
of Table 4. The success is a bit higher, but the
difference is not statistically significant. In order to
observe the effect of the threshold more clearly, the
experiments should be repeated with other
thresholds on corpora of different sizes.
4 CONCLUSIONS
In this work, we combined two approaches used in
automatic text summarization: lexical chains and
genetic algorithms. Different from previous works,
this paper combines information from different
levels of text analysis. The lexical chain concept is
included as a feature in the proposed model. We also
make use of machine learning to determine the
coefficients of the feature combinations. The results
showed that the combination of the features yields
better success rates than any individual feature.
Also, incorporating lexical chains into the model as
a feature increases the success of the overall model.
ACKNOWLEDGEMENTS
This work was supported by the Scientific and
Technological Research Council of Turkey
(TÜBİTAK) BİDEB under the programme 2219.
REFERENCES
Barzilay, R., 1997. Lexical Chains for Summarization.
M.Sc. Thesis, Ben-Gurion University of the Negev,
Department of Mathematics and Computer Science.
Barzilay, R., Elhadad, M., 1997. Using Lexical Chains for
Text Summarization. In ACL/EACL Workshop on
Intelligent Scalable Text Summarization, pp. 10-17.
Brandow, R., Mitze, K., Rau, L., 1994. Automatic
Condensation of Electronic Publications by Sentence
Selection. Information Processing and Management,
31(5), 675-685.
Brunn, M., Chali, Y., Pinchak, C. J., 2001. Text
Summarization Using Lexical Chains. In Document
Understanding Conference, pp. 135-140.
Dehkordi, P. K., Khosravi, H., Kumarci, F., 2009. Text
Summarization Based on Genetic Programming.
International Journal of Computing and ICT
Research, 3(1), 57-64.
Edmundson, H. P., 1969. New Methods in Automatic
Abstracting. Journal of the Association for Computing
Machinery, 16(2), 264-285.
Fattah, M. A., Ren, F., 2009. GA, MR, FFNN, PNN and
GMM Based Models for Automatic Text
Summarization. Computer Science and Language, 23,
126-144.
Fuentes, M., Rodriguez, H., 2002. Using Cohesive
Properties of Text for Automatic Summarization. In
Workshop on Processing and Information Retrieval.
Illinois Named Entity Tagger. http://cogcomp.cs.illinois.
edu/page/software/
Kiani, A., Akbarzadeh, M. R., 2006. Automatic Text
Summarization Using Hybrid Fuzzy GA-GP. In IEEE
International Conference on Fuzzy Systems, pp. 5465-
5471.
Kupiec, J., Pedersen, J., Chen, F., 1995. A Trainable
Document Summarizer. In ACM-SIGIR.
Li, J., Sun, L., Kit, C., Webster, J., 2007. A Query-
Focused Multi-Document Summarizer Based on
Lexical Chains. In Document Understanding
Conference.
LingPipe. http://alias-i.com/lingpipe/
Mani, I., 2001. Automatic Summarization. John
Benjamins, Amsterdam.
Mani, I., Bloedorn, E., 1998. Machine Learning of
Generic and User-Focused Summarization. In 15th
National Conference on Artificial Intelligence, pp.
821-826.
Paice, C., Jones, P., 1993. The Identification of Important
Concepts in Highly Structured Technical Papers. In
ACM-SIGIR.
Silber, H. G., McCoy, K. F., 2000. Efficient Text
Summarization Using Lexical Chains. In 5th
International Conference on Intelligent User
Interfaces, pp. 252-255.
ICAART 2012 - International Conference on Agents and Artificial Intelligence
600