rules managing to fit fuzzy logic rules and verifying
that equivalent results are obtained. In Figure 3, the
interface for the term weight generator is shown.
4 RESULTS
As we did in (Ropero et al., 2007) tests are based on
the use of standard-questions as user consultations.
The first goal of these tests is to check that the
system makes a correct identification of standard-
questions with an index of certainty higher than 0.7.
The use of fuzzy logic makes it possible to identify
not only the corresponding standard-question but
others as well. This is related to recall, though it
does not match that exact definition (Ruiz &
Srinivasan, 1998). The second goal is to check if the
required standard-question is among the three
answers with higher degree of certainty. These three
answers should be presented to the user, with the
correct one among these three options. This is
related to precision. The obtained results are shown
in Table 2.
Table 2: Obtained Results.
Type of system First
answer
Among
the first
three
answers
Out of
the first
three
answers
Failed
answer
Intuitive Term
Weighting
77 %
20.5 %
1 %
1.5 %
Automatic Term
Weighting
79 %
17 %
2 %
2 %
Comparative tests between the results obtained
with the fuzzy logic engine and the ones proposed
by the Knowledge Engineer System are very
satisfactory as it is observed that rules fit correctly to
produce a few functionally equal coefficients to the
wished ones.
Besides, there are two important advantages for
the new method: on the one hand, term weighting is
automatic; on the second hand, the level of required
expertise is much lower, as there is no need for an
operator to know very much about the way fuzzy
logic engine works, but only to know how many
times a keyword appears in every set and the answer
to some simple questions – Does a keyword
undoubtedly define an object by itself? Is a keyword
tied to another one? - .
5 CONCLUSIONS
As said above, comparative tests between the results
obtained with the fuzzy logic engine and the ones
obtained by Knowledge Engineer System’s expertise
are very similar. Taking into account that there are
two fundamental advantages with our new method -
automation and less level of expertise required – we
must conclude that the method is suitable for Term
Weighting in Information Retrieval. This method
will be used for the design of a Web Intelligent
Agent which will soon start to work for the
University of Seville web page.
ACKNOWLEDGEMENTS
The work described in this paper has been supported
by the Spanish Ministry of Education and Science
(MEC: Ministerio de Educación y Ciencia) through
project reference number DPI2006-15467-C02-02.
REFERENCES
Aronson, A.R, Rindflesch, T.C, Browne, A. C., 1994.
Exploiting a large thesaurus for information retrieval.
Proceedings of RIAO, pp. 197-216.
Lee, D.L., Chuang, H., Seamons, K., 1997. Document
ranking and the vector-space model. IEEE Software,
Vol. 14, Issue 2, p. 67 – 75.
Liu, S., Dong, M., Zhang, H., Li, R. Shi, Z., 2001. An
approach of multi-hierarchy text classification
Proceedings of the International Conferences on Info-
tech and Info-net, 2001. Beijing. Vol 3, pp. 95 – 100.
Lu, M., Hu, K., Wu, Y., Lu, Y., Zhou, L., 2002. SECTCS:
towards improving VSM and Naive Bayesian
classifier. IEEE International Conference on Systems,
Man and Cybernetics, Vol. 5, p. 5.
Ropero J., Gomez, A., Leon, C., Carrasco, A. 2007.
Information Extraction in a Set of Knowledge Using a
Fuzzy Logic Based Intelligent Agent. Lecture Notes in
Computer Science. Vol. 4707, pp. 811-820.
Ruiz, M.E., Srinivasan, P., 1998. Automatic Text
Categorization Using Neural Networks. Advances in
Classification Research vol. 8: Proceedings of the 8th
ASIS SIG/CR Classification Research Workshop. Ed.
Efthimis Efthimiadis. Information Today,
Medford:New Jersey. 1998. pp 59-72.
Xu, J., Wang, Z. 2003. TCBLSA: A new method of text
clustering. International Conference on Machine
Learning and Cybernetics. Vol. 1, pp. 63-66.
Zhao, Y., Karypis, G., 2002. Improving precategorized
collection retrieval by using supervised term
weighting schemes. Proceedings of the International
Conference on Information Technology: Coding and
Computing, 2002. pp 16 – 21.
A NOVEL TERM WEIGHTING SCHEME FOR A FUZZY LOGIC BASED INTELLIGENT WEB AGENT
499