confirm one of them. If there is a picture associated
with any of these sections, it is displayed to get
further confirmation. If the data entered by the user
is still not enough to confirm or rule out a category,
suspected categories are presented to the user with
links to their original section as a reference to the
user.
9 CONCLUSION
The objective of our research is to help Web users to
quickly and easily find an answer to some given
diagnostic question they have from specific
section(s) in some given document set. To achieve
this goal, we have constructed a web mining
technique that can extract information from the web
and create knowledge from it. Our system has been
built in the agricultural domain to extract
information from its related web pages, and to index
the diagnostic sections in it. The constructed index is
used for finding relevant knowledge to answer a user
query.
Our system has three main phases: the
categorization phase, the indexing phase, and the
search phase. The categorization phase has been
tested on a training web pages set, which is a
collection of extension documents. It automatically
generated 100 main categories, 145 sub categories,
and 127 sub-subcategory items. These categories
are used by the indexing component to assign for
each section in an input web page, a category if
possible. The indexing and search phases are still
under construction. Also, there are still some
problems must need to be solved like inheritance
from more than one category, and synonymous
words used in different web pages content.
REFERENCES
Borges, J. and Levene, M., 1999. Data mining of user
navigation patterns, In Web Usage Analysis and User
Profiling, vol. 1836, pp. 92-111.
Chen, H. and Chau, M., 2004. Web Mining: Machine
Learning for Web Applications. In the Annual Review
of Information Science and Technology, vol. 38, pp.
289-329.
Doherty, P., 2000. Web Mining - The E-Tailer's Holy
Grail. In DM Direct.
El-Beltagy, S. R., Rafea, A. and Abdelhamid, Y., 2004.
Using Dynamically Acquired Background Knowledge
For Information Extraction And Intelligent Search. In
M. Mohammadian, (Ed.) Intelligent Agents for Data
Mining and Information Retrieval, Idea Group
Publishing, Hershey, PA, USA, pp. 196-207.
Guan, T. and Wong, K., 1999. KPS: a Web information
mining algorithm. In Proceedings 8th Int. World Wide
Web Conf., Canada, pp. 417-429.
Hsu, J., 2002. Web Mining: A Survey of World Wide
Web Data Mining Research and Applications. In
Decision Sciences Institute Annual Meeting
Proceedings, PP. 753-758.
Kosala, R. and Blockeel, H., 2000. Web Mining Research:
A Survey. In SIGKDD Explorations, vol. 2, no. 1,pp
1-15.
Liu, B., Chin, Ch. W. and Ng, H. T., 2003. Mining Topic-
Specific Concepts and Definitions on the Web, In
Proceedings of the twelfth international World Wide
Web conference (WWW-2003), Budapest, Hungry, pp.
20-24.
Loh, S., Wives, L. K. and de Oliveira, J. P. M., 2000.
Concept-Based Knowledge Discovery. In Texts
Extracted from the Web SIGKDD Explorations, vol. 2,
no. 1, pp. 29-39.
Madria, S.K., Bhowmick, S.S., Ng, W.K. and Lim, E.P.,
1999. Research issues in web data mining. in
Proceedings 1st International Conf. On Data
Warehousing and Knowledge Discovery Florence
Italy, PP. 303-312.
Pal, S., Talwar, V., and Mitra, P., 2002. Web Mining in
Soft Computing Framework: Relevance, State of the
Art and Future Directions. IEEE Trans. on Neural
Networks, 13(5):1163 -1177, 2002.
Rafea, A. and Shaalan, K.,1993. Lexical Analysis of An
Inflected Arabic Word Using Exhaustive Search of an
Augmented Transition Network, In Software Practice
& Experience, vol. 23, no. 6, pp. 567-588.
Scime, A., 2004. Guest Editor's Introduction: Special Issue
on Web Content Mining. In Journal of Intelligent
Information Systems, vol. 22, no. 3, pp. 211-213.
Xu, J., Huang, Y. and Madey, G., 2003. A Research
Support System Framework for Web Data mining
Research. In Workshop on Applications, Products and
Services of Web-based Support Systems at the Joint
International Conference on Web Intelligence (2003
IEEE/WIC) and Intelligent Agent Technology,
Halifax, Canada, October 2003, 37-41.
Zaiane, O. R., 1999. Resource and Knowledge Discovery
from the Internet and Multimedia Repositories, Ph.D.
thesis, Simon Fraser University.
ICEIS 2005 - ARTIFICIAL INTELLIGENCE AND DECISION SUPPORT SYSTEMS
308