without demanding manual updating on the source
code. The ArchCollect software current version
presents a significant improvement to the interaction
pattern and to the monitored application markup
language implementation, when compared to the
previous version (Lima, 2003).
A direct interaction acquisition mechanism can
generate fast and accurate n-dimensional
information cube used as a collaborative filtering
algorithm input, to establish some patterns based on
usage, layout, content and performance focus. None
of the related works collect the element page
directly, what compromises the accuracy, making
future preprocessing necessary.
The acquisition mechanism compatibility to the
Netscape© browser is not complete. It cannot
capture the end of the session in Netscape©
browsers, because these assume that regions b and c
form a single region. For this browser type only the
interactions in region b are captured.
A possible limitation, not only to the ArchCollect
acquisition mechanism, but to every Web usage
mining tool that collects data from the user, is the
risk of the user disabling the monitoring in his/her
browser. That is valid for tools that use cookies, Java
applets or any plugin to be installed.
The collecting mechanism can be extended to
collect sounds and facial images produced by users,
and also to collect defined interactions as XML
elements/metatags from distributed systems.
Finally, it is necessary to extend the acquisition
mechanism compatibility, which is so far limited to
the Internet Explorer© and Netscape© browsers.
REFERENCES
Etzioni, O., 1999. The world wide web: Quaqmire or gold
mine. Communications of the ACM, 39(11):65-68.
Shahabi, C., Banaei-Kashani, F., Faruque, J., 2001. A
reliable, eficient, and scalable system for web usage
data acquisition. WebKDD'01 Workshop, ACM-
SIGKDD 2001, São Francisco, CA.
Spiliopoulou, M., 2000. Web usage mining for site
evolution: Making a site better fit its users. Special
Section of the Communications of ACM on
“Personalization Technologies with Data Mining”,
43(8):127-134, August, 2000.
Perkowitz, M., and Etzioni, O., 2000. Toward adaptive
Web sites: conceptual framework and case study.
Artificial Intelligence 118, p.p245-275, 2000.
Buchner, A.G., and Mulvenna, M.D., 1998. Discovering
Internet Marketing Intelligence through Online
Analytical Web Usage Mining. ACM SIGMOD
Record, ISSN 0163-5808, Vol. 27, No.4, p.p 54-61,
1998.
Sarwar, B.M., Karypis, G., Kostan, J.A., and Riedl, R.,
2000. Analysis of Recommender Algorithms for E-
Commerce. ACM E-Commerce’00 Conference.
October, 2000.
Srivastava, J., Cooley, R., Deshpande, M., Tan, P., 2000.
Web usage minig : Discovery and applications of
usage patterns from web data, SIGKDD, January,
2000.
Cooley, R., Tan, P., Srivastava, J., 2000. Discovery of
Interesting Usage Patterns from Web Data. Advances
in Web Usage Analysis and User Profiling, Lecture
Notes in Computer Science, Vol. 1836, Springer-
Verlag, 2000.
Gomory, S., Hoch, R., Lee, J., Poldlaseck, M., Schonberg,
E., 1999. Ecommerce Intelligence : Measuring,
Analyzing, and Reporting on Merchandising
Effectiveness of Online Stores, IBM Watson Research
Center.
Wu, K., Yu, P.S., and Ballman, A., 1999. SpeedTracer: A
web usage mining and analysis tool. IBM Systems
Journal, 37(1), 1999.
Zaiane, O.R., Xin, M., Han, J., 1999. Discovering Web
Access Patterns and Trends by Applying OLAP and
Data Mining Technology on Web Logs, Proc. of
Advances in Digital Libraries Conference, 1999.
Ackerman M. D., et al. 1997. “Learning Probabilistic user
profiles: Applications to finding interesting Web sites,
notifying users of relevant changes to the Web pages,
and locating grant opportunities”. AI Magazine 18(2)
47-56, 1997.
Lieberman H., 1995. “ Letizia: An agent that assists Web
browsing”. Proceedings of the international joint
conference on Artificial Intelligence, Montreal,
August 1995.
Alladvantage - http://www.alladvantage.com
NetZero - http://www.netzero.com
Lee, J., Lee, H.S., Wang, P., 2001. Design and
Implementation of a Visual Online Product Catalog
Interface. ICEIS (2) 2001: 1010-1017.
Ghani, R., Fano, A., 2002. Towards Semantic Data
Mining: Creating and Using a Knowledge Base of
Product Semantics, KDD 2002, Edmonton, Canada.
Menascé, Daniel A. & Almeida, Virgilio A.F., "Capacity
Planning for WEB Performance - Metrics, Models &
Methods", Prentice Hall, PTR, 1998.
Lima J.C., Carneiro T.G.S., Pagliares R.M., et. al, 2003.
ArchCollect: A set of Components directed towards
web users' interaction, ICEIS 2003 Conference,
Angers - France.
Lima J.C., Carneiro T.G.S., Esmin, A.A.A., 2003.
ArchCollect: concepts of na architecture that offers
services for analyzing and understanding Web users’
interactions. Conexão Ciência Magazine– Year I, n°
01, pp. 39-46 - Formiga – Brazil – 2003.
Kimball, R., 2002. Data Webhouse Toolkit. Editora
Campus.
ICEIS 2004 - SOFTWARE AGENTS AND INTERNET COMPUTING
262