
The proposed extension could also be used for
recommendation purposes. Rules with a high
forward confidence indicate possible future requests.
However, to be useful for recommendation
purposes, it is needed to expand the proposed
concept by allowing more than one item in the
antecedent of the rules.
Market basket analysis was the first application
field of association rules. There was no ordering of
the items within a transaction because all the items
bought were placed in a basket and then finally paid.
It was impossible to detect the order in which the
customer took the products from the shelves.
Nowadays, a large number of supermarkets are
doing experiments with self scanning devices. The
customers scan the code of every product they buy
and when finished shopping they go with the
scanner and products to the counter. At the counter
the payment is done and no clerks are needed to scan
all the products again. However, some random
checks are performed to prevent fraud from the
customers. This method not only reduces the amount
of employees needed or the waiting times but also
creates a lot of worthwhile information. The
marketeers can see how customers wander through
the shop and adopt the layout of the shelves to
improve sales. It can be expected that ordinary
association rules will not suffice to analyse this kind
of sequential data. It is for example impossible to
detect with association rules if customers go mostly
from shelf A to shelf B or vice versa. It can not be
seen why diapers and beer are frequently sold
together. Were diapers the first objective and was
the beer only picked up when passing by? The
proposed extension might prove useful for
answering some of these questions.
4 RELATED RESEARCH
The paper by Agrawal et al.(1995) is probably the
most-cited work concerning sequence analysis. They
assume a database similar to Table 2 but instead of
having time stamps associated with every item, they
associate only one time stamp with every
transaction. Also, their problem of finding sequential
patterns is concerned with finding patterns among
items of multiple transactions, whereas our problem
concerns finding patterns among items within one
transaction. In the successor of their successful
paper (Agrawal et al., 1996), some generalizations to
sequential patterns are made. These generalizations,
such as the introduction of time constraints, are quite
similar to the extension that we proposed but their
problem remains a search for inter-transactional
patterns while our problem concerns the detection of
intra-transaction patterns.
In Mannila et al. (1995), another approach to
analyse sequential data is suggested. Their goal is to
find frequent episodes. These are collections of
events occurring frequently together. The search for
these frequent episodes is performed within an event
sequence. An event sequence can be compared with
one single transaction from Table 2. We on the
contrary are interested in finding patterns that occur
in many transactions.
Many other approaches to find patterns in
sequential data have been suggested. Many of them
differ only in a few aspects. Therefore some
attempts have been made to unify these different
approaches. A detailed discussion is out of the
context of this paper but can be found in Joshi et al.
(1999).
5 CONCLUSION
In this paper, we presented an extension of the basic
association rule framework that is capable of
analyzing sequential data. The concept was tested on
the log data of an online wine shop and it was shown
how it helped filtering out the interesting rules. It
was also suggested to use the proposed extension for
the analysis of data generated by self-scanning
devices. Finally, a short overview of related research
was given.
In future research, the introduced concept will be
used to analyze the logs from other web sites and it
will be expanded to allow more than one item in the
body of the rules.
REFERENCES
Agrawal R., Imielinski T. & Swami A., 1993. Mining
Association Rules between Sets of Items in Massive
Databases. In Proceedings of the ACM SIGMOD
International Conference on Management of Data,
Washington D.C., USA. pp. 207-216.
Agrawal, R., & Srikant, R,. 1995. Mining sequential
patterns. In Proceedings of the 11th International
Conference on Data Engineering. pp. 3-14.
Catledge L. & Pitkow J., 1995. Characterizing Browsing
Strategies in the World-Wide Web. Journal of
Computer Networks and ISDN systems, Volume 27,
nr. 6. pp. 1065-1073.
Cooley R., Mobasher R. & Srivastava J., 1999. Data
Preparation for Mining World Wide Web Browsing
Patterns. Knowledge and Information Systems,
volume 1. pp. 5-32.
WEB USAGE MINING WITH TIME CONSTRAINED ASSOCIATION RULES
347