have applied the constraint programming models to
the MSNBC database from the UCI repository, hav-
ing successful results.
Our models where simple enough to work with
academic examples, but further experimentation is re-
quired, focussing on escalation (both: length of trans-
action and database size). In this sense, our research
is directed towards the use of reified constraints that
optimize the constraint programming model.
Note that we have presented two independent
models: one for mining item repetitions within item-
sets and another one or mining the itemsets with or-
der relations. The combination of both can lead us
into a complete sequence patter mining. Once the
combination is done, this model will be equiparable
to classical algorithms used nowadays like prefixspan
or clospan with the advantage of being declarative and
easily extensible.
ACKNOWLEDGEMENTS
This work is supported by the research projects
CTQ2008-06865-C02-02 and DPI2011-24929, and
the grant FPU-AP2009-2831, all of them funded
by the Spanish Government. The MSNBC dataset
has been extracted from the UCI Machine Reposi-
tory (Frank and Asuncion, 2010).
REFERENCES
(2012). Frequent itemset mining dataset repository. http://
fimi.ua.ac.be/data/.
Agrawal, R. and Srikant, R. (1994). Fast algorithms for
mining association rules, volume 1215, pages 487–
499. Morgan Kaufmann.
Agrawal, R. and Srikant, R. (1995). Mining sequential pat-
terns. In Proceedings of the Eleventh International
Conference on Data Engineering, ICDE ’95, pages 3–
14, Washington, DC, USA. IEEE Computer Society.
Bonchi, F., Giannotti, F., Lucchese, C., Orlando, S., Perego,
R., and Trasarti, R. (2009). A constraint-based query-
ing system for exploratory pattern discovery. Informa-
tion Systems, 34(1):3–27.
Bonchi, F. and Lucchese, C. (2007). Soft constraint-based
pattern mining. Data Knowl. Eng., 60:377–399.
Brand, M. (1998). Pattern discovery via entropy minimiza-
tion. Technical report, MERL - A Mitsubishi Electric
Research Laboratory.
Burke, R. (1999). The wasabi personal shopper: a case-
based recommender system. In Proceedings of the
sixteenth national conference on Artificial intelligence
and the eleventh Innovative applications of artificial
intelligence conference innovative applications of ar-
tificial intelligence, AAAI ’99/IAAI ’99, pages 844–
849, Menlo Park, CA, USA. American Association
for Artificial Intelligence.
Cadez, I., Heckerman, D., Meek, C., Smyth, P., and White,
S. (2000). Visualization of navigation patterns on a
web site using model-based clustering. In Proceed-
ings of the sixth ACM SIGKDD international confer-
ence on Knowledge discovery and data mining, KDD
’00, pages 280–284, New York, NY, USA. ACM.
David, J. and Nourine, L. (Submitted to Theoretical
Computer Science). A generic algorithm for
sequence mining. 15 pages. http://www-lipn.univ-
paris13.fr/ david/old.version/articles/generation.pdf
[Accessed: 9.2.2012].
De Raedt, L., Guns, T., and Nijssen, S. (2008). Con-
straint programming for itemset mining. In Proceed-
ings of the 14th ACM SIGKDD international confer-
ence on Knowledge discovery and data mining, KDD
’08, pages 204–212, New York, NY, USA. ACM.
De Raedt, L., Guns, T., and Nijssen, S. (2010). Constraint
programming for data mining and machine learning.
In AAAI, pages 1671–1675.
De Raedt, L. and Kramer, S. (2001). The levelwise ver-
sion space algorithm and its application to molecular
fragment finding. In Proceedings of the 17th inter-
national joint conference on Artificial intelligence -
Volume 2, pages 853–859, San Francisco, CA, USA.
Morgan Kaufmann Publishers Inc.
Frank, A. and Asuncion, A. (2010). UCI machine learning
repository. http://archive.ics.uci.edu/ml.
Frisch, A., Harvey, W., Jefferson, C., Martinez-Hernandez,
B., and Miguel, I. (2008). Essence: A constraint lan-
guage for specifying combinatorial problems. Con-
straints, 13:268–306. 10.1007/s10601-008-9047-y.
Gal, Y., Reddy, S., Shieber, S. M., Rubin, A., and Grosz,
B. J. (2012). Plan recognition in exploratory domains.
Artificial Intelligence, 176(1):2270 – 2290.
Guns, T., Nijssen, S., and Raedt, L. D. (2011). Itemset min-
ing: A constraint programming perspective. Artificial
Intelligence, 175(12-13):1951 – 1983.
Han, J., Cheng, H., Xin, D., and Yao, X. (2007). Frequent
pattern mining: current status and future directions.
Data Min Knowl Disc, 15:55–86.
Nethercote, N., Stuckey, P. J., Becket, R., Brand, S., Duck,
G. J., and Tack, G. (2007). Minizinc: towards a stan-
dard cp modelling language. In Proceedings of the
13th international conference on Principles and prac-
tice of constraint programming, CP’07, pages 529–
543, Berlin, Heidelberg. Springer-Verlag.
Schmidt, C., Sridharan, N., and Goodson, J. (1978). The
plan recognition problem: An intersection of psychol-
ogy and artificial intelligence. Artificial Intelligence,
11(1-2):45 – 83. Applications to the Sciences and
Medicine.
Soulet, A., Kl
´
ema, J., and Cremilleux, B. (2006). Effi-
cient mining under flexible constraints through sev-
eral datasets. In D
ˇ
zeroski, S. and Struyf, J., editors,
Proceedings of 5th International Workshop on Knowl-
edge Discovery in Inductive Databases, pages 131–
142, Berlin, Germany. Humbolt Universit
¨
at Berlin,
2006, s.
Constraint-programmingApproachforMultisetandSequenceMining
219