As future work we will explore some approaches
that aim to improve the labels through a general-
ization process. We want to explore the impact of
generic labels on P and RF to analyze if the results
of the labeling methods can be improved. From this
generalization process we intend to discover a topic
for each cluster considering the context given by the
user through ontology. Given, for example, the labels
“rice”, “bean” and “salad”, the topic could be food
or lunch, depending on the knowledge codified in the
ontology.
ACKNOWLEDGEMENTS
We wish to thank Fundac¸˜ao de Amparo `a Pesquisa do
Estado de S˜ao Paulo (FAPESP) (processes numbers:
2010/07879-0 and 2011/19850-9) and Fundac¸˜ao para
o Desenvolvimento da Unesp (FUNDUNESP) for the
financial support.
REFERENCES
Agrawal, R. and Srikant, R. (1994). Fast algorithms for
mining association rules. In Bocca, J. B., Jarke, M.,
and Zaniolo, C., editors, Proceedings of the 20th Inter-
national Conference on Very Large Data Bases, VLDB
1994, pages 487–499.
Carvalho, V. O., Santos, F. F., and Rezende, S. O. (2011).
Post-processing association rules with clustering and
objective measures. In Proceedings of the 13th Inter-
national Conference on Enterprise Information Sys-
tems, volume 1, pages 54–63.
Carvalho, V. O., Santos, F. F., Rezende, S. O., and Padua,
R. (2012). PAR-COM: A new methodology for post-
processing association rules. Lecture Notes in Busi-
ness Information Processing, 102. In press. Available
due May 19.
Cutting, D. R., Karger, D. R., Pedersen, J. O., and Tukey,
J. W. (1992). Scatter/gather: a cluster-based approach
to browsing large document collections. In Proceed-
ings of the 15th Annual International ACM SIGIR
Conference on Research and Development in Infor-
mation Retrieval, pages 318–329.
Fung, B. C. M., Wang, K., and Ester, M. (2003). Hier-
archical document clustering using frequent itemsets.
In Proceedings of the 3rd SIAM International Confer-
ence on Data Mining, pages 59–70.
Glover, E. J., Pennock, D. M., Lawrence, S., and Krovetz,
R. (2002). Inferring hierarchical descriptions. In
Proceedings of the 11th International Conference on
Information and Knowledge Management, pages 507–
514.
Jorge, A. (2004). Hierarchical clustering for thematic
browsing and summarization of large sets of associ-
ation rules. In Proceedings of the 4th SIAM Interna-
tional Conference on Data Mining. 10p.
Kashyap, V., Ramakrishnan, C., Thomas, C., and Sheth, A.
(2005). Taxaminer: An experimentation framework
for automated taxonomy bootstrapping. International
Journal of Web and Grid Services, 1(2):240–266.
Larsen, B. and Aone, C. (1999). Fast and effective text
mining using linear-time document clustering. In Pro-
ceedings of the 5th ACM SIGKDD International Con-
ference on Knowledge Discovery and Data Mining,
pages 16–22.
Lopes, A. A., Pinho, R., Paulovich, F. V., and Minghim,
R. (2007). Visual text mining using association rules.
Computers & Graphics, 31(3):316–326.
Manning, C. D., Raghavan, P., and Sch¨utze, H. (2009).
An Introduction to Information Retrieval. Cambridge
University Press. 544p.
Moura, M. F. and Rezende, S. O. (2010). A simple method
for labeling hierarchical document clusters. In Pro-
ceedings of the 10th IASTED International Confer-
ence on Artificial Intelligence and Applications, pages
336–371.
Natarajan, R. and Shekar, B. (2005). Interestingness of as-
sociation rules in data mining: Issues relevant to e-
commerce. S
¯
ADHAN
¯
A – Academy Proceedings in En-
gineering Sciences (The Indian Academy of Sciences),
30(Parts 2&3):291–310.
Popescul, A. and Ungar, L. (2000). Automatic label-
ing of document clusters. Unpublished manuscript.
http://www.cis.upenn.edu/∼popescul/Publications/po
pescul00labeling.pdf.
Reynolds, A. P., Richards, G., de la Iglesia, B., and
Rayward-Smith, V. J. (2006). Clustering rules: A
comparison of partitioning and hierarchical clustering
algorithms. Journal of Mathematical Modelling and
Algorithms, 5(4):475–504.
Sahar, S. (2002). Exploring interestingness through cluster-
ing: A framework. In Proceedings of the IEEE Inter-
national Conference on Data Mining, pages 677–680.
Toivonen, H., Klemettinen, M., Ronkainen, P., H¨at¨onen,
K., and Mannila, H. (1995). Pruning and grouping
discovered association rules. In Workshop Notes of
the ECML Workshop on Statistics, Machine Learning,
and Knowledge Discovery in Databases, pages 47–52.
Zhao, Y., Zhang, C., and Cao, L. (2009). Post-mining of
association rules: Techniques for effective knowledge
extraction. Information Science Reference. 372p.
LabelingMethodsforAssociationRuleClustering
111