Labeling Methods for Association Rule Clustering

Veronica Oliveira de Carvalho, Daniel Savoia Biondi, Fabiano Fernandes dos Santos, Solange Oliveira Rezende

2012

Abstract

Although association mining has been highlighted in the last years, the huge number of rules that are generated hamper its use. To overcome this problem, many post-processing approaches were suggested, such as clustering, which organizes the rules in groups that contain, somehow, similar knowledge. Nevertheless, clustering can aid the user only if good descriptors be associated with each group. This is a relevant issue, since the labels will provide to the user a view of the topics to be explored, helping to guide its search. This is interesting, for example, when the user doesn’t have, a priori, an idea where to start. Thus, the analysis of different labeling methods for association rule clustering is important. Considering the exposed arguments, this paper analyzes some labeling methods through two measures that are proposed. One of them, Precision, measures how much the methods can find labels that represent as accurately as possible the rules contained in its group and Repetition Frequency determines how the labels are distributed along the clusters. As a result, it was possible to identify the methods and the domain organizations with the best performances that can be applied in clusters of association rules.

References

  1. Agrawal, R. and Srikant, R. (1994). Fast algorithms for mining association rules. In Bocca, J. B., Jarke, M., and Zaniolo, C., editors, Proceedings of the 20th International Conference on Very Large Data Bases, VLDB 1994, pages 487-499.
  2. Carvalho, V. O., Santos, F. F., and Rezende, S. O. (2011). Post-processing association rules with clustering and objective measures. In Proceedings of the 13th International Conference on Enterprise Information Systems, volume 1, pages 54-63.
  3. Carvalho, V. O., Santos, F. F., Rezende, S. O., and Padua, R. (2012). PAR-COM: A new methodology for postprocessing association rules. Lecture Notes in Business Information Processing, 102. In press. Available due May 19.
  4. Cutting, D. R., Karger, D. R., Pedersen, J. O., and Tukey, J. W. (1992). Scatter/gather: a cluster-based approach to browsing large document collections. In Proceedings of the 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 318-329.
  5. Fung, B. C. M., Wang, K., and Ester, M. (2003). Hierarchical document clustering using frequent itemsets. In Proceedings of the 3rd SIAM International Conference on Data Mining, pages 59-70.
  6. Glover, E. J., Pennock, D. M., Lawrence, S., and Krovetz, R. (2002). Inferring hierarchical descriptions. In Proceedings of the 11th International Conference on Information and Knowledge Management, pages 507- 514.
  7. Jorge, A. (2004). Hierarchical clustering for thematic browsing and summarization of large sets of associ-
Download


Paper Citation


in Harvard Style

Oliveira de Carvalho V., Savoia Biondi D., Fernandes dos Santos F. and Oliveira Rezende S. (2012). Labeling Methods for Association Rule Clustering . In Proceedings of the 14th International Conference on Enterprise Information Systems - Volume 1: ICEIS, ISBN 978-989-8565-10-5, pages 105-111. DOI: 10.5220/0003970001050111


in Bibtex Style

@conference{iceis12,
author={Veronica Oliveira de Carvalho and Daniel Savoia Biondi and Fabiano Fernandes dos Santos and Solange Oliveira Rezende},
title={Labeling Methods for Association Rule Clustering},
booktitle={Proceedings of the 14th International Conference on Enterprise Information Systems - Volume 1: ICEIS,},
year={2012},
pages={105-111},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003970001050111},
isbn={978-989-8565-10-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 14th International Conference on Enterprise Information Systems - Volume 1: ICEIS,
TI - Labeling Methods for Association Rule Clustering
SN - 978-989-8565-10-5
AU - Oliveira de Carvalho V.
AU - Savoia Biondi D.
AU - Fernandes dos Santos F.
AU - Oliveira Rezende S.
PY - 2012
SP - 105
EP - 111
DO - 10.5220/0003970001050111