MINING PATTERNS IN THE PRESENCE OF DOMAIN KNOWLEDGE

Cláudia Antunes

Abstract

One of the main difficulties of pattern mining is to deal with items of different nature in the same itemset, which can occur in any domain except basket analysis. Indeed, if we consider the analysis of any transactional database composed by several entities and relationships, it is easy to understand that the equality function may be different for each element, which difficult the identification of frequent patterns. This situation is just one example of the need for using domain knowledge to manage the discovery process, but several other, no less important can be enumerated, such the need to consider patterns at higher levels of abstraction or the ability to deal with structured data. In this paper, we show how the Onto4AR framework can be explored to overcome these situations in a natural way, illustrating its use in the analysis of two distinct case studies. In the first one, exploring a cinematographic dataset, we capture patterns that characterize kinds of movies in accordance to the actors present in their casts and their roles. In the second one, identifying molecular fragments, we find structured patterns, including chains, rings and stars.

References

  1. Agrawal, R., Imielinsky, T., and Swami, A. Mining Association Rules between Sets of Items in Large Databases. In Proc. ACM SIGMOD Conf Management of Data. 1993. 207-216
  2. Antunes, C., and Oliveira, A.L., Constraint Relaxations for Discovering Unknown Sequential Patterns. In Knowledge Discovery in Inductive Databases: Third International Workshop, Springer, 2005, 11-32
  3. Antunes, C. Onto4AR: a framework for mining association rules. In Proc. Int'l Workshop on Constraint-Based Mining and Learning, 2007. 37-48
  4. Antunes, C. An ontology-based method for mining frequent patterns. Technical report, Instituto Superior Técnico. 2008.
  5. Bayardo, R.J., The Many Roles of Constraints in Data Mining. In SIGKDD Explorations, vol. 4, nr. 1 pp. i-ii, 2002.
  6. Garofalakis, M.N., Rastogi, R., and Shim, K., SPIRIT: Sequential Pattern Mining with Regular Expression Constraints. In Proc. Very Large Databases Conf. 1999, 223-234
  7. Maedche, A., Ontology Learning for the Semantic Web, Kluwer Academic Publishers, 2002.
  8. Wiederhold, G., Movies Database Documentation, 1989.
Download


Paper Citation


in Harvard Style

Antunes C. (2009). MINING PATTERNS IN THE PRESENCE OF DOMAIN KNOWLEDGE . In Proceedings of the 11th International Conference on Enterprise Information Systems - Volume 2: ICEIS, ISBN 978-989-8111-85-2, pages 188-193. DOI: 10.5220/0001995001880193


in Bibtex Style

@conference{iceis09,
author={Cláudia Antunes},
title={MINING PATTERNS IN THE PRESENCE OF DOMAIN KNOWLEDGE},
booktitle={Proceedings of the 11th International Conference on Enterprise Information Systems - Volume 2: ICEIS,},
year={2009},
pages={188-193},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001995001880193},
isbn={978-989-8111-85-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 11th International Conference on Enterprise Information Systems - Volume 2: ICEIS,
TI - MINING PATTERNS IN THE PRESENCE OF DOMAIN KNOWLEDGE
SN - 978-989-8111-85-2
AU - Antunes C.
PY - 2009
SP - 188
EP - 193
DO - 10.5220/0001995001880193