Authors:
Francisco Guil
1
;
Francisco Palacios
2
;
Manuel Campos
3
and
Roque Marín
3
Affiliations:
1
Computer Science School, University of Almería, Spain
;
2
University Hospital of Getafe, Spain
;
3
Computer Science Faculty, University of Murcia, Spain
Keyword(s):
Temporal data mining, Theory of evidence, Entropy, Specificity.
Related
Ontology
Subjects/Areas/Topics:
Artificial Intelligence
;
Biomedical Engineering
;
Business Analytics
;
Cardiovascular Technologies
;
Computing and Telecommunications in Cardiology
;
Data Engineering
;
Data Mining
;
Databases and Information Systems Integration
;
Datamining
;
Enterprise Information Systems
;
Health Engineering and Technology Applications
;
Health Information Systems
;
Medical and Nursing Informatics
;
Sensor Networks
;
Signal Processing
;
Soft Computing
Abstract:
Frequent sequences (or temporal associations) mining is a very important topic within the temporal data mining area. Syntactic simplicity, combined with the dual characteristics (descriptive and predictive) of the mined temporal patterns, allow the extraction of useful knowledge from dynamic domains, which are timevarying in nature. Some of the most representative algorithms for mining sequential patterns or frequent associations are Apriori-like algorithms and, therefore, they cannot handle numeric attributes or items. This peculiarity makes it necessary to add a new process in the data preparation step, the discretization process. An important fact is that, depending on the discretization technique used, the number and type of discovered temporal patterns change dramatically. In this paper, we propose a method based on the Shafer’s Theory of Evidence that uses two information measures proposed by Yager for the quality evaluation of the extracted sets of temporal patterns. From a pr
actical point of view, the main goal is to select, for a given dataset, the best discretization technique that leads to the discovery of useful knowledge. Nevertheless the underlying idea is to propose a formal method for assessing the mined patterns, seen as a belief structure, in terms of certainty in the information that represents. In this work, we also present a practical example, describing an application of this proposal in the Intensive Care Burn Unit domain.
(More)