Authors:
Fabio Fassetti
and
Fabrizio Angiulli
Affiliation:
DEIS, University of Calabria, Italy
Keyword(s):
Data mining, Example-based outlier detection, Genetic algorithms.
Related
Ontology
Subjects/Areas/Topics:
Artificial Intelligence
;
Biomedical Engineering
;
Biomedical Signal Processing
;
Data Manipulation
;
Data Mining
;
Databases and Information Systems Integration
;
Enterprise Information Systems
;
Health Engineering and Technology Applications
;
Human-Computer Interaction
;
Methodologies and Methods
;
Neurocomputing
;
Neurotechnology, Electronics and Informatics
;
Pattern Recognition
;
Physiological Computing Systems
;
Sensor Networks
;
Signal Processing
;
Soft Computing
Abstract:
In this work an example-based outlier detection method exploiting both positive (that is, outlier) and negative (that is, inlier) examples in order to guide the search for anomalies in an unlabelled data set, is introduced.
The key idea of the method is to find the subspace where positive examples mostly exhibit their outlierness while at the same time negative examples mostly exhibit their inlierness. The degree to which an example is an outlier is measured by means of well-known unsupervised outlier scores evaluated on the collection of unlabelled data.
A subspace discovery algorithm is designed, which searches for the most discriminating subspace. Experimental results show that the method is able to detect a near optimal solution, and that the method is promising from the point of view of the knowledge mined.