Authors:
Chiheb-Eddine Ben N'Cir
1
and
Nadia Essoussi
2
Affiliations:
1
ISG of Tunis and University of Tunis, Tunisia
;
2
FSEG Nabeul and University of Carthage, Tunisia
Keyword(s):
Overlapping Clustering, Multi-labels, Non disjoint Clusters, Additive Clustering.
Related
Ontology
Subjects/Areas/Topics:
Artificial Intelligence
;
Business Analytics
;
Clustering and Classification Methods
;
Computational Intelligence
;
Data Analytics
;
Data Engineering
;
Evolutionary Computing
;
Knowledge Discovery and Information Retrieval
;
Knowledge-Based Systems
;
Machine Learning
;
Soft Computing
;
Symbolic Systems
Abstract:
Clustering is an unsupervised learning technique which aims to fit structures for unlabeled data sets. Identifying non disjoint groups is an important issue in clustering. This issue arises naturally because many real life applications need to assign each observation to one or several clusters. To deal with this problem, recent proposed methods are based on theoretical, rather than heuristic, model and introduce overlaps in their optimized criteria. In order to model overlaps between clusters, some of these methods use the average of clusters’ prototypes while other methods are based on the sum of clusters’ prototypes. The use of SUM or AVERAGE can have significant impact on the theoretical validity of the method and affects induced patterns. Therefore, we study in this paper patterns induced by these approaches through the comparison of patterns induced by Overlapping k-means (OKM) and Alternating Least Square (ALS) methods which generalize k-means for overlapping clustering and are
based on AVERAGE and SUM approaches respectively.
(More)