Authors:
F. Jorge F. Duarte
1
;
Ana L. N. Fred
2
;
Fátima Rodrigues
1
;
João M. M. Duarte
1
and
André Lourenço
2
Affiliations:
1
GECAD – Knowledge Engineering and Decision Support Group, Instituto Superior de Engenharia do Porto, Instituto Superior Politécnico, Portugal
;
2
Instituto de Telecomunicações, Instituto Superior Técnico, Portugal
Related
Ontology
Subjects/Areas/Topics:
Artificial Intelligence
;
Artificial Intelligence and Decision Support Systems
;
Biomedical Engineering
;
Business Analytics
;
Data Engineering
;
Data Mining
;
Databases and Information Systems Integration
;
Datamining
;
Enterprise Information Systems
;
Health Information Systems
;
Information Systems Analysis and Specification
;
Knowledge Management
;
Ontologies and the Semantic Web
;
Sensor Networks
;
Signal Processing
;
Society, e-Business and e-Government
;
Soft Computing
;
Verification and Validation of Knowledge-Based Systems
;
Web Information Systems and Technologies
Abstract:
We introduce an approach based on evidence accumulation (EAC) for combining partitions in a clustering ensemble. EAC uses a voting mechanism to produce a co-association matrix based on the pairwise associations obtained from N partitions and where each partition has equal weight in the combination process. By applying a clustering algorithm to this co-association matrix we obtain the final data partition. In this paper we propose a clustering ensemble combination approach that uses subsampling and that weights differently the partitions (WEACS). We use two ways of weighting each partition: SWEACS, using a single validation index, and JWEACS, using a committee of indices. We compare combination results with the EAC technique and the HGPA, MCLA and CSPA methods by Strehl and Gosh using subsampling, and conclude that the WEACS approaches generally obtain better results. As a complementary step to the WEACS approach, we combine all the final data partitions produced by the different vari
ations of the method and use the Ward Link algorithm to obtain the final data partition.
(More)