Efficient Evidence Accumulation Clustering for Large Datasets

Diogo Silva; Helena Aidos; Ana Fred

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Efficient Evidence Accumulation Clustering for Large Datasets

Topics: Clustering; Ensemble Methods; Sparsity

In Proceedings of the 5th International Conference on Pattern Recognition Applications and Methods ICPRAM - Volume 1, 367-374, 2016 , Rome, Italy

Authors: Diogo Silva ¹ ; Helena Aidos ² and Ana Fred ²

Affiliations: ¹ Portuguese Air Force Academy, Portugal ; ² Instituto Superior Técnico, Portugal

Keyword(s): Clustering Methods, EAC, K-Means, MST, GPGPU, CUDA, Sparse Matrices, Single-Link.

Related Ontology Subjects/Areas/Topics: Clustering ; Ensemble Methods ; Pattern Recognition ; Sparsity ; Theory and Methods

Abstract: The unprecedented collection and storage of data in electronic format has given rise to an interest in automated analysis for generation of knowledge and new insights. Cluster analysis is a good candidate since it makes as few assumptions about the data as possible. A vast body of work on clustering methods exist, yet, typically, no single method is able to respond to the specificities of all kinds of data. Evidence Accumulation Clustering (EAC) is a robust state of the art ensemble algorithm that has shown good results. However, this robustness comes with higher computational cost. Currently, its application is slow or restricted to small datasets. The objective of the present work is to scale EAC, allowing its applicability to big datasets, with technology available at a typical workstation. Three approaches for different parts of EAC are presented: a parallel GPU K-Means implementation, a novel strategy to build a sparse CSR matrix specialized to EAC and Single-Link based on Minim um Spanning Trees using an external memory sorting algorithm. Combining these approaches, the application of EAC to much larger datasets than before was accomplished. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 18.225.56.185

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Silva, D., Aidos, H. and Fred, A. (2016). Efficient Evidence Accumulation Clustering for Large Datasets. In Proceedings of the 5th International Conference on Pattern Recognition Applications and Methods - ICPRAM; ISBN 978-989-758-173-1; ISSN 2184-4313, SciTePress, pages 367-374. DOI: 10.5220/0005770803670374

@conference{icpram16,
author={Diogo Silva and Helena Aidos and Ana Fred},
title={Efficient Evidence Accumulation Clustering for Large Datasets},
booktitle={Proceedings of the 5th International Conference on Pattern Recognition Applications and Methods - ICPRAM},
year={2016},
pages={367-374},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005770803670374},
isbn={978-989-758-173-1},
issn={2184-4313},
}

TY - CONF

JO - Proceedings of the 5th International Conference on Pattern Recognition Applications and Methods - ICPRAM
TI - Efficient Evidence Accumulation Clustering for Large Datasets
SN - 978-989-758-173-1
IS - 2184-4313
AU - Silva, D.
AU - Aidos, H.
AU - Fred, A.
PY - 2016
SP - 367
EP - 374
DO - 10.5220/0005770803670374
PB - SciTePress