Information Fusion for Semi-supervised Cluster Labelings

Huaying Li, Aleksandar Jeremic

Abstract

Clustering analysis is a widely used technique to find hidden patterns of a data set. Combining multiple clustering results into a consensus clustering (cluster ensemble) is a popular and efficient method to improve the quality of clustering analysis. Many algorithms were proposed in the literature and most of which are unsupervised learning techniques. In this paper, we proposed a semi-supervised cluster ensemble algorithm. It is so-called semi-supervised because labels of some data points in the given data set are known or provided by experts. To evaluate the performance of the proposed algorithm, we compare it with other well-known algorithms, such as MCLA and BCE.

References

  1. Bache, K. and Lichman, M. (2013). UCI machine learning repository.
  2. Fred, A. L. and Jain, A. K. (2005). Combining multiple clusterings using evidence accumulation. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 27(6):835-850.
  3. Pickett, J. P. (2006). The American heritage dictionary of the English language. Houghton Mifflin.
  4. Shariff, A., Kangas, J., Coelho, L. P., Quinn, S., and Murphy, R. F. (2010). Automated image analysis for highcontent screening and analysis. Journal of biomolecular screening, 15(7):726-734.
  5. Strehl, A. and Ghosh, J. (2003). Cluster ensemblesa knowledge reuse framework for combining multiple partitions. The Journal of Machine Learning Research, 3:583-617.
  6. Topchy, A. P., Jain, A. K., and Punch, W. F. (2004). A mixture model for clustering ensembles. In SDM, pages 379-390. SIAM.
  7. Vega-Pons, S. and Ruiz-Shulcloper, J. (2011). A survey of clustering ensemble algorithms. International Journal of Pattern Recognition and Artificial Intelligence, 25(03):337-372.
  8. Wang, H., Shan, H., and Banerjee, A. (2011). Bayesian cluster ensembles. Statistical Analysis and Data Mining, 4(1):54-70.
  9. Xu, R. and Wunsch, D. (2008). Clustering, volume 10. John Wiley & Sons.
Download


Paper Citation


in Harvard Style

Li H. and Jeremic A. (2015). Information Fusion for Semi-supervised Cluster Labelings . In Proceedings of the International Conference on Bio-inspired Systems and Signal Processing - Volume 1: BIOSIGNALS, (BIOSTEC 2015) ISBN 978-989-758-069-7, pages 318-323. DOI: 10.5220/0005282203180323


in Bibtex Style

@conference{biosignals15,
author={Huaying Li and Aleksandar Jeremic},
title={Information Fusion for Semi-supervised Cluster Labelings},
booktitle={Proceedings of the International Conference on Bio-inspired Systems and Signal Processing - Volume 1: BIOSIGNALS, (BIOSTEC 2015)},
year={2015},
pages={318-323},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005282203180323},
isbn={978-989-758-069-7},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Bio-inspired Systems and Signal Processing - Volume 1: BIOSIGNALS, (BIOSTEC 2015)
TI - Information Fusion for Semi-supervised Cluster Labelings
SN - 978-989-758-069-7
AU - Li H.
AU - Jeremic A.
PY - 2015
SP - 318
EP - 323
DO - 10.5220/0005282203180323