Parallel Classification System based on an Ensemble of Mixture of Experts

Benjamín Moreno-Montiel, René MacKinney-Romero

2014

Abstract

The classification of large amounts of data is a challenging problem that only a small number of classification algorithms can handle. In this paper we propose a Parallel Classification System based on an Ensemble of Mixture of Experts (PCEM). The system uses MIMD (Multiple Instruction and Multiple Data Stream) architecture, using a set of process that communicates via messages. PCEM is implemented using parallel schemes of traditional classifiers, for the mixture of experts, and using a parallel version of a Genetic Algorithm to implement a voting weighted criterion. The PCEM is a novel algorithm since it allows us to classify large amounts of data with low execution times and high performance measures, which makes it an excellent tool for in classification of large amounts of data. A series of tests were performed with well known databases that allowed us to measure how PCEM performs with many datasets and how well it does compared with other systems available.

References

  1. Graf, H. P. et al. (2005). Parallel support vector machines: The cascade svm. In Advances in Neural Information Processing Systems, pages 521-528.
  2. Houser, D. and Xiao, E. (2011). Classification of natural language messages using a coordination game. Experimental Economics, 14:1-14.
  3. Levchenko, K. et al. (2011). Click trajectories: Endtoend analysis of the spam value chain. in Proceedings of the IEEE Symposium and Security and Privacy.
  4. Menahem, E., Rokach, L., and Elovici, Y. (2009). Troika - an improved stacking schema for classification tasks. Inf. Sci., 179(24):4097-4122.
  5. Miller, D. J. and Uyar, H. S. (1997). A mixture of experts classifier with learning based on both labeled and unlabeled data. Neural Information Processing Systems, 9:571-577.
  6. Moreno-Montiel, B. and MacKinney-Romero, R. (2011). A hybrid classifier with genetic weighting. in Proceedings of the Sixth International Conference on Software and Data Technologies, 2:359-364.
  7. Moreno-Montiel, B. and MacKinney-Romero, R. (2013). Paraltabs: A parallel scheme of decision tables. Mexican International Conference on Computer Science.
  8. Moreno-Montiel, B. and Moreno-Montiel, C. H. (2013). Prediction system of larynx cancer. in Proceedings of the The Fourth International Conference on Computational Logics, Algebras, Programming, Tools, and Benchmarking Computation Tools 2013, 2:23-30.
  9. Peralta, R. et al. (2010). Increased expression of cellular retinol-binding protein 1 in laryngeal squamous cell carcinoma. Journal of Cancer Research and Clinical Oncology, 136:931-938.
  10. Polikar, R. (2006). Ensemble based systems in decision making. IEEE Circuits and Systems Mag., 6:21-45.
  11. Rauber, T. (2010). Parallel programming: for multicore and cluster systems. Springer, 1st Edition.
  12. Serhat, O. and Yilmaz, A. (2002). Classification and prediction in a data mining application. Journal of Marmara for Pure and Applied Sciences, 18:159-174.
  13. Sun, S. (2010). Local within-class accuracies for weighting individual outputs in multiple classifier systems. Pattern Recognition Letters, 31(2):119 - 124.
  14. Sun, S. and Zhang, C. (2007). The selective random subspace predictor for traffic flow forecasting. Intelligent Transportation Systems, IEEE Transactions on, 8(2):367-373.
  15. Sun, S., Zhang, C., and Lu, Y. (2008). The random electrode selection ensemble for fEEGg signal classification. Pattern Recognition, 41(5):1663 - 1675.
  16. Wu, X. et al. (2009). Top 10 algorithms in data mining. Knowledge and Information Systems, 14:1-37.
  17. Zhang, Y. et al. (2006). The study of parallel k-means algorithm. Proceedings of the 6th World Congress on Intelligent Control and Automation, pages 241-259.
Download


Paper Citation


in Harvard Style

Moreno-Montiel B. and MacKinney-Romero R. (2014). Parallel Classification System based on an Ensemble of Mixture of Experts . In Proceedings of the 3rd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM, ISBN 978-989-758-018-5, pages 271-278. DOI: 10.5220/0004828902710278


in Bibtex Style

@conference{icpram14,
author={Benjamín Moreno-Montiel and René MacKinney-Romero},
title={Parallel Classification System based on an Ensemble of Mixture of Experts},
booktitle={Proceedings of the 3rd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,},
year={2014},
pages={271-278},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004828902710278},
isbn={978-989-758-018-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 3rd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,
TI - Parallel Classification System based on an Ensemble of Mixture of Experts
SN - 978-989-758-018-5
AU - Moreno-Montiel B.
AU - MacKinney-Romero R.
PY - 2014
SP - 271
EP - 278
DO - 10.5220/0004828902710278