COMPARISON OF DIFFERENT CLASSIFICATION TECHNIQUES ON PIMA INDIAN DIABETES DATA

Farhana Afroz, Rashedur M. Rahman

Abstract

The development of data-mining applications such as classification and clustering has been applied to large scale data. In this research, we present comparative study of different classification techniques using three data mining tools named WEKA, TANAGRA and MATLAB. The aim of this paper is to analyze the performance of different classification techniques for a set of large data. The algorithm or classifiers tested are Multilayer Perceptron, Bayes Network, J48graft (c4.5), Fuzzy Lattice Reasoning (FLR), NaiveBayes, JRip (RIPPER), Fuzzy Inference System (FIS), Adaptive Neuro-Fuzzy Inference Systems(ANFIS). A fundamental review on the selected technique is presented for introduction purposes. The diabetes data with a total instance of 768 and 9 attributes (8 for input and 1 for output) will be used to test and justify the differences between the classification methods or algorithms. Subsequently, the classification technique that has the potential to significantly improve the common or conventional methods will be suggested for use in large scale data, bioinformatics or other general applications.

References

  1. FLT, 2011. The mathworks - fuzzy logic toolbox, from http://www.mathworks.ch/access/helpdesk r13/help/ toolbox/fuzzy/fuzzy.html
  2. Han J., Kamber, M., 2000. Data Mining Concept and Techniques, Morgan Kaufmann Publishers
  3. John, G. H., Langley, P., 1995. Estimating Continuous Distributions in Bayesian Classifiers. In: Proc. of the 11th Conf. on Uncertainty in Artificial Intelligence.
  4. Jyh, S., Roger, J., 1993.Anfis: Adaptive-network-based fuzzy inference system, IEEE Transactions on Systems, Man, and Cybernetics, vol. 23, pp. 665-685.
  5. Kappa Statistic, 2011. Link http://www.dmi.columbia. edu/homepages/chuangj/kappa.
  6. Nilsson, N. J., 2011. Introduction to Machine Learning, http://ai.stanford.edu/nilsson/mlbook.html.
  7. Quinlan, J., 1993. C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo.
  8. UCI machine learning repository, 2011. Link: http://www. ics.uci.edu/mlearn/MLRepository.html
  9. Werbos, P., 1974. Beyond Regression: New Tools for Prediction and Analysis in the Behavioural Sciences, PhD Thesis, Harvard University, 1974.
Download


Paper Citation


in Harvard Style

Afroz F. and M. Rahman R. (2011). COMPARISON OF DIFFERENT CLASSIFICATION TECHNIQUES ON PIMA INDIAN DIABETES DATA . In Proceedings of the 13th International Conference on Enterprise Information Systems - Volume 1: ICEIS, ISBN 978-989-8425-53-9, pages 365-368. DOI: 10.5220/0003496803650368


in Bibtex Style

@conference{iceis11,
author={Farhana Afroz and Rashedur M. Rahman},
title={COMPARISON OF DIFFERENT CLASSIFICATION TECHNIQUES ON PIMA INDIAN DIABETES DATA},
booktitle={Proceedings of the 13th International Conference on Enterprise Information Systems - Volume 1: ICEIS,},
year={2011},
pages={365-368},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003496803650368},
isbn={978-989-8425-53-9},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 13th International Conference on Enterprise Information Systems - Volume 1: ICEIS,
TI - COMPARISON OF DIFFERENT CLASSIFICATION TECHNIQUES ON PIMA INDIAN DIABETES DATA
SN - 978-989-8425-53-9
AU - Afroz F.
AU - M. Rahman R.
PY - 2011
SP - 365
EP - 368
DO - 10.5220/0003496803650368