much more complex algorithms such as neural
networks or support vector machine. From the results
of this research, it can be concluded that parameter
optimized k-NN combine with genetic algorithms as
feature selection is superior when compared to other
feature selection algorithms on five benchmarked
medical datasets.
5 CONCLUSIONS
Genetic algorithms are applied to select features and
optimizing k parameter for k-nearest neighbors to
improve accuracy of five benchmarked medical
datasets. Proposed method is proven effective to be
able improve accuracy, and furthermore the different
test results among five datasets produce significant
difference.
Comparison of the feature selection algorithms are
proposed to compare the accuracy of the results
among genetic algorithms, forward selection,
backward elimination and greedy feature selection.
Genetic algorithms are proven to have the highest
accuracy compared with any others feature selection
algorithms.
In this research, in general, genetic algorithms
applied to select features and optimizing parameters
to improve accuracy of five benchmarked medical
datasets. In further research, some things can be
applied to enhance the research, which uses other
algorithms for parameter optimizing or other methods
to reduce dimensionality of medical datasets.
ACKNOWLEDGEMENTS
This research is supported by The Ministries of
Research, Technology, And Higher Education of
Republic Indonesia
REFERENCES
Abe, S., 2005. Modified Backward Feature Selection by
Cross Validation. Bruges, European Symposium on
Artificial Neural Networks, pp. 163-168.
Abe, S., 2010. Support Vector Machine for Pattern
Classification. Second Edition ed. New York: Springer
London.
Amato, F. et al., 2013. Artificial neural networks in medical
diagnosis. Journal of Applied Biomedicine, 11(2), pp.
47-58.
Antal, B. & Hajdu, A., 2014. An ensemble-based system
for automatic screening of diabetic retinopathy.
Knowledge-Based Systems, Volume 60, pp. 20-27.
Ayres-de-campos, D. et al., 2000. SisPorto 2.0: A Program
for Automated Analysis of Cardiotocograms. The
Journal of Maternal-Fetal Medicine, Volume 9, pp.
311-318.
Babu, G. S. & Suresh, S., 2013. Meta-cognitive RBF
network and its projection based learning algorithm for
classification problems. Applied Soft Computing
Journal, 13(1), pp. 654-666.
Bharti, K. K. & Singh, P. K., 2014. A three-stage
unsupervised dimension reduction method for text
clustering. Journal of Computational Science, 5(2), pp.
156-169.
Blanchet, F. G., Legendre, P. & Borcard, D., 2008. Forward
Selection of Explanatory Variables. Ecology, 89(9), pp.
2623-2632.
Brameier, M. & Banzhaf, W., 2001. A comparison of linear
genetic programming and neural networks in medical
data mining. IEEE Transactions on Evolutionary
Computation , 5(1), pp. 17-26.
Chang, P.-C., Lin, J.-J. & Liu, C.-H., 2012. An attribute
weight assignment and particle swarm optimization
algorithm for medical database classifications.
Computer Methods and Programs in Biomedicine,
107(3), pp. 382-392.
Derksen, S. & Keselman, H. J., 1992. Backward, Forward
and Stepwise Automated Subset Selection Algorithms.
British Journal of Mathematical and Statistical
Psychology, Volume 45, pp. 265-282.
Dyer, E. L., Sankaranarayanan, A. C. & Baraniuk, R. G.,
2013. Greedy Feature Selection for Subspace
Clustering. Journal of Machine Learning Research,
Volume 14, pp. 2487-2517.
Farahat, A. K., Ghodsi, A. & Kamel, M. S., 2013. Efficient
Greedy Feature Selection for Unsupervised Learning.
Knowledge Information System, Volume 35, pp. 285-
310.
Gorunescu, F., 2011. Data Mining. Berlin: Springer.
Gorunescu, F., 2011. Data Mining: Concepts, Models, and
Techniques. Verlag Berlin Heidelberg: Springer.
Guyon, I. & Elisseeff, A., 2003. An Introduction to
Variable and Feature Selection. Journal of Machine
Learning Research, Volume 3, pp. 1157-1182.
Han, J. & Kamber, M., 2007. Data Mining: Concepts and
Techniques: 2nd (second) Edition. Amsterdam:
Elsevier Science.
Han, J., Kamber, M. & Pei, J., 2012. Data Mining Concepts
and Techniques. San Fransisco: Morgan Kauffman.
Harrington, P., 2012. Machine Learning in Action. New
York: Manning Publication.
Holland, J. H., 1975. Adaption in Natural and Artificial
Systems. Cambridge: MIT Press.
Inbarani, H. H., Azar, A. T. & Jothi, G., 2014. Supervised
hybrid feature selection based on PSO and rough sets
for medical diagnosis. Computer Methods and
Programs in Biomedicine, 113(1), pp. 175-185.
Jabbar, M. A., Deekshatulu, B. L. & Chandra, P., 2013.
Classification of Heart Disease Using K- Nearest
Features Selection and k-NN Parameters Optimization based on Genetic Algorithm for Medical Datasets Classification