VISUAL SVM

François Poulet

2005

Abstract

We present a cooperative approach using both Support Vector Machine (SVM) algorithms and visualization methods. SVM are widely used today and often give high quality results, but they are used as "black-box", (it is very difficult to explain the obtained results) and cannot treat easily very large datasets. We have developed graphical methods to help the user to evaluate and explain the SVM results. The first method is a graphical representation of the separating frontier quality (it is presented for the SVM case, but can be used for any other boundary like decision tree cuts, regression lines, etc). Then it is linked with other graphical methods to help the user explaining SVM results. The information provided by these graphical methods can also be used in the SVM parameter tuning stage. These graphical methods are then used together with automatic algorithms to deal with very large datasets on standard personal computers. We present an evaluation of our approach with the UCI and the Kent Ridge Bio-medical data sets.

References

  1. Becker R., Cleveland W. and Wilks A., 1987. Dynamics Graphics for Data Analysis, Statistical Science, 2:355- 395.
  2. Blake C. and Merz C., 1998. UCI Repository of Machine Learning Databases. http://www.ics.uci.edu/mlearn/ML-Repository.html.
  3. Caragea, D., Cook, D. and Honavar, V., 2003. Towards Simple, Easy-to-Understand, yet Accurate Classifiers, in proc. of VDM@ICDM'03, the 3rd Int. Workshop on Visual Data Mining, Melbourne, USA, pp. 19-31.
  4. Collobert, R., Bengio, S. and Bengio, Y., 2002. A parallel Mixture of SVMs for Very Large Scale Problems, in proc. of Advances in Neural Information Processing Systems, NIPS'02, Vol. 14, MIT Press, pp. 633-640.
  5. Cristianini, N. and Shawe-Taylor, J., 2000. An Introduction to Support Vector Machines and Other Kernel-based Learning Methods, Cambridge University Press.
  6. Fayyad U., Piatetsky-Shapiro G., Smyth P., Uthurusamy R., 1996. Advances in Knowledge Discovery and Data Mining, AAAI Press.
  7. Fung, G. and Mangasarian O., 2001. Proximal Support Vector Machine Classifiers, in proc. of the 7th ACM SIGKDD, Int. Conf. on KDD'01, San Francisco, USA, pp. 77-86.
  8. Fung G., Mangasarian O. and Shavlik J., 2002. Knowledge-Based Support Vector Machine Classifiers, in proc. of Neural Information Processing Systems, NIPS'2002, Vancouver.
  9. Fung G. and Mangasarian O., 2004. A Feature Selection Newton Method for Support Vector Machine Classification, Computational Optimization and Applications, 28(2):185-202.
  10. Inselberg A. and Avidan T., 1999. The Automated Multidimensional Detective, in proc. of IEEE Infoviz'99, 112-119.
  11. Jinyan, L. and Huiqing, L., 2002. Kent Ridge Bio-medical Data Set Repository. http://sdmc.lit.org.sg/GEDatasets.
  12. Lee, Y-J. and Mangasarian, O., 2000. RSVM, Reduced Support Vector Machines, Data Mining Institute Technical Report 00-07, Computer Sciences Department, University of Wisconsin, Madison, USA.
  13. Poulet F., 2002. Cooperation between Automatic Algorithms, Interactive Algorithms and Visualization Tools for Visual Data Mining, in proc. VDM@ECML/PKDD'2002, the 2nd Int. Workshop on Visual Data Mining, Helsinki, Finland.
  14. Poulet, F., 2004, Towards Visual Data Mining, in proc. of ICEIS'04, the 6th Int. Conf. on Enterprise Information Systems, Porto, Portugal, Vol. 2, pp. 349-356.
  15. Poulet, F. and Do, T-N., 2004. Mining Very Large Datasets with Support Vector Machine Algorithms, in Enterprise Information Systems V, Camp O., Piattini M. and Hammoudi S. Eds, Kluwer, 177-184.
  16. Poulet F., 2002. FullView: A Visual Data Mining Environment, in International Journal of Image and Graphics, 2(1):127-143.
  17. Shneiderman B., 2002. Inventing Discovery Tools: Combining Information Visualization with Data Mining, in Information Visualization 1(1), 5-12.
  18. Vapnik V., 1995, The Nature of Statistical Learning Theory, Springer-Verlag, New York.
  19. Wong P., 1999. Visual Data Mining, in IEEE Computer Graphics and Applications, 19(5), 20-21.
Download


Paper Citation


in Harvard Style

Poulet F. (2005). VISUAL SVM . In Proceedings of the Seventh International Conference on Enterprise Information Systems - Volume 2: ICEIS, ISBN 972-8865-19-8, pages 309-314. DOI: 10.5220/0002521003090314


in Bibtex Style

@conference{iceis05,
author={François Poulet},
title={VISUAL SVM},
booktitle={Proceedings of the Seventh International Conference on Enterprise Information Systems - Volume 2: ICEIS,},
year={2005},
pages={309-314},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002521003090314},
isbn={972-8865-19-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Seventh International Conference on Enterprise Information Systems - Volume 2: ICEIS,
TI - VISUAL SVM
SN - 972-8865-19-8
AU - Poulet F.
PY - 2005
SP - 309
EP - 314
DO - 10.5220/0002521003090314