RESULT COMPARISON OF TWO ROUGH SET BASED DISCRETIZATION ALGORITHMS

Shanchan Wu, Wenyuan Wang

Abstract

The area of knowledge discovery and data mining is growing rapidly. A large number of methods are employed to mine knowledge. Many of the methods rely of discrete data. However, most of the datasets used in real application have attributes with continuous values. To make the data mining techniques useful for such datasets, discretization is performed as a preprocessing step of the data mining. In this paper, we discuss rough set based discretization. We use UCI data sets to do experiments to compare the quality of Local discretization and Global discretization based on rough set. Our experiments show that Global discretization and Local discretization are dataset sensitive. Neither of them is always better than the other, though in some cases Global discretization generates far better results than Local discretization.

References

  1. Pawlak Z (1982, November 5). Rough Sets. Int'l J. Computer & Science [J], 11, 341-356.
  2. Nguyen H S, Skowron A (1995). Quantization of real value attributes. Proceedings of Second Joint Annual Conf. on Information Science, Wrightsville Beach, North Carolina, 34-37.
  3. Nguyen H S (1997). Discretization of Real Value Attributes: Boolean reasoning Approach [PhD Dissertation]. Warsaw University Warsaw, Poland.
  4. Hung Son Nguyen, Sinh Hoa Nguyen (1996). Some efficient algorithms for rough set methods. In 6th International conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems, 1451-1456.
  5. Jian-Hua Dai, Yuan-Xiang Li (2002, November 4-5). Study on discretization based on rough set theory. Proceedings of the First International Conference on Machine Learning and Cybernetics, 3, 1371-1373.
Download


Paper Citation


in Harvard Style

Wu S. and Wang W. (2004). RESULT COMPARISON OF TWO ROUGH SET BASED DISCRETIZATION ALGORITHMS . In Proceedings of the Sixth International Conference on Enterprise Information Systems - Volume 2: ICEIS, ISBN 972-8865-00-7, pages 511-514. DOI: 10.5220/0002611505110514


in Bibtex Style

@conference{iceis04,
author={Shanchan Wu and Wenyuan Wang},
title={RESULT COMPARISON OF TWO ROUGH SET BASED DISCRETIZATION ALGORITHMS},
booktitle={Proceedings of the Sixth International Conference on Enterprise Information Systems - Volume 2: ICEIS,},
year={2004},
pages={511-514},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002611505110514},
isbn={972-8865-00-7},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Sixth International Conference on Enterprise Information Systems - Volume 2: ICEIS,
TI - RESULT COMPARISON OF TWO ROUGH SET BASED DISCRETIZATION ALGORITHMS
SN - 972-8865-00-7
AU - Wu S.
AU - Wang W.
PY - 2004
SP - 511
EP - 514
DO - 10.5220/0002611505110514