where the new proposed method using bagging gets
better accuracy in classification with this kind of
datasets. These conclusions have been validated by
applying statistical techniques to analyze the behav-
ior of different methods in the experiments.
Improve the discretization of numerical attributes
in small size datasets is important like previous step to
carry out feature selection in microarrays data which
is a topic of current interest and we want to carry out
the feature selection using a fuzzy ensemble which
needs a partition of the numerical attributes.
ACKNOWLEDGEMENTS
Supported by the project TIN2011-27696-C02-02 of
the Ministry of Economy and Competitiveness of
Spain. Thanks also to “Fundaci
´
on S
´
eneca - Agen-
cia de Ciencia y Tecnolog
´
ıa de la Regi
´
on de Murcia”
(Spain) for the support given to Raquel Mart
´
ınez by
the scholarship program FPI.
REFERENCES
Antonelli, M., Ducange, P., Lazzerini, B., and Marcel-
loni, F. (2011). Learning knowledge bases of multi-
objective evolutionary fuzzy systems by simultane-
ously optimizing accuracy, complexity and partition
integrity. Soft Computing, 15:2335–2354.
Armengol, E. and Garc
´
ıa-Cerdana, A. (2012). Refining dis-
cretizations of continuous-valued attributes. In The
9th International Conference on Modeling Decisions
for Artificial Intelligence, pages 258–269.
Au, W. H., Chan, K. C., and Wong, A. (2006). A fuzzy ap-
proach to partitioning continuous attributes for classi-
fication. IEEE Tran, Knowledge and Data Engineer-
ing, 18(5):715–719.
Bonissone, P. P., Cadenas, J. M., Garrido, M. C., and
D
´
ıaz-Valladares, R. A. (2010). A fuzzy random for-
est. International Journal of Approximate Reasoning ,
51(7):729–747.
Breiman, L. (1996a). Bagging predictors. Maching Learn-
ing, 24(2):123–140.
Breiman, L. (1996b). Heuristics of instability and sta-
bilization in model selection. Annals of Statistics,
24(6):2350–2383.
Cadenas, J. M., Garrido, M. C., Mart
´
ınez, R., and Bonis-
sone, P. P. (2012a). Extending information processing
in a fuzzy random forest ensemble. Soft Computing,
16(5):845–861.
Cadenas, J. M., Garrido, M. C., Mart
´
ınez, R., and Bonis-
sone, P. P. (2012b). Ofp class: a hybrid method to
generate optimized fuzzy partitions for classification.
Soft Computing, 16:667–682.
Cox, E. (2005). Fuzzy Modeling and Genetic Algorithms
for Data Mining and Exploration. Morgan Kaufmann
Publishers, New York, USA.
Der-Chiang, L. and Chiao-Wen, L. (2012). Extending at-
tribute information for small data set classification.
IEEE Transactions on Knowledge and Data Engineer-
ing, 24:452–564.
Der-Chiang, L., Chien-Chih, C., Che-Jung, C., and Wu-
Kuo, L. (2012). A tree-based-trend-diffusion predic-
tion procedure for small sample sets in the early stages
of manufacturing systems. Expert Systems with Appli-
cations, 39:1575–1581.
Diaz-Uriarte, R. and de Andr
´
es, S. A. (2006). Gene selec-
tion and classification of microarray data using ran-
dom forest. BMC Bioinformatics, 7(3).
Dimitrova, E. S. and Vera-Licona, M. P. (2010). Discretiza-
tion of time series data. Journal in Computational Bi-
ology, 17(6):853–868.
Frank, A. and Asuncion, A. (2010). UCI Machine Learning
Repository. University of California, Irvine, School of
Information and Computer Sciences.
Garc
´
ıa, S., Fern
´
andez, A., Luengo, J., and Herrera, F.
(2009). A study statistical techniques and perfor-
mance measures for genetics-based machine learn-
ing: accuracy and interpretability. Soft Computing,
13(10):959–977.
Ihaka, R. and Gentleman, R. (1996). R: A language for data
analysis and graphics. Journal of Computational and
Graphical Statistics, 5(3):299–314.
Jain, A. K. (2000). Statistical pattern recognition: A review.
IEEE Transactions on Pattern Analysis and Machine
Intelligence, 22:4–37.
Kianmehr, K., Alshalalfa, M., and Alhajj, R. (2010). Fuzzy
clustering-based discretization for gene expression
classification. Knowledge and Information Systems,
24:441–465.
Liu, B., Cui, Q., Jiang, T., and Ma, S. (2004). A com-
binational feature selection and ensemble neuralnet-
work method for classification of gene expression
data. BMC Bioinformatics, 5(1).
O’Reilly, C. A. (1982). Variations in decision makers’ use
of information sources: the impact of quality and ac-
cessibility of information. Academy of Management
Journal, 25(4):756–771.
Qureshi, T. and Zighed, D. A. (2009). A soft discretization
technique for fuzzy decision trees using resampling.
Intelligent Data Engineering and Automated Learning
- IDEAL 2009, Lecture Notes in Computer Science,
5788:586–593.
Unler, A. and Murat, A. (2010). A discrete particle swarm
optimization method for feature selection in binary
classification problems. European Journal of Oper-
ational Research, 206:528–539.
Wang, C., Wang, M., She, Z., and Cao, L. (2012). Cd: A
coupled discretization algorithm. Advances in Knowl-
edge Discovery and Data Mining Lecture Notes in
Computer Science, 7302:407–418.
Wang, Y. and Witten, I. (1997). Inducing model trees for
continuous classes. In Proceedings of the poster pa-
ImprovingaFuzzyDiscretizationProcessbyBagging
211