means. Journal of the American Statistical Associa-
tion, 56(293):52–64.
Forman, G. (2003). An extensive empirical study of feature
selection metrics for text classification. The Journal
of Machine Learning Research, 3:1289–1305.
Frank, A. and Asuncion, A. (2010). UCI machine learning
repository.
Friedman, M. (1940). A comparison of alternative tests of
significance for the problem of m rankings. The An-
nals of Mathematical Statistics, 11(1):86–92.
Gangrade, A. and Patel, R. (2013). Privacy preserving
three-layer na¨ıve bayes classifier for vertically parti-
tioned databases. Journal of Information and Com-
puting Science, 8(2):119–129.
Guyon, I., Gunn, S., Nikravesh, M., and Zadeh, L. (2006).
Feature extraction: foundations and applications, vol-
ume 207. Springer.
Kantarcioglu, M. and Clifton, C. (2004). Privacy-
preserving distributed mining of association rules on
horizontally partitioned data. Knowledge and Data
Engineering, IEEE Transactions on, 16(9):1026–
1037.
Liu, H. and Setiono, R. (1995). Chi2: Feature selection
and discretization of numeric attributes. In Tools with
Artificial Intelligence, 1995. Proceedings., Seventh In-
ternational Conference on, pages 388–391. IEEE.
Liu, H. and Setiono, R. (1997). Feature selection via dis-
cretization. Knowledge and Data Engineering, IEEE
Transactions on, 9(4):642–645.
McConnell, S. and Skillicorn, D. (2004). Building predic-
tors from vertically distributed data. In Proceedings of
the 2004 conference of the Centre for Advanced Stud-
ies on Collaborative research, pages 150–162. IBM
Press.
Prodromidis, A., Chan, P., and Stolfo, S. (2000). Meta-
learning in distributed data mining systems: Issues
and approaches. Advances in distributed and paral-
lel knowledge discovery, 3.
Rish, I. (2001). An empirical study of the naive bayes clas-
sifier. In IJCAI 2001 workshop on empirical methods
in artificial intelligence, volume 3, pages 41–46.
Saari, P., Eerola, T., and Lartillot, O. (2011). Generaliz-
ability and simplicity as criteria in feature selection:
application to mood classification in music. Audio,
Speech, and Language Processing, IEEE Transactions
on, 19(6):1802–1812.
Saeys, Y., Inza, I., and Larra˜naga, P. (2007). A review of
feature selection techniques in bioinformatics. Bioin-
formatics, 23(19):2507–2517.
Skillicorn, D. and McConnell, S. (2008). Distributed pre-
diction from vertically partitioned data. Journal of
Parallel and Distributed computing, 68(1):16–36.
Tou, J. and Gonz´alez, R. (1977). Pattern recognition prin-
ciples. Addison-Wesley.
Tsoumakas, G. and Vlahavas, I. (2002). Distributed data
mining of large classifier ensembles. In Proceedings
Companion Volume of the Second Hellenic Confer-
ence on Artificial Intelligence, pages 249–256.
Tsoumakas, G. and Vlahavas, I. (2009). Distributed data
mining. Database Technologies: Concepts, Method-
ologies, Tools, and Applications, 1:157.
Vaidya, J. and Clifton, C. (2004). Privacy preserving na¨ıve
bayes classifier for vertically partitioned data. In 2004
SIAM International Conference on Data Mining, Lake
Buena Vista, Florida, pages 522–526.
Vaidya, J. and Clifton, C. (2005). Privacy-preserving deci-
sion trees over vertically partitioned data. In Data and
Applications Security XIX, pages 139–152. Springer.
Ventura, D. and Martinez, T. (1995). An empirical com-
parison of discretization methods. In Proceedings of
the Tenth International Symposium on Computer and
Information Sciences, pages 443–450.
Wang, J., Luo, Y., Zhao, Y., and Le, J. (2009). A survey on
privacy preserving data mining. In Database Technol-
ogy and Applications, 2009 First International Work-
shop on, pages 111–114. IEEE.
Wirth, R., Borth, M., and Hipp, J. (2001). When distribu-
tion is part of the semantics: A new problem class for
distributed knowledge discovery. In Proceedings of
the PKDD 2001 workshop on ubiquitous data mining
for mobile and distributed environments, pages 56–64.
Citeseer.
Wolpert, D. H. (1992). Stacked generalization. Neural net-
works, 5(2):241–259.
Yao, A. C. (1982). Protocols for secure computations. In
Proceedings of the 23rd Annual Symposium on Foun-
dations of Computer Science, pages 160–164.
Ye, M., Hu, X., and Wu, C. (2010). Privacy preserving
attribute reduction for vertically partitioned data. In
Artificial Intelligence and Computational Intelligence
(AICI), 2010 International Conference on, volume 1,
pages 320–324. IEEE.
Zhao, Z. and Liu, H. (2011). Spectral Feature Selection for
Data Mining. Chapman & Hall/Crc Data Mining and
Knowledge Discovery. Taylor & Francis Group.
LearningonVerticallyPartitionedDatabasedonChi-squareFeatureSelectionandNaiveBayesClassification
357