REFERENCES
Agrawal, R. and Srikant, R. (1994). Fast algorithms for
mining association rules. In Proceedigns of the 20th
International Conference on Very Large Data Bases
(VLDB), volume 1215, pages 487–499.
Bottou, L. and Bousquet, O. (2008). The tradeoffs of large
scale learning. Advances in neural information pro-
cessing systems, 20:161–168.
Caragea, D., Silvescu, A., and Honavar, V. (2001). Analysis
and synthesis of agents that learn from distributed dy-
namic data sources. Emergent neural computational
architectures based on neuroscience, pages 547–559.
Catlett, J. (1991). Megainduction: machine learning on
very large databases. PhD thesis, School of Computer
Science, University of Technology, Sydney, Australia.
Chan, P. and Stolfo, S. (1993). Toward parallel and dis-
tributed learning by meta-learning. In AAAI in Knowl-
edge Discovery in Databases, pages 227–240.
Chawla, N., Hall, L., Bowyer, K., Moore, T., and
Kegelmeyer, W. (2002). Distributed pasting of small
votes. Multiple Classifier Systems, pages 52–61.
D-Lib Magazine (2006). A Research Library Based on
the Historical Collections of the Internet Archive.
http://www.dlib.org/dlib/february06/arms/02arms.html.
[Online; accessed 27-Oct.-2010].
Davies, W., Edwards, P., and Scotland, U. (2000). Dag-
ger: A new approach to combining multiple mod-
els learned from disjoint subsets. Machine Learning,
2000:1–16.
Dietterich, T. (2000). Ensemble methods in machine learn-
ing. Multiple classifier systems, pages 1–15.
Guijarro-Berdi˜nas, B., Mart´ınez-Rego, D., and Fern´andez-
Lorenzo, S. (2009). Privacy-Preserving Distributed
Learning Based on Genetic Algorithms and Artifi-
cial Neural Networks. Distributed Computing, Artifi-
cial Intelligence, Bioinformatics, Soft Computing, and
Ambient Assisted Living, pages 195–202.
Guo, Y. and Sutiwaraphun, J. (1999). Probing knowledge
in distributed data mining. Methodologies for Knowl-
edge Discovery and Data Mining, pages 443–452.
Hansen, L. and Salamon, P. (1990). Neural network en-
sembles. IEEE Transactions on Pattern Analysis and
Machine Intelligence, 12(10):993–1001.
Huber, P. (1997). From large to huge: A statistician’s reac-
tion to KDD and DM. In Proceedings of the 3rd In-
ternational Conference on Knowledge Discovery and
Data Mining (KDD), pages 304–308.
Kargupta, H., Park, B., Hershberger, D., and Johnson, E.
(2000). Collective Data Mining: A New Perspec-
tive Toward Distributed Data Mining. Advances in
Distributed and Parallel Knowledge Discovery, pages
131–178.
Kittler, J. (1998). Combining classifiers: A theoretical
framework. Pattern Analysis & Applications, 1(1):18–
27.
Krishnan, S., Bhattacharyya, C., and Hariharan, R. (2008).
A randomized algorithm for large scale support vector
learning. In Proceedings of Advances in Neural Infor-
mation Processing Systems (NIPS), pages 793–800.
Lazarevic, A. and Obradovic, Z. (2002). Boosting al-
gorithms for parallel and distributed learning. Dis-
tributed and Parallel Databases, 11(2):203–229.
Moretti, C., Steinhaeuser, K., Thain, D., and Chawla, N.
(2008). Scaling up classifiers to cloud computers. In
Proceedings of the 8th IEEE International Conference
on Data Mining (ICDM), pages 472–481.
Provost, F. and Kolluri, V. (1999). A survey of methods
for scaling up inductive algorithms. Data mining and
knowledge discovery, 3(2):131–169.
Raina, R., Madhavan, A., and Ng, A. (2009). Large-scale
deep unsupervised learning using graphics processors.
In Proceedings of the 26th Annual International Con-
ference on Machine Learning (ICML), pages 873–
880.
School of Information and Management and Sys-
tems (2000). How much information?
http://www2.sims.berkeley.edu/research/projects/how-
much-info/internet.html. [Online; accessed 27-
September-2010].
School of Information and Management and Sys-
tems (2003). How much information?
http://www2.sims.berkeley.edu/research/projects/how-
much-info-2003/internet.htm. [Online; accessed
27-Sept.-2010].
Sonnenburg, S., Franc, V., Yom-Tov, E., and Sebag, M.
(2009). PASCAL Large Scale Learning Challenge.
Journal of Machine Learning Research.
Sonnenburg, S., Ratsch, G., and Rieck, K. (2007). Large
scale learning with string kernels. Large Scale Kernel
Machines, pages 73–104.
Tsoumakas, G. (2009). Distributed Data Mining. Database
Technologies: Concepts, Methodologies, Tools, and
Applications, pages 157–171.
Tsoumakas, G., Angelis, L., and Vlahavas, I. (2004a). Clus-
tering classifiers for knowledge discovery from physi-
cally distributed databases. Data & Knowledge Engi-
neering, 49(3):223–242.
Tsoumakas, G., Katakis, I., and Vlahavas, I. (2004b). Ef-
fective voting of heterogeneous classifiers. Machine
Learning: ECML 2004, pages 465–476.
Tsoumakas, G. and Vlahavas, I. (2002). Effective stacking
of distributed classifiers. In ECAI 2002: 15th Euro-
pean Conference on Artificial Intelligence, page 340.
Wolpert, D. (1992). Stacked generalization. Neural net-
works, 5(2):241–259.
DEALING WITH "VERY LARGE" DATASETS - An Overview of a Promising Research Line: Distributed Learning
481