level execution phase but yielding the potential for
rich macro optimization of the overall process simi-
lar to relational query optimization (logical, physical,
distributed/parallel).
Our future work plan includes the following: mi-
grating to the latest version of OGSA-DAI allow-
ing complex workflow graphs, applying the operator
based approach to our earlier work for distributed de-
cision trees (Hofer and Brezany, 2004), extending the
set of data mining operators to cover association rule
mining, and higher level optimization components for
KDD processes.
ACKNOWLEDGMENTS
This work has been supported by the ADMIRE
project which is financed by the European Commis-
sion via Framework Program 7 through contract no.
FP7-ICT-215024. Yan Zhang has been supported by
Project No. 2006AA01A121 of the National High-
Tech. R&D Program of China.
REFERENCES
Abe, H. and Yamaguchi, T. (2004). Constructive meta-
learning with machine learning method repositories.
In IEA/AIE, pages 502–511.
Agrawal, R. and Srikant, R. (1995). Mining sequential pat-
terns. In ICDE, pages 3–14.
Alpdemir, N. M., Mukherjee, A., Gounaris, A., Paton,
N. W., Watson, P., Fernandes, A. A., and Fitzgerald,
D. J. (2004). Ogsa-dqp: A service for distributed
querying on the grid. In Advances in Database Tech-
nology - EDBT 2004, pages 858–861.
Antonioletti, M., Atkinson, M., Baxter, R., Borley, A.,
Hong, N. P. C., Collins, B., Hardman, N., Hume,
A., Knox, A., Jackson, M., Magowan, J., Paton, N.,
Pearson, D., Sugden, T., Watson, P., and Westhead,
M. (2005). The design and implementation of grid
database services in ogsa-dai. Concurrency and Com-
putation: Practice and Experience, 17:357–376.
Atkinson, M., Brezany, P., Corcho, O., Han, L., van
Hemert, J., Hluchy, L., Hume, A., Janciak, I., Krause,
A., Snelling, D., and W¨ohrer, A. (2008). Admire
white paper: Motivation, strategy, overview and im-
pact. http://www.admire-project.eu/docs/ADMIRE-
WhitePaper.pdf.
Bernstein, A., Provost, F., and Hill, S. (2005). Toward
intelligent assistance for a data mining process: An
ontology-based approach for cost-sensitive classifica-
tion. IEEE Transactions on Knowledge and Data En-
gineering, 17(4):503–518.
Botta, M., Boulicaut, J.-F., Masson, C., and Meo, R. (2004).
Query languages supporting descriptive rule mining:
A comparative study. In Database Support for Data
Mining Applications, pages 24–51.
Fernandez-Baizan, M., Ruiz, E. M., Pena-Sanchez, J., and
Pastrana, B. (1998). integrating KDD algorithms and
RDBMS code. In Proceedings of RSCTC’98, pages
210–213.
Geist, I. and Sattler, K.-U. (2004). Towards data mining op-
erators in database systems: Algebra and implementa-
tion. Technical Report 124, University of Magdeburg.
Gounaris, A., Paton, N. W., Fernandes, A. A. A., and Sakel-
lariou, R. (2002). Adaptive query processing: A sur-
vey. In BNCOD, pages 11–25.
Graefe, G. (1993). Query evaluation techniques for large
databases. ACM Computing Surveys, 25(2):73–170.
Graefe, G. and Davison, D. (1993). Encapsulation of par-
allelism and architecture-independence in extensible
database query execution. IEEE Transactions on Soft-
ware Engineering, 19(8):749–764.
Hettich, S. and Bay, S. (1999). The UCI KDD Archive.
Hofer, J. and Brezany, P. (2004). Digidt: Distributed clas-
sifier construction in the grid data mining framework
gridminer-core. In In Proceedings of the Workshop
on Data Mining and the Grid (GM-Grid 2004) held in
conjunction with the 4th IEEE International Confer-
ence on Data Mining (ICDM’04).
Ioannidis, Y. E. (1996). Query optimization. ACM Comput-
ing Surveys, 28(1).
Johnson, T., Lakshmanan, L., and Ng, R. (2000). The 3w
model and algebra for unified data mining. In VLDB,
pages 21–32.
Kossmann, D. (2000). The state of the art in distributed
query processing. ACM Computing Surveys (CSUR),
32(4).
Meo, R., Psaila, G., and Ceri, S. (1996). A new sql-like
operator for mining association rules. In VLDB, pages
122–133.
Witten, I. H., Frank, E., Trigg, L., Hall, M., Holmes, G., and
Cunningham, S. J. (1999). Weka: Practical machine
learning tools and techniques with Java implementa-
tions. In Proceedings of the Workshop on Emerging
Knowledge Engineering and Connectionist-Based In-
formation Systems, pages 192–196.
YUAN, X. (2003). Data mining query language design and
implementation. Master’s thesis, The Chinese Univer-
sity of Hong Kong.
KDIR 2009 - International Conference on Knowledge Discovery and Information Retrieval
248