DATA TYPE MANAGEMENT IN A DATA MINING APPLICATION FRAMEWORK

Lauri Tuovinen, Perttu Laurinen, Juha Röning

2009

Abstract

Building application frameworks is one of the major approaches to code and design reuse in object-oriented software engineering. Some frameworks target a particular application domain, adopting a number of domain-specific problems to be addressed by the framework in such a fashion that there is no need for application developers to devise solutions of their own to those problems. When the target domain is data mining, one interesting domain-specific problem is management of the data types of model parameters and data variables. This is not trivial because the framework must be able to convert parameter and variable values between different representations, and it would be preferable to have these conversions take place transparently, without involving the application programmer. This is not difficult to achieve if the framework restricts the programmer to a predefined set of allowed data types, but if such a restriction is undesirable, the framework needs an extension mechanism in its type management subsystem. Smart Archive, a framework for developing data mining applications in Java or C++, includes such a mechanism, based on a type dictionary document and a type renderer programming interface. These make it possible to handle even highly complex values such as collections of instances of programmer-defined classes in a variety of platform-independent representation formats. The benefits of this approach can be seen in how the framework interfaces with databases through data sinks and in how it exports and imports application configurations.

References

  1. Automated Learning Group (2003). D2K Toolkit User Manual. Technical manual, available at http://alg.ncsa.uiuc.edu.
  2. Berthold, M. R., Cebron, N., Dill, F., di Fatta, G., Gabriel, T. R., Georg, F., Meinl, T., Ohl, P., Sieb, C., and Wiswedel, B. (2006). Knime: The Konstanz information miner. In Proceedings of the 4th Annual Industrial Simulation Conference, Workshop on MultiAgent Systems and Simulation.
  3. Fayad, M. E. and Schmidt, D. C. (1997). Object-oriented application frameworks. Communications of the ACM, 40(10):32-38.
  4. Laurinen, P., Tuovinen, L., Haapalainen, E., Junno, H., Röning, J., and Zettel, D. (2004). Managing and implementing the data mining process using a truly stepwise approach. In Proceedings of the Sixth International Baltic Conference on Databases & Information Systems, pages 246-257.
  5. Laurinen, P., Tuovinen, L., and Röning, J. (2005). Smart Archive: a component-based data mining application framework. In Proceedings of the Fifth International Conference on Intelligent Systems Design and Applications (ISDA 2005), pages 20-25.
  6. Mierswa, I., Wurst, M., Klinkenberg, R., Scholz, M., and Euler, T. (2006). YALE: Rapid prototyping for complex data mining tasks. In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 935-940.
  7. Prudsys AG (2007). Xeli's Intro. Introduction to XELOPES. Technical manual, available at http://www.prudsys.com.
  8. Tuovinen, L., Laurinen, P., Juutilainen, I., and Röning, J. (2008). Data mining applications for diverse industrial application domains with Smart Archive. In Proceedings of the IASTED International Conference on Software Engineering (SE 2008), pages 56-61.
Download


Paper Citation


in Harvard Style

Tuovinen L., Laurinen P. and Röning J. (2009). DATA TYPE MANAGEMENT IN A DATA MINING APPLICATION FRAMEWORK . In Proceedings of the International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, ISBN 978-989-8111-66-1, pages 333-338. DOI: 10.5220/0001658903330338


in Bibtex Style

@conference{icaart09,
author={Lauri Tuovinen and Perttu Laurinen and Juha Röning},
title={DATA TYPE MANAGEMENT IN A DATA MINING APPLICATION FRAMEWORK},
booktitle={Proceedings of the International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,},
year={2009},
pages={333-338},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001658903330338},
isbn={978-989-8111-66-1},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,
TI - DATA TYPE MANAGEMENT IN A DATA MINING APPLICATION FRAMEWORK
SN - 978-989-8111-66-1
AU - Tuovinen L.
AU - Laurinen P.
AU - Röning J.
PY - 2009
SP - 333
EP - 338
DO - 10.5220/0001658903330338


in Harvard Style

Tuovinen L., Laurinen P. and Röning J. (2009). DATA TYPE MANAGEMENT IN A DATA MINING APPLICATION FRAMEWORK.In Proceedings of the International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, ISBN 978-989-8111-66-1, pages 333-338. DOI: 10.5220/0001658903330338


in Bibtex Style

@conference{icaart09,
author={Lauri Tuovinen and Perttu Laurinen and Juha Röning},
title={DATA TYPE MANAGEMENT IN A DATA MINING APPLICATION FRAMEWORK},
booktitle={Proceedings of the International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,},
year={2009},
pages={333-338},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001658903330338},
isbn={978-989-8111-66-1},
}


in EndNote Style

TY - CONF

JO - Proceedings of the International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,
TI - DATA TYPE MANAGEMENT IN A DATA MINING APPLICATION FRAMEWORK
SN - 978-989-8111-66-1
AU - Tuovinen L.
AU - Laurinen P.
AU - Röning J.
PY - 2009
SP - 333
EP - 338
DO - 10.5220/0001658903330338