DeVisa - Concepts and Architecture of a Data Mining Models Scoring and Management Web System

Diana Gorea

2008

Abstract

In this paper we describe DeVisa, a Web system for scoring and management of data mining models. The system has been designed to provide unified access to different prediction models using standard technologies based on XML. The prediction models are serialized in PMML format and managed using a native XML database system. The system provides functions such as scoring, model comparison, model selection or sequencing through a web service interface. DeVisa also defines a specialized PMML query language named PMQL used for specifying client requests and interaction with PMML repository. The paper analyzes the system’s architecture and functionality and discusses its use as a tool for researchers.

References

  1. Axis (2007). Apache axis. http://ws.apache.org/axis/.
  2. Boag, S., Chamberlin, D., Fernandez, M., Florescu, D., Robie, J., and Simeon, J. (2007). Xquery 1.0: An xml query language. http://www.w3.org/TR/xquery/.
  3. Chaves, J., Curry, C., Grossman, R. L., Locke, D., and Vejcik, S. (2006). Augustus: the design and architecture of a pmml-based scoring engine. In DMSSP 7806: Proceedings of the 4th international workshop on Data mining standards, services and platforms, pages 38 - 46, New York, NY, USA. ACM.
  4. Chieh-Yuan Tsai, M.-H. T. (2005). A dynamic web service based data mining process system. In The Fifth International Conference on Computer and Information Technology CIT 2005, pages 1033- 1039.
  5. DeVisa (2007). Devisa. http://devisa.sourceforge.net.
  6. DMG (2007). Data mining group. http://www.dmg.org.
  7. eXist (2007). Exist - open source native xml database. http://www.exist-db.org/.
  8. Frawley, W. and Piatetsky-Shapiro, G. (1991). Knowledge Discovery In Databases: An Overview. Knowledge Discovery In Databases. AAAI Press/MIT Press, Cambridge, MA.
  9. GO (2007). The gene ontology.
  10. Gorea, D. (2007). Towards storing and interchanging data mining models. In Proceedings of the 3rd Balkan Conference in Informatics, volume 2, pages 229-236.
  11. Griffiths-Jones, S., Grocock, R., van Dongen, S., Bateman, A., and Enright, A. (2006). mirbase: microrna sequences, targets and gene nomenclature. Nucleic Acids Res., 34:140 - 144.
  12. Grigorios Tsoumakas, I. V. (2007). An interoperable and scalable web-based system for classifier sharing and fusion. Expert Systems with Applications, 33(3):716- 724.
  13. Hand, D., Mannila, H., and Smyth, P. (2001). Principles of Data Mining. The MIT Press.
  14. Ian H. Witten, E. F. (2005). Data Mining Practical Machine Learning Tools and Techniques. Morgan Kaufmann series in data management systems. Elsevier, 2nd edition.
  15. Nam, J.-W. (2005). Human microrna prediction through a probabilistic co-learning model of sequence and structure. Nucleic Acids Research, 33(11):3570-3581.
  16. PMML (2007). Pmml http://www.dmg.org/pmml-v3-2.html.
  17. Ritchie, W., Legendre, M., and Gautheret, D. (2007). Rna stem-loops: To be or not to be cleaved by rnase iii. RNA, 13:457-462.
  18. RuleML (2007). Ruleml. http://www.ruleml.org.
  19. Sewer, A. (2005). dentification of clustered micrornas using an ab initio prediction method. BMC Bioinformatics.
  20. Studer, R., Grimm, S., and Abecker, A., editors (2007). Semantic Web Services. Concepts, Technologies and Applications, chapter 2. Springer.
  21. Sung-Kyu, K. (2006). mitarget: microrna target-gene prediction using a support vector machine. BMC Bioinformatics 2006, 7(1):411.
  22. (2007). W3c semantic http://www.w3.org/2001/sw/.
Download


Paper Citation


in Harvard Style

Gorea D. (2008). DeVisa - Concepts and Architecture of a Data Mining Models Scoring and Management Web System . In Proceedings of the Tenth International Conference on Enterprise Information Systems - Volume 1: ICEIS, ISBN 978-989-8111-36-4, pages 276-281. DOI: 10.5220/0001706202760281


in Bibtex Style

@conference{iceis08,
author={Diana Gorea},
title={DeVisa - Concepts and Architecture of a Data Mining Models Scoring and Management Web System},
booktitle={Proceedings of the Tenth International Conference on Enterprise Information Systems - Volume 1: ICEIS,},
year={2008},
pages={276-281},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001706202760281},
isbn={978-989-8111-36-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Tenth International Conference on Enterprise Information Systems - Volume 1: ICEIS,
TI - DeVisa - Concepts and Architecture of a Data Mining Models Scoring and Management Web System
SN - 978-989-8111-36-4
AU - Gorea D.
PY - 2008
SP - 276
EP - 281
DO - 10.5220/0001706202760281