XQUAKE - An XQuery-like Language for Mining XML Data

Andrea Romei, Franco Turini

Abstract

The rapid growth of semi-structured sources raises the need of designing and implementing environments for knowledge discovery out of XML data. This paper presents an Inductive Database System in which raw data, mining models and domain knowledge are represented as XML documents, stored inside XML native databases. In particular, we discuss our experiences in the design and development of XQuake, a mining query language that extends XQuery. Features of the language are an intuitive syntax, a good expressiveness and the capability of dealing uniformly with raw data, induced and background knowledge. The language is presented by means of examples and a sketch of its implementations and the evaluation of its performance is given.

References

  1. Agrawal, R. and Srikant, R. (1994). Fast algorithms for mining association rules. In VLDB 7894, pages 487- 499, Santiago de Chile, Chile.
  2. Braga, D., Campi, A., Ceri, S., Klemettinen, M., and Lanzi, P. (2003). Discovering interesting information in XML data with association rules. In SAC 7803, pages 450-454, Melbourne, Florida.
  3. Euler, T., Klinkenberg, R., Mierswa, I., Scholz, M., and Wurst, M. (2006). YALE: rapid prototyping for complex data mining tasks. In KDD 7806, pages 935-940, Philadelphia, PA, USA.
  4. Feng, L. and Dillon, T. S. (2004). Mining Interesting XMLEnabled Association Rules with Templates. In KDID 7804, pages 66-88, Pisa, Italy.
  5. Holupirek, A., GrĂ¼n, C., and Scholl, M. H. (2009). BaseX and DeepFS joint storage for filesystem and database. In EDBT 7809, pages 1108-1111, Saint Petersburg, Russia.
  6. Imielinski, T. and Mannila, H. (1996). A database perspective on knowledge discovery. Comm. Of The Acm, 39(11):58-64.
  7. Meo, R. and Psaila, G. (2006). An XML-based database for knowledge discovery. In EDBT 7806, pages 814-828, Munich, Germany.
  8. Mitchell, T. M. (1997). Machine Learning. McGraw-Hill.
  9. Romei, A., Ruggieri, S., and Turini, F. (2006). KDDML: a middleware language and system for knowledge discovery in databases. Data Knowl. Eng., 57(2):179- 220.
  10. The Data Mining Group (2009). The Predictive Model Markup Language (PMML). Version 4.0. www.dmg.org/v4-0/GeneralStructure.html.
  11. W3C World Wide Web Consortium (2004). OWL Web Ontology Language. W3C Recommendation 10 February 2004. http://www.w3.org/TR/owl-features.
  12. W3C World Wide Web Consortium (2007). XQuery 1.0: An XML Query Language. W3C Recommendation 23 January 2007. http://www.w3.org/TR/Query.
Download


Paper Citation


in Harvard Style

Romei A. and Turini F. (2010). XQUAKE - An XQuery-like Language for Mining XML Data . In Proceedings of the 2nd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, ISBN 978-989-674-021-4, pages 20-27. DOI: 10.5220/0002703400200027


in Bibtex Style

@conference{icaart10,
author={Andrea Romei and Franco Turini},
title={XQUAKE - An XQuery-like Language for Mining XML Data},
booktitle={Proceedings of the 2nd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,},
year={2010},
pages={20-27},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002703400200027},
isbn={978-989-674-021-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 2nd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,
TI - XQUAKE - An XQuery-like Language for Mining XML Data
SN - 978-989-674-021-4
AU - Romei A.
AU - Turini F.
PY - 2010
SP - 20
EP - 27
DO - 10.5220/0002703400200027