ON ANALYZING THE DATABASE PERFORMANCE FOR DIFFERENT CLASSES OF XML DOCUMENTS BASED ON THE USED STORAGE APPROACH

Hagen Höpfner, Jörg Schad, Essam Mansour

Abstract

With the increasing popularity of XML data also the need for permanent XML document storage grows. As of today there exist number of different XML storage alternatives ranging from XML enabled relational database systems over a new class of hybrid database systems providing native storage for XML and relational data to pure native XML systems. This paper examines how these different storage approaches perform in respect to the different classes of XML data by devising a new benchmark scshape HyBe with special consideration to certain features of hybrid database systems. First results indicate that hybrid database systems can deliver performance which is (almost) equivalent to native XML database systems making them the optimal choice for small to mid-size companies with the need for both XML and relational data storage.

References

  1. Barbosa, D., Mendelzon, A. O., Keenleyside, J., and Lyons, K. (2002). ToXgene: An extensible template-based data generator for XML. In WebDB 2002 Proc., pages 49-54. ACM.
  2. Böhme, T. and Rahm, E. (2001). XMach-1: A Benchmark for XML Data Management. In BTW 2001 Proc., pages 264-273. Springer.
  3. Bourret, R. (2005). XML and Databases. online article. http://www.rpbourret.com/xml/XMLAndDa tabases.htm.
  4. Carey, M. J., DeWitt, D. J., and Naughton, J. F. (1993). The 007 Benchmark. ACM SIGMOD Record, 22(2):12- 21.
  5. Chen, W.-J., Sammartino, A., Goutev, D., Hendricks, F., Komi, I., Wei, M.-P., and Ahuja, R. (2007). DB2 9 pureXML Guide. IBM redbooks.
  6. Cover, R. (2002). FIXML - A Markup Language for the FIX Application Message Layer. Cover pages. xml.coverpages.org/fixml.html.
  7. DuCharme, B. (2004). Documents vs. Data, Schemas vs. Schemas. XML 2004, pages 1554-4648.
  8. Lee, G. (2007). Oracle Database 11g XML DB Technical Overview. Oracle Corportion.
  9. Li, Y. G., Bressan, S., Dobbie, G., Lacroix, Z., Lee, M. L., Nambiar, U., and Wadhwa, B. (2001). XOO7: applying OO7 benchmark to XML query processing tool. In CIKM 2001, pages 167-174.
  10. Lu, H., Yu, J., Wang, G., Zheng, S., Jiang, H., Yu, G., and Zhou, A. (2005). What makes the differences: benchmarking XML database implementations. TOIT, 5(1):154-194.
  11. Nicola, M., Kogan, I., and Schiefer, B. (2007). An XML transaction processing benchmark. In SIGMOD 2007, pages 937-948. ACM.
  12. Nicola, M. and Rodrigues, V. (2006). A performance comparison of DB2 9 pureXML and CLOB or shredded XML storage.
  13. Nicola, M. and van der Linden, B. (2005). Native XML support in DB2 universal database. In VLDB 2005, pages 1164-1174.
  14. Runapongsa, K., Patel, J. M., Jagadish, H. V., Chen, Y., and Al-Khalifa, S. (2006). The Michigan benchmark: towards XML query performance diagnostics. Information Systems, 31(2):73-97.
  15. Schad, J. (2008). XML-Document Management in Databases - A Performance Evaluation for Hybrid Database Systems. Bachelor thesis, IU in Germany, School of IT, Bruchsal, Germany.
  16. Schmidt, A. R., Waas, F., Kersten, M. L., Florescu, D., Manolescu, I., Carey, M. J., and Busse, R. (2001). The XML Benchmark Project. Technical Report INSR0103, CWI, Amsterdam.
  17. Schöning, H. (2003). Tamino-Software AG's Native XML Server. In XML Data Management, chapter 2. Addison-Wesley.
  18. Serna, A. and Gerrikagoitia, J. K. (2005). David & Goliath: A Comparison Of XML-Enabled and native XML Data Management Techniques. XML Journal. xml.sys-con.com/node/104980.
  19. Yao, B. B., O zsu, M. T., and Khandelwal, N. (2004). XBench Benchmark and Performance Testing of XML DBMSs. In ICDE 2004, pages 621-632.
Download


Paper Citation


in Harvard Style

Höpfner H., Schad J. and Mansour E. (2009). ON ANALYZING THE DATABASE PERFORMANCE FOR DIFFERENT CLASSES OF XML DOCUMENTS BASED ON THE USED STORAGE APPROACH . In Proceedings of the 4th International Conference on Software and Data Technologies - Volume 2: ICSOFT, ISBN 978-989-674-010-8, pages 243-248. DOI: 10.5220/0002252802430248


in Bibtex Style

@conference{icsoft09,
author={Hagen Höpfner and Jörg Schad and Essam Mansour},
title={ON ANALYZING THE DATABASE PERFORMANCE FOR DIFFERENT CLASSES OF XML DOCUMENTS BASED ON THE USED STORAGE APPROACH},
booktitle={Proceedings of the 4th International Conference on Software and Data Technologies - Volume 2: ICSOFT,},
year={2009},
pages={243-248},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002252802430248},
isbn={978-989-674-010-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 4th International Conference on Software and Data Technologies - Volume 2: ICSOFT,
TI - ON ANALYZING THE DATABASE PERFORMANCE FOR DIFFERENT CLASSES OF XML DOCUMENTS BASED ON THE USED STORAGE APPROACH
SN - 978-989-674-010-8
AU - Höpfner H.
AU - Schad J.
AU - Mansour E.
PY - 2009
SP - 243
EP - 248
DO - 10.5220/0002252802430248