tively measuring the differences between the storage
alternatives regarding different query types.
Our results indicate that it is important to con-
sider the XML storage approach individually for each
project. A main distinction is whether the database
has to “understand” the data or can just consider them
as one large file. Other important points to consider
are, e.g., read vs. write or the update frequency.
Clobbing and shredding are limited to certain
characteristics and more or less suitable for data-
oriented documents. Hybrid systems are a good
choice for document-oriented documents. Also hy-
brid systems offer often more flexibility as they sup-
port a wider range of features especially in cases
where XML and relational data must be combined.
As the focus of this study was the comparison of
different approaches it is interesting to compare dif-
ferent implementations of those approaches by differ-
ent products such as Oracle’s XML DB, IBM’s DB2
and Microsoft’s SQL Server. Furthermore, we plan to
extend the query set and extend the XML data used
(e.g., different document sizes). Another issue is the
evolution of XML schemata when performing update
queries.
Finally, we plan to observe index related perfor-
mances including speed-up and time for index cre-
ation. Here we assume that those results will vary
significantly across different storage options.
REFERENCES
Barbosa, D., Mendelzon, A. O., Keenleyside, J., and Lyons,
K. (2002). ToXgene: An extensible template-based
data generator for XML. In WebDB 2002 Proc., pages
49–54. ACM.
B
¨
ohme, T. and Rahm, E. (2001). XMach-1: A Benchmark
for XML Data Management. In BTW 2001 Proc.,
pages 264–273. Springer.
Bourret, R. (2005). XML and Databases. online article.
http://www.rpbourret.com/xml/XMLAndDa
tabases.htm.
Carey, M. J., DeWitt, D. J., and Naughton, J. F. (1993). The
007 Benchmark. ACM SIGMOD Record, 22(2):12–
21.
Chen, W.-J., Sammartino, A., Goutev, D., Hendricks, F.,
Komi, I., Wei, M.-P., and Ahuja, R. (2007). DB2 9
pureXML Guide. IBM redbooks.
Cover, R. (2002). FIXML - A Markup Language for
the FIX Application Message Layer. Cover pages.
xml.coverpages.org/fixml.html.
DuCharme, B. (2004). Documents vs. Data, Schemas vs.
Schemas. XML 2004, pages 1554–4648.
Lee, G. (2007). Oracle Database 11g XML DB Technical
Overview. Oracle Corportion.
Li, Y. G., Bressan, S., Dobbie, G., Lacroix, Z., Lee, M. L.,
Nambiar, U., and Wadhwa, B. (2001). XOO7: apply-
ing OO7 benchmark to XML query processing tool.
In CIKM 2001, pages 167–174.
Lu, H., Yu, J., Wang, G., Zheng, S., Jiang, H., Yu, G., and
Zhou, A. (2005). What makes the differences: bench-
marking XML database implementations. TOIT,
5(1):154–194.
Nicola, M., Kogan, I., and Schiefer, B. (2007). An XML
transaction processing benchmark. In SIGMOD 2007,
pages 937–948. ACM.
Nicola, M. and Rodrigues, V. (2006). A performance com-
parison of DB2 9 pureXML and CLOB or shredded
XML storage.
Nicola, M. and van der Linden, B. (2005). Native XML
support in DB2 universal database. In VLDB 2005,
pages 1164–1174.
Runapongsa, K., Patel, J. M., Jagadish, H. V., Chen, Y.,
and Al-Khalifa, S. (2006). The Michigan benchmark:
towards XML query performance diagnostics. Infor-
mation Systems, 31(2):73–97.
Schad, J. (2008). XML-Document Management in
Databases — A Performance Evaluation for Hybrid
Database Systems. Bachelor thesis, IU in Germany,
School of IT, Bruchsal, Germany.
Schmidt, A. R., Waas, F., Kersten, M. L., Florescu, D.,
Manolescu, I., Carey, M. J., and Busse, R. (2001).
The XML Benchmark Project. Technical Report INS-
R0103, CWI, Amsterdam.
Sch
¨
oning, H. (2003). Tamino-Software AG’s Native XML
Server. In XML Data Management, chapter 2.
Addison-Wesley.
Serna, A. and Gerrikagoitia, J. K. (2005). David & Go-
liath: A Comparison Of XML-Enabled and native
XML Data Management Techniques. XML Journal.
xml.sys-con.com/node/104980.
Yao, B. B.,
¨
Ozsu, M. T., and Khandelwal, N. (2004).
XBench Benchmark and Performance Testing of
XML DBMSs. In ICDE 2004, pages 621–632.
ICSOFT 2009 - 4th International Conference on Software and Data Technologies
248