ENHANCING XML-CACHE EFFICIENCY BY XML COMPRESSION

Stefan Böttcher, Lars Fernhomberg, Rita Hartel

2011

Abstract

Whenever a client with limited resources frequently has to retrieve large parts of a huge XML document that is stored on a remote web server, data exchange from the server to the client in combination with restricted bandwidth may become a serious bottleneck. We present an approach that combines the advantages of caching with the advantages of query-capable and updatable XML compression. We provide a performance evaluation that demonstrates that the combination of the two techniques – caching and compression – yields a benefit in terms of less data volume to be transferred as well as in terms of a reduced answer time for the user. The performance evaluation demonstrates that combining both technique yields an even stronger benefit than each technique alone.

References

  1. Abiteboul, S., Segoufin, L., & Vianu, V. (2001). Representing and Querying XML with Incomplete Information. Proceedings of the Twentieth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, May 21-23, 2001, Santa Barbara, California, USA.
  2. Balmin, A., Özcan, F., Beyer, K. S., Cochrane, R., & Pirahesh, H. (2004). A Framework for Using Materialized XPath Views in XML Query Processing. (e)Proceedings of the Thirtieth International Conference on Very Large Data Bases (pp. 60-71). Toronto, Canada: Morgan Kaufmann.
  3. Benedikt, M., Wenfei, F., & Geerts, F. (2005). XPath satisfiability in the presence of DTDs. Proceedings of the Twenty-fourth ACM SIGACT-SIGMOD-SIGART (pp. 25-36). Baltimore, Maryland, USA: ACM.
  4. Böhme, T., & Rahm, E. (2004). Supporting Efficient Streaming and Insertion of XML Data in RDBMS. In Z. Bellahsene, & P. McBrien (Ed.), DIWeb2004, Third International Workshop on Data Integration over the Web, (pp. 70-81). Riga, Latvia.
  5. Böttcher, S., & Hartel, R. (2009). CSC: Supporting Queries on Compressed Cached XML. In A. Bouguettaya, & X. Lin (Ed.), Database Technologies 2009, Twentieth Australasian Database Conference (ADC 2009) (pp. 153-160). Wellington, New Zealand: CRPIT.
  6. Böttcher, S., & Steinmetz, R. (2007). Evaluating XPath Queries on XML Data Streams. Data Management. Data, Data Everywhere, 24th British National Conference on Databases, BNCOD 24 (pp. 101-113). Glasgow, UK: Springer.
  7. Böttcher, S., & Türling, A. (2004). Caching XML Data on Mobile Web Clients. Proceedings of the International Conference on Internet Computing, IC 7804 (pp. 150- 156). Las Vegas, Nevada, USA: CSREA Press.
  8. Böttcher, S., Hartel, R., & Heinzemann, C. (2008). Compressing XML Data Streams with DAG+BSBC. In J. Cordeiro, S. Hammoudi, & J. Filipe (Ed.), Web Information Systems and Technologies, 4th International Conference, WEBIST 2008, Revised Selected Papers, Lecture Notes in Business Information Processing (pp. 65-79). Funchal, Madeira, Portugal: Springer.
  9. Buneman, P., Grohe, M., & Koch, C. (2003). Path Queries on Compressed XML. Proceedings of 29th International Conference on Very Large Data Bases (pp. 141-152). Berlin, Germany: Morgan Kaufmann.
  10. Busatto, G., Lohrey, M., & Maneth, S. (2005). Efficient Memory Representation of XML Documents. Database Programming Languages, 10th International Symposium, DBPL 2005 (pp. 199-216). Trondheim, Norway: Springer.
  11. Cheng, J., & Ng, W. (2004). XQzip: Querying Compressed XML Using Structural Indexing. Advances in Database Technology - EDBT 2004, 9th International Conference on Extending Database Technology (pp. 219-236). Heraklion, Crete, Greece: Springer.
  12. Franceschet, M. (2005). XPathMark: An XPath Benchmark for the XMark Generated Data. In S. Bressan, S. Ceri, E. Hunt, Z. G. Ives, Z. Bellahsene, M. Rys, et al. (Ed.), Database and XML Technologies, Third International XML Database Symposium, XSym 2005, (pp. 129-143). Trondheim, Norway.
  13. Hidders, J. (2003). Satisfiability of XPath Expressions. Database Programming Languages, 9th International Workshop, DBPL 2003 (pp. 21-36). Potsdam, Germany: Springer.
  14. Liefke, H., & Suciu, D. (2000). XMILL: An Efficient Compressor for XML Data. Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data (pp. 153-164). Dallas, Texas, USA: ACM.
  15. Mandhani, B., & Suciu, D. (2005). Query Caching and View Selection for XML Databases. In K. Böhm, C. S. Jensen, L. M. Haas, M. L. Kersten, P.-A. Larson, & B. C. Ooi (Ed.), Proceedings of the 31st International Conference on Very Large Data Bases (pp. 469-480). Trondheim, Norway: ACM.
  16. Ng, W., Lam, W. Y., Wood, P. T., & Levene, M. (2006). XCQ: A queriable XML compression system. Knowl. Inf. Syst. , 421-452.
  17. Obermeier, S., & Böttcher, S. (2008). XML fragment caching for large-scale mobile commerce applications. Proceedings of the 10th International Conference on Electronic Commerce, ICEC 2008 (p. 26). Innsbruck, Austria: ACM.
  18. Olteanu, D., Meuss, H., Furche, T., & Bry, F. (2002). XPath: Looking Forward. In A. B. Chaudhri, R. Unland, C. Djeraba, & W. Lindner (Ed.), XML-Based Data Management and Multimedia Engineering - EDBT 2002 Workshops, EDBT 2002 Workshops XMLDM, MDDE, and YRWS (pp. 109-127). Prague, Czech Republic: Springer.
  19. O'Neil, P. E., O'Neil, E. J., Pal, S., Cseri, I., Schaller, G., & Westbury, N. (2004). ORDPATHs: Insert-Friendly XML Node Labels. In G. Weikum, A. C. König, & S. Deßloch (Ed.), Proceedings of the ACM SIGMOD International Conference on Management of Data (pp. 903-908). Paris, France: ACM.
  20. Schmidt, A., Waas, F., Kersten, M. L., Carey, M. J., Manolescu, I., & Busse, R. (2002). XMark: A Benchmark for XML Data Management. VLDB 2002, Proceedings of 28th International Conference on Very Large Data Bases, (pp. 974-985). Hong Kong, China.
  21. Skibinski, P., & Swacha, J. (2007). Combining Efficient XML Compression with Query Processing. Advances in Databases and Information Systems, 11th East European Conference, ADBIS 2007 (pp. 330-342). Varna, Bulgaria: Springer.
  22. Xu, L., Ling, T. W., Wu, H., & Bao, Z. (2009). DDE: from dewey to a fully dynamic XML labeling scheme. In U. Cetintemel, S. B. Zdonik, D. Kossmann, & N. Tatbul (Ed.), Proceedings of the ACM SIGMOD International Conference on Management of Data, SIGMOD 2009 (pp. 719-730). Providence, Rhode Island, USA: ACM.
  23. Xu, W., & Özsoyoglu, Z. M. (2005). Rewriting XPath Queries Using Materialized Views. Proceedings of the 31st International Conference on Very Large Data Bases (pp. 121-132). Trondheim, Norway: ACM.
  24. Yang, L. H., Lee, M.-L., & Hsu, W. (2003). Efficient Mining of XML Query Patterns for Caching. Proceedings of 29th International Conference on Very Large Data Bases (pp. 69-80). Berlin, Germany: Morgan Kaufmann.
  25. Zhang, N., Kacholia, V., & Özsu, M. T. (2004). A Succinct Physical Storage Scheme for Efficient Evaluation of Path Queries in XML. Proceedings of the 20th International Conference on Data Engineering, ICDE 2004 (pp. 54-65). Boston, MA, USA: IEEE Computer Society.
Download


Paper Citation


in Harvard Style

Böttcher S., Fernhomberg L. and Hartel R. (2011). ENHANCING XML-CACHE EFFICIENCY BY XML COMPRESSION . In Proceedings of the 7th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-989-8425-51-5, pages 80-87. DOI: 10.5220/0003354300800087


in Bibtex Style

@conference{webist11,
author={Stefan Böttcher and Lars Fernhomberg and Rita Hartel},
title={ENHANCING XML-CACHE EFFICIENCY BY XML COMPRESSION },
booktitle={Proceedings of the 7th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2011},
pages={80-87},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003354300800087},
isbn={978-989-8425-51-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 7th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - ENHANCING XML-CACHE EFFICIENCY BY XML COMPRESSION
SN - 978-989-8425-51-5
AU - Böttcher S.
AU - Fernhomberg L.
AU - Hartel R.
PY - 2011
SP - 80
EP - 87
DO - 10.5220/0003354300800087