PROVIDING SCALABLE ACCESS TO LARGE XML DOCUMENTS

Arno Puder

2007

Abstract

XML documents often tend to be voluminous and accessing them through a DOM (Document Object Model) interface poses particular challenges. All the existing DOM implementations require an XML document to be completely collocated before it can be parsed. This solution does not scale for huge XML documents. In this paper we introduce an architecture, called VDOM (Virtual DOM) that allows scalable access to large XML documents through a DOM interface. In the VDOM architecture, the actively used portions of an XML document are transferred to the application. The application can begin to traverse this portion without requiring that the complete DOM tree is collocated. As the application traverses the DOM tree, portions of the XML document are loaded on-demand. Using the VDOM architecture is transparent to the application which uses a standard DOM interface to access the DOM tree.

References

  1. JDOM (2004). Java DOM-API. http://www.jdom.org/.
  2. Lowe, P. (1977). An approximating polynomial for the computation of saturation vapor pressure. Journal of Applied Meterology, 16:100-103.
  3. San Francisco State University (2003). NetBEAMS - Networked Bay Environmental Assessment Monitoring System. http://www.netbeams.org/.
  4. SAX Project (2004). Simple API for XML (SAX). http://www.saxproject.org/.
  5. Tanenbaum, A. and Woodhull, A. (2006). Operating Systems Design and Implementation. Prentice Hall, third edition.
  6. W3C (2004). Document Object Model (DOM). http://www.w3.org/DOM/.
  7. W3C (2006a). eXtensible Markup Language (XML). http://www.w3.org/XML/.
  8. W3C (2006b). XML Path Language 2.0. http://www.w3.org/TR/xpath/.
  9. Zambrano, B. and Puder, A. (2006). A flexible system for real-time oceanographic monitoring. Extended abstract, San Francisco State University.
Download


Paper Citation


in Harvard Style

Puder A. (2007). PROVIDING SCALABLE ACCESS TO LARGE XML DOCUMENTS . In Proceedings of the Third International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-972-8865-77-1, pages 178-183. DOI: 10.5220/0001270801780183


in Bibtex Style

@conference{webist07,
author={Arno Puder},
title={PROVIDING SCALABLE ACCESS TO LARGE XML DOCUMENTS},
booktitle={Proceedings of the Third International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2007},
pages={178-183},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001270801780183},
isbn={978-972-8865-77-1},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Third International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - PROVIDING SCALABLE ACCESS TO LARGE XML DOCUMENTS
SN - 978-972-8865-77-1
AU - Puder A.
PY - 2007
SP - 178
EP - 183
DO - 10.5220/0001270801780183