CLUX - Clustering XML Sub-trees

Stefan Böttcher; Rita Hartel; Christoph Krislin

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

CLUX - Clustering XML Sub-trees

Topics: Middleware Integration; Web Databases

In Proceedings of the 12th International Conference on Enterprise Information Systems - Volume 3: ICEIS, 142-150, 2010 , Funchal, Madeira, Portugal

Authors: Stefan Böttcher ; Rita Hartel and Christoph Krislin

Affiliation: University of Paderborn, Computer Science, Germany

Keyword(s): XML Compression, Grammar-based Compression, XML Sub-tree Clustering.

Related Ontology Subjects/Areas/Topics: Databases and Information Systems Integration ; e-Business ; Enterprise Information Systems ; Middleware Integration ; Middleware Platforms ; Technology Platforms ; Web Databases

Abstract: XML has become the de facto standard for data exchange in enterprise information systems. But whenever XML data is stored or processed, e.g. in form of a DOM tree representation, the XML markup causes a huge blow-up of the memory consumption compared to the data, i.e., text and attribute values, contained in the XML document. In this paper, we present CluX, an XML compression approach based on clustering XML sub-trees. CluX uses a grammar for sharing similar substructures within the XML tree structure and a cluster-based heuristics for greedily selecting the best compression options in the grammar. Thereby, CluX allows for storing and exchanging XML data in a space efficient and still queryable way. We evaluate different strategies for XML structure sharing, and we show that CluX often compresses better than XMill, Gzip, and Bzip2, which makes CluX a promising technique for XML data exchange whenever the exchanged data volume is a bottleneck in enterprise information systems.

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.181

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Böttcher, S., Hartel, R., Krislin and C. (2010). CLUX - Clustering XML Sub-trees. In Proceedings of the 12th International Conference on Enterprise Information Systems - Volume 3: ICEIS; ISBN 978-989-8425-04-1; ISSN 2184-4992, SciTePress, pages 142-150. DOI: 10.5220/0002877901420150

@conference{iceis10,
author={Stefan Böttcher and Rita Hartel and Christoph Krislin},
title={CLUX - Clustering XML Sub-trees},
booktitle={Proceedings of the 12th International Conference on Enterprise Information Systems - Volume 3: ICEIS},
year={2010},
pages={142-150},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002877901420150},
isbn={978-989-8425-04-1},
issn={2184-4992},
}

TY - CONF

JO - Proceedings of the 12th International Conference on Enterprise Information Systems - Volume 3: ICEIS
TI - CLUX - Clustering XML Sub-trees
SN - 978-989-8425-04-1
IS - 2184-4992
AU - Böttcher, S.
AU - Hartel, R.
AU - Krislin, C.
PY - 2010
SP - 142
EP - 150
DO - 10.5220/0002877901420150
PB - SciTePress