AN APPROXIMATION-AWARE ALGEBRA FOR XML FULL-TEXT QUERIES

Giacomo Buratti, Danilo Montesi

2007

Abstract

XQuery Full-Text is the proposed standard language for querying XML documents using either standard or full-text conditions; while full-text conditions can have a boolean or a ranked semantics, standard conditions must be satisfied for an element to be returned. This paper proposes a more general formal model that considers structural, value-based and full-text conditions as desiderata rather than mandatory constraints. The goal is achieved defining a set of relaxation operators that, given a path expression or a selection condition, return a set of relaxed path expressions or selection conditions. Algebraic approximated operators are defined for representing typical queries and returns either elements that perfectly respect the conditions and elements that answer to a relaxed version of the original query. A score reflecting the level of satisfaction of the original query is assigned to each result of the relaxed query.

References

  1. Amer-Yahia, S., Koudas, N., Marian, A., Srivastava, D., and Toman, D. (2005). Structure and Content Scoring for XML. In Proceedings of the 31st International Conference on Very Large Data Bases (VLDB 2005), pages 361-372, Trondheim, Norway.
  2. Amer-Yahia, S., Lakshmanan, L. V. S., and Pandit, S. (2004). FleXPath: Flexible Structure and Full-Text Querying for XML. In Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 83-94, Paris, France.
  3. Buratti, G. (2007). A Model and an Algebra for Semi-Structured and Full-Text Queries (Ph.D. Thesis). Technical Report UBLCS-2007-03, University of Bologna.
  4. Fagin, R. and Wimmers, E. L. (2000). A Formula for Incorporating Weights into Scoring Rules. Theoretical Computer Science, 239(2):309-338.
  5. INEX (2006). INitiative for the Evaluation of XML Retrieval. http://inex.is.informatik. uni-duisburg.de/2006/ .
  6. Marian, A., Amer-Yahia, S., Koudas, N., and Srivastava, D. (2005). Adaptive Processing of Top-K Queries in XML. In Proceedings of the 21st International Conference on Data Engineering (ICDE 2005), pages 162-173, Tokyo, Japan.
  7. Princeton University, C. S. L. (2007). http://wordnet.princeton.edu/.
  8. W3C (2006). XQuery 1.0 and XPath 2.0 Full-Text, W3C Working Draft. http://www.w3.org/TR/ xquery-full-text/ .
Download


Paper Citation


in Harvard Style

Buratti G. and Montesi D. (2007). AN APPROXIMATION-AWARE ALGEBRA FOR XML FULL-TEXT QUERIES . In Proceedings of the Second International Conference on Software and Data Technologies - Volume 3: ICSOFT, ISBN 978-989-8111-07-4, pages 62-69. DOI: 10.5220/0001327700620069


in Bibtex Style

@conference{icsoft07,
author={Giacomo Buratti and Danilo Montesi},
title={AN APPROXIMATION-AWARE ALGEBRA FOR XML FULL-TEXT QUERIES},
booktitle={Proceedings of the Second International Conference on Software and Data Technologies - Volume 3: ICSOFT,},
year={2007},
pages={62-69},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001327700620069},
isbn={978-989-8111-07-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Second International Conference on Software and Data Technologies - Volume 3: ICSOFT,
TI - AN APPROXIMATION-AWARE ALGEBRA FOR XML FULL-TEXT QUERIES
SN - 978-989-8111-07-4
AU - Buratti G.
AU - Montesi D.
PY - 2007
SP - 62
EP - 69
DO - 10.5220/0001327700620069