AN APPROXIMATION-AWARE ALGEBRA FOR XML FULL-TEXT QUERIES
Giacomo Buratti, Danilo Montesi
2007
Abstract
XQuery Full-Text is the proposed standard language for querying XML documents using either standard or full-text conditions; while full-text conditions can have a boolean or a ranked semantics, standard conditions must be satisfied for an element to be returned. This paper proposes a more general formal model that considers structural, value-based and full-text conditions as desiderata rather than mandatory constraints. The goal is achieved defining a set of relaxation operators that, given a path expression or a selection condition, return a set of relaxed path expressions or selection conditions. Algebraic approximated operators are defined for representing typical queries and returns either elements that perfectly respect the conditions and elements that answer to a relaxed version of the original query. A score reflecting the level of satisfaction of the original query is assigned to each result of the relaxed query.
References
- Amer-Yahia, S., Koudas, N., Marian, A., Srivastava, D., and Toman, D. (2005). Structure and Content Scoring for XML. In Proceedings of the 31st International Conference on Very Large Data Bases (VLDB 2005), pages 361-372, Trondheim, Norway.
- Amer-Yahia, S., Lakshmanan, L. V. S., and Pandit, S. (2004). FleXPath: Flexible Structure and Full-Text Querying for XML. In Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 83-94, Paris, France.
- Buratti, G. (2007). A Model and an Algebra for Semi-Structured and Full-Text Queries (Ph.D. Thesis). Technical Report UBLCS-2007-03, University of Bologna.
- Fagin, R. and Wimmers, E. L. (2000). A Formula for Incorporating Weights into Scoring Rules. Theoretical Computer Science, 239(2):309-338.
- INEX (2006). INitiative for the Evaluation of XML Retrieval. http://inex.is.informatik. uni-duisburg.de/2006/ .
- Marian, A., Amer-Yahia, S., Koudas, N., and Srivastava, D. (2005). Adaptive Processing of Top-K Queries in XML. In Proceedings of the 21st International Conference on Data Engineering (ICDE 2005), pages 162-173, Tokyo, Japan.
- Princeton University, C. S. L. (2007). http://wordnet.princeton.edu/.
- W3C (2006). XQuery 1.0 and XPath 2.0 Full-Text, W3C Working Draft. http://www.w3.org/TR/ xquery-full-text/ .
Paper Citation
in Harvard Style
Buratti G. and Montesi D. (2007). AN APPROXIMATION-AWARE ALGEBRA FOR XML FULL-TEXT QUERIES . In Proceedings of the Second International Conference on Software and Data Technologies - Volume 3: ICSOFT, ISBN 978-989-8111-07-4, pages 62-69. DOI: 10.5220/0001327700620069
in Bibtex Style
@conference{icsoft07,
author={Giacomo Buratti and Danilo Montesi},
title={AN APPROXIMATION-AWARE ALGEBRA FOR XML FULL-TEXT QUERIES},
booktitle={Proceedings of the Second International Conference on Software and Data Technologies - Volume 3: ICSOFT,},
year={2007},
pages={62-69},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001327700620069},
isbn={978-989-8111-07-4},
}
in EndNote Style
TY - CONF
JO - Proceedings of the Second International Conference on Software and Data Technologies - Volume 3: ICSOFT,
TI - AN APPROXIMATION-AWARE ALGEBRA FOR XML FULL-TEXT QUERIES
SN - 978-989-8111-07-4
AU - Buratti G.
AU - Montesi D.
PY - 2007
SP - 62
EP - 69
DO - 10.5220/0001327700620069