EXTENDING AN XML MEDIATOR WITH TEXT QUERY

Clément Jamard, Georges Gardarin

Abstract

Supporting full-text query in an XML mediator is a difficult problem. This is because most data-sources do not provide keyword search and ranking. In this paper, we report on the integration of the main functionalities of the emerging XQuery Text standard in XLive, a full XML/XQuery mediator. Our approach is to index on keywords virtual documents in views. Selected virtual documents are on demand mapped to data source objects. Thus, the mediator selection operator is efficiently extended to support full-text search on views. Keyword search and result ranking are integrated. We rank results using a relevance formula adapted to XPath, based on number of keywords in elements and distance from the searched nodes.

References

  1. Abiteboul S., S. Cluet, G. Ferran et M.C. Rousset: "The Xyleme project", Computer Networks 39(3): 225-238 (2002)
  2. Amer-Yahia S., C. Botev, J. Shanmugasundaram : "TeXQuery: A Full-Text Search Extension to XQuery", WWW'04
  3. BEA: "Liquid data for WebLogic 1.1, 2004, http://edocs.bea.com/liquiddata/docs11/
  4. Bremer J. M., M. Gertz : "XQuery/IR: Integrating XML Document and Data Retrieval", WebDB 2002.
  5. Buxton S., Rys M. Editors, "XQuery and XPath Full-Text Requirements", W3C Working Draft 02 May 2003, http://www.w3.org/TR/xquery-full-text-requirements/
  6. Chen Q., A. Lim and K.W. Ong : D(k)-index: An adaptive structural summary for graph-structured data. In Proc. of SIGMOD, 2003.
  7. Chung Chin-Wan, J. Min and K. Shim: "APEX: an adaptive path index for XML data", SIGMOD Conference 2002: 121-132
  8. Cooper B., N. Sample, M.J. Franklin, G.R. Hjaltason and M. Shadmon :" A Fast Index for Semistructured Data.", VLDB 2001: 341-350
  9. Dang-Ngoc T.-T., G. Gardarin : "Federating heterogeneous data sources with XML", In Proc. of IASTED IKS Conference, pages 193-198, Scottsdale, USA, Nov. 2003.
  10. Fuhr N., K. Großjohann: "XIRQL: A Query Language for Information Retrieval in XML Documents". SIGIR 2001: 172-180
  11. Gardarin G., L. Yeh: "Treeguide Index: Enabling Efficient XML Query Processing", Bases de Données Avancées, Montpellier, Octobre 2005
  12. IBM: "DB2 Information Integrator for Content", 2004, http://www-306.ibm.com/software/data/eip/
  13. Kaushik R., P. Shenoy, P. Bohannon and E. Gudes : Exploiting local similarity for indexing paths in graphstructured data. In Proc. of ICDE, 2002.
  14. Lin G., F. Shao, C. Botev, J. Shanmugasundaram : XRANK: Ranked Keyword Search over XML Documents. SIGMOD Conference 2003: 16-27
  15. Milo T., D. Suciu: "Index Structures for Path Expressions", ICDT 1999: 277-295
  16. Papakonstantinou Y., V. Borkar, M. Orgiyan, K. Stathatos, L. Suta, V. Vassalos, P. Velikhov : "XML queries and algebra in the Enosys integration platform", Data Knowl. Eng. 44(3): 299-322 (2003)
  17. Rahm E., P.A. Bernstein. 2001. A survey of approaches to automatic schema matching. VLDB journal:334-350.
  18. Theobald A., G. Weikum : "The Index-Based XXL Search Engine for Querying XML Data with Relevance Ranking". EDBT 2002: 477-495
  19. XQuare: "The XQuare project: open source information integration components based on XML and XQuery", 2004, http://xquare.objectweb.org/
Download


Paper Citation


in Harvard Style

Jamard C. and Gardarin G. (2006). EXTENDING AN XML MEDIATOR WITH TEXT QUERY . In Proceedings of WEBIST 2006 - Second International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-972-8865-46-7, pages 38-45. DOI: 10.5220/0001246000380045


in Bibtex Style

@conference{webist06,
author={Clément Jamard and Georges Gardarin},
title={EXTENDING AN XML MEDIATOR WITH TEXT QUERY},
booktitle={Proceedings of WEBIST 2006 - Second International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2006},
pages={38-45},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001246000380045},
isbn={978-972-8865-46-7},
}


in EndNote Style

TY - CONF
JO - Proceedings of WEBIST 2006 - Second International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - EXTENDING AN XML MEDIATOR WITH TEXT QUERY
SN - 978-972-8865-46-7
AU - Jamard C.
AU - Gardarin G.
PY - 2006
SP - 38
EP - 45
DO - 10.5220/0001246000380045