TUNING SEARCH ENGINE TO FIT XML RETRIEVAL SCENARIO

Gilles Hubert, Josiane Mothe, Kurt Englmeier

Abstract

XML usage is growing to describe documents and consequently systems to search in XML collections are necessary. Various proposals of systems intend to handle XML documents. This paper describes an XML approach based on direct contribution of the components constituting an information need. The search engine is largely configurable in order to be adapted to different context of search. Beyond being globally adapted to a collection of documents an important objective is to define a search engine that can be adapted to different retrieval scenarios and to identify how to adapt it. This paper presents first experiments on INEX testbeds that show how the engine can be adapted to better respond to different retrieval scenarios.

References

  1. Amer-Yahia S., Lakshmanan L., Pandit S., 2004. FleXPath: Flexible Structure and Full-Text Querying for XML, ACM SIGMOD, Paris, pp. 83-94.
  2. Augé, J., Englmeier, K., Hubert, G., Mothe, J., 2003. Catégorisation automatique de textes basée sur des hiérarchies de concepts, BDA'03, 19ièmes Journées de Bases de Données Avancées, Lyon, pp. 69-87.
  3. Bray T., Paoli J., Sperberg-McQueen C. M., Maler E., Yergeau Y., 2004. Extensible, Markup Language (XML) 1.0. (Third Edition), W3C Recommendation.
  4. Carmel D., Maarek Y. S., Mandelbrod M., Mass Y., Soffer A., 2003. Searching XML documents via XML fragments, 26th international conference SIGIR, Toronto, pp. 151-158.
  5. Clark J., DeRose S., 1999. XML Path Language (XPath), W3C Recommendation.
  6. Crouch C. J., Apte S., Bapat H., 2003. An Approach to Structured Retrieval Based on the Extended Vector Model, 2nd INEX Workshop, Dagstuhl, pp. 89-93.
  7. Fuhr N., Großjohann K., 2004. XIRQL: An XML query language based on information retrieval concepts, ACM TOIS, vol. 22, Issue 2, pp. 313-356.
  8. Fuhr N., Maalik S., Lalmas M., 2003. Overview of the INitiative for the Evaluation of XML Retrieval (INEX) 2003, 2nd INEX Workshop, Dagstuhl, pp. 1-7.
  9. Geva S., 2005. GPX - Gardens Point XML Information Retrieval at INEX 2004, LNCS 3493, INEX'04, 3rd International Workshop, Dagstuhl, p. 211-223.
  10. Hubert G., 2005. A voting method for XML retrieval, LNCS 3493, INEX'04, 3rd International Workshop, Dagstuhl, p. 183-196.
  11. Kazaï G., Lalmas M., 2005. INEX 2005 Evaluation Metrics, Pre Proceedings of the 4th INEX Workshop, pp. 401-406.
  12. Liu S., Zou O., Chu W. W., 2004. Configurable indexing and ranking for XML information. 27th International Conference SIGIR, Sheffield, pp. 88-95.
  13. Ogilvie P., Callan J., 2003. Using Language Models for Flat Text Queries in XML Retrieval, 2nd INEX Workshop, Dagstuhl, pp. 12-18.
  14. Pehcevski J., Thom J. A., Tahaghoghi S. M. M., 2005. Hybrid XML Retrieval Revisited, LNCS 3493, INEX'04, 3rd International Workshop, Dagstuhl, pp. 153-167.
  15. Piwowarski B., Vu H.-T., Gallinari P., 2003. Bayesian Networks and INEX'03, 2nd INEX Workshop, Dagstuhl, pp. 33-37.
  16. Ponte J. M., Croft W. B., 1998. A Language Modeling Approach to Information Retrieval, 21st International Conference SIGIR, Melbourne, pp. 275-281.
  17. Salton G., Wong A., Yang C. S., 1975. A vector space model for automatic indexing, Communication of the ACM, vol. 18, Issue 11, pp. 613-620.
  18. Sigurbjörnsson B., Kamps J., de Rijke M., 2005. Mixture Models, Overlap, and Structural Hints in XML Element Retrieval, LNCS 3493, INEX'04, 3rd International Workshop, Dagstuhl, pp. 196-210.
Download


Paper Citation


in Harvard Style

Hubert G., Mothe J. and Englmeier K. (2007). TUNING SEARCH ENGINE TO FIT XML RETRIEVAL SCENARIO . In Proceedings of the Third International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-972-8865-77-1, pages 228-233. DOI: 10.5220/0001278802280233


in Bibtex Style

@conference{webist07,
author={Gilles Hubert and Josiane Mothe and Kurt Englmeier},
title={TUNING SEARCH ENGINE TO FIT XML RETRIEVAL SCENARIO},
booktitle={Proceedings of the Third International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2007},
pages={228-233},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001278802280233},
isbn={978-972-8865-77-1},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Third International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - TUNING SEARCH ENGINE TO FIT XML RETRIEVAL SCENARIO
SN - 978-972-8865-77-1
AU - Hubert G.
AU - Mothe J.
AU - Englmeier K.
PY - 2007
SP - 228
EP - 233
DO - 10.5220/0001278802280233