STRUCTURE-BASED INTERROGATION AND AUTOMATIC QUERY REFORMULATION

Mohamed Ben Aouicha, Ines Kamoun, Mohamed Tmar, Abdelmajid Ben Hamadou

Abstract

This paper presents an information retrieval model on XML documents based on tree matching. Queries and documents are represented by extended trees. An extended tree is built starting from the original tree, with additional weighted virtual links between each node and its indirect descendants allowing to directly reach each descendant. Therefore only one level separates between each node and its indirect descendants. This allows to compare the user query and the document with flexibility and with respect to the query structural hints. The content of each node is however very important to decide whether a document element is relevant or not, thus the content should be taken into account in the retrieval process. We separate between the structurebased and the content-based retrieval processes. The structure notion should be taken into account during the retrieval process as well as during automatic query reformulation. We propose an approach to automatic query reformulation starting from the original query on one hand and the fragments judged relevant by the user on the other. Structure hints analysis allows us to identify nodes that match the user query and to rebuild it during the automatic query reformulation step. The main goal of this paper is to show the impact of structural hints in XML retrieval and XML query optimization. Some experiments have been undertaken into a dataset provided by INEXa to show the effectiveness of our proposals. aINitiative for the Evaluation of XML retrieval, an evaluation forum that aims at promoting retrieval capabilities on XML documents.

References

  1. Balog, K., Bron, M., and de Rijke, M. (2010). Categorybased query modeling for entity search. 32nd European Conference on Information Retrieval, ECIR, pages 319-331.
  2. Ben Aouicha, M., Tmar, M., Boughanem, M., and Abid, M. (2006). Vers une stratgie de recherche d'information structure base sur la comparaison d'arbres. CORIA, pages 65-72.
  3. Ben Aouicha, M., Tmar, M., Boughanem, M., and Abid, M. (2009). Experiments on element and document statics for xml retrieval based on tree matching. International Journal of Computer and Information Science and Engineering, IJCISE, 3(1):7-16.
  4. Bordogna, G. and Pasi, G. (2000). Flexible querying of structured documents. Proc. of the fourth International Conference on Flexible Query Answering Systems(FQAS).
  5. Fuhr, N. and Grossjohann, K. (2001). Xirql: A query language for information retrieval in xml documents. Proc. of the 24th annual ACM SIGIR conference on research and development in Information Retrieval, New Orleans, USA, pages 172-180.
  6. Piwowarski, B. and Dupret, G. (2006). Evaluation in (xml) information retrieval: Expected precision-recall with user modelling (eprum). Proc. of the 29th annual ACM SIGIR conference on research and development in Information Retrieval, pages 260-267.
  7. Rocchio, J. (1971). Relevance feedback in information retrieval. Prentice Hall Inc., englewood cliffs, nj edition.
  8. Schlieder, T. and Meuss, H. (2002). Querying and ranking xml documents. Journal of the American Society for Information Science and Technology, 6(53):489-503.
  9. Selkow, S. M. (1977). The tree-to-tree edition problem. Information processing letters, pages 184-186.
Download


Paper Citation


in Harvard Style

Ben Aouicha M., Kamoun I., Tmar M. and Ben Hamadou A. (2011). STRUCTURE-BASED INTERROGATION AND AUTOMATIC QUERY REFORMULATION . In Proceedings of the International Conference on Knowledge Management and Information Sharing - Volume 1: KMIS, (IC3K 2011) ISBN 978-989-8425-81-2, pages 123-128. DOI: 10.5220/0003624601230128


in Bibtex Style

@conference{kmis11,
author={Mohamed Ben Aouicha and Ines Kamoun and Mohamed Tmar and Abdelmajid Ben Hamadou},
title={STRUCTURE-BASED INTERROGATION AND AUTOMATIC QUERY REFORMULATION},
booktitle={Proceedings of the International Conference on Knowledge Management and Information Sharing - Volume 1: KMIS, (IC3K 2011)},
year={2011},
pages={123-128},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003624601230128},
isbn={978-989-8425-81-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Management and Information Sharing - Volume 1: KMIS, (IC3K 2011)
TI - STRUCTURE-BASED INTERROGATION AND AUTOMATIC QUERY REFORMULATION
SN - 978-989-8425-81-2
AU - Ben Aouicha M.
AU - Kamoun I.
AU - Tmar M.
AU - Ben Hamadou A.
PY - 2011
SP - 123
EP - 128
DO - 10.5220/0003624601230128