QUERY EXPANSION WITH MATRIX CORRELATION TECHNIQUES - A Systematic Approach

Claudio Biancalana, Antonello Lapolla, Alessandro Micarelli

2008

Abstract

This paper presents an Information Retrieval system that employs techniques based on Personalization and Query Expansion (QE). The system was developed in an incremental and iterative way, starting from a simpler system and reaching a more complex one, to the point that it is possible to talk about several systems each based on a specific, deeply analyzed approach: four systems sharing the concept of term co-occurrence. Starting from a simple system based on bigrams, we moved onto a system based on term proximity, through an approach known in the literature as Hyperspace Analogue to Language (HAL), and eventually developing a solution based on co-occurrence at page level. The latter presents a hybrid approach based on term proximity. This novel architecture is shown here for the first time to our knowledge.

References

  1. Anick, P. (2003). Using terminological feedback for web search refinement: a log-based study. In SIGIR 7803: Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval, pages 88-95, New York, NY, USA. ACM Press.
  2. Bai, J., Song, D., Bruza, P., Nie, J.-Y., and Cao, G. (2005). Query expansion using term relationships in language models for information retrieval. In CIKM, pages 688-695.
  3. Bruza, P. D. and Song, D. (2002). Inferring query models by computing information flow. In CIKM 7802: Proceedings of the eleventh international conference on Information and knowledge management, pages 260- 269, New York, NY, USA. ACM Press.
  4. Burgess, C., Livesay, K., and Lund, K. (1999). Exploration in Context Space: Words, Sentences, Discourse. Discourse Processes, 25(2&3):211-257.
  5. Gao, J., Nie, J.-Y., Wu, G., and Cao, G. (2004). Dependence language model for information retrieval. In SIGIR 7804: Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval, pages 170-177, New York, NY, USA. ACM Press.
  6. Gasparetti, F. and Micarelli, A. (2007). Personalized search based on a memory retrieval theory. International Journal of Pattern Recognition and Artificial Intelligence (IJPRAI): Special Issue on Personalization Techniques for Recommender Systems and Intelligent User Interfaces, 21(2):207-224.
  7. Jansen, B. J., Spink, A., and Saracevic, T. (2000). Real life, real users, and real needs: a study and analysis of user queries on the web. Information Processing and Management, 36(2):207-227.
  8. Porter, M. F. (1997). An algorithm for suffix stripping. pages 313-316.
  9. Radlinski, F. and Joachims, T. (2005). Query chains: Learning to rank from implicit feedback.
  10. Salton, G. and Buckley, C. (1997). Improving retrieval performance by relevance feedback. pages 355-364.
  11. Salton, G., Wong, A., and Yang, C. S. (1975). A vector space model for automatic indexing. Commun. ACM, 18(11):613-620.
  12. Schütze, H. and Pedersen, J. O. (1997). A cooccurrencebased thesaurus and two applications to information retrieval. Inf. Process. Manage., 33(3):307-318.
  13. Teevan, J., Dumais, S. T., and Horvitz, E. (2005). Personalizing search via automated analysis of interests and activities. In SIGIR 7805: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, pages 449-456, New York, NY, USA. ACM Press.
Download


Paper Citation


in Harvard Style

Biancalana C., Lapolla A. and Micarelli A. (2008). QUERY EXPANSION WITH MATRIX CORRELATION TECHNIQUES - A Systematic Approach . In Proceedings of the Fourth International Conference on Web Information Systems and Technologies - Volume 2: WEBIST, ISBN 978-989-8111-27-2, pages 34-41. DOI: 10.5220/0001520000340041


in Bibtex Style

@conference{webist08,
author={Claudio Biancalana and Antonello Lapolla and Alessandro Micarelli},
title={QUERY EXPANSION WITH MATRIX CORRELATION TECHNIQUES - A Systematic Approach},
booktitle={Proceedings of the Fourth International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,},
year={2008},
pages={34-41},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001520000340041},
isbn={978-989-8111-27-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Fourth International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,
TI - QUERY EXPANSION WITH MATRIX CORRELATION TECHNIQUES - A Systematic Approach
SN - 978-989-8111-27-2
AU - Biancalana C.
AU - Lapolla A.
AU - Micarelli A.
PY - 2008
SP - 34
EP - 41
DO - 10.5220/0001520000340041