A PERSONALIZED INFORMATION SEARCH ASSISTANT

M. Elena Renda

2010

Abstract

A common characteristic of most of the traditional search and retrieval systems is that they are oriented towards a generic user, often failing to connect people with what they are really looking for. In this paper we present PI SA, a Personalized Information Search Assistant, which, rather than relying on the unrealistic assumption that the user will precisely specify what she is really looking for when searching, leverages implicit information about the user’s interests. PI SA is a desktop application which provides the user with a highly personalized information space where she can create, manage and organize folders (similarly to email programs), and manage documents retrieved by the system into her folders to best fit her needs. Furthermore, PI SA offers different mechanisms to search the Web, and the possibility of personalizing result delivery and visualization. PI SA learns user and folder profiles from user’s choices, and uses these profiles to improve retrieval effectiveness in searching by selecting the relevant resources to query and filtering the results accordingly. A working prototype has been developed, tested and evaluated. Preliminary user evaluation and experimental results are very promising, showing that the personalized search environment PI SA provides considerably increases effectiveness and user satisfaction in the searching process.

References

  1. Albayrak, S., Wollny, S., Varone, N., Lommatzsch, A., and Milosevic, D. (2005). Agent technology for personalized information filtering: the pia-system. In Proc. of the 2005 ACM Symposium on Applied Computing, pages 54-59, Santa Fe, New Mexico. ACM.
  2. Amato, G. and Straccia, U. (1999). User profile and applications to digital libraries. In Proc. of the 3rd European Conference on Research and Advanced Technology for Digital Libraries, pages 184-197, Paris, France. Springer-Verlag.
  3. Aslam, J. A., Pavlu, V., and Yilmaz, E. (2005). Measurebased metasearch. In Proc. of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Salvador, Brazil. ACM Press.
  4. Baillie, M., Crestani, F., and Landoni, M. (2006). Peng: integrated search of distributed news archives. In Proc. of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, pages 607-608, Seattle, WA, USA. ACM.
  5. Belkin, N. and Croft, B. W. (1992). Information filtering and information retrieval: Two sides of the same coin? Communications of the ACM, 35(12):29-38.
  6. Callan, J. (1998). Learning while filtering documents. In Proc. of the 21th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 224-231, Melbourne, Australia.
  7. Callan, J. (2000). Distributed information retrieval. In Croft, W., editor, Advances in Information Retrieval, pages 127-150. Kluwer Academic Publishers, Hingham, MA, USA.
  8. Callan, J. and Connell, M. (2001). Query-based sampling of text databases. ACM Transactions on Information Systems, 19(2):97-130.
  9. Callan, J., Lu, Z., and Croft, B. W. (1995). Searching distributed collections with inference networks. In Proc. of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 21-28, Seattle, WA, USA.
  10. Carpineto, C., De Mori, R., Romano, G., and Bigi, B. (2001). An information-theoretic approach to automatic query expansion. ACM Transactions on Information Systems, 19(1):1-27.
  11. Chen, L. and Chue, W. L. (2005). Using web structure and summarisation techniques for web content mining. Information Processing and Management, 41(5):1225- 42.
  12. Dhamankar, R., Lee, Y., Doan, A., Halevy, A., and Domingos, P. (2004). iMAP: discovering complex semantic matches between database schemas. In Proc. of the ACM SIGMOD International Conference on Management of Data, pages 383-394. ACM Press.
  13. Dwork, C., Kumar, R., Noar, M., and Sivakumar, D. (2001). Rank aggregation methods for the web. In Proc. of the 10th International Conference on the World Wide Web, pages 613-622. ACM Press and Addison Wesley.
  14. Faensen, D., Faulstich, L., Schweppe, H., Hinze, A., and Steidinger, A. (2001). Hermes: a notification service for digital libraries. In ACM/IEEE Joint Conference on Digital Libraries, pages 373-380.
  15. Fernandez, L., Sanchez, J., and Garcia, A. (2000). MiBiblio: personal spaces in a digital library universe. In ACM DL, pages 232-233.
  16. Huang, R., Casanova, H., and Chien, A. A. (2007). Automatic resource specification generation for resource selection. In Proc. of the 2007 ACM/IEEE conference on Supercomputing, pages 1-11, New York, NY, USA. ACM.
  17. Lee, J. H. (1997). Analysis of multiple evidence combination. In Proc. of the 20th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 267-276, Philadelphia.
  18. Liu, X. and Croft, W. (2002). Passage retrieval based on language models. In Proc. of the 11th International Conference on Information and Knowledge Management, pages 375-382. ACM Press.
  19. Meng, W., Wu, Z., Yu, C., and Li, Z. (2001). A higly scalable and effective method for metasearch. ACM Transaction on Information Systems, 19(3):310-335.
  20. Nottelmann, H. and Fuhr, N. (2003). Evaluating different methods of estimating retrieval quality for resource selection. In Proc. of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.
  21. Renda, M. E. and Straccia, U. (2003). Web metasearch: Rank vs. score based rank aggregation methods. In Proc. 18th Annual ACM Symposium on Applied Computing, pages 841-846, Melbourne, Florida, USA. ACM.
  22. Renda, M. E. and Straccia, U. (2005). A personalized collaborative digital library environment: a model and an application. Information Processing & Management, 41(1):5-21.
  23. Renda, M. E. and Straccia, U. (2006). Automatic structured query transformation over distributed digital libraries. In Proc. of the 21st Annual ACM Symposium on Applied Computing, pages 1078-1083, Dijon, France. ACM Press.
  24. Salton, G. and McGill, J. M. (1983). Introduction to Modern Information Retrieval. Addison Wesley Publ. Co., Reading, Massachussetts.
  25. Sparck Jones, K. (1972). A statistical interpretation of term specificity and its application in retrieval. Journal of Documentation, 28:11-21.
  26. Teevan, J., Dumais, S. T., and Horvitz, E. (2005). Personalizing search via automated analysis of interests and activities. In Proc. of the 28th annual international ACM SIGIR Conference on Research and Development in Information Retrieval, pages 449-456, New York, NY, USA. ACM.
Download


Paper Citation


in Harvard Style

Elena Renda M. (2010). A PERSONALIZED INFORMATION SEARCH ASSISTANT . In Proceedings of the 6th International Conference on Web Information Systems and Technology - Volume 2: WEBIST, ISBN 978-989-674-025-2, pages 29-39. DOI: 10.5220/0002793200290039


in Bibtex Style

@conference{webist10,
author={M. Elena Renda},
title={A PERSONALIZED INFORMATION SEARCH ASSISTANT},
booktitle={Proceedings of the 6th International Conference on Web Information Systems and Technology - Volume 2: WEBIST,},
year={2010},
pages={29-39},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002793200290039},
isbn={978-989-674-025-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 6th International Conference on Web Information Systems and Technology - Volume 2: WEBIST,
TI - A PERSONALIZED INFORMATION SEARCH ASSISTANT
SN - 978-989-674-025-2
AU - Elena Renda M.
PY - 2010
SP - 29
EP - 39
DO - 10.5220/0002793200290039