ACCESS RIGHTS IN ENTERPRISE FULL-TEXT SEARCH - Searching Large Intranets Effectively using Virtual Terms

Jan Kasprzak, Michal Brandejs, Matěj Čuhel, Tomaš Obšivač

Abstract

One of the toughest problems to solve when deploying an enterprise-wide full-text search system is to handle the access rights of the documents and intranet web pages correctly and effectively. Post-processing the results of general-purpose full-text search engine (filtering out the documents inaccessible to the user who sent the query) can be an expensive operation, especially in large collections of documents. We discuss various approaches to this problem and propose a novel method which employs virtual tokens for encoding the access rights directly into the search index. We then evaluate this approach in an intranet system with several millions of documents and a complex set of access rights and access rules.

References

  1. Anderson, R. J. (2008). Security Engineering: A Guide to Building Dependable Distributed Systems. Wiley Publishing.
  2. Bailey, P., Hawking, D., and Matson, B. (2006). Secure search in enterprise webs: tradeoffs in efficient implementation for document level security. In CIKM 7806: Proceedings of the 15th ACM international conference on Information and knowledge management, pages 493-502, New York, NY, USA. ACM.
  3. Elias, P. (1975). Universal codeword sets and representations of the integers. IEEE Trans. Inform. Theory, pages 194-203.
  4. SAML (2005). Security Assertion Markup Language. http://docs.oasis-open.org/security/saml/ /v2.0/samlcore-2.0-os.pdf.
  5. Zhu, H., Raghavan, S., Vaithyanathan, S., and Löser, A. (2007). Navigating the intranet with high precision. In WWW 7807: Proceedings of the 16th international conference on World Wide Web, pages 491-500, New York, NY, USA. ACM.
  6. Zobel, J. and Moffat, A. (2006). Inverted files for text search engines. ACM Comput. Surv., 38(2):6.
Download


Paper Citation


in Harvard Style

Kasprzak J., Brandejs M., Čuhel M. and Obšivač T. (2010). ACCESS RIGHTS IN ENTERPRISE FULL-TEXT SEARCH - Searching Large Intranets Effectively using Virtual Terms . In Proceedings of the 12th International Conference on Enterprise Information Systems - Volume 1: ICEIS, ISBN 978-989-8425-04-1, pages 32-39. DOI: 10.5220/0002896900320039


in Bibtex Style

@conference{iceis10,
author={Jan Kasprzak and Michal Brandejs and Matěj Čuhel and Tomaš Obšivač},
title={ACCESS RIGHTS IN ENTERPRISE FULL-TEXT SEARCH - Searching Large Intranets Effectively using Virtual Terms},
booktitle={Proceedings of the 12th International Conference on Enterprise Information Systems - Volume 1: ICEIS,},
year={2010},
pages={32-39},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002896900320039},
isbn={978-989-8425-04-1},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 12th International Conference on Enterprise Information Systems - Volume 1: ICEIS,
TI - ACCESS RIGHTS IN ENTERPRISE FULL-TEXT SEARCH - Searching Large Intranets Effectively using Virtual Terms
SN - 978-989-8425-04-1
AU - Kasprzak J.
AU - Brandejs M.
AU - Čuhel M.
AU - Obšivač T.
PY - 2010
SP - 32
EP - 39
DO - 10.5220/0002896900320039