DISAMBIGUATING WEB SEARCH RESULTS BY TOPIC AND TEMPORAL CLUSTERING - A Proposal

Ricardo Campos, Gaël Dias, Alípio Mário Jorge

2009

Abstract

With so much information available on the web, looking for relevant documents on the Internet has become a difficult task. Temporal features play an important role with the introduction of a time dimension and the possibility to restrict a search by time, recreating a particular moment of a web page set. Despite its importance, temporal information is still under-considered by current search engines, limiting themselves to the capture of the most recent snapshot of the information. In this paper, we describe the architecture of a temporal search engine which uses timelines to browse search results. More specifically, we intend to add a time measure to cluster web page results, by analyzing web page contents, supporting the search of temporal and non-temporal information embedded in web documents.

References

  1. Adar, E., Dontcheva, M., Fogarty, J. and Weld, D., 2008. Zoetrope: interacting with the ephemeral web. In Proc. of 21st ACM Symp. User Interf. Soft. and Tech. USA.
  2. Alonso, O., Baeza-Yates, R. and Gertz, M., 2007. Exploratory search using timelines. In SIGCHI Workshop on Exploratory Search and HCI Workshop.
  3. Alonso, O. and Gertz, M., 2006. Clustering of search results using temporal attributes. Proc. of 29th SIGIR
  4. Alonso, O., Gertz, M. and Baeza-Yates, R, 2007. On the value of temporal information in IR. In Proc. of ACM SIGIR, Vol. 41 , Issue 2, pp 35-41, ISSN:0163-5840
  5. Campos, R., Dias, G., Nunes, C. and Nonchev, B., 2008. Clustering Web Page Search Results: A Full Text Based Approach. In International Journal of Computer and Information Science Vol 9(4). pp 29-40.
  6. Deniz, E., Chris, F. and Terence, J., 2006. Chronica: Temporal Web Search Engine. In Proc. of ICWE.
  7. Desikan, P. and Srivastava, J., 2002. Mining information from temporal behaviour of web usage. Minnesota.
  8. Dubinko, M., Kumar, R., Magnani, J., Kovak, J., Raghavan, P. and Tomkins, A., 2006. Visualizing tags over time. In Proc. of the 15th Int. Conf. on WWW 2006, Scotland. Pp 193-202, ISBN:1-59593-323-9.
  9. Jatowt, A., Kawai, Y., Nakamura, S., Kidawara, Y. and Tanaka, K., 2006. Journey to the past: proposal of a framework for past web browser. In Proc. of 17th Conf. on Hypertext and Hypermedia, Denmark.
  10. Jatowt, A., Kawai, Y. and Tanaka, K., 2008). Visualizing historical content of web pages. In Proc. of 17th International Conference on WWW, pp 1221 - 1222, Beijing, China. ISBN:978-1-60558-085-2.
  11. Jin, P., Lian, J., Zhao, X. and Wan, S., 2008. TISE: a temporal search engine for web contents. International Symp. on Intelligent IT Application. Shanghai, China.
  12. Koen, D. and Bender, W., 2000. Time frames: temporal augmentation of the news. IBM Systems Journal, Volume 39, Issue 3-4, pp 597-616, ISSN:0018-8670.
  13. Nunes, S., 2007. Exploring temporal evidence in web information retrieval. In Proc. of the Future Directions in Information Access, Glasgow, Scotland, pp 44 - 50.
  14. Plachhouras, V., 2007. Temporal aspects of web search. Yahoo! Research, Barcelona.
  15. Samia, M., 2003. Temporal web mining. In Proc. of 15th Work. on the Foundations of DB, pp 27-31, Germany.
  16. Schilder, F. and Habel, C., 2001. From temporal expressions to temporal information: semantic tagging of news messages. In Proc, of ACL'01, pp 65-72, Toulouse, France.
  17. Shaparenko, B., Caruana, R., Gehrke, J. and Joachims, T., 2005. Identifying temporal patterns and key players in document collections. Proc. of ICDM, 165-174, USA.
  18. Song, S. and JaJa, J., 2008. Archiving Temporal Web
  19. Toyoda, M. and Kitsuregawa, M., 2003. Extracting evolution of web communities from a series of web archives. Proc. of 14th ACM conference on hypertext and hypermedia, pp 28-37, ISBN: 1-58113-704-4.
Download


Paper Citation


in Harvard Style

Campos R., Dias G. and Mário Jorge A. (2009). DISAMBIGUATING WEB SEARCH RESULTS BY TOPIC AND TEMPORAL CLUSTERING - A Proposal . In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2009) ISBN 978-989-674-011-5, pages 292-296. DOI: 10.5220/0002301102920296


in Bibtex Style

@conference{kdir09,
author={Ricardo Campos and Gaël Dias and Alípio Mário Jorge},
title={DISAMBIGUATING WEB SEARCH RESULTS BY TOPIC AND TEMPORAL CLUSTERING - A Proposal},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2009)},
year={2009},
pages={292-296},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002301102920296},
isbn={978-989-674-011-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2009)
TI - DISAMBIGUATING WEB SEARCH RESULTS BY TOPIC AND TEMPORAL CLUSTERING - A Proposal
SN - 978-989-674-011-5
AU - Campos R.
AU - Dias G.
AU - Mário Jorge A.
PY - 2009
SP - 292
EP - 296
DO - 10.5220/0002301102920296