The Difficulty of Path Traversal in Information Networks

Frank W. Takes, Walter A. Kosters

Abstract

This paper introduces a set of classification techniques for determining the difficulty — for a human — of path traversal in an information network. In order to ensure the generalizability of our approach, we do not use ontologies or concepts of expected semantic relatedness, but rather focus on local and global structural graph properties and measures to determine the difficulty of finding a certain path. Using a large corpus of over two million traversed paths on Wikipedia, we demonstrate how our techniques are able to accurately assess the human difficulty of finding a path between two articles within an information network.

References

  1. Agarwal, R., Veer Arya, K., and Shekhar, S. (2010). An architectural framework for web information retrieval based on user's navigational pattern. In Proceedings of the 5th International Conference on Industrial and Information Systems, pages 195-200.
  2. Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., and Ives, Z. (2007). DBpedia: A nucleus for a web of open data. In Proceedings of 6th International Semantic Web Conference, pages 722-735.
  3. Bizer, C., Heath, T., and Berners-Lee, T. (2009). Linked data-the story so far. International Journal on Semantic Web and Information Systems, 5(3):1-22.
  4. Brandes, U. (2001). A faster algorithm for betweenness centrality. Journal of Mathematical Sociology, 25(2):163-177.
  5. He, B., Patel, M., Zhang, Z., and Chang, K. (2007). Accessing the deep web. Communications of the ACM, 50(5):94-101.
  6. Hsieh-Yee, I. (2001). Research on web search behavior. Library & Information Science, 23(2):167-185.
  7. Hu, J., Wang, G., Lochovsky, F., Sun, J., and Chen, Z. (2009). Understanding user's query intent with Wikipedia. In Proceedings of the 18th International World Wide Web Conference, pages 471-480.
  8. Kentsch, A. M., Kosters, W., van der Putten, P., and Takes, F. (2011). Exploratory recommendations using Wikipedia's linking structure. In Proceedings of the 20th Belgian Netherlands Conference on Machine Learning, pages 61-68.
  9. Milne, D. and Witten, I. (2008). Learning to link with Wikipedia. In Proceedings of the 17th Conference on Information and Knowledge Management, pages 509- 518.
  10. Teevan, J., Alvarado, C., Ackerman, M., and Karger, D. (2004). The perfect search engine is not enough: A study of orienteering behavior in directed search. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pages 415-422.
  11. Watts, D. and Strogatz, S. (1998). Collective dynamics of small-world-networks. Nature, 393(6684):440-442.
  12. West, R. and Leskovec, J. (2012a). Automatic versus human navigation in information networks. In Proceedings of the International Conference on Weblogs and Social Media. To appear.
  13. West, R. and Leskovec, J. (2012b). Human wayfinding in information networks. In Proceedings of the 21st World Wide Web Conference, pages 619-628.
Download


Paper Citation


in Harvard Style

W. Takes F. and A. Kosters W. (2012). The Difficulty of Path Traversal in Information Networks . In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2012) ISBN 978-989-8565-29-7, pages 138-144. DOI: 10.5220/0004104201380144


in Bibtex Style

@conference{kdir12,
author={Frank W. Takes and Walter A. Kosters},
title={The Difficulty of Path Traversal in Information Networks},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2012)},
year={2012},
pages={138-144},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004104201380144},
isbn={978-989-8565-29-7},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2012)
TI - The Difficulty of Path Traversal in Information Networks
SN - 978-989-8565-29-7
AU - W. Takes F.
AU - A. Kosters W.
PY - 2012
SP - 138
EP - 144
DO - 10.5220/0004104201380144