tion performance. However it lacks - as reported in
this paper - the labeling or definition of the clusters.
So again the user has to check by reading at least some
snippets inside a cluster (Cucerzan, 2007).
7 CONCLUSIONS AND
OUTLOOK
We presented an approach of guided interactive topic
graph extraction for exploration of web content. The
initial information request is issued online by a user
to the system in the form of a query topic description.
Instead of directly computing and presenting a topic
graph for the user query, possible senses of the query
are identified and enumerated by referring to an exter-
nal knowledge base, Wikipedia in our case. All found
readings are then sorted and presented to the user and
the user is asked to select her preferred one. The user–
selected sense is then used for constructing an initial
topic graph from a set of web snippets returned by a
standard search engine. At this point, the topic graph
already represents a graph of strongly correlated rel-
evant entities and terms. The topic graph is then dis-
played on a tablet computer (in our case an iPad) as
touch–sensitive graph. The user can then request fur-
ther detailed information through multiple iterations.
Experimental results achieved by means of an au-
tomatic evaluation procedure demonstrate the benefit
of the disambiguation method for exploratory search
strategies. The automatic evaluation has been ap-
proved by another human evaluation. Currently, the
main problem of our approach arises when an am-
biguous query cannot be found in Wikipedia using our
strategy. For example, the query “Famous Jim Clark”
would not be found as we require that all words of
the query occur in an Wikipedia article’s title. Even
if we could cope with this using a modified fuzzy
search strategy we still would not find out ambiguities
in queries that simply are not present in Wikipedia.
However, in the running system we plan to give some
feedback to the user by changing the color of the
search entry. Then the user knows that there may
be more then just one meaning for her query. An-
other open question is whether an improvement of
our rather simple way of expanding the query using
Wikipedia abstracts will lead to significant improve-
ments of the disambiguation results. We are planning
to do some research on this.
ACKNOWLEDGEMENTS
This research was conducted in the context of the
project Deependance (funded by the German Fed-
eral Ministry of Education and Research, contract
01IW11003)
REFERENCES
Akhavi, M. S., Rahmati, M., and Amini, N. N. (2007).
3d visualization of hierarchical clustered web search
results. In Proceedings of the Computer Graphics,
Imaging and Visualisation, CGIV ’07, pages 441–
446, Washington, DC, USA. IEEE Computer Society.
Breese, J. S., Heckerman, D., and Kadie, C. (1998). Empir-
ical analysis of predictive algorithms for collaborative
filtering. pages 43–52. Morgan Kaufmann.
Chirita, P. A., Nejdl, W., Paiu, R., and Kohlschütter, C.
(2005). Using odp metadata to personalize search.
In Proceedings of the 28th annual international ACM
SIGIR conference on Research and development in in-
formation retrieval, SIGIR ’05, pages 178–185, New
York, NY, USA. ACM.
Cucerzan, S. (2007). Large-scale named entity disambigua-
tion based on wikipedia data. In In Proc. 2007 Joint
Conference on EMNLP and CNLL, pages 708–716.
Di Giacomo, E., Didimo, W., Grilli, L., and Liotta, G.
(2007). Graph visualization techniques for web clus-
tering engines. IEEE Transactions on Visualization
and Computer Graphics, 13:294–304.
Fader, A., Soderland, S., and Etzioni, O. (2011). Identifying
relations for open information extraction. In Proceed-
ings of the 2011 Conference on Empirical Methods in
Natural Language Processing, EMNLP 2011, pages
1535–1545.
Gauch, S., Chaffee, J., and Pretschner, A. (2003). Ontology-
based personalized search and browsing. Web Intelli.
and Agent Sys., 1:219–234.
Gimenez, J. and Marquez., L. (2004). Svmtool: A gen-
eral pos tagger generator based on support vector ma-
chines. In In Proceedings of the 4th International
Conference on Language Resources and Evaluation
(LREC’04), vol. I, pages 43 - 46. Lisbon, Portugal,
2004. (ISBN 2-9517408-1-6).
Haveliwala, T. H. (2002). Topic-sensitive pagerank. In
Proceedings of the 11th international conference on
World Wide Web, WWW ’02, pages 517–526, New
York, NY, USA. ACM.
Hearst, M. A. (2009). Search User Interfaces. Cambridge
University Press.
Hoeber, O. and Yang, X. D. (2006). Interactive web in-
formation retrieval using wordbars. In Proceedings
of the 2006 IEEE/WIC/ACM International Conference
on Web Intelligence, WI ’06, pages 875–882, Wash-
ington, DC, USA. IEEE Computer Society.
Jeh, G. and Widom, J. (2003). Scaling personalized web
search. In Proceedings of the 12th international con-
ference on World Wide Web, WWW ’03, pages 271–
279, New York, NY, USA. ACM.
GuidedExploratorySearchontheMobileWeb
73