4 CURRENT STATUS
The current prototype can be seen at the URL
http://mine.uni.lu/
˜
weires/serebif/search.php. To the
user, SEREBIF looks just like a usual simple search
engine (see figure 4), even though it in fact only redi-
rects the queries to the underlying search engine. The
Figure 4: SEREBIF user interface.
implementation of SEREBIF is currently able to cap-
ture the user interaction as described. As underlying
search engine, we are using the Google SOAP Search
API
1
.
Since the prototype is running just for a few weeks
now, the recorded data set is not very big yet. We pro-
moted our system mainly to fellow researchers and
captured user information for about 150 sessions, in-
cluding more than 650 queries and a total of over 1900
query terms up to now. We expect this amount of data
to be steadily growing while we continue realizing our
storage architecture.
5 CONCLUSIONS
In this paper, we described the SEREBIF system,
which captures implicit feedback collected from the
users of an existing search engine. This information
is used to enhance the results of later search queries.
To a certain extent, the search system adapts to the
behaviour of the majority of the users, which is sup-
posed to increase the quality of the results.
Our main future work currently lies in finishing
and evaluating our prototype as described. There are
also some other problems that we want to deal with in
the future. We have to define how to merge the feed-
back information with the (unknown) ranking func-
tion of the underlying search engine in an appropriate
1
http://code.google.com/apis/soapsearch/
way. Since we do not know the detailed ranking val-
ues of the results returned by the search engine, it is
difficult to determine how much feedback from the
users is needed to push a result up the list a certain
amount. However, we plan to develop an own search
engine to base SEREBIF upon, to be more indepen-
dent of external systems. In this case, we would be
able to access the exact ranking values for the result
list, which makes this problem much easier to deal
with.
Another problem of our approach is the vulner-
ability against fake feedback, which could possibly
be exploited to artificially influence the ranking. If
such a user-based approach would be included into a
large commercial search engine, people could try to
automatically generate false feedback for the system
to push certain pages up the result list. This could e.g.
be done by writing programs which submit a certain
query to SEREBIF and always request the result in the
result list which is desired to be promoted. Solutions
have to be found to cope with this, too.
REFERENCES
Gong, Z., Cheang, C. W., and U, L. H. (2005). Web query
expansion by wordnet. In Andersen, K. V., Deben-
ham, J. K., and Wagner, R., editors, DEXA, volume
3588 of Lecture Notes in Computer Science, pages
166–175. Springer.
Joachims, T. (2002). Optimizing search engines using click-
through data. In KDD, pages 133–142. ACM.
Kelly, D. and Belkin, N. J. (2004). Display time as implicit
feedback: understanding task effects. In Sanderson,
M., J
¨
arvelin, K., Allan, J., and Bruza, P., editors, SI-
GIR, pages 377–384. ACM.
Kelly, D. and Teevan, J. (2003). Implicit feedback for in-
ferring user preference: a bibliography. SIGIR Forum,
37(2):18–28.
Miller, G. A. (1995). Wordnet: A lexical database for en-
glish. Commun. ACM, 38(11):39–41.
Radlinski, F. and Joachims, T. (2005). Query chains: learn-
ing to rank from implicit feedback. In Grossman, R.,
Bayardo, R., and Bennett, K. P., editors, KDD, pages
239–248. ACM.
Salton, G. and Buckley, C. (1990). Improving retrieval per-
formance by relevance feedback. JASIS, 41(4):288–
297.
Schommer, C. and Schroeder, B. (2005). Anima: Associate
memories for categorical data streams. In Proceed-
ings of the 3rd International Conference on Computer
Science and its Applications (ICCSA).
Schroeder, B., Hilker, M., and Weires, R. (2007). Dynamic
association networks in information management. In
Proceedings of the 4th International Conference on
Machine Learning and Data Analysis (MLDA).
WEBIST 2008 - International Conference on Web Information Systems and Technologies
266