For testing the model, we chose Google as our
ranking expert and compared the performance of
HITS and XHITS in relation to it. The gains of
XHITS’ model over HITS’ are substantial as shown
in the experimental result, over 200 % gain of qual-
ity. One promising direction for future work that we
are exploring is to extend this work by changing the
benchmark and apply the XHITS to GOV2 collection
and compare the performance with others ranking al-
gorithms already explored and reported in the litera-
ture.
ACKNOWLEDGEMENTS
Part of this work is supported by Brazilian Army
Technology Center - CTEx. We give our thanks to
all people who have contributed to this research and
development.
REFERENCES
Agichtein, E., Brill, E., and Dumais, S. (2006). Improv-
ing web search ranking by incorporating user behav-
ior information. In SIGIR ’06: Proceedings of the
29th annual international ACM SIGIR conference on
Research and development in information retrieval,
pages 19–26, New York, NY, USA. ACM.
Agosti, M. and Pretto, L. (2005). A theoretical study of a
generalized version of kleinberg’s hits algorithm. Inf.
Retr., 8(2):219–243.
Borodin, A., Roberts, G. O., Rosenthal, J. S., and Tsaparas,
P. (2001). Finding authorities and hubs from link
structures on the world wide web.
Chakrabarti, S., Joshi, M., and Tawde, V. (2001). Enhanced
topic distillation using text, markup tags, and hyper-
links. pages 208–216.
Cohn, D. and Chang, H. (2000). Learning to probabilisti-
cally identify authoritative documents.
Ding, C., He, X., Husbands, P., Zha, H., and Simon, H. D.
(2002a). Pagerank, HITS and a unified framework for
link analysis. In Proceedings of the 25th Annual In-
ternational ACM SIGIR Conference on Research and
Development in Information Retrieval, Poster session,
pages 353–354.
Ding, C., Zha, H., Simon, H., and He, X. (2002b). Link
analysis: Hubs and authorities on the world wide web.
Filho, F. B. (2005). Xhits: Extending the hits algorithm
for distillation of broad search topic on www. Mas-
ter’s thesis, Pontif
´
ıcia Universidade Cat
´
olica do Rio
de Janeiro, Rio de Janeiro, Rio de Janeiro, Brazil.
Fowler, R. H. and Karadayi, T. (2002). Visualizing the web
as hubs and authorities richard H. fowler and tarkan
karadayi.
Giles, C. L., Flake, G. W., and Lawrence, S. (2000). Effi-
cient identification of web communities.
Kalaba, R., Spingarn, K., and Tesfatsion, L. (1981). Vari-
ational equations for the eigenvalues and eigenvectors
of nonsymmetric matrices. Journal of Optimization
Theory and Applications: Vol. 33, No. 1.
Kleinberg, J. M. (1999). Hubs, authorities, and communi-
ties. ACM Computing Surveys (CSUR), 31(4es):5.
Lempel, R. and Moran, S. (2001). SALSA: the stochastic
approach for link-structure analysis. ACM Transac-
tions on Information Systems, 19(2):131–160.
Mendelzon, A. O. and Rafiei, D. (2000). What is this page
known for? computing web page reputations.
Mizzaro, S. and Robertson, S. (2007). Hits hits trec: ex-
ploring ir evaluation results with network analysis. In
SIGIR ’07: Proceedings of the 30th annual interna-
tional ACM SIGIR conference on Research and devel-
opment in information retrieval, pages 479–486, New
York, NY, USA. ACM.
Searle, S. R. (1982). Matrix Algebra Useful for Statistics.
John Wiley & Sons, NY, USA.
yu Kao, H., ming Ho, J., syan Chen, M., and hua Lin, S.
(2003). Entropy-based link analysis for mining web
informative structures.
XHITS - Multiple Roles in a Hyperlinked Structure
195