DRANK+: A DIRECTORY BASED PAGERANK PREDICTION METHOD FOR FAST PAGERANK CONVERGENCE

Hung-Yu Kao, Chia-Sheng Liu, Yu-Chuan Tsai, Chia-Chun Shih, Tse-Ming Tse-Ming

Abstract

In recent years, most part of search engines use link analysis algorithms to measure the importance of web pages. The most famous link analysis algorithm is PageRank algorithm. However, previous researches in recent years have found that there exists an inherent bias against newly created pages in PageRank. In the previous work, a new ranking algorithm called DRank has been proposed to solve this issue. It utilizes the cluster phenomenon of PageRank in a directory to predict the possible importance of pages in the future and to diminish the inherent bias of search engines to new pages. In this paper, we modify the original DRank algorithm to complement the weaker part of DRank which could fail while the number of pages in directory is not enough. In our experiments, the augmented algorithm, i.e., DRank+ algorithm, obtains more accuracy in predicting the importance score of pages at next time stage than the original DRank algorithm. DRank+ not only alleviates the bias of newly created pages successfully but also reaches more accuracy than Page Quality and original DRank in predicting the importance of newly created pages.

References

  1. Abiteboul, S., Preda, M., and Cobna, G., 2003. Adaptive on-line page importance computation. In Proceedings of the International World-Wide Web Conference.
  2. Brin, S. and Page, L., 1998. The anatomy of a large-scale hypertextual web search engine. In Proceedings of WWW Conference.
  3. Cho, J., Roy, S. and Adams, R. E., 2005. Page Quality: In Search of an Unbiased Web Ranking. In Proc. of the SIGMOD Conference.
  4. Cho, J. and Roy, S., 2004. Impact of Search Engines on Page Popularity. In Proceedings of the International World-Wide Web Conference.
  5. Eiron, N. and McCurley, K. S., 2003. Locality, Hierarchy, and Bidirectionality on the Web. In Workshop on Web Algorithms and Models.
  6. Haveliwala, T. H., 2002. Topic-sensitive pagerank. In Proceedings of the International World-Wide Web Conference.
  7. Jiang, X. M., Xue, G. R., Zeng, H. J., Chen, Z., Song, W.- G. and Ma, W.-Y., 2004. Exploiting PageRank Analysis at Different Block Level. In Proceedings of Conference of WISE.
  8. Kamvar, S., Haveliwala, T., Manning, C., and Golub, G., 2003. Extrapolation methods for accelerating pagerank computations. In Proceedings of the International World-Wide Web Conference.
  9. Kumar, R., Raghavan, P., Rajagopalan, S. and Sivakumar, D., 2000. Stochastic models for the Web graph. In Proceedings of the 41st Annual Symposium on Foundations of Computer Science.
  10. Kao, H.-Y. and Lin, S.-F., 2007. A Fast PageRank Convergence Method based on the Cluster Prediction. In Proceedings of IEEE/WIC/ACM International Conference on Web Intelligence.
  11. Salton, G. and McGill, M. J., 1983. Introduction to modern information retrieval. McGraw-Hill.
  12. Wang, W., Yang, J., Muntz, R., 1997. STING: A Statistical Information Grid Approach to Spatial Data Ming. In Proceedings of the 23rd VLDB Conference.
  13. Xing, W. and Ghorbani, A., 2004. Weighted PageRank Algorithm. In Proceedings of the 2nd Annual Conference on Communication Networks and Services Research. 305-314.
  14. Xue, G.-R, Yang, Q., Zeng, H.-J., Yu, Y., Chen, Z., 2005. Exploiting the Hierarchical Structure for Link Analysis. In Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.
  15. Yates, R. B., Castillo, C. and Jean, F. S., 2002. Web Dynamics, Structure, and Page Quality. In Proceedings of SPIRE Conference.
Download


Paper Citation


in Harvard Style

Kao H., Liu C., Tsai Y., Shih C. and Tse-Ming T. (2008). DRANK+: A DIRECTORY BASED PAGERANK PREDICTION METHOD FOR FAST PAGERANK CONVERGENCE . In Proceedings of the Fourth International Conference on Web Information Systems and Technologies - Volume 2: WEBIST, ISBN 978-989-8111-27-2, pages 175-180. DOI: 10.5220/0001521701750180


in Bibtex Style

@conference{webist08,
author={Hung-Yu Kao and Chia-Sheng Liu and Yu-Chuan Tsai and Chia-Chun Shih and Tse-Ming Tse-Ming},
title={DRANK+: A DIRECTORY BASED PAGERANK PREDICTION METHOD FOR FAST PAGERANK CONVERGENCE},
booktitle={Proceedings of the Fourth International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,},
year={2008},
pages={175-180},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001521701750180},
isbn={978-989-8111-27-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Fourth International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,
TI - DRANK+: A DIRECTORY BASED PAGERANK PREDICTION METHOD FOR FAST PAGERANK CONVERGENCE
SN - 978-989-8111-27-2
AU - Kao H.
AU - Liu C.
AU - Tsai Y.
AU - Shih C.
AU - Tse-Ming T.
PY - 2008
SP - 175
EP - 180
DO - 10.5220/0001521701750180