COVERAGE AND INDEPENDENCE - Defining Quality in Web Search Results

Panagiotis Metaxas, Lilia Ivanova

Abstract

Web search results enjoy an increasingly greater importance in our daily lives. But what can be said about their quality, especially when querying a controversial issue? The traditional information retrieval metrics of precision and recall do not provide much insight in the case of the web. In this paper we examine new ways of evaluating quality in search results: coverage and independence. We give examples on how these new metrics can be calculated and what their values reveal regarding the two major search engines, Google and Yahoo.

References

  1. Amento, B., Terveen, L., and Hill, W. (2000). Does authority mean quality? Predicting expert quality ratings of web documents. In Proceedings of the TwentyThird Annual International ACM SIGIR Conference Berenson, A. (2000). On hair-trigger wall street, a stock plunges on fake news. New York Times.
  2. Brin, S. and Page, L. (1998). The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems, 30(1-7):107-117.
  3. Google (2003). The Google http://code.google.com/apis/.
  4. Graham, L. and Metaxas, P. T. (2003). “Of course it's true; i saw it on the internet!”: Critical thinking in the internet era. Commun. ACM, 46(5):70-75.
  5. Gyuongyi, Z. and Garcia-Molina, H. (2005). Web spam taxonomy. In Proceedings of the First International Workshop on Adversarial Information Retrieval on the Web, Chiba, Japan.
  6. Manning, C., Raghavan, P., and Schultze, H. (2008). Introduction to Information Retrieval. Cambridge Press, Cambridge, UK, (forthcoming) edition.
  7. Metaxas, P. T. and Destefano, J. (2005). Web spam, propaganda and trust. In Proceedings of the First International Workshop on Adversarial Information Retrieval on the Web, Chiba, Japan.
  8. Moran, M. and Hunt, B. (2006). Search Engine Marketing. IBM Press, New Jersey, USA.
  9. Ntoulas, A., Cho, J., and Olston, C. (2004). What's new on the web? the evolution of the web from a search engine perspective. In Proceedings of the WWW 2004 Conference, New York, NY.
  10. Silverstein, C., Marais, H., Henzinger, M., and Moricz, M. (1999). Analysis of a very large web search engine query log. SIGIR Forum, 33(1):6-12.
  11. Vedder, A. (2001). Misinformation through the internet: Epistemology and ethics. Intersentia, Antwerpen, Gronigen, Oxford.
  12. Yahoo (2006). The Yahoo search API, yahoo, inc. http://developer.yahoo.com/search/.
Download


Paper Citation


in Harvard Style

Metaxas P. and Ivanova L. (2008). COVERAGE AND INDEPENDENCE - Defining Quality in Web Search Results . In Proceedings of the Fourth International Conference on Web Information Systems and Technologies - Volume 2: WEBIST, ISBN 978-989-8111-27-2, pages 106-113. DOI: 10.5220/0001529201060113


in Bibtex Style

@conference{webist08,
author={Panagiotis Metaxas and Lilia Ivanova},
title={COVERAGE AND INDEPENDENCE - Defining Quality in Web Search Results},
booktitle={Proceedings of the Fourth International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,},
year={2008},
pages={106-113},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001529201060113},
isbn={978-989-8111-27-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Fourth International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,
TI - COVERAGE AND INDEPENDENCE - Defining Quality in Web Search Results
SN - 978-989-8111-27-2
AU - Metaxas P.
AU - Ivanova L.
PY - 2008
SP - 106
EP - 113
DO - 10.5220/0001529201060113