practice, this will not be the case, and our experimen-
tal results must be considered a best case. Neverthe-
less, we are optimistic that classification errors can be
kept small. In particular, documents could be clas-
sified during indexing, when considerably more in-
formation than just the result set is available. And,
over time, users are likely to learn the classification
ontology and increase the frequency of choosing the
correct class.
REFERENCES
Bar-Ilan, J., Zhu, Z., and Levene, M. (2009). Topic-specific
analysis of search queries. In WSCD ’09: Proceed-
ings of the 2009 workshop on Web Search Click Data,
pages 35–42, New York, NY, USA. ACM.
Beitzel, S. M., Jensen, E. C., Frieder, O., Grossman, D.,
Lewis, D. D., Chowdhury, A., and Kolcz, A. (2005).
Automatic web query classification using labeled and
unlabeled training data. In SIGIR ’05: Proceedings
of the 28th annual international ACM SIGIR confer-
ence on Research and development in information re-
trieval, pages 581–582, New York, NY, USA. ACM.
Broder, A. Z., Fontoura, M., Gabrilovich, E., Joshi, A., Josi-
fovski, V., and Zhang, T. (2007). Robust classification
of rare queries using web knowledge. In SIGIR ’07:
Proceedings of the 30th annual international ACM SI-
GIR conference on Research and development in in-
formation retrieval, pages 231–238, New York, NY,
USA. ACM.
Cao, H., Hu, D. H., Shen, D., Jiang, D., Sun, J.-T., Chen,
E., and Yang, Q. (2009). Context-aware query classi-
fication. In SIGIR ’09: Proceedings of the 32nd inter-
national ACM SIGIR conference on Research and de-
velopment in information retrieval, pages 3–10, New
York, NY, USA. ACM.
Carpineto, C., Osi´nski, S., Romano, G., and Weiss, D.
(2009). A survey of web clustering engines. ACM
Comput. Surv., 41(3):1–38.
Chen, H. and Dumais, S. (2000). Bring order to the web:
Automatically categorizing search results. In CHI ’00:
Proceedings of the SIGCHI conference on Human fac-
tors in computing systems, pages 145–152, New York,
NY, USA. ACM Press.
Joachims, T., Granka, L., Pan, B., Hembrooke, H., and Gay,
G. (2005). Accurately interpreting clickthrough data
as implicit feedback. In SIGIR ’05: Proceedings of the
28th annual international ACM SIGIR conference on
Research and development in information retrieval,
pages 154–161, New York, NY, USA. ACM.
Li, Y. and Zheng, Z. (2005). Kdd cup 2005. Online at
http://www.acm.org/sigs/sigkdd/kdd2005/kddcup.html.
Lin, C.-J., Weng, R. C., and Keerthi, S. S. (2007). Trust
region newton methods for large-scale logistic regres-
sion. In ICML ’07: Proceedings of the 24th inter-
national conference on Machine learning, pages 561–
568, New York, NY, USA. ACM.
Liu, Y., Fu, Y., Zhang, M., Ma, S., and Ru, L. (2007).
Automatic search engine performance evaluation with
click-through data analysis. In WWW ’07: Proceed-
ings of the 16th international conference on World
Wide Web, pages 1133–1134, New York, NY, USA.
ACM.
Shen, D., Pan, R., Sun, J.-T., Pan, J. J., Wu, K., Yin, J., and
Yang, Q. (2006). Query enrichment for web-query
classification. ACM Trans. Inf. Syst., 24(3):320–352.
Zamir, O. and Etzioni, O. (1999). Grouper: A dynamic
clustering interface to web search results. Computer
Networks, pages 1361–1374.
Zeng, H. J., He, Q. C., Chen, Z., Ma, W. Y., and Ma, J. W.
(2004). Learning to cluster web search results. In SI-
GIR ’04: Proceedings of the 27th annual international
ACM SIGIR conference on Research and development
in information retrieval, pages 210–217, New York,
USA. ACM Press.
Zhu, Z., Cox, I. J., and Levene, M. (2008). Ranked-listed or
categorized results in ir: 2 is better than 1. In NLDB
’08: Proceedings of the 13th international conference
on Natural Language and Information Systems, pages
111–123, Berlin, Heidelberg. Springer-Verlag.
APPENDIX
The list of classes, See below:
[Art, Auto, Companies & Business, Computing, Di-
rectories, Education, Employment, Entertainment, Fi-
nance & Economy, Food & Drink, Games, Gov-
ernment Organization, Health & Medicine, Holiday,
Home, Law & Legislation, Nature, News, People,
Places, Pornography, Religion, Science, Shopping,
Society & Community, Sports, Technology]
RANKING CLASSES OF SEARCH ENGINE RESULTS
301