recognition performance and and less run time,
specially with larger and difficult datasets. Note
in scenarios 2 & 4, performance of IF is around
40% while that of MH is below 5%. This suggests
that MH is more suited to near duplicate detection
applications.
7. The overall performance of BoW methods is still
disappointing. For 400K images, the recognition
rate is less than 40% for some scenarios. This sug-
gests that more research is needed to improve the
performance of BoW. Possible directions include
better ways to generate the visual words, better
ways to incorporate geometric information, and
to combine information from different features or
dictionaries.
ACKNOWLEDGEMENTS
This work was supported by ONR grant N00173-09-
C-4005.
REFERENCES
Aly, M., Munich, M., and Perona, P. (2011). Indexing in
large scale image collections: Scaling properties and
benchmark. In WACV.
Aly, M., Welinder, P., Munich, M., and Perona, P. (2009a).
Scaling object recognition: Benchmark of current
state of the art techniques. In ICCV Workshop WS-
LAVD.
Aly, M., Welinder, P., Munich, M., and Perona, P. (2009b).
Towards Automated Large Scale Discovery of Image
Families. In CVPR Workshop on Internet Vision.
Arya, S., Mount, D., Netanyahu, N., Silverman, R., and
Wu, A. (1998). An optimal algorithm for approxi-
mate nearest neighbor searching. Journal of the ACM,
45:891–923.
Baeza-Yates, R. and Ribeiro-Neto, B. (1999). Modern In-
formation Retrieval. ACM Press.
Broder, A. (1997). On the resemblance and containment of
documents. In Proc. Compression and Complexity of
Sequences 1997, pages 21–29.
Broder, A., Charikar, M., and Mizenmacher, M. (2000).
Min-wise independent permutations. Journal of Com-
puter and System Sciences, 60:630–659.
Broder, A. Z., Glassman, S. C., Manasse, M. S., and Zweig,
G. (1997). Syntactic clustering of the web. Computer
Networks and ISDN Systems, 29:8–13.
Chum, O., Perdoch, M., and Matas, J. (2009). Geometric
min-hashing: Finding a (thick) needle in a haystack.
In CVPR.
Chum, O., Philbin, J., Isard, M., and Zisserman, A. (2007a).
Scalable near identical image and shot detection. In
CIVR, pages 549–556.
Chum, O., Philbin, J., Sivic, J., Isard, M., and Zisserman,
A. (2007b). Total recall: Automatic query expansion
with a generative feature model for object retrieval. In
ICCV.
Chum, O., Philbin, J., and Zisserman, A. (2008). Near du-
plicate image detection: min-hash and tf-idf weight-
ing. In British Machine Vision Conference.
Forsyth, D. and Ponce, J. (2002). Computer Vision: A mod-
ern approach. Prentice Hall.
Geusebroek, J. M., Burghouts, G. J., and Smeulders, A.
W. M. (2005). The amsterdam library of object im-
ages. IJCV, 61:103–112.
J
´
egou, H., Douze, M., and Schmid, C. (2008). Hamming
embedding and weak geometric consistency for large
scale image search. In ECCV.
J
´
egou, H., Douze, M., and Schmid, C. (2009). Packing bag-
of-features. In ICCV.
Lowe, D. (2004). Distinctive image features from scale-
invariant keypoints. IJCV.
Mikolajczyk, K. and Schmid, C. (2004). Scale and affine
invariant interest point detectors. IJCV.
Nister, D. and Stewenius, H. (2006). Scalable recognition
with a vocabulary tree. CVPR.
Philbin, J., Chum, O., Isard, M., Sivic, J., and Zisserman, A.
(2007). Object retrieval with large vocabularies and
fast spatial matching. CVPR.
Philbin, J., Chum, O., Isard, M., Sivic, J., and Zisserman,
A. (2008). Lost in quantization: Improving particu-
lar object retrieval in large scale image databases. In
CVPR.
Zobel, J. and Moffat, A. (2006). Inverted files for text search
engines. ACM Comput. Surv.
VISAPP 2011 - International Conference on Computer Vision Theory and Applications
306