Large-scale Image Retrieval based on the Vocabulary Tree

Bo Cheng, Li Zhuo, Pei Zhang, Jing Zhang

2014

Abstract

In this paper, vocabulary tree based large-scale image retrieval scheme is proposed that can achieve higher accuracy and speed. The novelty of this paper can be summarized as follows. First, because traditional Scale Invariant Feature Transform (SIFT) descriptors are excessively concentrated in some areas of images, the extraction process of SIFT features is optimized to reduce the number. Then, combined with optimized-SIFT, color histogram in Hue, Saturation, Value (HSV) color space is extracted to be another image feature. Moreover, Local Fisher Discriminant Analysis (LFDA) is applied to reduce the dimension of SIFT and color features, which will help to shorten feature-clustering time. Finally, dimension-reduced features are used to generate vocabulary trees which will be used for large-scale image retrieval. The experimental results on several image datasets show that, the proposed method can achieve satisfying retrieval precision.

References

  1. Bay H., Ess A., Tuytelaars T., Van G. L., 2008. SpeededUp Robust Features (SURF). Computer Vision and Image Understanding, Vol. 110(3), pp. 346-359.
  2. Böhm C., Berchtold S., Keim D. A., 2001. Searching in high-dimensional spaces - Index structures for improving the performance of multimedia databases. ACM Computing Surveys (CSUR), Vol. 33(3), pp.322- 373.
  3. Gionis A., Piotr I., Rajeev M., 1999. Similarity Search in High Dimensions via Hashing. In Proceedings of the 25th International Conference on Very Large Data Bases. Morgan Kaufmann Publishers Inc. pp. 518-529.
  4. Ke Y. and Sukthankar R., 2004. PCA-SIFT: A More Distinctive Representation for Local Image Descriptors. Proc. Conf. Computer Vision and Pattern Recognition, pp. 511-517.
  5. Lowe D. G., 2004. Distinctive image features from scaleinvariant keypoints. IJCV, Vol. 60(2), pp. 91-110.
  6. Mikolajczyk K., Schmid C., 2005. A performance evaluation of local descriptors. IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 1615- 1630.
  7. Nister D., Stewenius H., 2006. Scalable recognition with a vocabulary tree. In Proc. CVPR, Vol. 2, pp. 2161- 2168.
  8. Rahulamathavan Y., Phan R. C. W., Chambers J. A., Parish D. J., 2013. Facial Expression Recognition in the Encrypted Domain Based on Local Fisher Discriminant Analysis, IEEE Transactions on Affective Computing, Vol. 4(1), pp.83-92.
  9. Zobel J. and Moat A., 1998. Inverted files versus signature les for text indexing. ACM Transactions on Database Systems, vol. 23, pp. 453-490.
  10. Zelnik-Manor L. and Perona P., 2004. Self-Tuning Spectral Clustering. Proc. 18th Ann. Conf. Advances in Neural Information Processing Systems, Vol.17, pp. 1601-1608.
Download


Paper Citation


in Harvard Style

Cheng B., Zhuo L., Zhang P. and Zhang J. (2014). Large-scale Image Retrieval based on the Vocabulary Tree . In Proceedings of the 9th International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP, (VISIGRAPP 2014) ISBN 978-989-758-004-8, pages 299-304. DOI: 10.5220/0004661802990304


in Bibtex Style

@conference{visapp14,
author={Bo Cheng and Li Zhuo and Pei Zhang and Jing Zhang},
title={Large-scale Image Retrieval based on the Vocabulary Tree},
booktitle={Proceedings of the 9th International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP, (VISIGRAPP 2014)},
year={2014},
pages={299-304},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004661802990304},
isbn={978-989-758-004-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 9th International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP, (VISIGRAPP 2014)
TI - Large-scale Image Retrieval based on the Vocabulary Tree
SN - 978-989-758-004-8
AU - Cheng B.
AU - Zhuo L.
AU - Zhang P.
AU - Zhang J.
PY - 2014
SP - 299
EP - 304
DO - 10.5220/0004661802990304