7 CONCLUSION & FUTURE
WORK
In this paper, we have applied different compression
techniques on Fisher vectors derived from restricted
Boltzmann machine to make them amenable for large
scale retrieval problems. We explored four different
dimensionality reduction techniques for Fisher vec-
tors: PCA, spectral hashing, parametric t-SNE and
autoencoder,and found that parametric t-SNE outper-
forms all the other techniques on high dimensional
Fisher vectors. Moreover, the Max-Min normalisa-
tion scheme improves the accuracy of the linear clas-
sifier in Euclidian space.
In the future, we would extend our experiments
to other large scale data sets like PASCAL-VOC (Ev-
eringham et al., 2015) and ImageNet (Russakovsky
et al., 2015) and test the classification performance of
compressed Fisher scores with other competitive clas-
sifiers like support vector machines (SVM). In addi-
tion, we shall also explore if feature selection meth-
ods are much apt than feature compression schemes
to reduce the dimensionality of deep Fisher vectors
for retrieval tasks.
ACKNOWLEDGEMENT
This research was supported by Higher Education
Commission of Pakistan (SRGP: 21-402) & NVIDIA
(Ref.: 281400) with a valuable donation of Titan-X
graphics card.
REFERENCES
Azim, T. and Niranjan, M. (2013). Inducing Discrimina-
tion in Biologically Inspired Models of Visual Scene
Recognition. In IEEE International Workshop on Ma-
chine Learning for Signal Processing (MLSP).
Boureau, Y., Bach, F., LeCun, Y., and Ponce, J. (2010).
Learning Mid-level Features for Recognition. In IEEE
Computer Society Conference on Computer Vision
and Pattern Recognition.
Chatfield, K., Lempitsky, V., Vedaldi, A., and Zisserman, A.
(2011). The Devil is in the Details: An Evaluation of
Recent Feature Encoding Methods. In Proceedings of
the British Machine Vision Conference. BMVA Press.
Csurka, G., Dance, C., Fan, L., Willamowski, J., and Bray,
C. (2004). Visual Categorization with Bags of Key-
points. In Workshop on Statistical Learning in Com-
puter Vision, ECCV. Prague.
Everingham, M., Eslami, S., Van Gool, L., Williams, C.,
Winn, J., and Zisserman, A. (2015). The Pascal Visual
Object Classes Challenge: A Retrospective. Interna-
tional Journal of Computer Vision, 111(1):98–136.
Farquhar, J., Szedmak, S., Meng, H., and Shawe-Taylor, J.
(2005). Improving Bag-of-Keypoints Image Categori-
sation: Generative Models and PDF-Kernels.
Hinton, G. (2002). Training Products of Experts by Mini-
mizing Contrastive Divergence. Neural Computation,
14:1771–1800.
Hinton, G. (2010). A Practical Guide to Training Restricted
Boltzmann Machines. Technical report.
Hinton, G. and Salakhutdinov, R. (2006). Reducing the Di-
mensionality of Data with Neural Networks. Science.
Jaakkola, T. and Haussler, D. (1998). Exploiting Genera-
tive Models in Discriminative Classifiers. In Advances
in Neural Information Processing Systems 11, pages
487–493. MIT Press.
Jayalakshmi, T. and Santhakumaran, A. (2011). Statistical
Normalization and Back Propagation for Classifica-
tion. International Journal of Computer Theory and
Engineering.
Lazebnik, S., Schmid, C., and Ponce, J. (2006). Beyond
Bags of Features: Spatial Pyramid Matching for Rec-
ognizing Natural Scene Categories. In IEEE Com-
puter Society Conference on Computer Vision and
Pattern Recognition (CVPR’06). IEEE.
Maaten, L. (2009). Learning a Parametric Embedding by
Preserving Local Structure. RBM.
Maaten, L. (2011). Learning Discriminative Fisher Kernels.
In Getoor, L. and Scheffer, T., editors, Proceedings of
the 28th International Conference on Machine Learn-
ing, ICML 2011, Bellevue, Washington, USA, June 28
- July 2, 2011, pages 217–224. Omnipress.
Pearson, K. (1901). On Lines and Planes of Closest Fit to
Systems of Points in Space. Philosophical Magazine,
2:559–572.
Perronnin, F. and Dance, C. (2007). Fisher Kernels on Vi-
sual Vocabularies for Image Categorization. In IEEE
Conference on Computer Vision and Pattern Recogni-
tion. IEEE.
Perronnin, F., Dance, C., Csurka, G., and Bressan, M.
(2006). Adapted Vocabularies for Generic Visual Cat-
egorization. Springer Berlin Heidelberg.
Perronnin, F. and Larlus, D. (2015). Fisher Vectors Meet
Neural Networks: A Hybrid Classification Architec-
ture. In Proceedings of the IEEE Conference on Com-
puter Vision and Pattern Recognition. CVPR.
Perronnin, F., S´anchez, J., and Mensink, T. (2010). Improv-
ing the Fisher Kernel for Large-scale Image Classifi-
cation. In European Conference on Computer Vision.
Springer.
Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh,
S., Sean, M., Zhiheng, H., Karpathy, A., Khosla, A.,
Bernstein, M., Berg, A., and Fei-Fei, L. (2015). Im-
ageNet Large Scale Visual Recognition Challenge.
International Journal of Computer Vision (IJCV),
115(3):211–252.
Sanchez, J., Perronnin, F., Mensink, T., and Verbeek, J.
(2013). Compressed Fisher Vectors for Large-scale
Image Classification. IJCV.
Wang, J., Yang, J., Yu, K., Lv, F., Huang, T., and Gong, Y.
(2010). Locality-constrained Linear Coding for Im-