responding and non-corresponding patches properly.
We conclude that not always a CNN based model
will be able to find a spatial binary tests distribu-
tion to minimize the distances between correspon-
ding keypoints and maximize the distances between
non-corresponding keypoints. The results presented
do not prove the impossibility of using Convolutio-
nal Neural Networks to select binary tests, but they
clarify some limitations when using the local pixel’s
neighborhood.
ACKNOWLEDGEMENTS
The authors would like to thank the agencies CAPES,
CNPq, and FAPEMIG for funding different parts of
this work.
REFERENCES
Balntas, V., Tang, L., and Mikolajczyk, K. (2015). Bold
- binary online learned descriptor for efficient image
matching. In The IEEE Conference on Computer Vi-
sion and Pattern Recognition (CVPR).
Bay, H., Tuytelaars, T., and Gool, L. V. (2006). Surf: Spee-
ded up robust features. In In ECCV, pages 404–417.
Calonder, M., Lepetit, V., Strecha, C., and Fua, P. (2010).
Brief: Binary robust independent elementary features.
In Proceedings of the 11th European Conference on
Computer Vision, pages 778–792.
Chollet, F. et al. (2015). Keras.
Dalal, N. and Triggs, B. (2005). Histograms of oriented
gradients for human detection. In Proceedings of the
2005 IEEE Computer Society Conference on Compu-
ter Vision and Pattern Recognition (CVPR’05) - Vo-
lume 1 - Volume 01, pages 886–893.
Duan, Y., Lu, J., Wang, Z., Feng, J., and Zhou, J.
(2017). Learning deep binary descriptor with multi-
quantization. In The IEEE Conference on Computer
Vision and Pattern Recognition (CVPR).
Leutenegger, S., Chli, M., and Siegwart, R. Y. (2011).
Brisk: Binary robust invariant scalable keypoints. In
Proceedings of the 2011 International Conference on
Computer Vision, pages 2548–2555.
Lin, K., Lu, J., Chen, C.-S., and Zhou, J. (2016). Learning
compact binary descriptors with unsupervised deep
neural networks. In IEEE Conference on Computer
Vision and Pattern Recognition (CVPR).
Lowe, D. G. (2004). Distinctive image features from scale-
invariant keypoints. Int. J. Comput. Vision, pages 91–
110.
Ortiz, R. (2012). Freak: Fast retina keypoint. In Pro-
ceedings of the 2012 IEEE Conference on Computer
Vision and Pattern Recognition (CVPR), pages 510–
517.
Ramnath, K. and Zitnick, C. L. (2011). Edge foci inte-
rest points. In 2011 IEEE International Conference
on Computer Vision (ICCV 2011)(ICCV), pages 359–
366.
Rublee, E., Rabaud, V., Konolige, K., and Bradski, G.
(2011). Orb: An efficient alternative to sift or surf.
In Proceedings of the 2011 International Conference
on Computer Vision, pages 2564–2571.
Simo-Serra, E., Trulls, E., Ferraz, L., Kokkinos, I., Fua, P.,
and Moreno-Noguer, F. (2015). Discriminative lear-
ning of deep convolutional feature point descriptors.
In 2015 IEEE International Conference on Computer
Vision (ICCV), pages 118–126.
Simonyan, K. and Zisserman, A. (2015). Very deep convo-
lutional networks for large-scale image recognition. In
In Proc. of International Conference on Learning Re-
presentations (ICLR).
Theano Development Team (2016). Theano: A Python fra-
mework for fast computation of mathematical expres-
sions.
Winder, S. and Brown, M. (2007). Learning local image
descriptors. In Proceedings of the International Con-
ference on Computer Vision and Pattern Recognition
(CVPR07).
Xu, X., Tian, L., Feng, J., and Zhou, J. (2014). OSRI: A
rotationally invariant binary descriptor. IEEE Trans.
Image Processing, pages 2983–2995.
Yi, K. M., Verdie, Y., Fua, P., and Lepetit, V. (2016). Le-
arning to Assign Orientations to Feature Points. In
Proceedings of the Computer Vision and Pattern Re-
cognition.
Exploring the Limitations of the Convolutional Neural Networks on Binary Tests Selection for Local Features
271