ages, which might be the reason for SIFT exhibiting
inferior results to all of the other descriptors. Due to
multi-scale detection being applied in this case, the
associated Hough parameter space needs one more
dimension to be able to detect objects, compared to
the hough space generated for the other single scale
descriptors, which leads to lower pose estimate ac-
curacy. HexLDB3 works a little better than HexIDB3
due to the extra comparison information from the gra-
dient map. In summary, except SIFT, all of the other
descriptors gave close results in terms of pose estima-
tion, exhibiting an error of approximately half a pixel.
6 CONCLUSION
In this paper, we present two new HBD: HexIDB
and HexLDB descriptors. The new sampling struc-
ture of HBD reduces redundant information being en-
coded by decreasing the frequency of the same image
area being sampled, and produces shorter feature de-
scriptors for the third level of the feature hierarchy,
as compared to HexBinary descriptors. Moreover, a
gradient map is also employed to generate the binary
bits in the same way as the intensity map is encoded.
However, it is not a wise choice to use the gradient
map when the gradient information representing im-
age features has a low SNR. The HBD outperforms
SHexBinary and achieves very promising results com-
pared to fixed-scale U-FREAK and USO-SIFT de-
scriptors (no orientation normalisation). HBD is also
compared to the standard SIFT and a fixed-scale ex-
tracted descriptor US-SIFT within an object pose esti-
mation application. Although the parameters used in
this application are not learned from the training data,
HBD still produces much better performance than the
standard SIFT and shows competitive performance
compared to US-SIFT. In future work, we would like
to investigate dimensionality reduction methods for
HBD to decrease feature storage requirements and
improve its discriminability. We would also like to in-
vestigate the relationship between such hand-crafted
descriptors and those derived through learning tech-
niques, such as Deep Convolutional Neural Networks.
REFERENCES
Alahi, A., Ortiz, R., and Vandergheynst, P. (2012). Freak:
Fast retina keypoint. In Computer Vision and Pat-
tern Recognition (CVPR), 2012 IEEE Conference on,
pages 510–517. IEEE.
Bay, H., Tuytelaars, T., and Van Gool, L. (2006). Surf:
Speeded up robust features. In Computer Vision–
ECCV 2006, pages 404–417. Springer.
Calonder, M., Lepetit, V., Strecha, C., and Fua, P. (2010).
Brief: binary robust independent elementary features.
In Computer Vision–ECCV 2010, pages 778–792.
Springer.
Dalal, N. and Triggs, B. (2005). Histograms of oriented gra-
dients for human detection. In Computer Vision and
Pattern Recognition, 2005. CVPR 2005. IEEE Com-
puter Society Conference on, volume 1, pages 886–
893. IEEE.
Geusebroek, J.-M., Burghouts, G. J., and Smeulders, A. W.
(2005). The amsterdam library of object images. In-
ternational Journal of Computer Vision, 61(1):103–
112.
Ke, Y. and Sukthankar, R. (2004). Pca-sift: A more
distinctive representation for local image descriptors.
In Computer Vision and Pattern Recognition, 2004.
CVPR 2004. Proceedings of the 2004 IEEE Computer
Society Conference on, volume 2, pages II–506. IEEE.
Leutenegger, S., Chli, M., and Siegwart, R. Y. (2011).
Brisk: Binary robust invariant scalable keypoints. In
Computer Vision (ICCV), 2011 IEEE International
Conference on, pages 2548–2555. IEEE.
Liu, Y., Aragon-Camarasa, G., and Siebert, J. P. (2014).
Object edge contour localisation based on hexbinary
feature matching. In 2014 IEEE International Con-
ference on Robotics and Biomimetics (ROBIO), pages
99–106.
Liu, Y. and Siebert, J. P. (2014). Contour localization based
on matching dense hexhog descriptors. In Interna-
tional Conference on Computer Vision Theory and
Applications (VISAPP 2014), pages 656–666.
Lowe, D. G. (2004). Distinctive image features from scale-
invariant keypoints. International journal of computer
vision, 60(2):91–110.
Mikolajczyk, K. and Schmid, C. (2005). A perfor-
mance evaluation of local descriptors. Pattern Analy-
sis and Machine Intelligence, IEEE Transactions on,
27(10):1615–1630.
Rosten, E., Porter, R., and Drummond, T. (2010). Faster and
better: A machine learning approach to corner detec-
tion. Pattern Analysis and Machine Intelligence, IEEE
Transactions on, 32(1):105–119.
Rublee, E., Rabaud, V., Konolige, K., and Bradski, G.
(2011). Orb: an efficient alternative to sift or surf.
In Computer Vision (ICCV), 2011 IEEE International
Conference on, pages 2564–2571. IEEE.
Tola, E., Lepetit, V., and Fua, P. (2010). Daisy: An efficient
dense descriptor applied to wide-baseline stereo. Pat-
tern Analysis and Machine Intelligence, IEEE Trans-
actions on, 32(5):815–830.
Yang, X. and Cheng, K.-T. (2012). Ldb: An ultra-fast fea-
ture for scalable augmented reality on mobile devices.
In Mixed and Augmented Reality (ISMAR), 2012 IEEE
International Symposium on, pages 49–57. IEEE.
VISAPP 2016 - International Conference on Computer Vision Theory and Applications
182