K = 2 K = 4 K = 6
SIFTSIFTextSIFTSIFText
texturednon-textured
Figure 10: Confusion matrices generated using the algorithm
proposed in chapter 3, for various values of the parameter
K
.
As descriptor algorithm SIFT and SIFText were used.
textured non-textured
SIFTSIFText
Figure 11: Confusion matrices generated using RANSAC
to calculate the homography. As descriptor algorithm SIFT
and SIFText were used. The color coding is the same used
in figure 10.
5 CONCLUSIONS
In this paper we have implemented and tested a de-
scriptor architecture, which was designed to detect
non-textured objects as well as textured objects, by
combining edge, shape and color description tech-
niques and weighting them by a locally calculated
value, derived from the texture of the image. Further-
more, a matching algorithm was implemented, that
was designed to mitigate the weak points of the de-
scriptor. In table 1 we have shown that for each object
the number of detected features was increased signifi-
cantly compared to SIFT, while the number of correct
correspondences was increased as well. This applies
to both, the textured and non-textured dataset. We
have further shown, that the precision of detecting
an object’s position and orientation in two images is
increased compared to RANSAC, if the matching al-
gorithm, which was developed here, is used (cf. figure
10 and 11). But even in the case RANSAC is used, the
descriptor surpasses SIFT.
By choosing the weighting factor locally and
adapting the descriptor accordingly features that were
matched incorrectly by the partial descriptors are as-
signed correctly. This architecture is, therefore, well
suited to detect textured and non-textured objects, as
well as objects, that possess textured and non-textured
parts.
REFERENCES
Bay, H., Tuytelaars, T., and van Gool, L. (2006). SURF:
Speeded Up Robust Features. Computer Vision ECCV
2006, 3951:404–417.
Bober, M. (2001). Mpeg-7 visual shape descriptors. Circuits
and Systems for Video Technology, IEEE Transactions
on, 11(6):716–719.
Bulla, C. and Hosten, P. (2013). Detection of false feature
correspondences in feature based object detection sys-
tems. In Proceedings of International Conference on
Image and Vision Computing New Zealand, Wellington,
New Zealand.
Fischler, M. A. and Bolles, R. C. (1981). Random sample
consensus: A paradigm for model fitting with appli-
cations to image analysis and automated cartography.
Commun. ACM, 24(6):381–395.
Forssen, P.-E. and Lowe, D.G. (2007). Shape descriptors
for maximally stable extremal regions. In Proceedings
of IEEE 11th International Conference on Computer
Vision, pages 1–8.
Geusebroek, J.-M., Burghouts, G.J., and Smeulders, A.W.M.
(2005). The amsterdam library of object images. Inter-
national Journal of Computer Vision, 61(1):103–112.
Ljubisa, M. and Sladjana, S. (2006). Orthonormal decompo-
sition of fractal interpolating functions. Series Mathe-
matics and Informatics, 21:1–11.
Lowe, D.G. (2004). Distinctive Image Features from Scale-
Invariant Keypoints. International Journal of Com-
puter Vision, 60(2):91–110.
Matas, J., Chum, O., Urban, M., and Pajdla, T. (2004).
Robust wide-baseline stereo from maximally stable
extremal regions. Image and Vision Computing,
22(10):761 – 767.
Naik, S. and Murthy, C. A. (2007). Distinct multicolored
region descriptors for object recognition. IEEE Trans-
actions on Pattern Analysis and Machine Intelligence,
29(7):1291–1296.
Tombari, F., Franchi, A., and Di Stefano, L. (2013). Bold
features to detect texture-less objects. In Proceedings
VISAPP2015-InternationalConferenceonComputerVisionTheoryandApplications
272