Table 2: Number of matches on SIFT and locmax-SIFT descriptors.
Graffiti +Boat +Bark + Bikes
#descr. on image 900+1083 +844 +1255 +597
SIFT 94% (402) 87% (394) 87% (394) 83% (386)
SIFT (PCA) 81% (555) 76% (589) 60% (760) 57% (769)
locmax 89% (411) 83% (415) 78% (426) 74% (432)
Figure 3: Samples from used image dataset [4] (Graffiti, Boat, Bark, Bikes).
method, but to create a lower dimensional descriptor
which is stable enough.
The most of false matches came from textured
regions because the locmax vectors contain only the
most significant parts of SIFT features (e.g. Bark,
see Figure 3). Certainly the dimension reduction
causes loss of information, thus there will be similar
local maxima positions from such areas. Another
problem is the uncertainty of good and false
matches.
In summary, higher precision leads to better
detection rates for object retrieval (Schügerl et al,
2007). The proposed method can improve the
precision rate in the reduced dimensions.
4 CONCLUSIONS
This paper introduced a new type of error-distance
calculation on SIFT descriptors with decreased
dimension. The method uses only a dynamic set of
local maxima of standard feature vectors, and after
calculating a weighted position distance we use the
DTW algorithm for comparing locmax features, and
get a novel metric on descriptors. This makes
possible the unsupervised (non-linear) dimension
reduction which is the key step to construct an
effective search tree in the future.
In future works we will focus on perfecting the
weight function to improve the matching scores,
according to other structural properties of SIFT
descriptors. Another plan is to integrate it in a lower
dimensional tree structure.
REFERENCES
Brown, M. and Lowe, D. G., 2003. Recognizing
Panorama, Proceedings of Ninth IEEE International
Conference on Computer Vision Vol.2 pp. 1218- 1225
Hua, G., Brown, M., Winder, S., 2007. Discriminant
Embedding for Local Image Descriptors, IEEE
International Conference on Computer Vision, Rio de
Janeiro, Brazil, pp. 1-8.
Joliffe, I.T., 1986. Principal Component Analysis.
Springer-Verlag.
Ke, Y., and Sukthankar, R., 2004. PCA-SIFT: A More
Distinctive Representation for Local Image
Descriptors. CVPR 2004. Proceedings of the 2004
IEEE Computer Society Conference on, Vol.2 , pp.
506-513.
Lowe, D., 1999. Object recognition from local scale-
invariant features. In Proceedings of International
Conference on Computer Vision, pp. 1150–1157.
Lowe, D., 2004. Distinctive image features from scale-
invariant keypoints. International Journal of
Computer Vision. pp. 91-110.
Mikolajczyk, K. and Schmid, C., 2001. Indexing based on
scale invariant interest points. In Proceedings of
International Conference on Computer Vision, pp.
525–531.
Mikolajczyk, K. and Schmid, C., 2005. A performance
evaluation of local descriptors. IEEE Transaction on
Pattern Analysis and Machine Intelligence, pp.
27(10):1615–1630.
Myers, C. S. and Rabiner, L. R. 1981. A comparative
study of several dynamic time-warping algorithms for
connected word recognition. The Bell System
Technical Journal, pp. 60(7):1389-1409.
Schügerl, P., Sorschag, R., Bailer, W., Thallinger, G.,
,2007. Object Re-detection Using SIFT and MPEG-7
Color Descriptors, Multimedia Content Analysis and
Mining, Springer-Verlag, pp. 305-314.
LOCMAX SIFT - Non-Statistical Dimension Reduction on Invariant Descriptors
195