AN ALTERNATIVE TO SCALE-SPACE REPRESENTATION
FOR EXTRACTING LOCAL FEATURES IN IMAGE RECOGNITION
Hans Jørgen Andersen and Giang Phuong Nguyen
Media Technology Section, Department of Architecture, Design and Media Technology, Aalborg University,
Niels Jernes Vej 14, Aalborg, Denmark
Keywords:
Local Descriptors, Image Features, Triangular Representation, Image Retrieval, Image Recognition.
Abstract:
In image recognition, the common approach for extracting local features using a scale-space representation
has usually three main steps; first interest points are extracted at different scales, next from a patch around
each interest point the rotation is calculated with corresponding orientation and compensation, and finally a
descriptor is computed for the derived patch (i.e. feature of the patch). To avoid the memory and computational
intensive process of constructing the scale-space, we use a method where no scale-space is required This is
done by dividing the given image into a number of triangles with sizes dependent on the content of the image,
at the location of each triangle. In this paper, we will demonstrate that by rotation of the interest regions at the
triangles it is possible in grey scale images to achieve a recognition precision comparable with that of MOPS.
The test of the proposed method is performed on two data sets of buildings.
1 INTRODUCTION
The use of local features has during the last years
proven as an powerful method for recognition of ob-
jects, places and navigation (Zhou et al., 2009; Koeck
et al., 2005; Zhi et al., 2009; Huang et al., 2009)
The advantages of using local features lead to an in-
creasing number of researches exploring these and a
comprehensive overviews can be found in (Mikola-
jczyk and Schmid, 2005; Tuytelaars and Mikolajczyk,
2008).In this study we will concentrate on its use for
building recognition, using the method recently intro-
duced by the authors (Nguyen and Andersen, 2010).
The ambition is to develop a method that is less com-
putational expense compared to scale-space methods
and hence suitable for implementation on resource
limited devices as mobile phones or tablets.
Up to now, local features is mostly known as
descriptors extracted from areas located at interest-
ing points (Tuytelaars and Mikolajczyk, 2008). This
means that, existing methods first detect interesting
points, for example using Harris corner detector (Har-
ris and Stephens, 1988). Then a patch is drawn which
is centered at the corresponding interest point, and de-
scriptors are computed from each patch. So, the main
issue is how to define the size of the patch. In other
words, how to make these descriptors scale invariant.
To satisfy this requirement, these methods need to lo-
cate a given image at different scales, or so called the
scale-space approach. A given image is represented in
a scale-space using difference of Gaussian and down
sampling (Lowe, 2004; Brown et al., 2005; Nguyen
and Andersen, 2008). The size of a patch depends on
the corresponding scale where the interest point is de-
tected.
In short, the common approach for extracting lo-
cal features using a scale-space representation have
usually three main steps; first extract interest points
at each scale, next from a patch around each interest
point calculate the rotation and compensate for this,
and finally compute a descriptor at each interest point
(i.e. feature of the patch).
In the paper (Nguyen and Andersen, 2010) we
proposed an alternative to the first step approximat-
ing the scale-space using bTree triangular encoding
as introduced by (Distasi et al., 1997). This method
divides the image into triangles of ”homogenous” re-
gions. The process is done automatically and if an
object appears at different scales, the triangular rep-
resentation will adapt to draw the corresponding tri-
angle size. Each triangle may be view upon as an
interest region or ”point”. In our previous study, we
investigated the use of gray scale or color information
for triangle descriptors. We demonstrated that using
color we could achieve an performance in accordance
with the MOPS method (Brown et al., 2005) and that
341
Jørgen Andersen H. and Phuong Nguyen G..
AN ALTERNATIVE TO SCALE-SPACE REPRESENTATION FOR EXTRACTING LOCAL FEATURES IN IMAGE RECOGNITION.
DOI: 10.5220/0003836203410345
In Proceedings of the International Conference on Computer Vision Theory and Applications (VISAPP-2012), pages 341-345
ISBN: 978-989-8565-03-7
Copyright
c
2012 SCITEPRESS (Science and Technology Publications, Lda.)