2 RELATED WORK
Some recent works where shape descriptors are ex-
tracted using all the pixel information within a shape
region include Zernike moments (Kim and Kim,
2000), Legendre moments (Chong et al., 2004), and
generic Fourier descriptor (Zhang and Lu, 2002). The
main limitation of region-based approaches resides in
that only global shape characteristics are captured,
without taking into account important shape details.
Hence, the discriminative power of these approaches
is limited in applications with large intra-class varia-
tions or with databases of considerable size.
Curvature scale space (CSS) (Mokhtarian and
Bober, 2003), multi-scale convexity concavity (MCC)
(Adamek and O’Connor, 2004) and multi-scale
Fourier-based descriptor (Direkoglu and Nixon,
2011) are shape descriptors defined in a multi-scale
space. In CSS and MCC, by changing the sizes
of Gaussian kernels in contour convolution, several
shape approximations of the shape contour at differ-
ent scales are obtained. CSS uses the number of
zero-crossing points at these different scale levels.
In MCC, a curvature measure based on the relative
displacement of a contour point between every two
consecutive scale levels is proposed. The multi-scale
Fourier-based descriptor uses a low-pass Gaussian fil-
ter and a high-pass Gaussian filter, separately, at dif-
ferent scales. The main drawback of multi-scale space
approaches is that determining the optimal parameter
of each scale is a very difficult and application depen-
dent task.
Geometric relationships between sampled contour
points have been exploited effectively for shape de-
scription. Shape context (SC) (Belongie et al., 2002)
finds the vectors of every sample point to all the other
boundary points. The length and orientation of the
vectors are quantized to create a histogram map which
is used to represent each point. To make the histogram
more sensitive to nearby points than to points far-
ther away, these vectors are put into log-polar space.
The triangle-area representation (TAR) (Alajlan et al.,
2007) signature is computed from the area of the tri-
angles formed by the points on the shape boundary.
TAR measures the convexity or concavity of each
sample contour point using the signed areas of trian-
gles formed by contour points at different scales. In
these approaches, the contour of each object is repre-
sented by a fixed number of sample points and when
comparing two shapes, both contours must be repre-
sented by the same fixed number of points. Hence,
how these approaches work under occluded or un-
completed contours is not well-defined. Also, most
of these kind of approaches can only deal with closed
contours and/or assume a one-to-one correspondence
in the matching step.
In addition to shape representations, in order
to improve the performance of shape matching, re-
searchers have also proposed alternative matching
methods designed to get the most out of their shape
representations. In (McNeill and Vijayakumar, 2006),
the authors proposed a hierarchical segment-based
matching method that proceeds in a global to local
direction. The locally constrained diffusion process
proposed in (Yang et al., 2009) uses a diffusion pro-
cess to propagate the beneficial influence that offer
other shapes in the similarity measure of each pair
of shapes. (Bai et al., 2010) replace the original dis-
tances between two shapes with distances induced by
geodesic paths in the shape manifold.
Shape descriptors which only use global or local
information will probably fail in presence of trans-
formations and perturbations of shape contour. Local
descriptors are accurate to represent local shape fea-
tures, however, are very sensitive to noise. On the
other hand, global descriptors are robust to local de-
formations, but can not capture the local details of
the shape contour. In order to balance discrimina-
tive power and robustness, in this work we use lo-
cal features (contour fragments) for shape representa-
tion; later, in the matching step, in a global manner,
the structure and spatial relationships between the ex-
tracted local features are taken into account to com-
pute shapes similarity. To improve matching perfor-
mance, specific characteristics such as scale and ori-
entation of the extracted features are used. The extrac-
tion, description and matching processes are invariant
to rotation, translation and scale changes. In addition,
there is not restriction about only dealing with closed
contours or silhouettes, i.e. the method also extract
features from open contours.
The shape representation method used to de-
scribed our extracted contour fragments is similar to
that of shape context (Belongie et al., 2002). Besides
locality, the main difference between these descrip-
tors is that in (Belongie et al., 2002) the authors ob-
tain a histogram for each point in the contour, while
we only use one histogram for each contour fragment,
i.e. our representation is more compact. Unlike our
proposed method, shape context assumes a one-to-
one correspondence between points in the matching
step, which makes it more sensitive to occlusion.
The main contribution of this paper is a local
shape features extraction, description and matching
schema that i) is invariant to rotation, translation and
scaling, ii) provides a balance between distinctiveness
and robustness thanks to the local character of the ex-
tracted features, which are later matched using global
ICPRAM2014-InternationalConferenceonPatternRecognitionApplicationsandMethods
430