2 RELATED WORKS
A considerable amount of work has been done on ro-
bust local feature descriptors, studying invariance to
orientation, scale, affine transformation, and, recently,
also to generic deformation.
In the work of Mikolajczyck and Schmid (Miko-
lajczyk and Schmid, 2005; Mikolajczyk et al., 2005)
a performance evaluation of several of these local
descriptors is performed both with respect to view-
points and lighting conditions. As final result, this
work reports that the SIFT descriptor, proposed by
Lowe (Lowe, 2004), has the best performance with
images of flat scenes and affine transformations.
These conclusions have been supported also in the
paper by Moreels and Perona (Moreels and Perona,
2005), which generalizes these results to 3D scenes
using images of 3D objects viewed under different
scales, viewpoints, and lighting conditions.
All the local invariant descriptors, investigated so
far, are based on the hypothesis of perspective de-
formation being properly approximated, locally, by
affine transformation; recently, Ling and Jacobs (Ling
and Jacobs, 2005) demonstrated that it is possible also
to construct descriptors invariant to generic deforma-
tion of image subject (e.g., a moving flag). In their
proposal, they suggest to treat the intensity image as
a surface embedded in 3D space, with the third coor-
dinate being proportional to the intensity values, and
then build the descriptor by deformation invariance
geodesic distance in this 3D space.
Stimulated by their work, in this paper we want
demonstrate that it is possible to improve the perfor-
mance of the Geodesic Intensity Histogram (GIH) de-
scriptor by introducing color information. Although
color seems to be a fundamental clue for object recog-
nition in everyday life only few color invariant de-
scriptors have been proposed in the literature. The
work of Van De Weijer and Schmid (van de Weijer
and Schmid, 2006) is an important example from this
point of view. Their results lead to the encouraging
conclusions that a pure color-based approach outper-
forms a shape-based approach only for colorful ob-
jects, while, for the general case, it is possible any
way to outperforms a pure shape-based approach us-
ing a combination of shape and color.
In our work we verify the results achieved by Van
De Weijer and Schmid also for non affine transforma-
tions with the idea that shape-based descriptors can
fail when dealing with generic deformations and com-
bining it with color can improve recognition rates..
3 COLORING GEODESIC
INVARIANT FEATURES
In the geodesic framework, an image can be inter-
preted as a 2D surface in a 3D space, with the third
coordinate being proportional to the pixels intensity
value, with an aspect weight α → 1, and the first two
coordinates proportional to (x, y) (image pixel coordi-
nates) with weight 1 − α.
We define a geodesic level curve as the set of
points at the same geodesic distance from a given in-
terest point; it is possible to capture the joint distribu-
tion of intensity and geodesic distances and summa-
rize it into the so called GIH histogram-based descrip-
tor by sampling pixels with constant geodesic step ∆..
An efficient scheme for the geodesic level curves
computation on discrete pixel grids, was provided
by Sethian with the name of fast marching algo-
rithm (Sethian, 1999). A marching speed F(x,y) is
associated to each pixel x,y and the geodesic distance
T (x,y) can be estimated solving locally the equation
|
∇T
|
F = 1, where
F(x,y) =
1
f (x, y)
=
1
q
(1 − α)
2
+ α
2
I
2
x
+ α
2
I
2
y
. (1)
Although the shape of the resulting region is irregular,
it is covariant with deformation and it has shown in-
teresting results for generic continuous deformations.
Sometimes worst behaviors may occur in correlation
with the presence of isotropic and anisotropic scale
transformations, causing a resampling of pattern con-
tours, but for uniform intensity region, the expansion
is independent from image gradient, mainly depend-
ing on 1 − α value.
3.1 Fast Marching Algorithm in RGB
Space
The first improvement proposed in this paper aims at
modifying region expansion in the fast marching al-
gorithm considering color information
1
. We take into
account each RGB channel separately, computing 3
different inverse marching speeds one for each chan-
nel:
f
r
(x,y)
2
= (1 − α)
2
+ α
2
R
2
x
+ α
2
R
2
y
, (2)
f
g
(x,y)
2
= (1 − α)
2
+ α
2
G
2
x
+ α
2
G
2
y
, (3)
f
b
(x,y)
2
= (1 − α)
2
+ α
2
B
2
x
+ α
2
B
2
y
. (4)
1
We present in this paper only the RGB implementation
for the new fast marching algorithm; we tested also other
color spaces with no meaningful improvements thus we de-
cided to use the most efficient one.
VISAPP 2008 - International Conference on Computer Vision Theory and Applications
228